Downloading: s3://natera-rnd-pltf-dev-nextflow-scratch-01/work/82/7e3399ff93089af48e91d5b34147b7/.command.sh Downloading: s3://natera-platform-sandbox/pipeline-resources/AIH/rna/GRCh38/ensembl/Homo_sapiens.GRCh38.102.cdna.all.fa.gz Downloading: s3://natera-rnd-pltf-dev-nextflow-scratch-01/work/82/7e3399ff93089af48e91d5b34147b7/.command.run ==> STAGING COMPLETE (3 inputs) [build] loading fasta file Homo_sapiens.GRCh38.102.cdna.all.fa.gz [build] k-mer length: 31 [build] warning: clipped off poly-A tail (longer than 10) from 1490 target sequences [build] warning: replaced 100005 non-ACGUT characters in the input sequence with pseudorandom nucleotides KmerStream::KmerStream(): Start computing k-mer cardinality estimations (1/2) KmerStream::KmerStream(): Start computing k-mer cardinality estimations (1/2) KmerStream::KmerStream(): Finished CompactedDBG::build(): Estimated number of k-mers occurring at least once: 112003378 CompactedDBG::build(): Estimated number of minimizer occurring at least once: 27349942 CompactedDBG::filter(): Processed 329784767 k-mers in 194360 reads CompactedDBG::filter(): Found 111935243 unique k-mers CompactedDBG::filter(): Number of blocks in Bloom filter is 765650 CompactedDBG::construct(): Extract approximate unitigs (1/2) CompactedDBG::construct(): Extract approximate unitigs (2/2) CompactedDBG::construct(): Closed all input files CompactedDBG::construct(): Splitting unitigs (1/2) CompactedDBG::construct(): Splitting unitigs (2/2) CompactedDBG::construct(): Before split: 1041595 unitigs CompactedDBG::construct(): After split (1/1): 1041596 unitigs CompactedDBG::construct(): Unitigs split: 1786 CompactedDBG::construct(): Unitigs deleted: 0 CompactedDBG::construct(): Joining unitigs CompactedDBG::construct(): After join: 983294 unitigs CompactedDBG::construct(): Joined 58712 unitigs [build] building MPHF [build] creating equivalence classes ... [build] target de Bruijn graph has k-mer length 31 and minimizer length 23 [build] target de Bruijn graph has 983294 contigs and contains 112035259 k-mers