Downloading: s3://natera-rnd-pltf-dev-nextflow-scratch-01/work/6c/bb7cc1d07e19f8d2500aee7658ea87/.command.sh
Downloading: s3://natera-platform-sandbox/pipeline-resources/AIH/rna/GRCh38/ensembl/Homo_sapiens.GRCh38.102.cdna.all.fa.gz
Downloading: s3://natera-rnd-pltf-dev-nextflow-scratch-01/work/6c/bb7cc1d07e19f8d2500aee7658ea87/.command.run
==> STAGING COMPLETE (3 inputs)
[build] loading fasta file Homo_sapiens.GRCh38.102.cdna.all.fa.gz
[build] k-mer length: 31
[build] warning: clipped off poly-A tail (longer than 10)
from 1490 target sequences
[build] warning: replaced 100005 non-ACGUT characters in the input sequence
with pseudorandom nucleotides
KmerStream::KmerStream(): Start computing k-mer cardinality estimations (1/2)
KmerStream::KmerStream(): Start computing k-mer cardinality estimations (1/2)
KmerStream::KmerStream(): Finished
CompactedDBG::build(): Estimated number of k-mers occurring at least once: 112003378
CompactedDBG::build(): Estimated number of minimizer occurring at least once: 27349942
CompactedDBG::filter(): Processed 329784767 k-mers in 194360 reads
CompactedDBG::filter(): Found 111935813 unique k-mers
CompactedDBG::filter(): Number of blocks in Bloom filter is 765650
CompactedDBG::construct(): Extract approximate unitigs (1/2)
CompactedDBG::construct(): Extract approximate unitigs (2/2)
CompactedDBG::construct(): Closed all input files
CompactedDBG::construct(): Splitting unitigs (1/2)
CompactedDBG::construct(): Splitting unitigs (2/2)
CompactedDBG::construct(): Before split: 1041391 unitigs
CompactedDBG::construct(): After split (1/1): 1041391 unitigs
CompactedDBG::construct(): Unitigs split: 1791
CompactedDBG::construct(): Unitigs deleted: 0
CompactedDBG::construct(): Joining unitigs
CompactedDBG::construct(): After join: 983294 unitigs
CompactedDBG::construct(): Joined 58503 unitigs
[build] building MPHF
[build] creating equivalence classes ...
[build] target de Bruijn graph has k-mer length 31 and minimizer length 23
[build] target de Bruijn graph has 983294 contigs and contains 112035259 k-mers