Downloading: s3://natera-platform-sandbox/pipeline-resources/ngi-igenomes/igenomes/Homo_sapiens/GATK/GRCh38/Sequence/WholeGenomeFasta/Homo_sapiens_assembly38.fasta
Downloading: s3://natera-rnd-pltf-dev-nextflow-scratch-01/work/f1/86f56559321f5b6238dd1b1f59c667/.command.sh
Downloading: s3://natera-rnd-pltf-dev-nextflow-scratch-01/work/8f/206c01ca9552d4dd451be025e684a4/positive_somatic_control_1.snv_indel.merged.vcf.gz
Downloading: s3://natera-platform-sandbox/pipeline-resources/ngi-igenomes/igenomes/Homo_sapiens/GATK/GRCh38/Sequence/WholeGenomeFasta/Homo_sapiens_assembly38.fasta.fai
Downloading: s3://natera-rnd-pltf-dev-nextflow-scratch-01/work/6f/3b8f493c4bbd324b6d78581c167e95/FFPE_HD789_01_p2_gDNA_01_A23MHWWLT4_1.recalibrated.bam
Downloading: s3://natera-rnd-pltf-dev-nextflow-scratch-01/work/8f/206c01ca9552d4dd451be025e684a4/positive_somatic_control_1.snv_indel.merged.vcf.gz.tbi
Downloading: s3://natera-rnd-pltf-dev-nextflow-scratch-01/work/f1/86f56559321f5b6238dd1b1f59c667/.command.run
Downloading: s3://natera-rnd-pltf-dev-nextflow-scratch-01/work/6f/3b8f493c4bbd324b6d78581c167e95/FFPE_HD789_01_p2_gDNA_01_A23MHWWLT4_1.recalibrated.bam.bai
==> STAGING COMPLETE (8 inputs)
This is WhatsHap 2.8 running under Python 3.9.23
[W::hts_idx_load3] The index file is older than the data file: FFPE_HD789_01_p2_gDNA_01_A23MHWWLT4_1.recalibrated.bam.bai
[W::hts_idx_load3] The index file is older than the data file: FFPE_HD789_01_p2_gDNA_01_A23MHWWLT4_1.recalibrated.bam.bai
[E::idx_find_and_load] Could not retrieve index file for 'input_filtered.vcf.gz'
[E::idx_find_and_load] Could not retrieve index file for 'input_filtered.vcf.gz'
[E::idx_find_and_load] Could not retrieve index file for 'input_filtered.vcf.gz'
Working on 2 samples from 2 families
WARNING: Skipping duplicated position 1612203 on chromosome 'chr6' Hiding further warnings of this type, use --debug to show
# Working on contig chr6 in individual FFPE_HD789_01_p2_gDNA_01_A23MHWWLT4_1
Found 6750 usable heterozygous variants (0 skipped due to missing genotypes)
[W::hts_idx_load3] The index file is older than the data file: FFPE_HD789_01_p2_gDNA_01_A23MHWWLT4_1.recalibrated.bam.bai
Number of supplementary alignments: 0
Number of non-singleton groups: 941395
Skipped 0 groups
Found 3216286 reads covering 6750 variants
Kept 1267225 reads that cover at least two variants each
Selected 16964 most phase-informative reads covering 4971 variants
Best-case phasing would result in 1012 non-singleton phased blocks (0 singletons).
Phasing 1 sample by solving the MEC problem ...
Largest block contains 830 variants (16.7% of accessible variants) between position 29722917 and 31357755
# Working on contig chr6 in individual reference-NA12878
Found 172 usable heterozygous variants (0 skipped due to missing genotypes)
WARNING: Sample 'reference-NA12878' not found in any BAM/CRAM file.
Found 0 reads covering 0 variants
Kept 0 reads that cover at least two variants each
Selected 0 most phase-informative reads covering 0 variants
Best-case phasing would result in 0 non-singleton phased blocks (0 singletons).
Phasing 1 sample by solving the MEC problem ...
No. of phased blocks: 0
WARNING: Ignoring existing phasing information found in input VCF (PS tag exists).
# Resource usage
Maximum memory usage: 5.330 GB
Time spent reading BAM/CRAM: 678.3 s
Time spent parsing VCF: 0.2 s
Time spent selecting reads: 113.4 s
Time spent phasing: 20.1 s
Time spent writing VCF: 0.4 s
Time spent finding components: 0.1 s
Time spent on rest: 18.0 s
Total elapsed time: 830.4 s