-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Long read bam processing #43
Comments
Hi, @SHuang-Broad It's known to be slow(potentially with large memory consumption) using default settings with htslib when reading long read bams. (Due to calibration of qualities) If possible, you can provide a subset of the pileup of the BAM file with "—Pileup" argument as the input(which bypassed BAM input processing step) Let me know if this workaround resolved your issue. |
Thanks for pointing me in that direction! One follow-up question: how do I generate the pileup file, with In the meantime, I also run the following experiment:
It took approximately 4 hours using 10k sites (1kg and hdp). Thanks! |
One more question out of curiosity: when you mention the slowness possibly comes from htslib parsing the long reads due to calibration of qualities, I wonder
Thank you! |
"—OutputPileup" is used for debugging purposes in VB2 when the input file is a bam file. I don't think there will be a difference when the linked htslib in samtools and VB2 are the same. I think you procedure is worth exploring, but you need to disable the quality calibration argument as disscussed in the referenced issue ticket #32 or refer to http://www.htslib.org/doc/samtools-mpileup.html BAQ related options. |
Hi Fan, Sorry, I forgot to provide an update. I ran an experiment with different options, and here's what I found
So based on this little experiment,
The WDL pipeline I used is coded here. Thanks, |
@SHuang-Broad Thank you very much! This profiling is useful for future reference. |
Hi,
We're experimenting with
VerifyBamID2
on long reads contamination estimations.The test sample is a 30X PacBio CCS BAM aligned to HG38.
We found that the process is taking a bit longer than expected:
I know this is an off-label use, so would love any suggestions on how to carry out this experiment (including submitting PRs).
Thank you!
Steve
The text was updated successfully, but these errors were encountered: