-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multi-threading #7
Comments
For others stumbling across this issue, here's a little snakemake template that mimmicks multi-threading by running badread multiple times and concatenating the fastq files at the end.
Just saw that there is already a wiki entry for doing exactly the same thing in bash. Anyway, maybe this is still useful for someone. |
Before anyone do the same thing that I did and follow blindly W-L's answer, note that doing so will in some occasion generate the same read name multiple times. This might affect your pipeline, especially if you're cleaning your reads later since minimap2 do not care if multiple reads with the same name appear, and will just map them individually, leading to secondary / chimeric alignments. |
I know but I'm not as lucky with the lottery sadly ... |
Is your feature request related to a problem? Please describe.
The simulator is very slow when it comes to
Both of these steps should have straighforward data parallelism
Describe the solution you'd like
Multithreading of the two steps (and possible others?)
Describe alternatives you've considered
Adding a program command to prepare the reference contigs and pickle the results so rerunning won't be slow. That won't really resolve the read generation speed thu
Additional context
I am building a wrapper around Badread for transcriptomic reads. It's still in the design stage. I plan to code the multithreading described above on a separate branch and make PR
The text was updated successfully, but these errors were encountered: