-
Notifications
You must be signed in to change notification settings - Fork 372
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merging of previously sorted BAM files #1990
Comments
MergeSamFiles should merge multiple sorted files into a single file with consistent sorting, IF the files are all sorted the same way. This should not require additional sorting afterwards. There is a log message when it identifies this case: "Input files are in same order as output so sorting to temp directory is not needed." There is also an argument SORT_ORDER which defaults to coordinate sorting. If your files are in query name order you should specify that with this argument or they will be resorted afterwards. If you're not seeing that message or if there is a conflict with SORT_ORDER you may have to specify ASSUME_SORTED=true but this can be dangerous. |
I tried it on three files sorted by query (
I guess this error is because the files have different headers, no?
And in the cases when it needs to sort, is it threaded? |
When sorting several pre-sorted BAM files (either by query or coordinate) with different headers, is it possible to merge them with
MergeSamFiles
keeping the original sorting?I am working with very large BAM files and sorting the BAM file after merging would take a considerable amount of time!
Thanks,
The text was updated successfully, but these errors were encountered: