-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
minimap error #1
Comments
Hi, Do you have a more precise list of barcodes? It would be great if you can narrow down this list to a few thousand. Not only for saving memory but also for better demultiplexing results. You can get a precise list of barcodes if you have the 10X genomics short-read sequencing data. |
No, unfortunately, I don't have the 10X data. That is why I am using all available barcodes to figure out the used ones. I see some of the threads got failed (4 out of 24). I re-ran the failed commands and they got finished successfully. Not sure what is next after having the missing paf files? |
If you don't know the barcodes, you'd better ask the person who performed the experiment. Usually, a subset of the 3 million barcodes was used in one experiment. By the way, are you sure the barcode upstream sequence is If you finished AmpBinner successfully, you will see 3 output files: |
Hi @fangli08, WRT to the barcode upstream sequence, I think the sequence should be the one that you've mentioned in your comment. I see more reads identified with that sequence compared with an alternative sequence identified by running some code on the reads (which seems to be the second half of the sequence in your comment).
Using the alternative sequence, ACACTCTTTCCCTACACGACGCTCTTCCGATCT.
Please let me know your thoughts on that. |
That looks correct. |
Hi @fangli08 , Some of the 10X barcodes I used in running "ampBinner_10X.py" were duplicates and so when I split the list, they randomly ended up in two different files. Now, I see a different number of reads for the duplicate barcodes. Any thoughts on that?
|
When binning the reads, AmpBinner will consider the best matched barcode and the second best matched barcode, to exclude potential misclasification due to sequencing error of the long reads. To confidently assign a read to a barcode, the following two criteria is required:
For example, if a barcode list has the following two barcodes: barcode-1: AACCATGAGCAGGTCA If a read has a sequence of Therefore, all the barcodes should be supplied in one file. I thought you split the list simply to test if the tool works in your environment. Since there are too many barcodes, which use too much resources, I think you can narrow down this list by collecting all barcodes from the |
Hi,
I am trying to use ampBinner_10X.py on my fastq and getting errors on the alignment step. Could you please comment on that?
Thanks!
Running code:
Error:
The text was updated successfully, but these errors were encountered: