Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using Isoquant for single-cell data and de-duplications #268

Open
Kyung-TaeLee opened this issue Dec 13, 2024 · 1 comment
Open

Using Isoquant for single-cell data and de-duplications #268

Kyung-TaeLee opened this issue Dec 13, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@Kyung-TaeLee
Copy link

Hi, thank you for providing such a wonderful tool and keeping constant updates.

I am currently trying to use Isoquant for single-cell Nanopore data (10X cDNA library -> genomic ligation sequencing).
I ran Flexiplex for barcode and UMI identification, minimap2 for alignment, and successfully ran Isoquant for quantification. I tried to do de-duplication myself using "OUT.read_assignments.tsv.gz" file but It is not clear to me which reads are actually used for quantification because even if read is assigned to transcript, there are many classification such as unique, inconsistent, etc.
Could you tell me which reads with certain classification are actually quantified for specific feature?

And you mentioned in other post that you have personal de-duplication code that you could share. Could you also share that script by e-mail?

Thank you once again and have a great day.

Best regards,
KyungTae Lee

@andrewprzh andrewprzh added the enhancement New feature or request label Dec 13, 2024
@andrewprzh
Copy link
Collaborator

Dear @Kyung-TaeLee

Thank you for the feedback!

Yes, I work on our own barcode calling and UMI deduplication.
In brief, it is currently in sc_v3 branch. It can be run the same way as IsoQuant with couple additional parameters, such as

Single-cell/spatial-related options::
  --mode {bulk,tenX,double}, -m {bulk,tenX,double}
                        IsoQuant modes: bulk, tenX, double; default:bulk
  --barcode_whitelist BARCODE_WHITELIST
                        file with barcode whitelist for barcode calling
  --barcoded_reads BARCODED_READS [BARCODED_READS ...]
                        file with barcoded reads; barcodes will be called automatically if not provided

Feel free to email me for further details - the pipeline is still in progress, but I already used it in several projects.

Best
Andrey

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants