Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make a file with unique protein sequences and associated IDs #60

Open
inodb opened this issue Aug 26, 2022 · 0 comments
Open

Make a file with unique protein sequences and associated IDs #60

inodb opened this issue Aug 26, 2022 · 0 comments
Labels
discussion enhancement New feature or request

Comments

@inodb
Copy link
Member

inodb commented Aug 26, 2022

Maybe we can create a file that has one row per unique sequence and associated IDs? Each column can e.g. be a database so you get something like

unique protein sequence ensembl_grch37_vxx_protein ensembl_grch37_vxx_transcript ensembl_grch38_vxx_protein ensembl_grch38_vxx_transcript uniprot
RRRRR ENSPxxx ENSTyyyyy ENSPxxx ENSTyyyyy Pzzzz

We can then reuse this file for uniprot, oncokb and hotspot transcript assignments. It also allows to easily add other potential protein resources

@inodb inodb added enhancement New feature or request discussion labels Aug 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discussion enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant