You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
run a compute
It currently queries BigQuery directly and submits a pipelines directly to GCP Life Sciences pipeline.
We want to convert the script so steps 1 and 3 use the equivalent GA4GH APIs.
Creating issue here to track the following
Modify FASPScript1.py to submit jobs via WES instead of Life Sciences Pipeline
Make BigQuery datasets searchable through Discovery Search
1000Genomes
COPDGene
GECCO
Modify FASPScript1.py to use Discovery Search API to search the data
Create scrambled/fake versions of COPDGene, GECCO
Put DiscoverySearch datasets under authentication and authorization – stretch?
The text was updated successfully, but these errors were encountered:
8/11 session WES
Worked through submitting a WES job (MD5 checksum) on a file specified as a URL obtained from DRS. See checksum.wdl in FASPclient. For debugging purposes we ended up doing this with a relatively small file. Submitted everything via postman. Discovery
Identified 1000 Genomes views in BigQuery which link subject and specimen data with ids for .bam files. These work as DRS ids for BioDataCatalyst DRS Server. The views were created as queries on a table which is an import of PFB (Avro) from BDC. Due to Presto's preferences for working with views created a table to be used for Discovery Search.
Additional tasks based on 8/11 session
Give Ian account in DNAStack WES - Max
Write python call to WES Server DNAStack - Ian
Add 1000 Genomes dataset to GA4GH Presto adapter - Jonathan
Check the MD5 workflow works on a larger file i.e. BAM - Ian
Created onek_genomes dataset with lower-cased name per Jonathan's request. Presto needs lower case names. The ssd_drs table is also lower-cased. Granted BigQuery Viewer role on the dataset to the DNAStask service accounts used for Presto.
https://github.com/ianfore/FASPclient has example python code to
It currently queries BigQuery directly and submits a pipelines directly to GCP Life Sciences pipeline.
We want to convert the script so steps 1 and 3 use the equivalent GA4GH APIs.
Creating issue here to track the following
The text was updated successfully, but these errors were encountered: