Skip to content

An extremely rudimentary gene-disease association search-tool, developed during a short educational project.

License

Notifications You must be signed in to change notification settings

jstrobaek/gdads

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

gDADs

The central software of gDADs (Gene-Disease Association Database Search-tool) is written in Python script. It seeks to streamline the process of enrichment analysis of a set of genes awarded from a differential expression study.

To reference gDADs see the following faux article.

Getting Started

These instructions will get you a copy of the software, and associated files, on your local machine for testing purposes (see section Running tests).

Disclaimer: gDADs was developed on a Linux system (Ubuntu 18.04.4 LTS) and should preferably be run on a Linux distribution of similar architecture. It might work on macOS, but is not compatible with Windows (as of version 1.0.1).

Prerequisites

Regrettably, no Anaconda environment file can be supplied at the moment, due to local update issues.

The command-line executable was written in Python version 3.6.9, and only uses standard modules in the current iteration (version 1.0.1).

It was intended for GOATOOLS to be implemented in the software, but this fell outside the scope of the project. Instead, a rudimentary function was added to gDADs version 1.0.1, tested in subsection Running tests.

Installation

Start by either cloning the repository using git (code below), or downloading it directly from this page.

git clone https://github.com/yogogoba/gdads.git

This should be enough to run gDADs by calling the script located in the sub-directory /scr/. If it needs to be turned into an executable, use:

chmod /path/to/scr/gDADs +x

To link to your user bin folder, and run it as a command-line command:

ln -s ~/path/to/scr/gDADs ~/bin

To make sure that it is working, run it as follows to get some further information (where the first example requires linking it to your ~/bin):

gDADs -h

or,

~/path/to/scr/gDADs -h

or,

python3 ~/path/to/scr/gDADs -h

Running tests

Testing can be done by utilizing the files contained within /data/test_in/. These hold example IDs, and should be specified with [-i <input file>], potentially using [-I]. The latter is needed to specify ID format, unless it is of default type ("gene_symbol"). The attained output files can then be compared to the files found in /data/test_out/.

Example 1

Test basic functionality.

~/path/to/scr/gDADs -i ~/path/to/data/test_in/deg_entrez.txt -I entrez_id

Example 2

Test GO term function, where domain type is specified with [-g <ontology_domain>]. See [-h] for alternatives. By default this generates the regular gene-disease association output as well, but this can be silenced using [-silence_associations].

~/path/to/scr/gDADs -i ~/path/to/data/test_in/deg_symbol.txt -g "go_process"

Example 3

Test search function.

~/path/to/scr/gDADs -s "hear"

Built Using

The database source files were trimmed to limit the scope and size, and enable upload to GitHub. They were reduced by first limiting the curated_gene_disease_associations.tsv by only including genes which were associated with inflammation. This was then further trimmed by matching the leftover IDs with those of the other source files. These files can be found within /data/db_table_data/.

Author(s)

  • Joel Ströbaek

License

See LICENCE file for information.

About

An extremely rudimentary gene-disease association search-tool, developed during a short educational project.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages