-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
initial pipeline pseudo PR #17
Draft
vagkaratzas
wants to merge
163
commits into
TEMPLATE
Choose a base branch
from
dev
base: TEMPLATE
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+7,972
−151
Draft
Changes from 161 commits
Commits
Show all changes
163 commits
Select commit
Hold shift + click to select a range
7a768c5
modules installed
vagkaratzas 821f73e
input samplesheet update to fasta
vagkaratzas 84f8965
clustering modules chained
vagkaratzas e5bc24f
nf-core createtsv test init
vagkaratzas 1802206
debugged targetdb copy by staging
vagkaratzas 40e02f2
syncing samples by meta, module diffs
vagkaratzas d362d9e
EXECUTE_CLUSTERING subworkflow done
vagkaratzas af91b89
Merge pull request #1 from vagkaratzas/createtsv-tests
vagkaratzas 0c56473
chunk_clusters nf and py init
vagkaratzas ace2dbd
cluster chunkcs complete
vagkaratzas 65381a7
Merge pull request #2 from vagkaratzas/chunking_clusters
vagkaratzas b1309f5
seqera container for local python module
vagkaratzas a84dbe6
clustering parameters added, chunking fasta instead of tsv to then fe…
vagkaratzas 2e46012
removed pandas from chunking script, optimised for memory
vagkaratzas f2983bc
msa input formatted, famsa run properly
vagkaratzas 6f52683
meta updated to chunks level, hmmbuild chained to workflow
vagkaratzas 2d48faf
HMMER_HMMSEARCH chained, inputs combined successfully on meta id, met…
vagkaratzas 54f7a05
minor publish path change
vagkaratzas 5d9e7e9
hmmsearch evalue cutoff and output full MSA by default
vagkaratzas a39d649
generate_families converted into subworkflow
vagkaratzas e203af3
mafft alternative alignment tool
vagkaratzas e24a63b
hmmbuild updated
vagkaratzas 4c639f0
added support for .gz input fasta
vagkaratzas b4a054f
Preparing input channel to extract family representative sequences fo…
vagkaratzas 2b6b4a8
added clipkit module, fixing some linting warnings, updated subworkflows
vagkaratzas c136435
extract family reps script
vagkaratzas 5e068d1
removed prokka module
vagkaratzas b0ee92d
export fam to rep mapping
vagkaratzas 82742e7
EXTRACT_FAMILY_REPS publishDir
vagkaratzas 8b444df
Merge pull request #4 from vagkaratzas/prokka
vagkaratzas ea8e300
custom multiqc with family metadata
vagkaratzas dfaae41
added label and when blocks to custom modules to fix linting warnings
vagkaratzas 53ea42f
input samplesheet and test data hosted in nf-core test-datasets repo,…
vagkaratzas eb08a3c
updated mafft module to mafft-align
vagkaratzas 5283390
subworkflows updated
vagkaratzas 18be6da
updated readme and citations doc
vagkaratzas f7bc3f5
usage.md updated
vagkaratzas fb54976
clustering outputs updated and written in output.md
vagkaratzas c9ac121
aligners outputs updated
vagkaratzas 1aa92cd
hmmer outputs updated + documentation
vagkaratzas 619f701
gap_threshold aprameter for clipkit
vagkaratzas 39a037f
mmseqs suite version bump
vagkaratzas e95a1ff
clip_ends only local module
vagkaratzas cbd1292
Merge pull request #5 from vagkaratzas/clip-ends-only
vagkaratzas 86a950b
nf-core subworkflows update
vagkaratzas 2df0b34
hmmsearch results filtering by length, fasta instead of stockholm ref…
vagkaratzas 1a2d333
init redundancy_check subworkflow
vagkaratzas 98fb26c
execute_clustering subworkflow updated and reused in redundancy remov…
vagkaratzas 8ae6410
TODOs added to continue with redundancy mechanisms
vagkaratzas b42db21
remove_redundant_seqs module done, sequence align converted to subwor…
vagkaratzas 7aa1e3c
concat_hmms module
vagkaratzas 21c3d19
CONCAT_HMMS module, HMMER_HMMSEARCH for family redundancy checking, R…
vagkaratzas 39afbb9
remove_self_hits function
vagkaratzas 5ddbf77
filter_by_length
vagkaratzas 69b054e
remove_redundant_fams module done and container updated to pandas
vagkaratzas 480bc2a
filtered sequence names updated, results folder architecture updated
vagkaratzas e5e28dc
Merge pull request #6 from vagkaratzas/remove-family-redundancy
vagkaratzas 178ff6b
Merge pull request #7 from vagkaratzas/redundancy_check
vagkaratzas 2f0e75c
readme and output.md doc updates
vagkaratzas 7b16c7c
metro map updated
vagkaratzas dfe39fb
non redundant hmm filtering
vagkaratzas 940659c
Merge branch 'sync' into merge_template
vagkaratzas 9f22b4e
Merge pull request #8 from vagkaratzas/merge_template
vagkaratzas 92a3fdb
manifest update and modules config proper sourcing
vagkaratzas 8e6d4bd
Merge pull request #9 from vagkaratzas/sync
vagkaratzas baf48e9
future proofing multiqc numeric protein name bug (proper strings will…
vagkaratzas 97fcd22
smallest test skipping clipping of msas and redundancy removal mechanism
vagkaratzas ca3b434
make recruiting of sequences with hmmsearch optional, both tests runn…
vagkaratzas f64d5de
Merge pull request #11 from vagkaratzas/bug-fix--multiqc-numeric-prot…
vagkaratzas 0f2bda3
minor reformats
vagkaratzas 7835d65
Merge pull request #10 from vagkaratzas/new-test--minimal-test-withou…
vagkaratzas ec73bfa
This changeset included a WIP update workflow.
mberacochea 6d1083b
Fix a missing bit in the update
mberacochea 5ab2ce6
This changeset included a WIP update workflow.
mberacochea 42a6105
Fix a missing bit in the update
mberacochea deb184b
Merge branch 'update_subworkflow_vag' of https://github.com/vagkaratz…
vagkaratzas 185802e
correct and rearrange stuff in remove_redundancy subworkflow
vagkaratzas 1cb45ff
sync to latest template
vagkaratzas 8b36bd3
nf-core yml version updated
vagkaratzas 0a6b233
multiQC stuff undo linting
vagkaratzas 8437b96
update_families moved to its own subworkflow
vagkaratzas 2dd40f7
restructured update_families until hmmer_hmmsearch
vagkaratzas c7c3df6
branch hits fasta
vagkaratzas 71c39a1
cat_cat module config
vagkaratzas 7d0ee54
removed unused modules, binding proper family msas with recruited seq…
vagkaratzas c2e20f1
seqkit + cat fasta with family specific msa
vagkaratzas 7f1a7c6
prefixes up to clustering updated
vagkaratzas ae129a2
update_families subworkflow done
vagkaratzas 7aa8c0b
clipends file extension updated
vagkaratzas a723a42
metro map and output.md updated
vagkaratzas 3d17c4f
updated readme and citations
vagkaratzas 59b006a
validation schema updated to allow either existing hmm and msa folder…
vagkaratzas b71c210
untar logic implemented, removed support for simple folders due to co…
vagkaratzas e31cdc9
test configs updated
vagkaratzas 6c9008c
samplesheet description updated
vagkaratzas da80843
Merge pull request #14 from vagkaratzas/test_data_update
vagkaratzas 01502b7
Update README.md
vagkaratzas 27de55f
python errors to stderr
vagkaratzas e81cc77
Update workflows/proteinfamilies.nf
vagkaratzas 2ebf65a
Update docs/output.md
vagkaratzas 37a2c64
Update docs/output.md
vagkaratzas 3406075
Update docs/output.md
vagkaratzas 5d93651
syncing of seqs and clusters after clustering done inside the execute…
vagkaratzas 99a62a9
file.getSimpleName
vagkaratzas b542eb6
chunk in REMOVE_REDUNDANT_SEQS tag
vagkaratzas b98c514
Update docs/output.md
vagkaratzas c0cbae9
regex for parsed hit sequence name and range
vagkaratzas 81d0d67
validateMatchingFolders groovy function to check for valid existing H…
vagkaratzas bb82f71
Merge pull request #13 from vagkaratzas/update_subworkflow_vag
vagkaratzas de8421f
branch hits fasta updated to use env coords of domtbl
vagkaratzas ed1cc98
tryCatch block added in filter_recruited function of branch_hits_fasta
vagkaratzas 040464e
filter recruited updated -not using auto full alignment from hmmsearc…
vagkaratzas 7577bcc
HMMER_HMMALIGN added, checkpoint
vagkaratzas 85272bd
remove redundancy updated to work with sto alignments as well
vagkaratzas 7a74d6e
metro map updated with hmmalign
vagkaratzas 49acd97
metro updated with hmmalign
vagkaratzas 54980f7
output.md updated
vagkaratzas 51e0b9b
filter_non_redundant_hmms updated to work properly even without fishi…
vagkaratzas bf099ff
Merge pull request #1 from vagkaratzas/using-envelope-coords
vagkaratzas dc80ef6
Merge pull request #15 from vagkaratzas/dev
vagkaratzas f4849e9
usage.md samplesheet input description update
vagkaratzas 9f022f3
Merge pull request #18 from vagkaratzas/dev
vagkaratzas 36bc897
Merge branch 'dev' into nf-core-template-merge-3.1.2
vagkaratzas 6d4442c
Merge pull request #19 from nf-core/nf-core-template-merge-3.1.2
vagkaratzas dc5a884
modules update (mmseqs, multiqc, seqkit)
vagkaratzas f630ac0
Merge pull request #20 from vagkaratzas/dev
vagkaratzas 3b377ab
Update README.md
vagkaratzas 251f7e7
Merge branch 'dev' into nf-core-template-merge-3.2.0
vagkaratzas f3caa75
Merge pull request #21 from nf-core/nf-core-template-merge-3.2.0
vagkaratzas 6b877be
local modules mem reqs in base.config
vagkaratzas 906eec2
Merge pull request #22 from vagkaratzas/dev
vagkaratzas 704ebe4
metro map updated and svg added
vagkaratzas a5a8383
changelog updated for first release
vagkaratzas 02ec8aa
Biopython citation added
vagkaratzas 891706b
schema pattern updated to include faa and fas input files
vagkaratzas 228659e
Update assets/schema_input.json
vagkaratzas a777d52
gzipped tarball description specification
vagkaratzas e83d6b4
python script licensing
vagkaratzas 00c2c6f
Update conf/test_full.config
vagkaratzas e3db3aa
resourceLimits removed from test_full
vagkaratzas bdb1bec
removed unused process block from test_full config
vagkaratzas bc594f8
Parameter specifications in usage.md
vagkaratzas dd2cc97
Update subworkflows/local/utils_nfcore_proteinfamilies_pipeline/main.nf
vagkaratzas 07d52a3
checkIfExists: true added to file()
vagkaratzas f6cb471
Update nextflow_schema.json
vagkaratzas 0ce6ee0
Update nextflow.config
vagkaratzas f050d84
Update docs/output.md
vagkaratzas 1b2d81b
Update docs/usage.md
vagkaratzas 921cf06
Update docs/output.md
vagkaratzas bdc8627
batch commit suggestions from james
vagkaratzas 0ab063f
ch_ notation instead of set {}, .map changed to multiline everywhere,…
vagkaratzas 9be00f7
assets/samplesheet.csv added back
vagkaratzas e39a349
modules and subworkflows moved to respective folders + env
vagkaratzas f64c769
prefix instead of meta.id for output naming in two of the local modules
vagkaratzas d6e2e3e
only main author mail kept
vagkaratzas 86cdb61
removed duplicate includeConfig 'conf/modules.config'
vagkaratzas 99de435
interpretation of folder outputs in output.md, strict clustering thre…
vagkaratzas ada9368
help_text in most schema params
vagkaratzas acb9465
Merge pull request #23 from vagkaratzas/dev
vagkaratzas d3a9e21
first release version bump (1.0.0)
vagkaratzas 16c1ca9
Merge pull request #24 from vagkaratzas/dev
vagkaratzas 6ffb2df
Update CHANGELOG.md
jfy133 a380e49
[automated] Fix code linting
nf-core-bot File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vagkaratzas marked this conversation as resolved.
Show resolved
Hide resolved
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,2 @@ | ||
sample,fastq_1,fastq_2 | ||
SAMPLE_PAIRED_END,/path/to/fastq/files/AEG588A1_S1_L002_R1_001.fastq.gz,/path/to/fastq/files/AEG588A1_S1_L002_R2_001.fastq.gz | ||
vagkaratzas marked this conversation as resolved.
Show resolved
Hide resolved
|
||
SAMPLE_SINGLE_END,/path/to/fastq/files/AEG588A4_S4_L003_R1_001.fastq.gz, | ||
sample,fasta,existing_hmms_to_update,existing_msas_to_update | ||
mgnifams_test,https://github.com/nf-core/test-datasets/raw/proteinfamilies/test_data/mgnifams_input_small.fa,, |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.