| Step | Annotation |
|---|---|
|
Step 1: Input dataset
select at runtime
|
|
|
Step 2: Input dataset collection
select at runtime
|
|
|
Step 3: fastp
Paired Collection
Output dataset 'output' from step 2
Adapter Trimming Options:
False
Empty.
Empty.
Global trimming options:
Not available.
Not available.
Not available.
Not available.
Overrepresented Sequence Analysis:
True
Not available.
Filter Options:
Quality filtering options:
False
Not available.
Not available.
Not available.
Length filtering options:
False
Not available.
Not available.
Low complexity filtering options:
False
Not available.
Read Modification Options:
Automatic trimming for Illumina NextSeq/NovaSeq data
Not available.
Disable polyX trimming
UMI processing:
False
Empty.
Not available.
Empty.
Per read cutting by quality options:
False
False
Not available.
Not available.
Base correction by overlap analysis options:
False
Output Options:
True
True
|
|
|
Step 4: Apply rules
Output dataset 'output_paired_coll' from step 3
Rules:
- {'error': None, 'type': 'add_column_metadata', 'value': 'identifier0', 'warn': None}
- {'error': None, 'type': 'add_column_metadata', 'value': 'identifier1', 'warn': None}
- {'error': None, 'expression': '\\w+_(\\d+)_S\\d+_L\\d+.*', 'group_count': 1, 'target_column': 0, 'type': 'add_column_regex', 'warn': None}
Column Definitions:
- {'columns': [1], 'type': 'paired_identifier'}
- {'columns': [2], 'editing': False, 'type': 'list_identifiers'}
|
|
|
Step 5: Unzip collection
Output dataset 'output' from step 4
|
|
|
Step 6: FASTQ Groomer
Output dataset 'reverse' from step 5
Sanger & Illumina 1.8+
Hide Advanced Options
|
|
|
Step 7: FASTQ Groomer
Output dataset 'forward' from step 5
Sanger & Illumina 1.8+
Hide Advanced Options
|
|
|
Step 8: Bowtie2
Paired-end
Output dataset 'output_file' from step 7
Output dataset 'output_file' from step 6
False
False
Yes
0
500
--fr
False
False
False
False
False
Use a genome from the history and build index
Output dataset 'output' from step 1
Set read groups (SAM/BAM specification)
True
True
ILLUMINA
False
Empty.
Empty.
Empty.
Empty.
Empty.
Empty.
Empty.
Not available.
Empty.
1: Default setting only
Sensitive local (--sensitive-local)
No
True
|
|
|
Step 9: Samtools sort
Output dataset 'output' from step 8
coordinate
|
|
|
Step 10: MarkDuplicates
Output dataset 'output1' from step 9
Comments
True
True
SUM_OF_BASE_QUALITIES
Empty.
100
Empty.
Lenient
|
|
|
Step 11: MiModD Variant Calling
Use a genome from my history
Output dataset 'output' from step 1
Output dataset 'outFile' from step 10
False
More options:
True
250
|
|
|
Step 12: MultiQC
Results
Results 1
fastp
Output dataset 'report_json' from step 3
Results 2
Bowtie 2
Output dataset 'mapping_stats' from step 8
Results 3
Picard
Picard outputs
Picard output 1
Markdups
Output dataset 'metrics_file' from step 10
QC data for raw fastqs, trimming (fastp), alignment (bowtie2), and deduplication (picard)
Empty.
False
False
|
|
|
Step 13: MiModD Deletion Calling (for PE data)
Output dataset 'outFile' from step 10
Output dataset 'ofile' from step 11
False
False
4
100
|
|
|
Step 14: MiModD Extract Variant Sites
Output dataset 'ofile' from step 11
include information from pre-calculated vcf datasets
False
|
|
|
Step 15: SnpSift Variant Type
Output dataset 'output_vcf' from step 14
|
richardjacton
All published workflows
Published workflows by richardjacton