Workflows
What is a Workflow?Filters
ARA (Automated Record Analysis) : An automatic pipeline for exploration of SRA datasets with sequences as a query
Requirements
-
Docker
-
Please checkout the Docker installation guide.
or
-
Mamba package manager
-
Please checkout the mamba or micromamba official installation guide.
-
We prefer
mamba
overconda
since it is faster and uses ...
prepareChIPs
This is a simple snakemake
workflow template for preparing single-end ChIP-Seq data.
The steps implemented are:
- Download raw fastq files from SRA
- Trim and Filter raw fastq files using
AdapterRemoval
- Align to the supplied genome using
bowtie2
- Deduplicate Alignments using
Picard MarkDuplicates
- Call Macs2 Peaks using
macs2
A pdf of the rulegraph is available here
Full details for each step are given below. Any additional ...
BatchConvert 
A command line tool for converting image data into either of the standard file formats OME-TIFF or OME-Zarr.
The tool wraps the dedicated file converters bfconvert and bioformats2raw to convert into OME-TIFF or OME-Zarr, respectively. The workflow management system NextFlow is used to perform conversion in parallel for batches of images.
The tool also wraps s3 and Aspera clients (go-mc and aspera-cli, respectively). ...
A CWL-based pipeline for calling small germline variants, namely SNPs and small INDELs, by processing data from Whole-genome Sequencing (WGS) or Targeted Sequencing (e.g., Whole-exome sequencing; WES) experiments.
On the respective GitHub folder are available:
- The CWL wrappers and subworkflows for the workflow
- A pre-configured YAML template, based on validation analysis of publicly available HTS data
Briefly, the workflow performs the following steps:
- Quality control of Illumina reads ...
Type: Common Workflow Language
Creators: Konstantinos Kyritsis, Nikolaos Pechlivanis, Fotis Psomopoulos
Submitter: Konstantinos Kyritsis
A CWL-based pipeline for calling small germline variants, namely SNPs and small INDELs, by processing data from Whole-genome Sequencing (WGS) or Targeted Sequencing (e.g., Whole-exome sequencing; WES) experiments.
On the respective GitHub folder are available:
- The CWL wrappers and subworkflows for the workflow
- A pre-configured YAML template, based on validation analysis of publicly available HTS data
Briefly, the workflow performs the following steps:
- Quality control of Illumina reads ...
Type: Common Workflow Language
Creators: Konstantinos Kyritsis, Nikolaos Pechlivanis, Fotis Psomopoulos
Submitter: Konstantinos Kyritsis
A CWL-based pipeline for processing ChIP-Seq data (FASTQ format) and performing:
- Peak calling
- Consensus peak count table generation
- Detection of super-enhancer regions
- Differential binding analysis
On the respective GitHub folder are available:
- The CWL wrappers for the workflow
- A pre-configured YAML template, based on validation analysis of publicly available HTS data
- Tables of metadata (
EZH2_metadata_CLL.csv
andH3K27me3_metadata_CLL.csv
), based on the same validation ...
Type: Common Workflow Language
Creators: Konstantinos Kyritsis, Nikolaos Pechlivanis, Fotis Psomopoulos
Submitter: Konstantinos Kyritsis
A CWL-based pipeline for processing RNA-Seq data (FASTQ format) and performing differential gene/transcript expression analysis.
On the respective GitHub folder are available:
- The CWL wrappers for the workflow
- A pre-configured YAML template, based on validation analysis of publicly available HTS data
- A table of metadata (
mrna_cll_subsets_phenotypes.csv
), based on the same validation analysis, to serve as an input example for the design of comparisons during differential expression ...
Type: Common Workflow Language
Creators: Konstantinos Kyritsis, Nikolaos Pechlivanis, Fotis Psomopoulos
Submitter: Konstantinos Kyritsis
SARS-CoV-2 variant prediction using Read It And Keep, fastp, bbmap and iVar