Workflows
What is a Workflow?Filters
Workflow for the GTN training "Antibiotic resistance detection"
VGP Workflow #1
This workflow produces a Meryl database and Genomescope outputs that will be used to determine parameters for following workflows, and assess the quality of genome assemblies. Specifically, it provides information about the genomic complexity, such as the genome size and levels of heterozygosity and repeat content, as well about the data quality.
Inputs
- Collection of Hifi long reads in FASTQ format
Outputs
- Meryl Database of kmer counts
- GenomeScope
- Linear plot
...
VGP Workflow #1
This workflow collects the metrics on the properties of the genome under consideration by analyzing the k-mer frequencies. It provides information about the genomic complexity, such as the genome size and levels of heterozygosity and repeat content, as well about the data quality. It uses reads from two parental genomes to partition long reads from the offspring into haplotype-specific k-mer databases.
Inputs
- Collection of Hifi long reads in FASTQ format
- Paternal short-read ...
Generic variation analysis on WGS PE data
This workflows performs paired end read mapping with bwa-mem followed by sensitive variant calling across a wide range of AFs with lofreq and variant annotation with snpEff. The reference genome can be provided as a GenBank file.
Generic consensus building
This workflow generates consensus sequences using a list of variants generated by Variant Calling Workflow.
The workflow accepts a single input:
- A collection of VCF files
The workflow produces a single output:
- Consensus sequence for each input VCF file
The workflow can be accessed at usegalaxy.org
Generic variation analysis reporting
This workflow generates reports from a list of variants generated by Variant Calling Workflow.
The workflow accepts a single input:
- A collection of VCF files
The workflow produces two outputs (format description below):
- A list of variants grouped by Sample
- A list of variants grouped by Variant
Here is example of output by sample. In this table all varinats in all samples are epxlicitrly listed:
| Sample | ...
Generic variant calling
A generic workflow for identification of variants in a haploid genome such as genomes of bacteria or viruses. It can be readily used on MonkeyPox. The workflow accepts two inputs:
- A genbank file with the reference genomes
- A collection of paired fastqsanger files
The workflow outputs a collection of VCF files for each sample (each fastq pair). These VCF files serve as input to the Reporting workflow.
Workflow can be accessed ...
Performs Long Read assembly using PacBio data and Hifiasm. Part of VGP assembly pipeline. This workflow generate a phased assembly.
Performs scaffolding using HiC Data. Part of VGP assembly pipeline. The scaffolding can be performed on long read assembly contigs or on scaffolds (e.g.: Bionano scaffolds).
Performs scaffolding using Bionano Data. Part of VGP assembly pipeline.
Tests