Space: This Team is not associated with a Space
SEEK ID: https://workflowhub.eu/projects/240
Public web page: Not specified
Organisms: No Organisms specified
WorkflowHub PALs: No PALs for this Team
Team created: 1st May 2024
Related items
Teams: Intergalactic Workflow Commission (IWC), Vertebrate Genomes Pipelines in Galaxy, iwc
Organizations: Bots, European Galaxy Team
Purge Duplicate Contigs
Purge contigs marked as duplicates by purge_dups in a single haplotype(could be haplotypic duplication or overlap duplication) This workflow is the 6th workflow of the VGP pipeline. It is meant to be run after one of the contigging steps (Workflow 3, 4, or 5)
Inputs
- Genomescope model parameters [txt] (Generated by the k-mer profiling workflow)
- Hifi long reads - trimmed [fastq] (Generated by Cutadapt in the contigging workflow)
- Assembly to purge (e.g. hap1) ...
Assembly with Hifi reads and Trio Data
Generate phased assembly based on PacBio Hifi Reads using parental Illumina data for phasing
Inputs
- Hifi long reads [fastq]
- Concatenated Illumina reads : Paternal [fastq]
- Concatenated Illumina reads : Maternal [fastq]
- K-mer database [meryldb]
- Paternal hapmer database [meryldb]
- Maternal hapmer database [meryldb]
- Genome profile summary generated by Genomescope [txt]
- Genome model parameters generated by Genomescope [tabular]
...
Importing single-end multiplexed data (not demultiplexed yet)
Type: Galaxy
Creators: Debjyoti Ghosh, Helmholtz-Zentrum für Umweltforschung - UFZ
Submitter: WorkflowHub Bot
Use DADA2 for sequence quality control. DADA2 is a pipeline for detecting and correcting (where possible) Illumina amplicon sequence data. As implemented in the q2-dada2 plugin, this quality control process will additionally filter any phiX reads (commonly present in marker gene Illumina sequence data) that are identified in the sequencing data, and will filter chimeric sequences.
Type: Galaxy
Creators: Debjyoti Ghosh, Helmholtz-Zentrum für Umweltforschung - UFZ
Submitter: WorkflowHub Bot
This workflow takes as input SR BAM from ChIP-seq. It calls peaks on each replicate and intersect them. In parallel, each BAM is subsetted to smallest number of reads. Peaks are called using all subsets combined. Only peaks called using a combination of all subsets which have summits intersecting the intersection of at least x replicates will be kept.
Racon polish with long reads, x4
RepeatMasking Workflow
This workflow uses RepeatModeler and RepeatMasker for genome analysis.
-
RepeatModeler is a software package for identifying and modeling de novo families of transposable elements (TEs). At the heart of RepeatModeler are three de novo repeat search programs (RECON, RepeatScout and LtrHarvest/Ltr_retriever) which use complementary computational methods to identify repeat element boundaries and family relationships from sequence data.
-
RepeatMasker is a program that analyzes ...
We assume the identifiers of the input list are like: sample_name_replicateID. The identifiers of the output list will be: sample_name
The workflow for Illumina-sequenced ARTIC data builds on the RNASeq workflow for paired-end data using the same steps for mapping and variant calling, but adds extra logic for trimming ARTIC primer sequences off reads with the ivar package. In addition, this workflow uses ivar also to identify amplicons affected by ARTIC primer-binding site mutations and tries to exclude reads derived from such tainted amplicons when calculating allele-frequencies of other variants.
Automated inference of stable isotope incorporation rates in proteins for functional metaproteomics
Run baredSC in 1 dimension in logNorm for 1 to N gaussians and combine models.
VGP Workflow #1
This workflow produces a Meryl database and Genomescope outputs that will be used to determine parameters for following workflows, and assess the quality of genome assemblies. Specifically, it provides information about the genomic complexity, such as the genome size and levels of heterozygosity and repeat content, as well about the data quality.
Inputs
- A collection of Hifi long reads in FASTQ format
- k-mer length
- Ploidy
Outputs
- Meryl Database of kmer counts
...
Create Meryl Database used for the estimation of assembly parameters and quality control with Merqury. Part of the VGP pipeline.
This workflow is composed with the XCMS tool R package (Smith, C.A. 2006) able to extract, filter, align and fill gapand the possibility to annotate isotopes, adducts and fragments using the CAMERA R package (Kuhl, C 2012).
This workflow is composed with the XCMS tool R package (Smith, C.A. 2006) able to extract and the metaMS R package (Wehrens, R 2014) for the field of untargeted metabolomics.
This workflow processes the CMO fastqs with CITE-seq-Count and include the translation step required for cellPlex processing. In parallel it processes the Gene Expresion fastqs with STARsolo, filter cells with DropletUtils and reformat all outputs to be easily used by the function 'Read10X' from Seurat.
Type: Galaxy
Creators: Lucille Delisle, Mehmet Tekman, Hans-Rudolf Hotz, Daniel Blankenberg, Wendi Bacon
Submitter: WorkflowHub Bot
Run velocyto to get loom with counts of spliced and unspliced. It will extract the 'barcodes' from the bundled outputs.
Assemble long reads with Flye, then view assembly statistics and assembly graph