Workflows

What is a Workflow?
539 Workflows visible to you, out of a total of 581

Assembly with Hifi reads and Trio Data

Generate phased assembly based on PacBio Hifi Reads using parental Illumina data for phasing

Inputs

  1. Hifi long reads [fastq]
  2. Concatenated Illumina reads : Paternal [fastq]
  3. Concatenated Illumina reads : Maternal [fastq]
  4. K-mer database [meryldb]
  5. Paternal hapmer database [meryldb]
  6. Maternal hapmer database [meryldb]
  7. Genome profile summary generated by Genomescope [txt]
  8. Bloom Filter
  9. Name of first haplotype
  10. Name of second haplotype ...

Type: Galaxy

Creator: Galaxy, VGP

Submitter: WorkflowHub Bot

Purge Duplicate Contigs

Purge contigs marked as duplicates by purge_dups in a single haplotype(could be haplotypic duplication or overlap duplication) This workflow is the 6th workflow of the VGP pipeline. It is meant to be run after one of the contigging steps (Workflow 3, 4, or 5)

Inputs

  1. Genomescope model parameters [txt] (Generated by the k-mer profiling workflow)
  2. Hifi long reads - trimmed [fastq] (Generated by Cutadapt in the contigging workflow)
  3. Assembly to purge (e.g. hap1) ...

Type: Galaxy

Creator: Galaxy, VGP

Submitter: WorkflowHub Bot

Stable

Name: Matmul GPU Case 1 Cache-ON Contact Person: cristian.tatu@bsc.es Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: Minotauro-MN4

Matmul running on the GPU leveraging COMPSs GPU Cache for deserialization speedup. Launched using 32 GPUs (16 nodes). Performs C = A @ B Where A: shape (320, 56_900_000) block_size (10, 11_380_000)             B: shape (56_900_000, 10)   block_size (11_380_000, 10)             C: shape (320, 10)                block_size ...

Type: COMPSs

Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)

Submitter: Cristian Tatu

DOI: 10.48546/workflowhub.workflow.798.1

Stable

Name: Matmul GPU Case 1 Cache-OFF Contact Person: cristian.tatu@bsc.es Access Level: public License Agreement: Apache2 Platform: COMPSs 3.3 Machine: Minotauro-MN4

Matmul running on the GPU without Cache. Launched using 32 GPUs (16 nodes). Performs C = A @ B Where A: shape (320, 56_900_000) block_size (10, 11_380_000)             B: shape (56_900_000, 10)   block_size (11_380_000, 10)             C: shape (320, 10)                block_size (10, 10) Total dataset size 291 ...

Type: COMPSs

Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)

Submitter: Cristian Tatu

DOI: 10.48546/workflowhub.workflow.797.1

Stable

Name: K-Means GPU Cache OFF Contact Person: cristian.tatu@bsc.es Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: Minotauro-MN4

K-Means running on GPUs. Launched using 32 GPUs (16 nodes). Parameters used: K=40 and 32 blocks of size (1_000_000, 1200). It creates a block for each GPU. Total dataset shape is (32_000_000, 1200). Version dislib-0.9

Average task execution time: 194 seconds

Type: COMPSs

Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)

Submitter: Cristian Tatu

DOI: 10.48546/workflowhub.workflow.799.1

Stable

Name: K-Means GPU Cache ON Contact Person: cristian.tatu@bsc.es Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: Minotauro-MN4

K-Means running on the GPU leveraging COMPSs GPU Cache for deserialization speedup. Launched using 32 GPUs (16 nodes). Parameters used: K=40 and 32 blocks of size (1_000_000, 1200). It creates a block for each GPU. Total dataset shape is (32_000_000, 1200). Version dislib-0.9

Average task execution time: 16 seconds

Type: COMPSs

Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)

Submitter: Cristian Tatu

DOI: 10.48546/workflowhub.workflow.800.1

Stable

Name: Dislib Distributed Training - Cache ON Contact Person: cristian.tatu@bsc.es Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: Minotauro-MN4

PyTorch distributed training of CNN on GPU and leveraging COMPSs GPU Cache for deserialization speedup. Launched using 32 GPUs (16 nodes). Dataset: Imagenet Version dislib-0.9 Version PyTorch 1.7.1+cu101

Average task execution time: 36 seconds

Type: COMPSs

Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)

Submitter: Cristian Tatu

DOI: 10.48546/workflowhub.workflow.802.1

Stable

Name: Dislib Distributed Training - Cache OFF Contact Person: cristian.tatu@bsc.es Access Level: public License Agreement: Apache2 Platform: COMPSs Machine: Minotauro-MN4

PyTorch distributed training of CNN on GPU. Launched using 32 GPUs (16 nodes). Dataset: Imagenet Version dislib-0.9 Version PyTorch 1.7.1+cu101

Average task execution time: 84 seconds

Type: COMPSs

Creators: Cristian Tatu, The Workflows and Distributed Computing Team (https://www.bsc.es/discover-bsc/organisation/scientific-structure/workflows-and-distributed-computing/)

Submitter: Cristian Tatu

DOI: 10.48546/workflowhub.workflow.801.1

Work-in-progress

With this galaxy pipeline you can use Salmonella sp. next generation sequencing results to predict bacterial AMR phenotypes and compare the results against gold standard Salmonella sp. phenotypes obtained from food.

This pipeline is based on the work of the National Food Agency of Canada. Doi: 10.3389/fmicb.2020.00549

Type: Galaxy

Creators: None

Submitter: Dennis Dollée

No description specified

Type: Galaxy

Creators: None

Submitter: Dennis Dollée

Powered by
(v.1.14.1)
Copyright © 2008 - 2023 The University of Manchester and HITS gGmbH