nf-core/hlatyping
Version 1

Workflow Type: Nextflow

nfcore/hlatyping

Precision HLA typing from next-generation sequencing data using OptiType.

Build Status Nextflow DOI

install with bioconda Docker https://www.singularity-hub.org/static/img/hosted-singularity--hub-%23e32929.svg

Table of Contents

  1. Introduction
  2. Quick Start
  3. Documentation
  4. Pipeline DAG
  5. Credits

Introduction

OptiType is a HLA genotyping algorithm based on integer linear programming. Reads of whole exome/genome/transcriptome sequencing data are mapped against a reference of known MHC class I alleles. To produce accurate 4-digit HLA genotyping predictions, all major and minor HLA-I loci are considered simultaneously to find an allele combination that maximizes the number of explained reads.

Introduction

The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker containers making installation trivial and results highly reproducible.

Quick Start

If you want to test with a single line, if the pipeline works on your system, follow the next commands, with pre-configured test data-sets.

Docker

bash nextflow run nf-core/hlatyping -profile docker,test --outdir $PWD/results

Singularity

bash nextflow run nf-core/hlatyping -profile singularity,test --outdir $PWD/results

Documentation

The nf-core/hlatyping pipeline comes with documentation about the pipeline, found in the docs/ directory:

  1. Installation
  2. Pipeline configuration
  3. Running the pipeline
  4. Output and how to interpret the results
  5. Troubleshooting

Pipeline DAG

The hlatyping pipeline can currently deal with two input formats: .fastq{.gz} or .bam, not both at the same time however. If the input file type is bam, than the pipeline extracts all reads from it and performs an mapping additional step with the yara mapper against the HLA reference sequence. Indices are provided in the ./data directory of this repository. Optitype uses razers3, which is very memory consuming. In order to avoid memory issues during pipeline execution, we reduce the mapping information on the relevant HLA regions on chromosome 6.

DAG with .fastq{.gz} as input

Creates a config file from the command line arguments, which is then passed to OptiType. In parallel, the fastqs are unzipped if they are passed as archives. OptiType is then used for the HLA typing.

DAG with `fastq.{gz}` files

DAG with .bam as input

Creates a config file from the command line arguments, which is then passed to OptiType. In parallel, the reads are extracted from the bam file and mapped again against the HLA reference sequence on chromosome 6. OptiType is then used for the HLA typing.

DAG with `.bam` file

Credits

This pipeline was originally written by:

Total size: 153 MB
help Creators and Submitter
Creators
Not specified
Submitter
License
Activity

Views: 652   Downloads: 29

Created: 25th Feb 2020 at 11:06

Last used: 28th Sep 2021 at 03:05

help Attributions

None

Version History

Version 1 (earliest) Created 25th Feb 2020 at 11:06 by Finn Bacall

Added/updated 63 files


Open master 06a17f1

Related items

Powered by
(v.1.12.0-master)
Copyright © 2008 - 2021 The University of Manchester and HITS gGmbH