qcif/taxapus is a modular, reproducible Nextflow workflow for the conservative taxonomy assignment to DNA sequences, designed for high-confidence, auditable results in biosecurity and biodiversity contexts. The workflow integrates multiple bioinformatics tools and databases, automates best-practice analysis steps, and produces detailed reports with supporting evidence for each taxonomic assignment.
Workflow Overview
The pipeline orchestrates a series of analytical steps, each encapsulated in a dedicated module or subworkflow. The main stages are:
-
Environment Configuration Sets up environment variables and paths required for downstream processes, ensuring reproducibility and portability.
-
Input Validation Checks the integrity and compatibility of input files (FASTA sequences, metadata, databases), preventing downstream errors.
-
Sequence Search
- BLAST Core Nucleotide Database (BLASTN): Queries input sequences against the NCBI nucleotide database using BLASTN.
- BOLD v4 (API): Queries input sequences against the Barcode of Life Data Systems. Taxonomic lineage included in the results.
-
Hit Extraction Parses BLAST results to extract relevant hits for each query.
-
Taxonomic ID Extraction Retrieves taxonomic IDs for BLAST hits.
-
Taxonomic Lineage Extraction Maps taxonomic IDs to full lineages, enabling downstream filtering and reporting.
-
Candidate Extraction Identifies candidate species for each query, applying user-defined thresholds for identity and coverage.
-
Supporting Evidence Evaluation
-
Multiple Sequence Alignment (MAFFT) Aligns candidate and query sequences to prepare for phylogenetic analysis.
-
Phylogenetic Tree Construction (FastMe) Builds a phylogenetic tree to visualise relationships among candidates and queries.
-
Comprehensive Reporting Generates detailed HTML and text reports, including sequence alignments, phylogenetic trees, database coverage, and all supporting evidence for each assignment.
Version History
main @ 0d8e125 (earliest) Created 1st Jul 2025 at 06:23 by Magdalena Antczak
corrections
Frozen
main
0d8e125

Creators
Submitter
Views: 57 Downloads: 8
Created: 1st Jul 2025 at 06:23
Last updated: 1st Jul 2025 at 07:31

This item has not yet been tagged.

None