Skip to content

Quickstart

Prerequisites

  • Nextflow >= 23.04
  • Conda or Mamba (for -profile conda)
  • Or: Docker / Apptainer (for container execution)

Run from GitHub

No installation needed — Nextflow pulls the pipeline automatically:

nextflow run rec3141/microscape-nf \
    --input /path/to/reads \
    --ref_databases "silva:/path/to/silva_train_set.fasta:Domain,Phylum,Class,Order,Family,Genus" \
    -profile conda \
    -resume

Run with Container

# Docker
nextflow run rec3141/microscape-nf \
    --input /path/to/reads \
    --ref_databases "silva:/path/to/silva.fasta:Domain,Phylum,Class,Order,Family,Genus" \
    -with-docker ghcr.io/rec3141/microscape-nf:latest \
    -resume

# Apptainer (HPC)
nextflow run rec3141/microscape-nf \
    --input /path/to/reads \
    --ref_databases "silva:/path/to/silva.fasta:Domain,Phylum,Class,Order,Family,Genus" \
    -with-singularity docker://ghcr.io/rec3141/microscape-nf:latest \
    -resume

Run from Local Clone

git clone https://github.com/rec3141/microscape-nf.git
cd microscape-nf

nextflow run main.nf \
    --input /path/to/reads \
    --ref_databases "silva:/path/to/silva.fasta:Domain,Phylum,Class,Order,Family,Genus" \
    -resume

Multi-Database Taxonomy

Classify against multiple reference databases in parallel:

nextflow run rec3141/microscape-nf \
    --input /path/to/reads \
    --ref_databases "silva:/db/silva.fasta:Domain,Phylum,Class,Order,Family,Genus;pr2:/db/pr2.fasta:Domain,Supergroup,Division,Class,Order,Family,Genus,Species" \
    --run_phylogeny \
    -profile conda -resume

Persistent Cache

For large projects, use --store_dir to skip completed steps across runs:

nextflow run rec3141/microscape-nf \
    --input /path/to/reads \
    --ref_databases "silva:/db/silva.fasta:Domain,Phylum,Class,Order,Family,Genus" \
    --store_dir /scratch/microscape_cache \
    -profile conda -resume

Outputs

Results are written to --outdir (default: results/):

results/
├── filtered/              Filtered FASTQ files
├── errors/                Error model plots
├── seqtab/                Per-plate sequence tables
├── merged/                Merged sequence table
├── chimeras/              Chimera-filtered table
├── filtered_seqtab/       QC-filtered final table
├── taxonomy/              Per-database classifications
├── phylogeny/             MSA + NJ tree (if --run_phylogeny)
├── renormalized/          Per-group proportional tables
├── ordination/            t-SNE/PCA coordinates
├── network/               Correlation edge lists
├── viz/                   JSON files for web viewer
└── pipeline_info/         Nextflow timeline, trace, DAG