Skip to content
Contact Support

Dorado

High-performance, easy-to-use, open source basecaller for Oxford Nanopore reads. Dorado Homepage

Available Modules

module load Dorado/1.4.0

Description

Dorado is a high-performance, easy-to-use, open source basecaller for Oxford Nanopore reads.

Features

  • One executable with sensible defaults, automatic hardware detection and configuration.
  • Nvidia GPUs including multi-GPU with linear scaling.
  • Modified basecalling (Remora models).
  • Duplex basecalling.
  • POD5 support for highest basecalling performance.
  • Based on libtorch, the C++ API for pytorch.
  • Multiple custom optimisations in CUDA and Metal for maximising inference performance.

License and Disclaimer

\(c\) 2022 Oxford Nanopore Technologies Ltd.

Dorado is distributed under the terms of the Oxford Nanopore Technologies, Ltd. Public License, v. 1.0. If a copy of the License was not distributed with this file, You can obtain one at http://nanoporetech.com

Dorado models

Dorado can fetch models for basecalling from the Oxford Nanopore Content Delivery Network (CDN) using dorado download. By default, Dorado will download and look for models in the current working directory; this can be changed using the --models-directory option for Dorado commands. You can also set this via the DORADO_MODELS_DIRECTORY environment variable.

Example Slurm script

  • The following Slurm script is a template to run Basecalling on the NVIDIA A100 GPUs. We do not recommend running Dorado jobs on CPUs.
  • --device 'cuda:all' will automatically pick up the GPU over CPU
  • We are not providing the models as part of the module yet. 
#!/bin/bash -e

#SBATCH --account        nesi12345
#SBATCH --job-name       dorado
#SBATCH --gpus-per-node  A100:1
#SBATCH --mem            6G
#SBATCH --cpus-per-task  4
#SBATCH --time           00:10:00
#SBATCH --output         slurmout.%j.out

module purge
module load Dorado/0.4.3

dorado download --model dna_r10.4.1_e8.2_400bps_hac@v4.1.0

dorado basecaller  --device 'cuda:all' dna_r10.4.1_e8.2_400bps_hac@v4.1.0 pod5s/ > calls.bam