Skip to content

Releases: nanoporetech/dorado

v0.5.3

06 Feb 15:10
Compare
Choose a tag to compare

[0.5.3] (06 Feb 2024)

This release of Dorado fixes a bug causing low Poly(A) estimation rates in RNA.

  • 59a083c - Fix RNA Poly(A) tail estimated in the absence of adapter trimming.
  • f0f9883 - Clarify ns tag in Dorado SAM spec.

v0.5.2

19 Jan 10:00
Compare
Choose a tag to compare

[0.5.2] (18 Jan 2024)

This release of Dorado fixes a bug causing malformed CIGAR strings, prevents crashing when calling modifications with duplex, and improves adapter and primer trimming support.

  • 062e5e3 - Fix malformed CIGAR string for non-primary alignment
  • 0a057bb - Fix duplex modifications crash
  • d453db2 - Add missing support for RAD adapter detection and trimming
  • 8c2d004 - Correctly trim modbase tags for reverse strand alignments
  • 76f24b2 - Update custom barcode documentation
  • 9959654 - Only require standardisation parameters if standardisation is active

v0.5.1

21 Dec 21:49
Compare
Choose a tag to compare

[0.5.1] (21 Dec 2023)

This release of Dorado fixes bugs with adapter trimming and custom barcodes, introduces a more accurate 6mA model, and adds several quality of life improvements.

  • 9a46392 - Replace use of constant with a parameter from custom barcode file.
  • 1893d69 - Decouple basecall library from models library.
  • e42761c - Allow RNA adapter trimming to be skipped.
  • a510d53 - Prevent simultaneous usage of multiple modbase models affecting the same canonical base.
  • 371a252 - Fix incorrect sample count in the ns tag with sequence trimming.
  • 9f532ff - Remove modbase tags for non-primary alignments except when soft clipping is enabled.
  • 52431e6 - Update 6mA model.
  • 7109c1c - Remove superfluous clamp from Metal model implementation.
  • 5fa4de7 - Refactor decoder interfaces.
  • a3dfc94 - Improve README for adapter trimming.
  • 3bfb1f0 - Fix bug with out-of-order primer trimming positions.
  • b1302ae - Allow alignment to be skipped for disconnected clients.
  • 55d09f9 - Update HDF5 pre-built library location.
  • 2048ad5 - Decrease httplib connection timeout.
  • aae47b1 - Refactor codebase to unify interfaces and reduce dependencies.
  • 6ed81c5 - Run separate modbase models in different CUDA streams.
  • decb9e7 - Update build settings to simplify integration into basecall server.
  • e8b07e2 - Report warning and skip FAST5 files when datasets contain FAST5 and POD5 files.
  • 6c984a0 - Enable Xcode builds.
  • 6d31793 - Split Metal LSTM kernel into multiple command buffers.
  • 364d15d - Fix bug with passing custom barcode file into basecaller command.
  • 951e3c3 - Allow read to override adapter/primer trimming defaults.
  • d6e2a80 - Clean up model auto download directories.
  • c552351 - Improve error handling during model auto download.
  • 936d408 - Report incorrect results warning for CPU basecalling on TX2.

v0.5.0

05 Dec 19:30
Compare
Choose a tag to compare

[0.5.0] (5 Dec 2023)

This release of Dorado introduces new, more accurate, and faster v4.3 basecalling models. It also enables hemi-methylation basecalling of duplex reads. Dorado now supports DNA primer and adapter trimming, custom barcode arrangements and sequences, and can automatically select the correct model for your data. Furthermore, this release introduces speed and memory enhancements for basecalling on Apple silicon and various stability improvements.

  • 1415969 - Add v4.3 basecalling models
  • b7d4b38 - Support for modified bases with duplex basecalling (hemi-methylation)
  • 30e639c - Primer and adapter trimming
  • fb85a70 - Enable automatic model selection
  • 16e5b6a - Support for custom barcode arrangements and sequences
  • 46bbfdd - Add barcode column to summary file
  • e9f060c - Improve the precision of read splitting
  • 4102ffc - Increase speed of v4.3 model execution
  • 0a07110 - Prevent progress bar from --resume-from logging excessive dots
  • 20b5637 - Ensure that aligner outputs SAM when not piped to a file
  • 942a35a - Add MN tag to ouput BAM to help downstream tools interpret modified base tags
  • f0ac935 - Added modbase model name to BAM files in RG header section.
  • a7fa371 - Improve performance of HAC and SUP on Apple silicon
  • 152d5fd - Improvements to auto batch sizing on Apple silicon
  • b0767a6 - Fix bug causing segfault with summary command on Windows
  • 1c2c6a9 - Make AVX reverse_complement implementation preserve nucleotide case
  • 4a4dd1c - Use updated Koi functions for small LSTM layers, final convolutional layer in LSTM models, and final linear layer

v0.4.3

14 Nov 19:41
Compare
Choose a tag to compare

[0.4.3] (14 Nov 2023)

This release of Dorado introduces a new RNA m6A modified base model and initial support for poly(A)/poly(T) tail length estimation. It also introduces duplex performance enhancements and bug fixes to improve the stability of Dorado.

  • 803e3a7 - Add RNA m6A DRACH-context model
  • 0f282cd - Add poly(A)/poly(T) tail length estimation support for RNA and cDNA
  • 54e14ca - Add RNA read splitting
  • 2dc1f03 - Enable RNA adapter trimming
  • 80114c0 - Correctly update CIGAR and POS entries when trimming barcodes
  • 4b2025c - Add documentation for sample sheet support
  • 641cb08 - Reduce host memory footprint for duplex basecalling
  • 7c1c0f0 - Reduce working reads size, in particular for duplex.
  • 831f0a9 - Fix pairing check for split reads in duplex basecalling
  • b630567 - Account for split reads during progress tracking
  • 383fe02 - Update to Koi v0.4.1
  • 873c6b1 - Fix warnings about ONLY_C_LOCAL mismatches in PCH builds
  • 52cbabf - Encapsulate date dependency
  • 8fb8a4d - Disable Cutlass LSTM codepath for 128-wide LSTM layers because this kernel does not work
  • 6a9dad9 - Enable warnings as errors at build time
  • 5aaef31 - Address auto batchsize issues on unified memory Linux systems
  • 92b5a67 - Reduce compilation times
  • 062e3fd - Minor speed improvements to CPU beam search

v0.4.2

30 Oct 09:27
Compare
Choose a tag to compare

[0.4.2] (30 Oct 2023)

This release of Dorado fixes a bug with the CpG-context 5mC/5hmC model calling all contexts and adds beta support for using a barcode alias from a sample sheet.

v0.4.1

13 Oct 00:23
Compare
Choose a tag to compare

[0.4.1] (12 Oct 2023)

This release of Dorado fixes a bug with duplex tags and adds functionality to dorado demux.

  • 7fefd5d - Fix missing dx:i:-1 tag for simplex reads with duplex offsprings
  • d532ef1 - Enable dorado debug build in Visual Studio 22
  • eeef757 - Add dorado demux option to demux pre-classified barcoded data

v0.4.0

09 Oct 17:25
Compare
Choose a tag to compare

[0.4.0] (9 Oct 2023)

This release of Dorado introduces barcode demultiplexing, barcode trimming, simplex read splitting, and updated models for calling 6mA and 5mC/5hmC modified bases. Furthermore, it contains improvements to duplex pairing for increased yield, basecalling speed enhancements and reductions to memory consumption, bug fixes, and additional tests for enhanced stability.

  • e836fa4 - New all-context and updated CpG-context 5mC/5hmC models
  • e4aca76 - Handle custom batch sizes that exceed maximum safe batch size
  • ad463ea - Updated all-context 6mA model
  • 21d25a3 - SSL host verification enabled and dorado download HTTPS proxy support improved
  • 8ae95e7 - Enable simplex read splitting
  • 1210607 - Fix mean-qscore calculation with trimming
  • d27666e - Beam search optimisation
  • 46e68c1 - Report the error we get back from httplib if a download fails
  • 89db1e1 - Support ChEBI codes for modified bases
  • 92097ca - Add options to dorado aligner
  • 4cb9862 - Fix Linux ccache usage
  • 692ecd3 - Fixed issues with internal representation of signal during duplex calling
  • 80efe8c - Fix std::logic_error (issue #205)
  • d92547a - Make CUDA kernel profiling to stderr available via --devopts
  • 69bf701 - Refactoring to support further duplex developments
  • 510e2e9 - Verify models when downloaded using dorado download
  • 12476ee - Improve memory usage diagnostics in Metal
  • 3172413 - Fix non-determinism in selecting matrix multiplication sub-routine during basecalling
  • f14b418 - Provide NVIDIA driver version in server API
  • 207871e - Use DORADO_GPU_BUILD rather than !defined(__x86_64__)
  • d16ccbe - Get tests that use CUDA working when ASAN is enabled
  • 41bfb99 - Update duplex pairing heuristics to improve duplex yield
  • 87c2c6e - Change ReadPair to take full ownership of its data, and drop ReadPtr
  • 88aa9f3 - Only use pre-compiled headers if ccache isn't available
  • c04c145 - Bump Koi version to 0.3.9
  • 6b8064d - Separate out data shared by both Duplex and Simplex reads into ReadCommon class
  • 34e9b55 - Flip RNA signal for modified basecalling
  • 636ac83 - CRFModel: Update to use KoiActivation and corresponding Koi host functions
  • cd50a01 - Add parent ID tag (pi:Z:parentid) to split reads so that original read can be inferred
  • 2956bf6 - Add support for barcode trimming
  • 92dee85 - Fix segfault in modified basecalling
  • fe22d21 - Add support for read groups with barcoding
  • 2f69da6 - Allow basecalling from single files rather than a directory
  • c22e46f - Support ambiguous motifs for modified basecalling
  • 882da60 - Refactor to create separate utils lib target
  • b792bba - Enforce const-west style via clang-format
  • 611a4ea - Skip code signing on x64 to prevent crashes
  • a6a1902 - Fix various deficiencies of the iOS htslib build
  • 5ff8034 - Add barcode check for both ends
  • e42b8c8 - Remove mux from pairing cache key to reduce memory consumption by up to 4x
  • 93d052b - Setup signing of executables on macOS
  • a45b97c - Bump RNA004 models to V3.0.1 with corrected scaling
  • 7e152fd - Reinstate ReadFilterNode tests
  • c3e412e - Add duplex commandline test
  • 3ba422b - Don't perform adapter trimming on RNA signals
  • 066b815 - Add barcoding support to Dorado
  • 09cc44f - Add unit test coverage report generation
  • b2e54b8 - Fix typo for 5fC modification
  • 829dba3 - Enforce that values passed to a sink are mutable rvalues
  • 8aa7722 - Remove dependence of dorado_lib on dorado_models_lib
  • cd6d2bf - Extend the lifetime of the NNTasks in the metal backend
  • abed8ee - Bug fixes for iOS build
  • 64b3ae3 - Clarify --min-qscore option in help
  • 225a153 - Further restrict Metal kernel run times to improve stability of basecalling on Apple silicon
  • 3ae95e4 - Refactor modified basecalling code
  • 8ec58f0 - Option for --guard-gpus no longer used in duplex
  • 1f3cade - Add CUDAGuard before cache clear to reduce CUDA memory consumption
  • dae5e30 - Move ModBaseCallerNode to using an unordered_set for working reads, like BasecallerNode
  • b43adfa - Improvements to pipeline API

v0.3.4

14 Aug 20:18
Compare
Choose a tag to compare

[0.3.4] (14 Aug 2023)

This release of Dorado contains a few bug fixes and a hotfix for CUDA out of memory issues encountered during duplex runs with v0.3.3.

  • c5c0ea0 - Introduce pipeline API functions to simplify setup for simplex/duplex basecalling.
  • 9614eba - Fix potential hang in modbase calling node.
  • 67f84a6 - Set the --max-reads default to unlimited for dorado aligner
  • f6a0422 - Fix CUDA OOM in duplex by removing tensor caching in decode and updating memory fraction for stereo model.
  • 107ebba - Account for filtered duplex reads when tagging simplex parents and calculating duplex yield.

v0.3.3

09 Aug 00:26
Compare
Choose a tag to compare

[0.3.3] (8 Aug 2023)

This release of Dorado introduces improvements to Duplex pair identification for improved duplex yields, faster basecalling on A100 and H100, improvements to modified base calling speed on Apple silicon, and major enhancements to the portability of Dorado binaries.

  • 7307146 - Major reduction to required GPU memory, especially for A100/H100. Allows greater batch size and consequently improved basecalling speed
  • 8073364 - Improvements to Duplex pairing algorithm for increased Duplex yield. Situations where complement read is truncated are now handled.
  • 39ffb35 - Report the duplex rate percentage
  • 65b8b8a - Major speed upgrade to modified base calling on Apple silicon devices
  • 481438a - Improve performance of basecalling of Fast model on M silicon by 6% by inlining of function used during decode
  • fe8dbf2 - Improve basecalling stability on Apple silicon by limiting run time of LSTM kernel to avoid CB submission errors
  • 0abea5f - Upgrade dna_r9.4.1_e8 5mCG models to version 0.1 for improved accuracy
  • 752e094 - Upgrade to OpenSSL3
  • 447b559 - Switch to target-based includes CMake builds - Prevents dependent projects from copying all nclude paths and link libs
  • b0d10a9 - Fix edge case where it is possible to spawn no basecaller worker threads
  • 1650b83 - Fix issue with inability to find CUPTI library during compilation on some systems
  • e4ba3e5 - Add missing SA:Z tag to alignments generated by Dorado
  • c5a4cfc - Fix various linker errors
  • bf72fdd - Dorado will error out gracefully if no POD5s are provided
  • bcdeb8f - Improvements to portability of Dorado binaries via use of static linking on macOS and Linux
  • e14a7e6 - Improvements to error handling on Apple Silicon devices (Metal command buffer error handling)
  • ca1d191 - Improvements to read ids for non-split reads
  • 5d9238a - Revamp AsyncQueue interface
  • 6a3ccb6 - Removed RNA003 model which is obsolesced by RNA004
  • 1a94fac - Add summary stats and progress bar for basecalling from Fast5
  • 072ed96 - Add pipeline restart capability and pairing cache retention option to termination
  • 01acbd5 - Stop progress bar cutting out near the end of the run
  • 211968c - Add suppression for false-positive vptr issues in older Xcodes
  • 1e14d2a - Reduce unnecessary startup cost by setting an upper limit on the maximum batchsize to 10,240
  • 09c5b28 - Speed up the fixed cost of auto batchsize detection on mGPU systems by running in parallel