Skip to content
/ canu Public
forked from marbl/canu

A single molecule sequence assembler for genomes large and small.

Notifications You must be signed in to change notification settings

gaworj/canu

This branch is 277 commits behind marbl/canu:master.

Folders and files

NameName
Last commit message
Last commit date
May 17, 2017
Mar 26, 2021
Feb 25, 2021
Mar 31, 2021
Mar 27, 2020
Mar 15, 2020
Aug 10, 2015
Dec 13, 2016
Aug 20, 2020

Repository files navigation

Canu

Canu is a fork of the Celera Assembler, designed for high-noise single-molecule sequencing (such as the PacBio RS II/Sequel or Oxford Nanopore MinION).

Canu is a hierarchical assembly pipeline which runs in four steps:

  • Detect overlaps in high-noise sequences using MHAP
  • Generate corrected sequence consensus
  • Trim corrected sequences
  • Assemble trimmed corrected sequences

Install:

  • Do NOT download the .zip source code. It is missing files and will not compile. This is a known flaw with git itself.

  • The easiest way to get started is to download a binary release.

  • Installing with a 'package manager' is not encouraged, but if you have no other choice:

    • Conda: conda install -c conda-forge -c bioconda -c defaults canu
    • Homebrew: brew install brewsci/bio/canu
  • Alternatively, you can use the latest unreleased version from the source code. This version has not undergone the same testing as a release and so may have unknown bugs or issues generating sub-optimal assemblies. We recommend the release version for most users.

      git clone https://github.com/marbl/canu.git
      cd canu/src
      make -j <number of threads>
    
  • An unsupported Docker image made by Frank Förster is at https://hub.docker.com/r/greatfireball/canu/.

Learn:

The quick start will get you assembling quickly, while the tutorial explains things in more detail.

Run:

Brief command line help:

../<architecture>/bin/canu

Full list of parameters:

../<architecture>/bin/canu -options

Citation:

About

A single molecule sequence assembler for genomes large and small.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C++ 63.9%
  • C 25.6%
  • Perl 9.1%
  • Other 1.4%