-
Notifications
You must be signed in to change notification settings - Fork 68
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'slee/release-v0.8.0' into 'master'
v0.8.0 documentation updates See merge request machine-learning/dorado!1199
- Loading branch information
Showing
3 changed files
with
57 additions
and
20 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -22,10 +22,10 @@ If you encounter any problems building or running Dorado, please [report an issu | |
|
||
First, download the relevant installer for your platform: | ||
|
||
- [dorado-0.7.3-linux-x64](https://cdn.oxfordnanoportal.com/software/analysis/dorado-0.7.3-linux-x64.tar.gz) | ||
- [dorado-0.7.3-linux-arm64](https://cdn.oxfordnanoportal.com/software/analysis/dorado-0.7.3-linux-arm64.tar.gz) | ||
- [dorado-0.7.3-osx-arm64](https://cdn.oxfordnanoportal.com/software/analysis/dorado-0.7.3-osx-arm64.zip) | ||
- [dorado-0.7.3-win64](https://cdn.oxfordnanoportal.com/software/analysis/dorado-0.7.3-win64.zip) | ||
- [dorado-0.8.0-linux-x64](https://cdn.oxfordnanoportal.com/software/analysis/dorado-0.8.0-linux-x64.tar.gz) | ||
- [dorado-0.8.0-linux-arm64](https://cdn.oxfordnanoportal.com/software/analysis/dorado-0.8.0-linux-arm64.tar.gz) | ||
- [dorado-0.8.0-osx-arm64](https://cdn.oxfordnanoportal.com/software/analysis/dorado-0.8.0-osx-arm64.zip) | ||
- [dorado-0.8.0-win64](https://cdn.oxfordnanoportal.com/software/analysis/dorado-0.8.0-win64.zip) | ||
|
||
Once the relevant `.tar.gz` or `.zip` archive is downloaded, extract the archive to your desired location. | ||
|
||
|
@@ -363,7 +363,7 @@ $ dorado download --model all | |
The names of Dorado models are systematically structured, each segment corresponding to a different aspect of the model, which include both chemistry and run settings. Below is a sample model name explained: | ||
`dna_r10.4.1_e8.2_400bps_hac@v4.3.0` | ||
`dna_r10.4.1_e8.2_400bps_hac@v5.0.0` | ||
- **Analyte Type (`dna`)**: This denotes the type of analyte being sequenced. For DNA sequencing, it is represented as `dna`. If you are using a Direct RNA Sequencing Kit, this will be `rna002` or `rna004`, depending on the kit. | ||
|
@@ -375,7 +375,7 @@ The names of Dorado models are systematically structured, each segment correspon | |
- **Model Type (`hac`)**: This represents the size of the model, where larger models yield more accurate basecalls but take more time. The three types of models are `fast`, `hac`, and `sup`. The `fast` model is the quickest, `sup` is the most accurate, and `hac` provides a balance between speed and accuracy. For most users, the `hac` model is recommended. | ||
- **Model Version Number (`v4.3.0`)**: This denotes the version of the model. Model updates are regularly released, and higher version numbers typically signify greater accuracy. | ||
- **Model Version Number (`v5.0.0`)**: This denotes the version of the model. Model updates are regularly released, and higher version numbers typically signify greater accuracy. | ||
### **DNA models:** | ||
|
@@ -387,8 +387,8 @@ The versioning of modification models is bound to the basecalling model. This me | |
| Basecalling Models | Compatible<br />Modifications | Modifications<br />Model<br />Version | Data<br />Sampling<br />Frequency | | ||
| :-------- | :------- | :--- | :--- | | ||
| **[email protected]** | | | 5 kHz | | ||
| **[email protected]** | 4mC_5mC<br />5mCG_5hmCG<br />5mC_5hmC<br />6mA<br /> | v1<br />v1<br />v1<br />v1 | 5 kHz | | ||
| **[email protected]** | 4mC_5mC<br />5mCG_5hmCG<br />5mC_5hmC<br />6mA<br /> | v1<br />v1<br />v1<br />v1 | 5 kHz | | ||
| **[email protected]** | 4mC_5mC<br />5mCG_5hmCG<br />5mC_5hmC<br />6mA<br /> | v2<br />v2<br />v2<br />v2 | 5 kHz | | ||
| **[email protected]** | 4mC_5mC<br />5mCG_5hmCG<br />5mC_5hmC<br />6mA<br /> | v2<br />v2<br />v2<br />v2 | 5 kHz | | ||
| [email protected] | | | 5 kHz | | ||
| [email protected] | 5mCG_5hmCG<br />5mC_5hmC<br />6mA<br /> | v1<br />v1<br />v2 | 5 kHz | | ||
| [email protected] | 5mCG_5hmCG<br />5mC_5hmC<br />6mA<br /> | v1<br />v1<br />v2 | 5 kHz | | ||
|
@@ -420,18 +420,21 @@ The versioning of modification models is bound to the basecalling model. This me | |
### **RNA models:** | ||
**Note:** The BAM format does not support `U` bases. Therefore, when Dorado is performing RNA basecalling, the resulting output files will include `T` instead of `U`. This is consistent across output file types. The same applies to parsing inputs. Any input HTS file (e.g. FASTQ generated by `guppy`/`basecall_server`) with `U` bases is not handled by `dorado`. | ||
**Note:** The BAM format does not support `U` bases. Therefore, when Dorado is performing RNA basecalling, the resulting output files will include `T` instead of `U`. This is consistent across output file types. | ||
| Basecalling Models | Compatible<br />Modifications | Modifications<br />Model<br />Version | Data<br />Sampling<br />Frequency | | ||
| :-------- | :------- | :--- | :--- | | ||
| **[email protected]** | N/A | N/A | 4 kHz | | ||
| **[email protected]** | m6A<br />pseU | v1<br />v1<br />v1 | 4 kHz | | ||
| **[email protected]** | m6A<br />pseU | v1<br />v1<br />v1 | 4 kHz | | ||
| [email protected] | N/A | N/A | 4 kHz | | ||
| [email protected] | N/A | N/A | 4 kHz | | ||
| **[email protected]** | | | 4 kHz | | ||
| **[email protected]** | m5C<br />m6A_DRACH<br />inosine_m6A<br />pseU | v1<br />v1<br />v1<br />v1 | 4 kHz | | ||
| **[email protected]** | m5C<br />m6A_DRACH<br />inosine_m6A<br />pseU | v1<br />v1<br />v1<br />v1 | 4 kHz | | ||
| [email protected] | | | 4 kHz | | ||
| [email protected] | m6A<br />m6A_DRACH<br />pseU | v1<br />v1<br />v1 | 4 kHz | | ||
| [email protected] | m6A<br />m6A_DRACH<br />pseU | v1<br />v1<br />v1 | 4 kHz | | ||
| [email protected] | | | 4 kHz | | ||
| [email protected] | | | 4 kHz | | ||
| [email protected] | m6A_DRACH | v1 | 4 kHz | | ||
| rna002_70bps_fast@v3 | N/A | N/A | 3 kHz | | ||
| rna002_70bps_hac@v3 | N/A | N/A | 3 kHz | | ||
| rna002_70bps_fast@v3 | | | 3 kHz | | ||
| rna002_70bps_hac@v3 | | | 3 kHz | | ||
## Automatic model selection complex | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters