You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Clair3-RNA is a small variant caller for RNA long-read data. Clair3-RNA supports ONT R9 chemistry with complementary DNA sequencing (cDNA) and direct RNA sequencing (dRNA). It also supports PacBio Sequel and PacBio MAS-Seq RNA sequencing data.
17
-
18
-
At the current stage, Clair3-RNA is a side project of the lab. It runs, but its design is not very different from Clair3, thus not optimized for the characteristics of transcriptome data, such as uneven and low coverage. Later, we will put more effort into this project, and it is likely that we will give up most of the current designs.
16
+
Clair3-RNA is a small variant caller for RNA long-read data. Clair3-RNA supports ONT complementary DNA sequencing (cDNA) and direct RNA sequencing (dRNA). dRNA sequencing support the ONT latest [SQK-RNA004 kit](https://community.nanoporetech.com/docs/prepare/library_prep_protocols/direct-rna-sequencing-sqk-rna004/v/drs_9195_v4_revd_20sep2023) data for variant calling. Clair3-RNA also supports PacBio Sequel and PacBio MAS-Seq RNA sequencing data.
19
17
20
18
For germline small variant calling, please use [Clair3](https://github.com/HKU-BAL/Clair3).
21
19
@@ -39,6 +37,13 @@ For somatic small variant calling using tumor sample only, please try [ClairS-TO
39
37
40
38
----
41
39
40
+
## Latest Updates
41
+
*v0.1.0 (Aug 15, 2024)* : 1. Added a new ONT dRNA004 direct RNA sequencing model (`ont_dorado_drna004`) for SQK-RNA004 kit. 2. Added new PacBio Sequel (`hifi_sequel2_minimap2`) and Revio (`hifi_mas_minimap2`) model to support minimap2 alignment. 3. Enhance model training techniques to boost performance by incorporating strategies such as managing low-coverage sites, verifying variant zygosity, filtering RNA editing sites, etc. 4. Renamed all ONT and PacBio model names, check [here](https://github.com/HKU-BAL/Clair3-RNA?tab=readme-ov-file#pre-trained-models) for more details.
42
+
43
+
*v0.0.1 (Nov 27, 2023)*: Initial release for early access.
44
+
45
+
---
46
+
42
47
## Quick Demo
43
48
44
49
- Oxford Nanopore (ONT) data as input, see [ONT Quick Demo](docs/ont_quick_demo.md).
@@ -49,7 +54,7 @@ For somatic small variant calling using tumor sample only, please try [ClairS-TO
49
54
After following [installation](#installation), you can run Clair3-RNA with one command:
--tag_variant_using_readiportal ## optional, tag variants uisng REDIportal dataset
191
200
--output_dir ${OUTPUT_DIR}## output path prefix
192
201
193
202
## Final output file: ${OUTPUT_DIR}/output.vcf.gz
@@ -207,7 +216,7 @@ docker run -it hkubal/clair3-rna:latest /opt/bin/clair3_rna --help
207
216
-t THREADS, --threads THREADS
208
217
Max #threads to be used.
209
218
-p PLATFORM, --platform PLATFORM
210
-
Select the sequencing platform of the input. Possible options: {ont_r9_guppy_cdna, ont_r9_guppy_drna, hifi_sequel2, hifi_mas}.
219
+
Select the sequencing platform of the input. Possible options: {ont_dorado_drna004, ont_guppy_drna002, ont_guppy_cdna, hifi_sequel2_pbmm2, hifi_sequel2_minimap2, hifi_mas_pbmm2, hifi_sequel2_minimap2}.
211
220
```
212
221
213
222
**Miscellaneous parameters:**
@@ -257,27 +266,27 @@ docker run -it hkubal/clair3-rna:latest /opt/bin/clair3_rna --help
257
266
#### Call variants in one or mutiple chromosomes using the `-C/--ctg_name` parameter
@@ -286,3 +295,5 @@ We highly recommended using BED file to define multiple regions of interest like
286
295
287
296
RNA undergoes editing by ADAR (adenosine deaminases acting on RNA), resulting in Adenosine-to-inosine (A-to-I) changes. These A-to-I changes can be observed in RNA-seq datasets as A-to-G and T-to-C changes, which do not represent genuine RNA variants. To address this, we provide users with the option to utilize external datasets such as [REDIportal](http://srv00.recas.ba.infn.it/atlas/) to annotate RNA editing sites. In Clair3-RNA's VCF output, variants that are also RNA editing sites reported in REDIportal can be tagged. These sites will be marked as `RNAEditing` instead of `PASS` in the `FILTER` column when the `--tag_variant_using_readiportal` option is enabled.
288
297
298
+
**Caution**: `--tag_variant_using_readiportal` option currently works for GRCh38 and GRCh37 reference genome only, use can specify the reference genome version by using option `--readiportal_reference_genome_version={grch38, grch37}`.
0 commit comments