Kodoja : a workflow for virus detection in plants using k-mer analysis of RNA-sequencing data
MetadataShow full item record
RNA-sequencing of plant material allows for hypothesis-free detection of multiple viruses simultaneously. This methodology relies on bioinformatics workflows for virus identification. Most workflows are designed for human clinical data, and few go beyond sequence mapping for virus identification. We present a new workflow (Kodoja) for the detection of plant virus sequences in RNA-sequence data. Kodoja uses k-mer profiling at the nucleotide level and sequence mapping at the protein level by integrating two existing tools Kraken and Kaiju. Kodoja was tested on three existing RNA-seq datasets from grapevine, and two new RNA-seq datasets from raspberry. For grapevine, Kodoja was shown to be more sensitive than a method based on contig building and blast alignments (27 viruses detected compared to 19). The application of Kodoja to raspberry, showed that field-grown raspberries were infected by multiple viruses, and that RNA-seq can identify lower amounts of virus material than reverse transcriptase PCR. This work enabled the design of new PCR-primers for detection of Raspberry yellow net virus and Beet ringspot virus. Kodoja is a sensitive method for plant virus discovery in field samples and enables the design of more accurate primers for detection. Kodoja is available to install through Bioconda and as a tool within Galaxy.
Baizan-Edge , A , Cock , P , MacFarlane , S , McGavin , W , Torrance , L & Jones , S 2019 , ' Kodoja : a workflow for virus detection in plants using k-mer analysis of RNA-sequencing data ' Journal of General Virology , vol. 100 , pp. 533-542 . https://doi.org/10.1099/jgv.0.001210
Journal of General Virology
© 2019 The Authors | Published by the Microbiology Society. This work has been made available online in accordance with the publisher’s policies. This is the author created accepted version manuscript following peer review and as such may differ slightly from the final published version. The final published version of this work is available at: https://doi.org/10.1099/jgv.0.001210
DescriptionThis work was supported by the Biotechnology and Biological SciencesResearch Council [BB/N023293/1]. The work of L.T., S.J., S.M. and P.C.was additionally supported by the Scottish Government’s Rural andEnvironment Science and Analytical Services division (RESAS)
Items in the St Andrews Research Repository are protected by copyright, with all rights reserved, unless otherwise indicated.