FOCUS: an alignment-free model to identify organisms in metagenomes using non-negative least squares

Publication date

2014

Authors

Silva, Genivaldo Gueiros Z
Cuevas, Daniel A
Dutilh, BasISNI 0000000389464735
Edwards, Robert A

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

Abstract

One of the major goals in metagenomics is to identify the organisms present in a microbial community from unannotated shotgun sequencing reads. Taxonomic profiling has valuable applications in biological and medical research, including disease diagnostics. Most currently available approaches do not scale well with increasing data volumes, which is important because both the number and lengths of the reads provided by sequencing platforms keep increasing. Here we introduce FOCUS, an agile composition based approach using non-negative least squares (NNLS) to report the organisms present in metagenomic samples and profile their abundances. FOCUS was tested with simulated and real metagenomes, and the results show that our approach accurately predicts the organisms present in microbial communities. FOCUS was implemented in Python. The source code and web-sever are freely available at http://edwards.sdsu.edu/FOCUS.

Keywords

Citation

Silva, G G Z, Cuevas, D A, Dutilh, B E & Edwards, R A 2014, 'FOCUS : an alignment-free model to identify organisms in metagenomes using non-negative least squares', PeerJ [E], vol. 2, 2:e425. https://doi.org/10.7717/peerj.425