A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes

Publication date

2014

Authors

Dutilh, Bas E.ISNI 0000000389464735
Cassman, Noriko
McNair, Katelyn
Sanchez, Savannah E
Silva, Genivaldo G Z
Boling, Lance
Barr, Jeremy J
Speth, Daan R
Seguritan, Victor
Aziz, Ramy K

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

Abstract

Metagenomics, or sequencing of the genetic material from a complete microbial community, is a promising tool to discover novel microbes and viruses. Viral metagenomes typically contain many unknown sequences. Here we describe the discovery of a previously unidentified bacteriophage present in the majority of published human faecal metagenomes, which we refer to as crAssphage. Its ~97 kbp genome is six times more abundant in publicly available metagenomes than all other known phages together; it comprises up to 90% and 22% of all reads in virus-like particle (VLP)-derived metagenomes and total community metagenomes, respectively; and it totals 1.68% of all human faecal metagenomic sequencing reads in the public databases. The majority of crAssphage-encoded proteins match no known sequences in the database, which is why it was not detected before. Using a new co-occurrence profiling approach, we predict a Bacteroides host for this phage, consistent with Bacteroides-related protein homologues and a unique carbohydrate-binding domain encoded in the phage genome.

Keywords

Citation

Dutilh, B E, Cassman, N, McNair, K, Sanchez, S E, Silva, G G Z, Boling, L, Barr, J J, Speth, D R, Seguritan, V, Aziz, R K, Felts, B, Dinsdale, E A, Mokili, J L & Edwards, R A 2014, 'A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes', Nature Communications [E], vol. 5, 4498. https://doi.org/10.1038/ncomms5498