Genomic landscape of rat strain and substrain variation

Publication date

2015-05-06

Authors

Hermsen, Roel
de Ligt, Joep
Spee, Wim
Blokzijl, Francis
Schäfer, Sebastian
Adami, Eleonora
Boymans, Sander
Flink, Stephen
van Boxtel, Ruben
van der Weide, Robin H.

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

Abstract

Background: Since the completion of the rat reference genome in 2003, whole-genome sequencing data from more than 40 rat strains have become available. These data represent the broad range of strains that are used in rat research including commonly used substrains. Currently, this wealth of information cannot be used to its full extent, because the variety of different variant calling algorithms employed by different groups impairs comparison between strains. In addition, all rat whole genome sequencing studies to date used an outdated reference genome for analysis (RGSC3.4 released in 2004). Results: Here we present a comprehensive, multi-sample and uniformly called set of genetic variants in 40 rat strains, including 19 substrains. We reanalyzed all primary data using a recent version of the rat reference assembly (RGSC5.0 released in 2012) and identified over 12 million genomic variants (SNVs, indels and structural variants) among the 40 strains. 28,318 SNVs are specific to individual substrains, which may be explained by introgression from other unsequenced strains and ongoing evolution by genetic drift. Substrain SNVs may have a larger predicted functional impact compared to older shared SNVs. Conclusions: In summary we present a comprehensive catalog of uniformly analyzed genetic variants among 40 widely used rat inbred strains based on the RGSC5.0 assembly. This represents a valuable resource, which will facilitate rat functional genomic research. In line with previous observations, our genome-wide analyses do not show evidence for contribution of multiple ancestral founder rat subspecies to the currently used rat inbred strains, as is the case for mouse. In addition, we find that the degree of substrain variation is highly variable between strains, which is of importance for the correct interpretation of experimental data from different labs.

Keywords

Rat, Inbred strain, Substrain, Genomic variation, RGSC5.0, rn5, rnor5.0, SPONTANEOUSLY HYPERTENSIVE-RAT, BLOOD-PRESSURE, LABORATORY RAT, SEQUENCE, SNP, DIVERSITY, DISCOVERY, CANCER, MODELS, MOUSE, Biotechnology, Genetics

Citation

Hermsen, R, de Ligt, J, Spee, W, Blokzijl, F, Schäfer, S, Adami, E, Boymans, S, Flink, S, van Boxtel, R, van der Weide, R H, Aitman, T, Hübner, N, Simonis, M, Tabakoff, B, Guryev, V & Cuppen, E 2015, 'Genomic landscape of rat strain and substrain variation', BMC Genomics, vol. 16, 16:357. https://doi.org/10.1186/s12864-015-1594-1