Quantitative Trait Mapping: Glossary

Key Points

Introduction
Input File Format
  • QTL mapping data consists of a set of tables of data: marker genotypes, phenotypes, marker maps, etc.

  • These different tables are in separate comma-delimited (CSV) files.

  • In each file, the first column is a set of IDs for the rows, and the first row is a set of IDs for the columns.

  • In addition to primary data, a separate file with control parameters (or metadata) in either YAML or JSON format is required.

  • Published and public data already formatted for QTL mapping are available on the web.

  • These data can be used as a model for formatting your own QTL data.

Calculating Genotype Probabilities
  • The first step in QTL analysis is to calculate genotype probabilities.

  • Insert pseudomarkers to calculate QTL at positions between genotyped markers.

  • Calculate genotype probabilities between genotyped markers with calc_genoprob().

Special covariates for the X chromosome
  • The X chromosome requires special treatment in QTL mapping.

  • Special covariates such as sex should be included to avoid spurious evidence of linkage.

Performing a genome scan
  • A qtl2 genome scan requires genotype probabilities and a phenotype matrix.

  • The output from a genome scan contains a LOD score matrix, map positions, and phenotypes.

  • LOD curve plots for a genome scan can be viewed with plot_scan1().

Performing a permutation test
  • A permutation test establishes the statistical significance of a genome scan.

Finding LOD peaks
  • LOD peaks and support intervals can be identified with find_peaks().

Calculating A Kinship Matrix
  • Kinship matrices account for relationships among individuals.

  • Kinship is calculated as the proportion of shared alleles between individuals.

  • Kinship calculation is a precursor to a genome scan via a linear mixed model.

Performing a genome scan with a linear mixed model
  • To perform a genome scan with a linear mixed model, supply a kinship matrix.

  • Different mapping and kinship calculation methods give different results.

Performing a genome scan with binary traits
  • A genome scan for binary traits (0 and 1) requires special handling; scans for non-binary traits assume normal variation of the residuals.

  • A genome scan for binary traits is performed with logistic regression.

Estimated QTL effects
  • Estimated founder allele effects can be plotted from the mapping model coefficients.

  • Additive and dominance effects can be plotted using contrasts.

SNP association mapping
  • SNP association analysis with DO mice requires SNPs in the QTL regions.

  • .

QTL analysis in Diversity Outbred Mice
  • .

  • .

Glossary

External references

Quantitative Trait Mapping

Broman KW, Wu H, Sen Ś, Churchill GA. R/qtl: QTL mapping in experimental crosses. Bioinformatics. 2003 May 1;19(7):889-90.

Broman KW, Sen S. A Guide to QTL Mapping with R/qtl. New York: Springer; 2009 Apr 23.

Flint J, Eskin E. Genome-wide association studies in mice. Nature Reviews Genetics. 2012 Nov;13(11):807.

Gonzales NM, Palmer AA. Fine-mapping QTLs in advanced intercross lines and other outbred populations. Mammalian Genome. 2014 Aug 1;25(7-8):271-92.

Mott R, Flint J. Dissecting quantitative traits in mice. Annual Review of Genomics and Human Genetics. 2013 Aug 31;14:421-39.

Hidden Markov Models

Durbin R, Eddy SR, Krogh A, Mitchison G. Biological sequence analysis: probabilistic models of proteins and nucleic acids. Cambridge University Press; 1998 Apr 23.

Eddy SR. What is a hidden Markov model?. Nature Biotechnology. 2004 Oct;22(10):1315.

Rabiner LR. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE. 1989 Feb;77(2):257-86.

Mapping in Multiparent Crosses

Aylor DL, Valdar W, Foulds-Mathes W, Buus RJ, Verdugo RA, Baric RS, Ferris MT, Frelinger JA, Heise M, Frieman MB, Gralinski LE. Genetic analysis of complex traits in the emerging Collaborative Cross. Genome Research. 2011 Aug 1;21(8):1213-22.

Churchill GA, Airey DC, Allayee H, Angel JM, Attie AD, Beatty J, Beavis WD, Belknap JK, Bennett B, Berrettini W, Bleich A. The Collaborative Cross, a community resource for the genetic analysis of complex traits. Nature Genetics. 2004 Nov 1;36(11):1133.

Churchill GA, Gatti DM, Munger SC, Svenson KL. The diversity outbred mouse population. Mammalian Genome. 2012 Oct 1;23(9-10):713-8.

Collaborative Cross Consortium. The genome architecture of the Collaborative Cross mouse genetic reference population. Genetics. 2012 Feb 1;190(2):389-401.

French JE, Gatti DM, Morgan DL, Kissling GE, Shockley KR, Knudsen GA, Shepard KG, Price HC, King D, Witt KL, Pedersen LC. Diversity outbred mice identify population-based exposure thresholds and genetic factors that influence benzene-induced genotoxicity. Environmental Health Perspectives. 2014 Nov 6;123(3):237-45.

Gatti DM, Svenson KL, Shabalin A, Wu LY, Valdar W, Simecek P, Goodwin N, Cheng R, Pomp D, Palmer A, Chesler EJ. Quantitative trait locus mapping methods for diversity outbred mice. G3: Genes, Genomes, Genetics. 2014 Sep 1;4(9):1623-33.

Huang X, Paulo MJ, Boer M, Effgen S, Keizer P, Koornneef M, van Eeuwijk FA. Analysis of natural allelic variation in Arabidopsis using a multiparent recombinant inbred line population. Proceedings of the National Academy of Sciences. 2011 Mar 15;108(11):4488-93.

Kover PX, Valdar W, Trakalo J, Scarcelli N, Ehrenreich IM, Purugganan MD, Durrant C, Mott R. A multiparent advanced generation inter-cross to fine-map quantitative traits in Arabidopsis thaliana. PLoS Genetics. 2009 Jul 10;5(7):e1000551.

Review: Population Structure in Genetic Studies: Confounding Factors and Mixed Models Lana S. Martin, Eleazar Eskin bioRxiv 092106; doi: https://doi.org/10.1101/092106

Svenson KL, Gatti DM, Valdar W, Welsh CE, Cheng R, Chesler EJ, Palmer AA, McMillan L, Churchill GA. High-resolution genetic mapping using the Mouse Diversity Outbred population. Genetics. 2012 Feb 1;190(2):437-47.