Population Genomics with R presents a multidisciplinary approach to the analysis of population genomics. The methods treated cover a large number of topics from traditional population genetics to large-scale genomics with high-throughput sequencing data. Several dozen R packages are examined and integrated to provide a coherent software environment with a wide range of computational, statistical, and graphical tools. Small examples are used to illustrate the basics and published data are used as case studies. Readers are expected to have a basic knowledge of biology, genetics, and statistical inference methods. Graduate students and post-doctorate researchers will find resources to analyze their population genetic and genomic data as well as help them design new studies.
The first four chapters review the basics of population genomics, data acquisition, and the use of R to store and manipulate genomic data. Chapter 5 treats the exploration of genomic data, an important issue when analysing large data sets. The other five chapters cover linkage disequilibrium, population genomic structure, geographical structure, past demographic events, and natural selection. These chapters include supervised and unsupervised methods, admixture analysis, an in-depth treatment of multivariate methods, and advice on how to handle GIS data. The analysis of natural selection, a traditional issue in evolutionary biology, has known a revival with modern population genomic data. All chapters include exercises. Supplemental materials are available on-line
2. Data Acquisition
3. Genomic Data in R
4. Data Manipulation
5. Data Exploration and Summaries
6. Linkage Disequilibrium and Haplotype Structure
7. Population Genetic Structure
8. Geographical Structure
9. Past Demographic Events
10. Natural Selection
A Installing R Packages
B Compressing Large Sequence Files
C Sampling of Alleles in a Population
Emmanuel Paradis is a senior researcher in the French Institute of Research for Development (IRD). His research focuses on evolutionary models and their applications. The development and publication of software associated with his research has been an important aspect of his activities for more than twenty years. He adopted R as his main software for data analysis in 2000 and has since published and maintained several packages, including ape since 2002 and pegas since 2009. He gives regular workshops and trainings in several countries.
"The author has taken good care of including several important as well as emerging topics (data acquisition, next generation sequencing) that would be extremely useful for the readers [...] suggest that this book be targeted to graduate students and researchers who have some background in basic genetics or are taking a graduate level population genetics course [...] The data acquisition chapter, descriptions of DNA sample quality, and file formats are the strengths. Case studies are very valuable and would provide more "hands-on" training on working on specific population genetics problems."
– Santhosh Girirajan, Pennsylvania State University
"The strength of those chapters is to provide a global coverage of the field of population genetics based on a broad spectrum of statistical methods. The author proposes to deal with population genetic analyses in a unified programming framework that uses specific classes of the R packages ape/pegas and adegenet, and I was impressed by the work done."
– Oliver Francois, University Grenoble Alpes
"This book could serve as both a reference book and a textbook. Population genetics, applied bioinformatics, genomics, molecular ecology, and conservation genetic classes with a lab component at both undergraduate and graduate levels could teach from this text. Graduate students and possible postdocs in evolutionary biology and applied bioinformatics could use this as a reference. Additionally, government and non-profit organizations that process genetic samples for conservation and management purposes would find this instruction useful. [...] What this text offers is unique in that it is focused on practical steps to analyze data using already available programs that users can install [...] Given the variety of subjects and types of analyses, I think it could be a valuable resource for many students."
– Sarah Hendricks, San Diego Zoo Institute for Conservation Research