To see accurate pricing, please choose your delivery country.
 
 
United States
£ GBP
All Shops

British Wildlife

8 issues per year 84 pages per issue Subscription only

British Wildlife is the leading natural history magazine in the UK, providing essential reading for both enthusiast and professional naturalists and wildlife conservationists. Published eight times a year, British Wildlife bridges the gap between popular writing and scientific literature through a combination of long-form articles, regular columns and reports, book reviews and letters.

Subscriptions from £33 per year

Conservation Land Management

4 issues per year 44 pages per issue Subscription only

Conservation Land Management (CLM) is a quarterly magazine that is widely regarded as essential reading for all who are involved in land management for nature conservation, across the British Isles. CLM includes long-form articles, events listings, publication reviews, new product information and updates, reports of conferences and letters.

Subscriptions from £26 per year
Academic & Professional Books  Reference  Data Analysis & Modelling  Data Analysis & Statistics

Introduction to Data Science Data Analysis and Prediction Algorithms with R

Textbook
By: Rafael A Irizarry(Author)
743 pages, illustrations
Publisher: CRC Press
Introduction to Data Science
Click to have a closer look
  • Introduction to Data Science ISBN: 9780367357986 Hardback Nov 2019 Not in stock: Usually dispatched within 1 week
    £74.99
    #254776
Price: £74.99
About this book Contents Customer reviews Biography Related titles

About this book

Introduction to Data Science: Data Analysis and Prediction Algorithms with R introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression, and machine learning. It also helps you develop skills such as R programming, data wrangling, data visualization, predictive algorithm building, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation.

This book is a textbook for a first course in data science. No previous knowledge of R is necessary, although some experience with programming may be helpful. The book is divided into six parts: R, data visualization, statistics with R, data wrangling, machine learning, and productivity tools. Each part has several chapters meant to be presented as one lecture.

The author uses motivating case studies that realistically mimic a data scientist's experience. He starts by asking specific questions and answers these through data analysis so concepts are learned as a means to answering the questions. Examples of the case studies included are: US murder rates by state, self-reported student heights, trends in world health and economics, the impact of vaccines on infectious disease rates, the financial crisis of 2007-2008, election forecasting, building a baseball team, image processing of hand-written digits, and movie recommendation systems.

The statistical concepts used to answer the case study questions are only briefly introduced, so complementing with a probability and statistics textbook is highly recommended for in-depth understanding of these concepts. If you read and understand the chapters and complete the exercises, you will be prepared to learn the more advanced concepts and skills needed to become an expert.

A complete solutions manual is available to registered instructors who require the text for a course.

Contents

I. R
1. Installing R and RStudio
2. Getting Started with R and RStudio
3. R Basics
4. Programming basics
5. The tidyverse 84
6. Importing data

II. Data Visualization
7. Introduction to data visualization
8. ggplot2
9. Visualizing data distributions
10. Data visualization in practice
11. Data visualization principles
12. Robust summaries

III. Statistics with R
13. Introduction to Statistics with R
14. Probability
15. Random variables
16. Statistical Inference
17. Statistical models
18. Regression
19. Linear Models
20. Association is not causation

IV. Data Wrangling
21. Introduction to Data Wrangling
22. Reshaping data
23. Joining tables
24. Web Scraping
25. String Processing
26. Parsing Dates and Times
27. Text mining

V. Machine Learning
28. Introduction to Machine Learning
29. Smoothing
30. Cross validation
31. The caret package
32. Examples of algorithms
33. Machine learning in practice
34. Large datasets
35. Clustering

VI. Productivity tools
36. Introduction to productivity tools
37. Accessing the terminal and installing Git
38. Organizing with Unix
39. Git and GitHub
40. Reproducible projects with RStudio and R markdown

Customer Reviews

Biography

Rafael A. Irizarry is a professor of data sciences at the Dana-Farber Cancer Institute, a professor of biostatistics at Harvard, and a fellow of the American Statistical Association. Dr Irizarry is an applied statistician and during the last 20 years has worked in diverse areas, including genomics, sound engineering, and public health. He disseminates solutions to data analysis challenges as open-source software, tools that are widely downloaded and used. Dr Irizarry has also developed and taught several data science courses at Harvard as well as popular online courses.

Textbook
By: Rafael A Irizarry(Author)
743 pages, illustrations
Publisher: CRC Press
Media reviews

"I think the book would be perfect for schools looking to make a transition to a model where introduction to data science takes the place of introduction to statistics and maybe introductory computer science."
– Arend Kuyper, Northwestern University

"A great introduction to data science and modern R programing, with tons of examples of application of the R abilities throughout the whole volume. The book suggests multiple links to the internet websites related to the topics under consideration that makes it an incredibly useful source of contemporary data science and programing, helping to students and researchers in their projects."
Technometrics

"Introduction to Data Science will teach you to juggle with your data and get maximum results from it using R. I highly recommended this book for students and everybody taking the first steps in data science using R."
– Maria Ivanchuk, ISCB News

Current promotions
New and Forthcoming BooksNHBS Moth TrapBritish Wildlife MagazineBuyers Guides