A Hands-On Introduction to Data Science

Front Cover
Cambridge University Press, Apr 2, 2020 - Business & Economics - 424 pages
This book introduces the field of data science in a practical and accessible manner, using a hands-on approach that assumes no prior knowledge of the subject. The foundational ideas and techniques of data science are provided independently from technology, allowing students to easily develop a firm understanding of the subject without a strong technical background, as well as being presented with material that will have continual relevance even after tools and technologies change. Using popular data science tools such as Python and R, the book offers many examples of real-life applications, with practice ranging from small to big data. A suite of online material for both instructors and students provides a strong supplement to the book, including datasets, chapter slides, solutions, sample exams and curriculum suggestions. This entry-level textbook is ideally suited to readers from a range of disciplines wishing to build a practical, working knowledge of data science.
 

Contents

Data
37
Techniques
66
UNIX
99
Python
125
R
161
MySQL
187
Machine Learning for Data Science
207
Supervised Learning
235
Applications Evaluations and Methods
319
Data Collection Experimentation and Evaluation
354
Appendices
379
Installing and Configuring Tools
385
Using Cloud Services
393
Data Science Jobs
407
Data Science and Ethics
412
Index
418

Unsupervised Learning
290

Other editions - View all

Common terms and phrases

About the author (2020)

Chirag Shah is an Associate Professor of Information and Computer Science at Rutgers University, New Jersey. He investigates issues of search and recommendations using data mining and machine learning. Dr Shah received his M.S. in Computer Science from the University of Massachusetts, Amherst, and his Ph.D. in Information Science from the University of North Carolina, Chapel Hill. He directs the InfoSeeking Lab, supported by awards from the National Science Foundation, the National Institute of Health, the Institute of Museum and Library Services, as well as Amazon, Google, and Yahoo. He was a Visiting Research Scientist at Spotify and has served as a consultant to the United Nations Data Analytics on various data science projects. He is currently working on large-scale e-commerce data and machine learning problems as Amazon Scholar.