Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Book Overview & Buying Principles of Data Science
  • Table Of Contents Toc
Principles of Data Science

Principles of Data Science - Third Edition

By : Sinan Ozdemir
4.8 (4)
close
close
Principles of Data Science

Principles of Data Science

4.8 (4)
By: Sinan Ozdemir

Overview of this book

Principles of Data Science bridges mathematics, programming, and business analysis, empowering you to confidently pose and address complex data questions and construct effective machine learning pipelines. This book will equip you with the tools to transform abstract concepts and raw statistics into actionable insights. Starting with cleaning and preparation, you’ll explore effective data mining strategies and techniques before moving on to building a holistic picture of how every piece of the data science puzzle fits together. Throughout the book, you’ll discover statistical models with which you can control and navigate even the densest or the sparsest of datasets and learn how to create powerful visualizations that communicate the stories hidden in your data. With a focus on application, this edition covers advanced transfer learning and pre-trained models for NLP and vision tasks. You’ll get to grips with advanced techniques for mitigating algorithmic bias in data as well as models and addressing model and data drift. Finally, you’ll explore medium-level data governance, including data provenance, privacy, and deletion request handling. By the end of this data science book, you'll have learned the fundamentals of computational mathematics and statistics, all while navigating the intricacies of modern ML and large pre-trained models like GPT and BERT.
Table of Contents (18 chapters)
close
close

Exploring the data

The process of exploring data is not always straightforward and can involve a variety of approaches and techniques. Some common tasks that are involved in data exploration include recognizing different types of data, transforming data types, and using code to systematically improve the quality of the entire dataset. These tasks can be accomplished using tools such as the pandas Python package, which is commonly used for data manipulation and analysis.

There are a few basic questions that you should consider when exploring a new dataset. These questions can help you to get a sense of the data and guide your analysis. The three basic questions are presented here:

  • What are the types of data that are present in the dataset?
  • What are the characteristics and patterns of the data?
  • How is the data organized, and what transformations might be necessary to make it more usable?

By answering these questions and exploring your data thoroughly, you can...

CONTINUE READING
83
Tech Concepts
36
Programming languages
73
Tech Tools
Icon Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.
Icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Icon 50+ new titles added per month and exclusive early access to books as they are being written.
Principles of Data Science
notes
bookmark Notes and Bookmarks search Search in title playlist Add to playlist download Download options font-size Font size

Change the font size

margin-width Margin width

Change margin width

day-mode Day/Sepia/Night Modes

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Confirmation

Modal Close icon
claim successful

Buy this book with your credits?

Modal Close icon
Are you sure you want to buy this book with one of your credits?
Close
YES, BUY

Submit Your Feedback

Modal Close icon
Modal Close icon
Modal Close icon