Book Image

Clean Data: Tips, Tricks, and Techniques [Video]

By : Tomasz Lelek
Book Image

Clean Data: Tips, Tricks, and Techniques [Video]

By: Tomasz Lelek

Overview of this book

<p>"Give me six hours to chop down a tree and I will spend the first four sharpening the axe"? Do you apply the same principle when doing Data Science?</p> <p>Effective data cleaning is one of the most important aspects of good Data Science and involves acquiring raw data and preparing it for analysis, which, if not done effectively, will not give you the accuracy or results that you're looking to achieve, no matter how good your algorithm is.<br />Data Cleaning is the hardest part of big data and ML. To address this matter, this course will equip you with all the skills you need to clean your data in Python, using tried and tested techniques. You'll find a plethora of tips and tricks that will help you get the job done, in a smart, easy, and efficient way.</p> <p>All the code and supporting files for this course are available on Github at&nbsp;<a href="https://github.com/PacktPublishing/Clean-Data-Tips-Tricks-and-Techniques" target="_blank">https://github.com/PacktPublishing/Clean-Data-Tips-Tricks-and-Techniques</a></p> <h1>Style and Approach</h1> <p>Each section teaches one particular aspect of the overall topic and its section title reflects that. Each video teaches a subtopic in a hands-on way with a practical demonstration, along with explanation and a discussion of how it works and how to use it.</p>
Table of Contents (5 chapters)
Chapter 3
Dealing with Unstructured Data (Text)
Content Locked
Section 1
Analyzing Unstructured Text Input Data
In this video, we will learn how to deal with text data. - Understand how to prepare data to extract features