Book Image

Mastering Social Media Mining with Python

By : Marco Bonzanini
Book Image

Mastering Social Media Mining with Python

By: Marco Bonzanini

Overview of this book

Your social media is filled with a wealth of hidden data – unlock it with the power of Python. Transform your understanding of your clients and customers when you use Python to solve the problems of understanding consumer behavior and turning raw data into actionable customer insights. This book will help you acquire and analyze data from leading social media sites. It will show you how to employ scientific Python tools to mine popular social websites such as Facebook, Twitter, Quora, and more. Explore the Python libraries used for social media mining, and get the tips, tricks, and insider insight you need to make the most of them. Discover how to develop data mining tools that use a social media API, and how to create your own data analysis projects using Python for clear insight from your social data.
Table of Contents (10 chapters)

NLP Basics


This section tries to scratch the surface of the complex field of NLP. The previous chapters have mentioned some of the basics that are necessary for dealing with textual data (for example, tokenization) without going too much into the details. Here, we'll try to go one step further into the basic understanding of this discipline. Due to its complexity and many aspects, we're taking a pragmatic approach and only scratching the surface of the theoretical foundations in favor of practical examples.

Text preprocessing

An essential part of any NLP system is the preprocessing pipeline. Before we can perform any interesting task on a piece of text, we must first convert it in a useful representation.

In the previous chapters, we already performed some analysis on the textual data without digging into the details of text preprocessing, but instead using the common tools with a pragmatic approach. In this section, we'll highlight some of the common preprocessing steps and discuss their role...