Book Image

Mastering Social Media Mining with Python

By : Marco Bonzanini
Book Image

Mastering Social Media Mining with Python

By: Marco Bonzanini

Overview of this book

Your social media is filled with a wealth of hidden data – unlock it with the power of Python. Transform your understanding of your clients and customers when you use Python to solve the problems of understanding consumer behavior and turning raw data into actionable customer insights. This book will help you acquire and analyze data from leading social media sites. It will show you how to employ scientific Python tools to mine popular social websites such as Facebook, Twitter, Quora, and more. Explore the Python libraries used for social media mining, and get the tips, tricks, and insider insight you need to make the most of them. Discover how to develop data mining tools that use a social media API, and how to create your own data analysis projects using Python for clear insight from your social data.
Table of Contents (15 chapters)
Mastering Social Media Mining with Python
Credits
About the Author
About the Reviewer
www.PacktPub.com
Preface

Summary


This chapter introduced some data mining applications using the Twitter data. We discussed how to register an application with the Twitter platform in order to get the credentials and interact with the Twitter APIs. We have considered different ways to download tweets, particularly using the REST endpoints to search for published tweets, and using the Streaming API to keep the connection open and collect upcoming tweets.

When observing the anatomy of a tweet, we found that a tweet is much more than 140 characters. In fact, it is a complex object with a lot of information in it.

The starting point of our analysis opened the discussion on frequency analysis based on entities. Our focus has been on hashtags, one of Twitter's peculiarities, widely adopted by users to track specific topics. We also discussed aspects of natural language processing (NLP) such as tokenization and normalization of tokens. As we have seen, the language on Twitter doesn't follow the conventions of standard English...