Mastering Social Media Mining with Python

Mastering Social Media Mining with Python

By : Marco Bonzanini

Buy this Book

Mastering Social Media Mining with Python

By: Marco Bonzanini

Buy this Book

Overview of this book

Your social media is filled with a wealth of hidden data – unlock it with the power of Python. Transform your understanding of your clients and customers when you use Python to solve the problems of understanding consumer behavior and turning raw data into actionable customer insights. This book will help you acquire and analyze data from leading social media sites. It will show you how to employ scientific Python tools to mine popular social websites such as Facebook, Twitter, Quora, and more. Explore the Python libraries used for social media mining, and get the tips, tricks, and insider insight you need to make the most of them. Discover how to develop data mining tools that use a social media API, and how to create your own data analysis projects using Python for clear insight from your social data.

Mastering Social Media Mining with Python

Credits

About the Author

About the Reviewer

www.PacktPub.com

Preface

Free Chapter

Social Media, Social Data, and Python

Getting started

Social media - challenges and opportunities

Python tools for data science

Processing data in Python

Building complex data pipelines

Summary

#MiningTwitter – Hashtags, Topics, and Time Series

Getting started

The Twitter API

Collecting data from Twitter

Analyzing tweets - entity analysis

Analyzing tweets - text analysis

Analyzing tweets - time series analysis

Summary

Users, Followers, and Communities on Twitter

Users, friends, and followers

Mining your followers

Mining the conversation

Plotting tweets on a map

Summary

Posts, Pages, and User Interactions on Facebook

The Facebook Graph API

Mining your posts

Mining Facebook Pages

Summary

Topic Analysis on Google+

Getting started with the Google+ API

Embedding the search results in a web GUI

Notes and activities from a Google+ page

Text analysis and TF-IDF on notes

Summary

Questions and Answers on Stack Exchange

Questions and answers

Getting started with the Stack Exchange API

Working with Stack Exchange data dumps

Text classification for question tags

Summary

Blogs, RSS, Wikipedia, and Natural Language Processing

Blogs and NLP

Getting data from blogs and websites

NLP Basics

Summary

Mining All the Data!

Many social APIs

Mining videos on YouTube

Mining open source software on GitHub

Mining local businesses on Yelp

Building a custom Python client

Summary

Linked Data and the Semantic Web

A Web of Data

Mining relations from DBpedia

Mining geo coordinates

Summary

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Analyzing tweets - text analysis

The previous section analyzed the entity field of a tweet. This provides useful knowledge on the tweet, because these entities are explicitly curated by the author of the tweet. This section will focus on unstructured data instead, that is, the raw text of the tweet. We'll discuss aspects of text analytics such as text preprocessing and normalization and we'll perform some statistical analysis on the tweets. Before digging the details, we'll introduce some terminology.

Tokenization is one of the important steps in the preprocessing phase. Given a stream of text (such as a tweet status), tokenization is the process of breaking this text down into individual units called tokens. In the simplest form, these units are words, but we could also work on a more complex tokenization that deals with phrases, symbols, and so on.

Tokenization sounds like a trivial task, and it's been widely studied by the natural language processing community. Chapter 1, Social Media...

Mastering Social Media Mining with Python

By : Marco Bonzanini

Mastering Social Media Mining with Python

By: Marco Bonzanini

Overview of this book

Related Content you might be interested in

Current Title:

Mastering Social Media Mining with Python

Analyzing tweets - text analysis