Mastering Social Media Mining with Python

Mastering Social Media Mining with Python

By : Marco Bonzanini

Buy this Book

Mastering Social Media Mining with Python

By: Marco Bonzanini

Buy this Book

Overview of this book

Your social media is filled with a wealth of hidden data – unlock it with the power of Python. Transform your understanding of your clients and customers when you use Python to solve the problems of understanding consumer behavior and turning raw data into actionable customer insights. This book will help you acquire and analyze data from leading social media sites. It will show you how to employ scientific Python tools to mine popular social websites such as Facebook, Twitter, Quora, and more. Explore the Python libraries used for social media mining, and get the tips, tricks, and insider insight you need to make the most of them. Discover how to develop data mining tools that use a social media API, and how to create your own data analysis projects using Python for clear insight from your social data.

Mastering Social Media Mining with Python

Credits

About the Author

About the Reviewer

www.PacktPub.com

Preface

Free Chapter

Social Media, Social Data, and Python

Getting started

Social media - challenges and opportunities

Python tools for data science

Processing data in Python

Building complex data pipelines

Summary

#MiningTwitter – Hashtags, Topics, and Time Series

Getting started

The Twitter API

Collecting data from Twitter

Analyzing tweets - entity analysis

Analyzing tweets - text analysis

Analyzing tweets - time series analysis

Summary

Users, Followers, and Communities on Twitter

Users, friends, and followers

Mining your followers

Mining the conversation

Plotting tweets on a map

Summary

Posts, Pages, and User Interactions on Facebook

The Facebook Graph API

Mining your posts

Mining Facebook Pages

Summary

Topic Analysis on Google+

Getting started with the Google+ API

Embedding the search results in a web GUI

Notes and activities from a Google+ page

Text analysis and TF-IDF on notes

Summary

Questions and Answers on Stack Exchange

Questions and answers

Getting started with the Stack Exchange API

Working with Stack Exchange data dumps

Text classification for question tags

Summary

Blogs, RSS, Wikipedia, and Natural Language Processing

Blogs and NLP

Getting data from blogs and websites

NLP Basics

Summary

Mining All the Data!

Many social APIs

Mining videos on YouTube

Mining open source software on GitHub

Mining local businesses on Yelp

Building a custom Python client

Summary

Linked Data and the Semantic Web

A Web of Data

Mining relations from DBpedia

Mining geo coordinates

Summary

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Text analysis and TF-IDF on notes

After discussing how to download a list of notes and activities for a given page or user, we will shift our focus to the textual analysis of the content.

For each post published by a given user, we want to extract the most interesting keywords, which could be used to summarize the post itself.

While this is intuitively a simple exercise, there are a few subtleties to consider. On the practical side, we can easily observe that the content of each post is not always a clean piece of text, in fact, HTML tags can be included in the content. Before we can carry out our computation, we need to extract the clean text. While the JSON object returned by the Google+ API has a clear structure, the content itself is not necessarily a well-formed structured document. Fortunately, there's a nice Python package that comes to the rescue. Beautiful Soup is, in fact, able to parse HTML and XML documents, including malformed markup. It is compatible with Python 3 and can be...

Mastering Social Media Mining with Python

By : Marco Bonzanini

Mastering Social Media Mining with Python

By: Marco Bonzanini

Overview of this book

Related Content you might be interested in

Current Title:

Mastering Social Media Mining with Python

Text analysis and TF-IDF on notes