Sign In Start Free Trial

Book Overview & Buying
Table Of Contents

Natural Language Processing and Computational Linguistics

By : Bhargav Srinivasa-Desikan

3.6 (7)

Natural Language Processing and Computational Linguistics

3.6 (7)

By: Bhargav Srinivasa-Desikan

Overview of this book

Modern text analysis is now very accessible using Python and open source tools, so discover how you can now perform modern text analysis in this era of textual data. This book shows you how to use natural language processing, and computational linguistics algorithms, to make inferences and gain insights about data you have. These algorithms are based on statistical machine learning and artificial intelligence techniques. The tools to work with these algorithms are available to you right now - with Python, and tools like Gensim and spaCy. You'll start by learning about data cleaning, and then how to perform computational linguistics from first concepts. You're then ready to explore the more sophisticated areas of statistical NLP and deep learning using Python, with realistic language and text samples. You'll learn to tag, parse, and model text using the best tools. You'll gain hands-on knowledge of the best frameworks to use, and you'll know when to choose a tool like Gensim for topic models, and when to work with Keras for deep learning. This book balances theory and practical hands-on examples, so you can learn about and conduct your own natural language processing projects and computational linguistics. You'll discover the rich ecosystem of Python tools you have available to conduct NLP - and enter the interesting world of modern text analysis.

Preface

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

What is Text Analysis?

What is Text Analysis?

What is text analysis?

Where's the data at?

Garbage in, garbage out

Why should you do text analysis?

Summary

References

Python Tips for Text Analysis

Python Tips for Text Analysis

Why Python?

Text manipulation in Python

Summary

References

spaCy's Language Models

spaCy's Language Models

spaCy

Installation

Tokenizing text

Summary

References

Gensim – Vectorizing Text and Transformations and n-grams

Gensim – Vectorizing Text and Transformations and n-grams

Introducing Gensim

Vectors and why we need them

Vector transformations in Gensim

n-grams and some more preprocessing

Summary

References

POS-Tagging and Its Applications

POS-Tagging and Its Applications

What is POS-tagging?

POS-tagging in Python

Training our own POS-taggers

POS-tagging code examples

Summary

References

NER-Tagging and Its Applications

NER-Tagging and Its Applications

What is NER-tagging?

NER-tagging in Python

Training our own NER-taggers

NER-tagging examples and visualization

Summary

References

Dependency Parsing

Dependency Parsing

Dependency parsing

Dependency parsing in Python

Dependency parsing with spaCy

Training our dependency parsers

Summary

References

Topic Models

Topic Models

What are topic models?

Topic models in Gensim

Topic models in scikit-learn

Summary

References

Advanced Topic Modeling

Advanced Topic Modeling

Advanced training tips

Exploring documents

Topic coherence and evaluating topic models

Visualizing topic models

Summary

References

Clustering and Classifying Text

Clustering and Classifying Text

Clustering text

Starting clustering

K-means

Hierarchical clustering

Classifying text

Summary

References

Similarity Queries and Summarization

Similarity Queries and Summarization

Similarity metrics

Similarity queries

Summarizing text

Summary

References

Word2Vec, Doc2Vec, and Gensim

Word2Vec, Doc2Vec, and Gensim

Word2Vec

Doc2Vec

Other word embeddings

Summary

References

Deep Learning for Text

Deep Learning for Text

Deep learning

Deep learning for text (and more)

Generating text

Summary

References

Keras and spaCy for Deep Learning

Keras and spaCy for Deep Learning

Keras and spaCy

Classification with Keras

Classification with spaCy

Summary

References

Sentiment Analysis and ChatBots

Sentiment Analysis and ChatBots

Sentiment analysis

ChatBots

Summary

References

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

What is Text Analysis?

There is no time like now to do text analysis – we have an abundance of easily available data, powerful and free open source tools to conduct our analysis, and research on machine learning, computational linguistics and computing with text is progressing at a pace we have not seen before.

In this chapter, we will go into details about what exactly text analysis is and look at the motivations for studying and understanding text analysis. Following are the topics we will cover in this chapter:

What is text analysis?
Where's the data at?
Garbage in, garbage out
Why should YOU be interested?
References

A note about the references: they will appear throughout the PDF version of the book as links, and if it is an academic reference it will link to the PDF of the reference or the journal page. All of these links and references are then displayed as the final section of the chapter, so offline readers can also visit the websites or research papers.

CONTINUE READING

83

Tech Concepts

36

Programming languages

73

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

Natural Language Processing and Computational Linguistics

Search

Your notes and bookmarks