Book Image

Python 3 Text Processing with NLTK 3 Cookbook

By : Jacob Perkins
Book Image

Python 3 Text Processing with NLTK 3 Cookbook

By: Jacob Perkins

Overview of this book

Table of Contents (17 chapters)
Python 3 Text Processing with NLTK 3 Cookbook
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Penn Treebank Part-of-speech Tags
Index

Chapter 3. Creating Custom Corpora

In this chapter, we will cover the following recipes:

  • Setting up a custom corpus

  • Creating a wordlist corpus

  • Creating a part-of-speech tagged word corpus

  • Creating a chunked phrase corpus

  • Creating a categorized text corpus

  • Creating a categorized chunk corpus reader

  • Lazy corpus loading

  • Creating a custom corpus view

  • Creating a MongoDB-backed corpus reader

  • Corpus editing with file locking