Book Image

Natural Language Processing with Java and LingPipe Cookbook

Book Image

Natural Language Processing with Java and LingPipe Cookbook

Overview of this book

Table of Contents (14 chapters)
Natural Language Processing with Java and LingPipe Cookbook
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 5. Finding Spans in Text – Chunking

This chapter covers the following recipes:

  • Sentence detection

  • Evaluation of sentence detection

  • Tuning sentence detection

  • Marking embedded chunks in a string – sentence chunk example

  • Paragraph detection

  • Simple noun phrases and verb phrases

  • Regular expression-based chunking for NER

  • Dictionary-based chunking for NER

  • Translating between word tagging and chunks – BIO codec

  • HMM-based NER

  • Mixing the NER sources

  • CRFs for chunking

  • NER using CRFs with better features