Book Image

fastText Quick Start Guide

By : Joydeep Bhattacharjee
Book Image

fastText Quick Start Guide

By: Joydeep Bhattacharjee

Overview of this book

Facebook's fastText library handles text representation and classification, used for Natural Language Processing (NLP). Most organizations have to deal with enormous amounts of text data on a daily basis, and gaining efficient data insights requires powerful NLP tools such as fastText.  This book is your ideal introduction to fastText. You will learn how to create fastText models from the command line, without the need for complicated code. You will explore the algorithms that fastText is built on and how to use them for word representation and text classification.  Next, you will use fastText in conjunction with other popular libraries and frameworks such as Keras, TensorFlow, and PyTorch.  Finally, you will deploy fastText models to mobile devices. By the end of this book, you will have all the required knowledge to use fastText in your own applications at work or in projects.
Table of Contents (14 chapters)
Free Chapter
1
First Steps
4
The FastText Model
7
Using FastText in Your Own Models

FastText in Python

The use of fastText is specifically to transform words and sentences into efficient vector representations. Although fastText is written in C++, there are community-written Python bindings to train and use the models. Along with that, Python is one of the most popular languages used for NLP, and hence there are many other popular libraries in Python that support fastText models and the training of fastText models. Gensim and Spacy are two popular libraries that make it easy to load these vectors, transform, lemmatize, and perform other NLP tasks efficiently. This chapter will focus on how to use fastText with Python and its popular libraries. This chapter will also focus on showing you some common tasks that the two libraries can do to work with fastText models.

The topics that are covered in this chapter are as follows:

  • FastText official bindings
  • PyBind
  • Preprocessed...