Book Image

Transformers for Natural Language Processing - Second Edition

By : Denis Rothman
5 (1)
Book Image

Transformers for Natural Language Processing - Second Edition

5 (1)
By: Denis Rothman

Overview of this book

Transformers are...well...transforming the world of AI. There are many platforms and models out there, but which ones best suit your needs? Transformers for Natural Language Processing, 2nd Edition, guides you through the world of transformers, highlighting the strengths of different models and platforms, while teaching you the problem-solving skills you need to tackle model weaknesses. You'll use Hugging Face to pretrain a RoBERTa model from scratch, from building the dataset to defining the data collator to training the model. If you're looking to fine-tune a pretrained model, including GPT-3, then Transformers for Natural Language Processing, 2nd Edition, shows you how with step-by-step guides. The book investigates machine translations, speech-to-text, text-to-speech, question-answering, and many more NLP tasks. It provides techniques to solve hard language problems and may even help with fake news anxiety (read chapter 13 for more details). You'll see how cutting-edge platforms, such as OpenAI, have taken transformers beyond language into computer vision tasks and code creation using DALL-E 2, ChatGPT, and GPT-4. By the end of this book, you'll know how transformers work and how to implement them and resolve issues like an AI detective.
Table of Contents (25 chapters)
18
Other Books You May Enjoy
19
Index
Appendix I — Terminology of Transformer Models

Chapter 2, Getting Started with the Architecture of the Transformer Model

  1. NLP transduction can encode and decode text representations. (True/False)

    True. NLP is transduction that converts sequences (written or oral) into numerical representations, processes them, and decodes the results back into text.

  1. Natural Language Understanding (NLU) is a subset of Natural Language Processing (NLP). (True/False)

    True.

  1. Language modeling algorithms generate probable sequences of words based on input sequences. (True/False)

    True.

  1. A transformer is a customized LSTM with a CNN layer. (True/False)

    False. A transformer does not contain an LSTM or a CNN at all.

  1. A transformer does not contain LSTM or CNN layers. (True/False)

    True.

  1. Attention examines all the tokens in a sequence, not just the last one. (True/False)

    True.

  1. A transformer does not use positional encoding...