-
Book Overview & Buying
-
Table Of Contents
Kubernetes for Generative AI Solutions
By :
This chapter will start building on the basic concepts of GenAI that were introduced in Part 1 and cover options for optimizing general-purpose foundational models (FMs) for domain-specific use cases, such as chatbots and personalized recommendations. We will explore specific techniques, including retrieval-augmented generation (RAG) and fine-tuning methods, offering an in-depth understanding of how these approaches can be used to enhance model performance for targeted applications. Additionally, we will focus on the key concepts and functionalities of the Transformer architecture and the LangChain framework so that we can implement these techniques in K8s in the next chapter.
In this chapter, we’re going to cover the following main topics: