Large Language Model-Based Solutions Large Language Model-Based Solutions
Tech Today

Large Language Model-Based Solutions

How to Deliver Value with Cost-Effective Generative AI Applications

    • 38,99 €
    • 38,99 €

Beschreibung des Verlags

Learn to build cost-effective apps using Large Language Models

In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine tuning.

The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find:
Effective strategies to address the challenge of the high computational cost associated with LLMs Assistance with the complexities of building and deploying affordable generative AI apps, including tuning and inference techniques Selection criteria for choosing a model, with particular consideration given to compact, nimble, and domain-specific models
Perfect for developers and data scientists interested in deploying foundational models, or business leaders planning to scale out their use of GenAI, Large Language Model-Based Solutions will also benefit project leaders and managers, technical support staff, and administrators with an interest or stake in the subject.

GENRE
Computer und Internet
ERSCHIENEN
2024
2. April
SPRACHE
EN
Englisch
UMFANG
224
Seiten
VERLAG
Wiley
GRÖSSE
268,5
 MB

Andere Bücher in dieser Reihe

Getting Great Results with Excel Pivot Tables, PowerQuery and PowerPivot Getting Great Results with Excel Pivot Tables, PowerQuery and PowerPivot
2024
Microsoft SharePoint Premium in the Real World Microsoft SharePoint Premium in the Real World
2024
x86 Software Reverse-Engineering, Cracking, and Counter-Measures x86 Software Reverse-Engineering, Cracking, and Counter-Measures
2024
Enterprise AI in the Cloud Enterprise AI in the Cloud
2023
x64 Assembly Language Step-by-Step x64 Assembly Language Step-by-Step
2023
Beginning Power BI for Business Users Beginning Power BI for Business Users
2023