KIT | KIT-Bibliothek | Impressum | Datenschutz

Leveraging MLflow for Efficient Evaluation and Deployment of Large Language Models

Berberi, Lisana ORCID iD icon 1; Kozlov, Valentin ORCID iD icon 1; Alibabaei, Khadijeh Fahimeh ORCID iD icon 1; Esteban Sanchis, Borja ORCID iD icon 1
1 Scientific Computing Center (SCC), Karlsruher Institut für Technologie (KIT)

Abstract:

In recent years, Large Language Models (LLMs) have become powerful tools in the machine learning (ML) field, including features of natural language processing (NLP) and code generation. The employment of these tools often faces complex processes, starting from interacting with a variety of providers to fine-tuning models of a certain degree of appropriateness to meet the project’s needs.
This work explores in detail using MLflow [1] in deploying and evaluating two notable LLMs: Mixtral[2] from MistralAI and Meta-Llama (llama) [3] from Meta, both available as open-source models in the HuggingFace portal. The focus lies on enhancing inference efficiency, specifically emphasising the fact that DBRX has better throughput than traditional models of similar scale.
Hence, MLflow offers a unified interface for interacting with various LLM providers through the Deployments Server (previously known as “MLflow AI Gateway”) [4], which streamlines the deployment process. Further, with standardised evaluation metrics, we present a comparative analysis between Mixtral and Llama.
MLflow's LLM Evaluation tools are designed to address the unique challenges of evaluating LLMs. ... mehr


Volltext §
DOI: 10.5445/IR/1000174847
Veröffentlicht am 08.10.2024
Cover der Publikation
Zugehörige Institution(en) am KIT Scientific Computing Center (SCC)
Publikationstyp Vortrag
Publikationsdatum 03.10.2024
Sprache Englisch
Identifikator KITopen-ID: 1000174847
HGF-Programm 46.21.02 (POF IV, LK 01) Cross-Domain ATMLs and Research Groups
Veranstaltung EGI Conference (2024), Lecce, Italien, 30.09.2024 – 04.10.2024
Projektinformation AI4EOSC (EU, EU 9. RP, 101058593)
Skills4EOSC (EU, EU 9. RP, 101058527)
Schlagwörter LLM, MLflow, evaluation
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page