Session
Beyond AI Accuracy: Building Trustworthy and Responsible AI Application Through Mosaic AI Framework
Overview
Experience | In Person |
---|---|
Type | Breakout |
Track | Artificial Intelligence |
Industry | Enterprise Technology, Health and Life Sciences, Financial Services |
Technologies | MLFlow, Llama, Mosaic AI |
Skill Level | Advanced |
Duration | 40 min |
Generic LLM metrics are useless until it meets your business needs.In this session we will dive deep into creating bespoke custom state-of-the-art AI metrics that matters to you. Discuss best practices on LLM evaluation strategies, when to use LLM judge vs. statistical metrics and many more.
Through a live demo using Mosaic AI Framework, we will showcase:
- How you can build your own custom AI metric tailored to your needs for your GenAI application
- Implement autonomous AI evaluation suite for complex, multi-agent systems
- Generate ground truth data at scale and production monitoring strategies
- Drawing from extensive experience on working with customers on real-world use cases, we will share actionable insights on building a robust AI evaluation framework
By the end of this session, you'll be equipped to create AI solutions that are not only powerful but also relevant to your organizations needs. Join us to transform your AI strategy and make a tangible impact on your business!
Session Speakers
IMAGE COMING SOON
Ananya Roy
/Specialist Solution Architect
Databricks