Skip to main content
megaphone icon

AI Dev Days Hackathon

Build production‑ready AI during our global hack using Microsoft’s latest AI, agent, and dev tools to solve real‑world problems and compete for prizes.

LEARN, CONNECT, BUILD

Microsoft Reactor

Join Microsoft Reactor and engage with developers live

Ready to get started with AI and the latest technologies? Microsoft Reactor provides events, training, and community resources to help developers, entrepreneurs and startups build on AI technology and more. Join us!

LEARN, CONNECT, BUILD

Microsoft Reactor

Join Microsoft Reactor and engage with developers live

Ready to get started with AI and the latest technologies? Microsoft Reactor provides events, training, and community resources to help developers, entrepreneurs and startups build on AI technology and more. Join us!

Go back

Building a LLM Judge with Weights & Biases

29 October, 2024 | 5:00 PM - 6:00 PM (UTC) Coordinated Universal Time

  • Format:
  • alt##LivestreamLivestream

Topic: AI Applications

Language: English

Evaluating LLM outputs accurately is critical to being able to iterate quickly on a LLM system. Human annotations can be slow and expensive and using LLMs instead promises to solve this. However, aligning a LLM Judge with human judgements is often hard with many implementation details to consider. In this workshop we will explore:

  • Evaluating specialized LLMs using Weave
  • Productionizing the latest LLM-as-a-judge research
  • Improving on your existing judge
  • Building annotation UIs
  • LLM

Speakers

Related Events

The events below may be of interest to you as well. Be sure to visit our Reactor homepage to see all available events.