JudgeLM: Fine-tuned Large Language Models are Scalable Judges
Paper
•
2310.17631
•
Published
•
34
Curated resources that support the use of LLMs to serve as automatic evaluators of other LLM outputs.
Compare AI models by voting on responses