AI

LLM Benchmarks 사이트 모음

codens 2024. 9. 27. 03:16

LLM(Large Language Model) 성능 측정  사이트 모음

* Chatbot Arena LLM Leaderboard: Community-driven Evaluation for Best LLM and AI chatbots
https://lmarena.ai/?leaderboard=

* Comparison of Models: Quality, Performance & Price Analysis
https://artificialanalysis.ai/models/

//-------------------------------------
* SEAL - LLM Leaderboards, Expert-Driven Private Evaluations
https://scale.com/leaderboard

* KLU - LLM Leaderboard
https://klu.ai/llm-leaderboard

//-------------------------------------
* LiveBench , A Challenging, Contamination-Free LLM Benchmark
https://livebench.ai/

//-------------------------------------
* LLM Benchmarks: Overview, Limits and Model Comparison
https://www.vellum.ai/blog/llm-benchmarks-overview-limits-and-model-comparison

 

반응형