AI
LLM Benchmarks 사이트 모음
codens
2024. 9. 27. 03:16
LLM(Large Language Model) 성능 측정 사이트 모음
* Chatbot Arena LLM Leaderboard: Community-driven Evaluation for Best LLM and AI chatbots
https://lmarena.ai/?leaderboard=
* Comparison of Models: Quality, Performance & Price Analysis
https://artificialanalysis.ai/models/
//-------------------------------------
* SEAL - LLM Leaderboards, Expert-Driven Private Evaluations
https://scale.com/leaderboard
* KLU - LLM Leaderboard
https://klu.ai/llm-leaderboard
//-------------------------------------
* LiveBench , A Challenging, Contamination-Free LLM Benchmark
https://livebench.ai/
//-------------------------------------
* LLM Benchmarks: Overview, Limits and Model Comparison
https://www.vellum.ai/blog/llm-benchmarks-overview-limits-and-model-comparison
반응형