AI
                
              LLM Benchmarks 사이트 모음
                codens
                 2024. 9. 27. 03:16
              
              
            
            LLM(Large Language Model) 성능 측정  사이트 모음 
* Chatbot Arena LLM Leaderboard: Community-driven Evaluation for Best LLM and AI chatbots 
https://lmarena.ai/?leaderboard=
* Comparison of Models: Quality, Performance & Price Analysis 
https://artificialanalysis.ai/models/
//------------------------------------- 
* SEAL - LLM Leaderboards, Expert-Driven Private Evaluations 
https://scale.com/leaderboard
* KLU - LLM Leaderboard 
https://klu.ai/llm-leaderboard
//------------------------------------- 
* LiveBench , A Challenging, Contamination-Free LLM Benchmark 
https://livebench.ai/
//------------------------------------- 
* LLM Benchmarks: Overview, Limits and Model Comparison 
https://www.vellum.ai/blog/llm-benchmarks-overview-limits-and-model-comparison
반응형