Change the repository type filter
All
Repositories list
29 repositories
VLMEvalKit
PublicOpen-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarksGTA
Publicopencompass
PublicOpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.GPassK
PublicGAOKAO-Eval
PublicANAH
PublicCriticEval
PublicCompassJudger
PublicProSA
Publiclagent-cibench
PublicMMBench
Publichinode
Publicstorage
PublicCompassBench
PublicCIBench
PublicMathBench
Public.github
PublicDevEval
PublicCodeBench
PublicAda-LEval
PublicT-Eval
Publichuman-eval
PublicOpenFinData
Publiccode-evaluator
Publicevalplus
PublicMixtralKit
PublicLawBench
PublicBotChat
Publicpytorch_sphinx_theme
Public