We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images
Python 31
[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents
Python 33 4
[AAAI 2024] SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research
Python 27
Open-sourced dialogue foundation model for Chemistry and molecule science
Python 63 4
[COLING 2025]A curated paper list about LLMs for chemistry
🎮Manipulates mobile phones just like how you would. Official code for "MobA: A Two-Level Agent System for Efficient Mobile Task Automation".
[ACL 2024] Official code for "IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation" (TheatreMaker)
Experiment codes of Mobile-Env paper.
Loading…