🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context

"It's not like finding a needle in a haystack, it is like creating new needles."

🏆 Leaderboard: http://liveideabench.com 💡

Dataset

Paper

🧠✨🎉 News (2025/1/27): Latest Dataset Update on Hugging Face!

We are excited to announce that the latest dataset, including supplementary tests for models like deepseek-R1, deepseek-V3, minimax-01, phi-4, and Opus, has been uploaded to Hugging Face! 🚀

Check it out here: https://huggingface.co/datasets/6cf/liveideabench-DLC-250127

LiveIdeaBench Evaluation Framework

Bibtex

@article{ruan2024liveideabench,
title={LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context},
author={Kai Ruan and Xuan Wang and Jixiang Hong and Peng Wang and Yang Liu and Hao Sun},
journal={arXiv preprint arXiv:2412.17596},
year={2024}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context

Dataset

Paper

🧠✨🎉 News (2025/1/27): Latest Dataset Update on Hugging Face!

LiveIdeaBench Evaluation Framework

Bibtex

Files

README.md

Latest commit

History

README.md

File metadata and controls

🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context

Dataset

Paper

🧠✨🎉 News (2025/1/27): Latest Dataset Update on Hugging Face!

LiveIdeaBench Evaluation Framework

Bibtex