EgoNormia: Benchmarking Physical Social Norm Understanding

EgoNormia is a comprehensive benchmark evaluating agentic VLM capabilities in grounded reasoning scenarios.

Features

Comprehensive evaluation of grounded agentic abilities
Support for onboarding and evaluation on custom dataset
Support for both reasoning and vision-language models
Integration with popular AI APIs
Easy integration for custom agents

Installation

Use conda or venv for installation. (Or install it all locally if you're brave.)

conda create -n egonormia python=3.10 -y
conda activate egonormia

git clone https://github.com/Open-Social-World/EgoNormia
cd EgoNormia
pip install -e .

Evaluate using custom VLM (if openai API compatible)

To run using a custom VLM, replace self.modelname in eval/custom_eval_api.py with the model name of your openai API-compatible VLM, and fill in any remaining fields as necessary.

Then, simply evaluate using API the evaluation API with --modelname custom.

## Evaluate using API

To run only eval scripts, you can provide either an OpenAI API key or a Gemini API key (depending on the model you intend to run)

(To run all scripts related to EgoNormia, you need to populate *both* an OpenAI key and a Gemini API key)

This can be directly exported:

```bash
export OPENAI_API_KEY=<KEY>
export ANTHROPIC_API_KEY=<KEY>

Or you can modify the SECRETS.env file, adding your api keys.

You can then run the evaluation from the egonormia/src directory with the following command:

python3 evaluate.py --modelname gemini-1.5-flash-002 --jsonfile final_data.json (--blind) (--description)

Include the --blind flag to run the evaluation without the ground truth, and the --description flag to include the description in the evaluation. --blind and --description flags are mutually exclusive.

License

This project is licensed under the Apache License - see the LICENSE file for details.

Citation

If you use EgoNormia in any of your work, please cite:

@misc{rezaei2025egonormiabenchmarkingphysicalsocial,
      title={EgoNormia: Benchmarking Physical Social Norm Understanding},
      author={MohammadHossein Rezaei and Yicheng Fu and Phil Cuvin and Caleb Ziems and Yanzhe Zhang and Hao Zhu and Diyi Yang},
      year={2025},
      eprint={2502.20490},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2502.20490},
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
egonormia-docs		egonormia-docs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SECRETS.env		SECRETS.env
manifest.in		manifest.in
narration_verb_taxonomy.csv		narration_verb_taxonomy.csv
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EgoNormia: Benchmarking Physical Social Norm Understanding

Features

Installation

Evaluate using custom VLM (if openai API compatible)

License

Citation

About

Releases

Packages

Contributors 2

Languages

License

Open-Social-World/EgoNormia

Folders and files

Latest commit

History

Repository files navigation

EgoNormia: Benchmarking Physical Social Norm Understanding

Features

Installation

Evaluate using custom VLM (if openai API compatible)

License

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages