OSCAR is an open-source agent system developed by Universite de Montreal & Mila that achieves strong performance using screenshot-based interaction.
- Screenshot-based interaction
- GPT-4o integration
- Academic research focus
- Developed by Mila team
- OSCAR w/ GPT-4o: 24.5% (best screenshot-based model)
- Base Model: GPT-4o integration
- Input: Screenshot-based
- Focus: Visual understanding and interaction
- Paper: [Wang et al., '24]
- Citation: https://arxiv.org/pdf/2410.18963