MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

2 badmonster 1 5/30/2025, 2:43:05 AM arxiv.org ↗

Comments (1)

badmonster · 1d ago
The stark performance gap between current models and humans—especially the fact that even the top proprietary model only hits 40%—highlights how underexplored and underdeveloped multi-image spatial reasoning still is.