MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
2 badmonster 1 5/30/2025, 2:43:05 AM arxiv.org ↗
Comments (1)
badmonster · 1d ago
The stark performance gap between current models and humans—especially the fact that even the top proprietary model only hits 40%—highlights how underexplored and underdeveloped multi-image spatial reasoning still is.