The 3rd Perception Test Challenge at ICCV

19 October 2025, Honolulu [BALLROOM B, Full day, 9am-5pm]

How strong is your perception model? Can it track objects and points even through strong occlusions? Can it localise actions and sounds? Can it answer questions that require memory and understanding of physics, abstraction, and semantics? Can it reason over hour-long videos?

Put your model to test and win prizes totalling 50K EUR across 6 tracks!

NEW this year: VQA is a unified track containing regular video QAs, but also questions related to point tracking, object tracking, action localisation in a video QA format.

NEW this year: Points&Objects is a unified track where a single model has to track both points and objects.

NEW this year: Actions&Sounds is a unified track where a single model has to localise in time both actions and sounds.

NEW this year: Perception Test interpretability track (open for submission until Dec 1, 2025).

NEW this year: We have 2 guest tracks: KiVA (image evaluation probing visual analogies skills), and Physics-IQ (assessing if generative models generate physics-aware videos).

Challenge tracks

VQA Points&Objects Actions&Sounds GroundedVQA 1h-VQA Interpretability

Guest tracks

Physics-IQ KiVA

Speakers

Ali Farhadi

University of Washington

Alison Gopnik

UC Berkeley

Philipp Krähenbühl

University of Texas at Austin

Phillip Isola

MIT

Workshop Agenda

09:00 - 09:10 Welcome and introduction (Joao Carreira)
09:10 - 09:40 Perception Test overview (Viorica Patraucean)
09:40 - 10:10 Keynote (Ali Farhadi)
10:10 - 10:40 Coffee break
10:40 - 10:50 Challenges overview and winner announcement (Joe Heyward)
10:55 - 11:20 Oral presentations from the challenge winners
11:20 - 11:50 Keynote (Philipp Krähenbühl)
11:50 - 12:30 Panel discussion
12:30 - 14:00 Lunch break
14:00 – 15:10 KiVA Guest Track (Keynote: Alison Gopnik)
- 14:00 – 14:30 Keynote: Alison Gopnik
- 14:30 – 14:40 KiVA Challenge Overview & Winner Announcement
- 14:40 – 15:05 Presentations by Top Three Winners (7 min each + 1 min Q&A)
15:10 - 15:30 Coffee break
15:30 - 17:00 Physics-IQ Guest Track (Keynote: Phillip Isola)
- 15:30 - 16:05 Keynote and Q/A by Phillip Isola
- 16:05 - 16:20 Physics-IQ Challenge Overview & Winner Announcement
- 16:20 - 17:00 Presentations by Top Three Winners

The Third Perception Test Challenge Winners

We received 557 submissions from 81 teams across five tracks. We awarded runner-up and winner prizes per track. For tracks where the top performing entries were very close in performance, but had very different approaches, we awarded 2 teams as winners.

Unified multiple-choice videoQA

Winner: Team Wind_Rain_Tower (Du Jingtao) report
Winner: Team njust_kmg_2 (Zhe Zhang, Haotian Si, Xiangbo Shu, Yang Yang) report
Runner-up: Team PCIE_VQA (Feng Chen, Kanokphan Lertniphonphan, Jun Xie, Zimeng Tan, Zhepeng Wang) report

Grounded videoQA

Winner: Team SGVR@KAIST (Jinhwan Seo, Yoonki Cho, Junhyug Noh, Sung-eui Yoon) report
Runner-up: Team TutuAI (Yixiao Yuan, Yingzhe Peng) report

Hour-long videoQA

Winner: Team NJUST—_KMG (Yinan Han, Li Wang, Qingyuan Jiang, Yang Yang) report
Runner-up: Team Oair_Hunter (Haoxuan Ma, Yongliang Wu, Bozheng Li, Yangguang Ji, Jiawang Cao, Licheng Tang, Wenbo Zhu, Jay Wu, Xu Yang, Zhenxiang Jiang) report

Joint object and point tracking

Winner:Team NJUST KMG (Zhi-Qiang Zhong, Yang Yang, Jing-Long Li, Xiang-Yang Ji) report
Winner: Team 30497 (Jiale Li) report
Runner-up: Team LC (Veronika Prokhorchuk, Kostiantyn Kovalov, Naum Ochkus) report