Geo Ahn
M.S. in Computer Science & Engineering, Kyung Hee University
I am Geo Ahn, a recent M.S. graduate in Computer Science & Engineering at Kyung Hee University, South Korea, advised by Prof. Jinwoo Choi. I am currently looking for Ph.D. positions.
My research interests lie in video representation learning, video understanding, and debiasing. Specifically, I aim to build models that generalize beyond spurious shortcuts toward holistic reasoning over actions in videos. I am also broadly interested in addressing the current bottlenecks in video understanding, such as limited temporal and compositional reasoning, and in exploring how video models can serve as a foundation for VLA and world models.
Most recently, I was a research intern at NAVER Cloud (Video Team) in 2025, where I worked on compositional generalization with vision-language models.
Publications
-
EVIDENT: Routing MLLM Adaptation through Entity-Grounded Visual Evidence for Cross-Domain Video Temporal GroundingarXiv preprint, 2026 -
Why Can’t I Open My Drawer? Mitigating Object-Driven Shortcuts in Zero-Shot Compositional Action RecognitionIn ECCV, 2026 -
SlotVTG: Object-Centric Adapter for Generalizable Video Temporal GroundingIn GRAIL-V Workshop at CVPR, 2026 -
DEVIAS: Learning Disentangled Video Representations of Action and Scene for Holistic Video UnderstandingIn ECCV (Oral, 2.3% acceptance rate), 2024