1st Workshop on Human–Scene Interaction (HSI) @ ECCV 2026

Call for Papers: 1st Workshop on Human-Scene Interaction (HSI)

Submission deadline: July 7, 2026
Location: ECCV 2026, Malmö, Sweden
Website: hsi-workshop.com


Overview

We invite submissions to the First Workshop on Human-Scene Interaction (HSI) at ECCV 2026. This workshop focuses on modelling and generating human motion and behaviour grounded in the surrounding scene. Our goal is to bring together research from computer vision, graphics, robotics, and multimodal learning to advance scene-aware embodied agents, including (humanoid) robots and virtual avatars.

We welcome both archival and non-archival submissions.

Invited Speakers

  • Umar Iqbal – NVIDIA DAIR Lab
  • Taku Komura – University of Hong Kong
  • Zhengyi Luo – NVIDIA GEAR Lab
  • Gerard Pons-Moll – University of Tübingen

Topics of interest

Topics include, but are not limited to:

  • Scene-conditioned human motion generation
  • Human-scene and human-object interaction modelling
  • Referring expression understanding and grounding in 3D scenes
  • Language understanding and grounded communication for embodied agents
  • Vision-language-motion alignment and grounding
  • Vision-language-action (VLA) models for embodied agents
  • Multimodal learning for motion and interaction
  • Datasets, benchmarks, and evaluation for interaction
  • Affordance learning and scene understanding
  • Physically-based simulation of interaction
  • Applications in robotics, animation, AR/VR, and embodied communication
  • Technical reports accompanying challenge submissions

Submission guidelines

Archival submissions

  • Must present original, unpublished work
  • Will undergo peer review
  • Accepted papers will be published in the ECCV 2026 Workshop Proceedings
  • Papers must follow the ECCV formatting guidelines

Non-archival submissions

  • May include previously published work, work under review, or ongoing research
  • Intended for presentation only (poster or oral), and will not be included in the proceedings
  • Ideal for sharing recent results, demos, or position papers

Submission links: To be announced

Important dates

  • July 7 – Submission deadline
  • July 31 – Notification
  • September 2026 – Workshop

Challenge

The workshop also hosts a challenge on scene-aware referential gesture generation. Given speech, a 3D target coordinate, and a virtual scene, the goal is to generate full-body referential gestures that correctly indicate the target object among distractors. For details on the task, data, evaluation protocol, and baselines, see: 

hsi-workshop.com/challenge

Contact
hsi-workshop@googlegroups.com

Organizers

  • Jonas Beskow – KTH Royal Institute of Technology, Sweden
  • Rishabh Dabral – Max Planck Institute for Informatics, Germany
  • Anna Deichler – KTH Royal Institute of Technology, Sweden
  • Fethiye Irmak Doğan – University of Cambridge, United Kingdom
  • Anindita Ghosh – Max Planck Institute for Informatics, Germany
You can leave a response, or trackback from your own site.

Leave a Reply

Design by 2b Consult