Human-AI Co-Creation ECCV 2026 Workshop Submit Paper
ECCV Workshop

Human-AI Co-Creation

Submit Paper

Human-AI Co-Creation explores how generative AI can become a collaborative partner that inspires, supports, and amplifies human creativity while preserving human agency and authorship.

Sep 8th, 2026 ECCV 2026, Malmö, Sweden
Watercolor illustration of human-AI co-creation for ECCV 2026
Invited Speakers Four perspectives on creative Human-AI collaboration.
View all speakers →
About

About the Workshop

Human creativity rarely begins with precision.

At the earliest stages of visual creation, ideas are often incomplete, intuitive, and in motion. People sketch, improvise, borrow metaphors, follow sensations, and revise their intentions as they make. Yet many multimodal GenAI systems are still designed around explicit prompts and semantic correctness. This workshop asks how such systems can better support open-ended creative processes: not by replacing human imagination, but by helping people explore possibilities, refine emerging intent, and remain in control of meaning and authorship.

Bringing together generative modeling, human–AI interaction, design, and evaluation, we rethink GenAI as a medium for creative exploration rather than automated content production.

Call for Papers

Call for Papers

We invite submissions on multimodal generative AI and human-AI collaborative creation. The workshop focuses on GenAI as a collaborator, inspiration source, and assistant in open-ended, ambiguous, subjective, and iterative creative processes—while preserving human agency and authorship.

Submission Deadline July 17, 2026 23:59, Anywhere on Earth
Submit via OpenReview →

Topics of Interest

  • Human-centered GenAI models
  • Collaborative multimodal AI
  • GenAI for creative processes
  • Multimodal representation and alignment
  • Creativity-oriented HCI paradigms
  • Cognitive foundations of creativity
  • Human-centric benchmarks and evaluation
  • Novel applications for creativity
  • Ethical GenAI in creative domains

Important Dates

  1. Paper submission due
  2. Review starts
  3. Review due
  4. Meta-review starts
  5. Meta-review due
  6. Notification
  7. Camera ready

Submission Format

  • Full papers 14 pages excl. references · archival · ECCV 2026 template
  • Extended abstracts 4 pages excl. references · non-archival · WIP or prior work

All submissions are reviewed double-blind by at least two reviewers, based on relevance, significance, novelty, technical quality, and clarity.

Submission Platform

All papers should be submitted through the OpenReview submission website. Submissions Now Open

Important Note for Authors:

All co-authors must have an active OpenReview account before the deadline.

Keynote Speakers

Schedule Preview

Workshop Program

  1. Introduction
  2. Talk 1: Chen Change Loy
    Beyond Prompting: Creative Control in Visual Generation
    30 min + 5 min Q&A
  3. Talk 2: Mira Dontcheva
    30 min + 5 min Q&A
  4. Best workshop papers award & presentation (10 mins)
  5. Poster session & Coffee break
  6. Talk 3: Fabio Pellacini
    30 min + 5 min Q&A
  7. Talk 4: Yuhui Yuan
    30 min + 5 min Q&A
  8. Closing

Organizers

Loris Bazzani

Adjunct Professor · University of Verona

Personal Page

Zanxi Ruan

PhD Candidate · University of Verona

Personal Page

Umberto Michieli

Research Scientist · Canva Research

Personal Page

Fabian Caba Heilbron

Research Scientist · Adobe Research

Personal Page

Nicu Sebe

Full Professor · University of Trento

Personal Page

Marco Cristani

Full Professor · University of Verona · Reykjavik University

Personal Page

Sponsor

← Back to main page
Invited Speakers

Speaker Biographies

Invited speaker #1

Chen Change Loy

Personal Page

Nanyang Technological University, Singapore

Talk title and abstract

Beyond Prompting: Creative Control in Visual Generation

Human-AI co-creation requires generative models that do more than produce visually plausible outputs from one-shot prompts. Creative users often need precise, iterative control over who appears, what should be changed or preserved, where new content should be placed, and how the final image should look. In this talk, I will present four recent works that explore visual generation through this lens of controllability. We begin with video-reference talking avatar generation, where identity and expression cues are drawn from rich temporal references. We then discuss precise object and effect removal, highlighting the importance of preserving user-specified scene content. Next, we examine 3D-aware object insertion as a form of spatial and design-oriented co-creation. Finally, we present defocus blur control in text-to-image diffusion, showing how photographic parameters can become intuitive creative handles. Together, these works suggest a path from passive generation toward controllable visual systems that better support human intent, authorship, and refinement.

Chen Change (Cavan) Loy is Tan Lip-Bu Professor in Artificial Intelligence at the College of Computing and Data Science, Nanyang Technological University, Singapore. He is the Director of MMLab@NTU and Co-Associate Director of S-Lab. His research interests include large multimodal models, generative AI, spatial intelligence, representation learning, and visual content creation. Cavan currently serves or has served as an Associate Editor for leading journals including IJCV, TPAMI, and CVIU. He has also served as an Area Chair or Senior Area Chair for major conferences such as CVPR, ICCV, ECCV, ICLR, and NeurIPS. He was Program Co-Chair for CVPR 2026 and will serve as General Co-Chair for ACCV 2028.

Invited speaker #2

Mira Dontcheva

Personal Page

Adobe Research

Talk title and abstract

Title coming soon

Abstract coming soon.

Mira is senior principal scientist at Adobe leading research in Human Computer Interaction (HCI) at the intersection of video interaction and AI agents. Mira leads the STORIE Lab, which focuses on research on storytelling and interactive experiences with generative AI. Mira and her team build new tools that make video and audio creation easier, more fun, and more accessible to a wider audience. She is passionate about multimodal interaction, AI agents, and experiences at the intersection of the physical and digital world. She finished her Ph.D. in computer science at the University of Washington with David Salesin, Michael Cohen and Steven Drucker. Her thesis focused on novel interaction techniques for collecting and organizing web content. She was an undergraduate at the University of Michigan in Ann Arbor and completed her B.S.E. in Computer Engineering.

Mira helped write and edit: No Code Required: Giving Users Tools to Transform the Web, was featured on Forbes.com and helped lead Project Blink. She has presented her research on stage at Adobe Summit and Adobe MAX.

Invited speaker #3

Fabio Pellacini

Personal Page

University of Modena and Reggio Emilia, Italy

Talk title and abstract

Title coming soon

Abstract coming soon.

Fabio Pellacini is a Full Professor of Computer Science at the University of Modena and Reggio Emilia, Italy. His research focuses on the use of computer graphics methods to address design problems, with particular emphasis on creative, design, and entertainment industries. His work combines algorithms, efficient systems, numerical methods, and machine learning to support intuitive and interactive creation of complex 3D scenes, enabling both professional designers and novices to work with significantly less effort.

Before joining the University of Modena and Reggio Emilia, Italy, he was an Associate and Full Professor at Sapienza University of Rome, an Assistant and Associate Professor at Dartmouth College, a Visiting Assistant Professor at Cornell University, and a researcher in the R&D division of Pixar Animation Studios. He received his M.S. and Ph.D. in Computer Science from Cornell University and a Laurea degree in Physics from the University of Parma.

Pellacini has made significant contributions to appearance design, material and lighting editing, appearance fabrication, visualization of design workflows, and cloud-based collaborative design. He has received a National Science Foundation CAREER Award and an Alfred P. Sloan Fellowship, and regularly publishes in leading computer graphics venues.

Invited speaker #4

Yuhui Yuan

Personal Page

Canva

Talk title and abstract

Title coming soon

Abstract coming soon.

Yuhui (Ryan) Yuan is a Research Director at Canva CORE and the founder of Canva Research Lab in China, where he leads a research team focused on building next-generation graphic design foundation models. His current work centers on frontier multimodal generation, multi-layer visual content generation, and graphic design editing models, with the goal of advancing AI systems that can transform how people create and edit visual content.

Before joining Canva, he spent eight years at Microsoft as a Senior Researcher, working on cutting-edge computer vision problems including semantic segmentation, object detection, scene understanding, document intelligence, and generative AI applications. Several of his research contributions have been integrated into Microsoft products, including Azure Form Recognizer and Microsoft Designer.

Ryan received his Ph.D. in Computer Vision from the University of Chinese Academy of Sciences, where he worked on semantic segmentation, and his M.S. in Computer Science from Peking University. His representative research includes works such as OCNet, OCRNet, HDETR, Glyph-ByT5, SPO, and ART, as well as publications in top venues including ICCV, ECCV, IJCV, and ICLR.