October 19, 2025
Call for PapersMultimodal systems are transforming AI by enabling models to understand and act across language, vision, and other modalities, powering advances in robotics, autonomous driving, and scientific discovery. Yet, these capabilities raise serious safety and trustworthiness concerns, especially as traditional safeguards fall short in multimodal contexts. The Workshop on Safe and Trustworthy Multimodal AI Systems (SaFeMM-AI) at ICCV 2025 brings together the computer vision community to address challenges, including and beyond hallucinations, privacy leakage, and jailbreak vulnerabilities, and advance the development of safer, more robust, and more reliable multimodal models.
Archival track - will appear in ICCV proceedings
Full-Paper Submission
Full-Paper Notifications
Non-archival track - will NOT appear in ICCV proceedings
Short-Paper Submission
Camera-Ready Full-Paper (Archival Track)
Short-Paper Notifications
Camera-Ready Short-Paper (Non-Archival Track)
Workshop Day
Our workshop focuses on advancing the development of multimodal AI systems that can robustly handle unsafe or adversarial inputs and consistently generate safe, reliable, and trustworthy outputs. Topics of interest include but are not limited to:
Paper Submission Information for Short Paper (Active):
Submitted papers must be formatted using the ICCV 2025 Author Kit and are limited to four pages, including figures and tables. Additional pages are allowed only for references. We strongly encourage authors to carefully follow the ICCV Author Guidelines, since our workshop will adhere to the same formatting and submission policies as the main conference. Please check our Call for Papers document which also applies to short paper submission except the page limit.
Accepted full paper submissions will be included in the official ICCV 2025 workshop proceedings. Both accepted full papers and short papers will be presented in the workshop poster session, so at least one author should register and present the poster.
Note: Submission of Full-paper (Archival track) has ended, submission platform is currently open for short paper submissions only.
Time | Event |
---|---|
9:00 | Opening Remarks |
9:10 | Prof. Yoshua Bengio: Avoiding Catastrophic Risks from Uncontrolled AI Agency |
9:45 | Prof. Florian Tramèr: Memorization and Robustness in Multimodal Models |
10:20 | Networking/Coffee Break |
10:55 | Prof. Yarin Gal: TBD |
11:30 | Spotlight 1 |
11:45 | Voxel51 - Sponsored Demo |
12:00 | Lunch Break |
13:30 | Spotlight 2 |
13:45 | Spotlight 3 |
14:00 | Spotlight 4 |
14:15 | Prof. Yao Qin: TBD |
14:50 | Closing Remarks |
15:00 | Poster Session & Networking/Coffee Break |
Yoshua Bengio is a world-leading expert in artificial intelligence, renowned for his pioneering work in deep learning, which earned him the 2018 A.M. Turing Award alongside Geoffrey Hinton and Yann LeCun. He is a Full Professor at Université de Montréal, Founder and Scientific Advisor of Mila - Quebec AI Institute. He received numerous awards, including the prestigious Killam Prize and Herzberg Gold medal in Canada, CIFAR's AI Chair, Spain's Princess of Asturias Award, the VinFuture Prize and he is a Fellow of both the Royal Society of London and Canada, Knight of the Legion of Honor of France, Officer of the Order of Canada, Member of the UN's Scientific Advisory Board for Independent Advice on Breakthroughs in Science and Technology. Yoshua Bengio was named in 2024 as one of TIME magazine's 100 most influential people in the world. Concerned about the social impact of AI, he actively contributed to the Montreal Declaration for the Responsible Development of Artificial Intelligence and currently chairs the International Scientific Report on the Safety of Advanced AI. In June 2025, he launches a new non-profit AI safety research organization called LawZero, to prioritize safety over commercial imperatives.
Title: Avoiding Catastrophic Risks from Uncontrolled AI Agency
Abstract: AI agentic capabilities are rising exponentially, driven by scientific advances incorporating system 2 cognition into deep networks as well as by the commercial value of automating numerous human tasks. Besides bodily control, this may be the most significant gap that remains to human-level intelligence. Unfortunately, a series of recent scientific observations raise a major red flag: as AIs become better at reasoning and planning, more occurrences of deceptive and self-preservation behaviors are observed. We have not solved the problem of making sure that advanced AIs will follow our instructions, and in some circumstances they are found to cheat, lie, hack computers and try to escape our control, against their alignment training and instructions. Is it wise to design AIs that will soon be smarter than us across many cognitive abilities and could compete with us and try to avoid our control? We propose a safer path going forward: the design of non-agentic but fully trustworthy AIs modeled after a selfless platonic scientist trying to understand the world rather than trying to imitate or please us. For example, such non-agentic Scientist AIs could be used as monitors that reject potentially dangerous inputs or outputs of untrusted AI agents.
Yarin leads the Oxford Applied and Theoretical Machine Learning (OATML) group. He is an Associate Professor of Machine Learning at the Computer Science department, University of Oxford. He is also the Tutorial Fellow in Computer Science at Christ Church, Oxford, a Turing AI Fellow at the Turing Institute, and Director of Research at the UK Government's AI Security Institute (AISI, formerly the Frontier AI Taskforce). Prior to his move to Oxford he was a Research Fellow in Computer Science at St Catharine's College at the University of Cambridge. He obtained his PhD from the Cambridge machine learning group, working with Prof Zoubin Ghahramani and funded by the Google Europe Doctoral Fellowship.
TBD
Florian Tramèr is an Assistant Professor of Computer Science at ETH Zürich, where he leads the Secure and Private AI (SPY) Lab. He is a member of the Information Security Institute and ZISC, and an associated faculty member of the ETHZ AI Center. His research lies at the intersection of Computer Security, Machine Learning, and Cryptography. His work studies the worst-case behavior of deep learning systems from an adversarial perspective, aiming to understand and mitigate long-term threats to user safety and privacy. Under his leadership, the SPY Lab investigates the robustness and trustworthiness of machine learning systems, often using adversarial attacks to probe and improve their security.
Title: Memorization and Robustness in Multimodal Models
Abstract: In this talk, I'll cover some recent work on memorization and robustness in vision-language models. First, we'll discuss a curious phenomenon of "divergent memories" where VLMs can memorize data in one modality while having no concept of it in another. Second, we'll explore how techniques from the adversarial examples literature can be adapted to design efficient black-box attacks against VLMs.
Qin Yao is an Assistant Professor at the Department of Electrical and Computer Engineering, affiliated with the Department of Computer Science at UC Santa Barbara, where she is also co-leading the REAL AI initiative. Meanwhile, she is a senior Research Scientist at Google DeepMind, working on Gemini Multimodal. She obtained her PhD degree at UC San Diego in Computer Science, advised by Prof. Garrison W. Cottrell. During her PhD, she also interned under the supervision of Geoffrey Hinton, Ian Goodfellow and many others.
TBD
Stay up to date by following us on @SaFeMMAI.
For any inquiries, feel free to reach out to us via email at: safemm.ai.workshop@gmail.com. or You may also contact the organizers directly: Carlos Hinojosa, and Yinpeng Dong.