Vicky Kalogeiton

vicky_2026_small.jpeg
photo by Laurent Ardhuin for CNRS
· · · · · CV (updated 2026)

I am a Professor (HDR, 2024 from Polytechnique) in AI at the Computer Science Laboratory (LIX) of École Polytechnique, Paris, France. I am the head of the VISTA team and an Ellis member attached to the Paris Unit, I am also a contributing researcher at Archimedes and affiliated with Hi!Paris. In 2026, I received the Bronze Medal from CNRS for my research in AI.

My research goal is to develop generalizable methods applicable to various domains and my current focus is on multimodal generative AI, from the angle of efficiency, structured or multiple outputs, and medical applications! At Polytechnique, I am the main genAI researcher and I publish papers in the most prestigious computer vision conferences (CVPR, ICCV, ECCV) and top journals (T-PAMI, IJCV). I support Slow Science and Open Science.

Previously, I was a research fellow at VGG, University of Oxford, where I worked with Andrew Zisserman. I completed my PhD at the CALVIN group, University of Edinburgh and the THOTH team, INRIA Grenoble (previous name LEAR) advised by Vittorio Ferrari and Cordelia Schmid.

I am always looking for new collaborations and students! Email me, if you want to discuss or work with me!

News

  • May 2026 I am truly honored and happy and humbled to have received the
    CNRS Bronze Medal in AI in 2026, the highest award for junior researchers in France!!!
    Thank you to all who believed in me!

    Also, I am happy to have received the Outstanding Area Chair Award at CVPR 2026!
    I am co-organizing this year’s CVPR@Paris 2026 (thanks to ELLIS, Hi!Paris, SCAI, DIM and CNRS for the support) and two ECCV 2026 workshops: on “Story-Level Movie Understanding and Audio Description” and the “3rd Workshop on Audio-Visual Generation & Learning”.
  • Mar 2026 Big thank you to Google DeepMind and Hi!Paris for the Google Academic gift, Google Gemini 2026 award and the Hi!Paris chaire!!
  • Jun 2025 So many things!! I will serve as Program Chair for CVPR 2027.
    Also, I am serving as Diversity Chair & Area Chair for ICCV 2025 and I have just received a Hi!Paris chaire!

    Together with the great David and Matthieu, we are organising CVPR 2025 in Paris.
    Finally, Greeks in AI is almost there and is gaining serious momentum!
  • Jan 2025 Organising the Computer Vision Workshop at IP Paris. Details here
  • Nov 2024 I spent a wonderful week visiting Ivan Laptev and the whole CV team at MBZUAI!
  • Oct 2024 Our E.T. work is featured in the CV magazine, congrats to Robin!
    Happy to receive a grant from AMIAD. I will be serving as an AC for CVPR 2025.
  • Sep 2024 Wonderful experience talking to the Deep Learning Indaba (thanks Raoul and Benji!) and to the Paris GenAI autumn school.
  • Mar 2024 Happy to receive a Hi!Paris grant, a CIEDS grant and a Microsoft academic gift!
  • Jan 2024 I will be serving as Area Chair for WACV'24, ACCV'24 and ECCV'24.
  • Mar 2023 I received a Hi!Paris grant!
  • Dec 2022 Happy to be outstanding Area Chair for ACCV 2022!
  • Dec 2022 Happy that our paper received the best student honorable mention award at ACCV 2022!
  • Nov 2022 I will be serving as Area Chair for ICCV 2023.
  • Sep 2022 Happy to receive a Microsoft Academic gift for Azure Education Hub!
  • Jul 2022 Happy to receive funding for two of my projects: the Young Researchers in France (JCJC WhyBehindScenes) and for the ANR APATE!
  • Mar 2022 I will be serving as Area Chair for ACCV 2022.
  • Oct 2021 Happy and humbled to receive the best paper award at the CVEU ICCV-W!
  • Oct 2021 Happy to receive a DIM RFSI 2021 grant!
  • Sep 2021 I am outstanding reviewer for ICCV 2021!
  • Aug 2021 Alongside with ICCV 2021, we are organizing the Real-World Computer Vision from Inputs with Limited Quality Workshop (RLQ)!
  • Jul 2021 co-organizing the Doctoral Symposium of AIML Systems 2021.
  • Mar 2021 Alongside with CVPR 2021, we are organizing the 1st Workshop on Future Video Conferencing (FVC)!
  • Sep 2020 I have joined the GeoViC team at École Polytechnique.
  • Jul 2020 Rated as one of the top 215 reviewers in ECCV’20 and received the free registration.
  • May 2020 I will be serving as Area Chair for CVPR 2021.

People

Current Alumni

Selected publications

  1. MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency
    Nicolas Dufour, Lucas Degeorge, Arijit Ghosh, and 2 more authors
    In International Conference on Machine Learning (ICML) 2026
  2. IDES T: Assessing Self-Supervised Learning Representations via Intrinsic Dimension
    Julie Mordacq, Vicky Kalogeiton, and Steve Oudot
    In International Conference on Machine Learning (ICML) 2026
  3. Pulp Motion: framing-aware multimodal camera and human motion generation
    Robin Courant, Xi Wang, David Loiseaux, and 2 more authors
    In International Conference on Learning Representations (ICLR) 2026
  4. Soft-Di[M]O: Improving One-Step Discrete Image Generation with Soft Embeddings
    Yuanzhi Zhu, Xi Wang, Stéphane Lathuilière, and 1 more author
    In International Conference on Learning Representations (ICLR) 2026
  5. ProxiVideoFriends: Revisiting Proxemics through Temporal and Social Reasoning
    Isabel Jiménez-Velasco, Rafael Muñoz-Salinas, Vicky Kalogeiton, and 1 more author
    In Proceedings of the International Conference on Computer Vision Theory and Applications (VISAPP) 2026
  6. When Surgery Meets the Unknown: Uncertainty-Aware Open-Set Recognition for Surgery Phase Classification
    Stefan Geyer, Vicky Kalogeiton, and Alina Roitberg
    In Proceedings of the International Conference on Computer Vision Theory and Applications (VISAPP) 2026
  7. T-REGS: Minimum Spanning Tree Regularization for Self-Supervised Learning
    Julie Mordacq, David Loiseaux, Vicky Kalogeiton, and 1 more author
    In The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS) 2025
  8. Di[M]o: Distilling masked diffusion models into one-step generator
    Yuanzhi Zhu, Xi Wang, Stéphane Lathuilière, and 1 more author
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025
  9. Around the world in 80 timesteps: A generative approach to global visual geolocation
    Nicolas Dufour, Vicky* Kalogeiton, David* Picard, and 1 more author
    In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) 2025
  10. Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance
    Luc Boudier, Loris Manganell, Eleftherios Tsonis, and 2 more authors
    In The British Machine Vision Conference (BMVC) 2025
  11. AKiRa: Augmentation Kit on Rays for optical video generation
    Xi Wang, Robin Courant, Marc Christie, and 1 more author
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025
  12. Lead: Latent realignment for human motion diffusion
    Nefeli Andreou, Xi Wang, Victoria Fernández Abrevaya, and 3 more authors
    In Computer Graphics Forum (CFG) 2025
  13. Story-level multimodal generativeAI: from understanding to generating visual data using multiple modalities
    Vicky Kalogeiton
    In Habilitation to direct research (HDR) 2024
  14. Analysis of Classifier-Free Guidance Weight Schedulers
    Xi Wang, Nicolas Dufour, Nefeli Andreou, and 4 more authors
    In Transactions on Machine Learning Research (TMLR) 2024
  15. ET the Exceptional Trajectories: Text-to-camera-trajectory generation with character awareness
    Robin Courant, Nicolas Dufour, Xi Wang, and 2 more authors
    In European Conference on Computer Vision (ECCV) 2024
  16. Your diffusion model is an implicit synthetic image detector
    Xi Wang, and Vicky Kalogeiton
    In European Conference on Computer Vision Workshop (ECCV-W) 2024
  17. Bridging Text and Image for Artist Style Transfer via Contrastive Learning
    Zhi-Song Liu, Li-Wen Wang, Jun Xiao, and 1 more author
    In European Conference on Computer Vision Workshop (ECCV-W) 2024
  18. Conditional Gradient-based Textual Inversion
    Xi Wang, and Vicky Kalogeiton
    In European Conference on Computer Vision Workshop (ECCV-W) 2024
  19. Don’t drop your samples! Coherence-aware training benefits Conditional diffusion
    Nicolas Dufour, Victor Besnier, Vicky Kalogeiton, and 1 more author
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024
  20. ADAPT: Multimodal Learning for Detecting Physiological Changes under Missing Modalities
    Julie Mordacq, Leo Milecki, Maria Vakalopoulou, and 2 more authors
    In Medical Imaging with Deep Learning (MIDL) 2024
  21. Collaborating Foundation models for Domain Generalized Semantic Segmentation
    Yasser Benigmim, Subhankar Roy, Slim Essid, and 2 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024
  22. FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild
    Zhi-Song Liu, Robin Courant, and Vicky Kalogeiton
    In International Journal of Computer Vision (IJCV) 2024
  23. Learning the What and How of Annotation in Video Object Segmentation
    Thanos Delatolas, Vicky Kalogeiton, and Dim P Papadopoulos
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024
  24. BluNF: Blueprint Neural Field
    Robin Courant, Xi Wang, Marc Christie, and 1 more author
    In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshop (ICCV-W) 2023
  25. MEDIMP: 3D Medical Images with clinical Prompts from limited tabular data for renal transplantation
    Leo Milecki, Vicky Kalogeiton, Sylvain Bodard, and 4 more authors
    In Medical Imaging with Deep Learning 2023
  26. One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models
    Yasser Benigmim, Subhankar Roy, Slim Essid, and 2 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop (CVPR-W) 2023
  27. Reward Function Design for Crowd Simulation via Reinforcement Learning
    Ariel Kwiatkowski, Vicky Kalogeiton, Julien Pettré, and 1 more author
    In Proceedings of the 16th ACM SIGGRAPH Conference on Motion, Interaction and Games (MIG) 2023
  28. Machine Learning for Brain Disorders: Transformers and Visual Transformers
    Robin Courant, Maika Edberg, Nicolas Dufour, and 1 more author
    Book Chapter Machine Learning for Brain Disorders, Springer 2023
  29. Name Your Style: Text-Guided Artistic Style Transfer
    Zhi-Song Liu, Li-Wen Wang, Wan-Chi Siu, and 1 more author
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop (CVPR-W) 2023
  30. FunnyNet: Audiovisual Learning of Funny Moments in Videos
    Zhi-Song Liu, Robin Courant, and Vicky Kalogeiton
    In Asian Conference on Computer Vision (ACCV) 2022
  31. SCAM! Transferring humans between images with Semantic Cross Attention Modulation
    Nicolas Dufour, David Picard, and Vicky Kalogeiton
    In European Conference on Computer Vision (ECCV) 2022
  32. Understanding reinforcement learned crowds
    Ariel Kwiatkowski, Vicky Kalogeiton, Julien Pettré, and 1 more author
    In Motion, Interaction and Games (MIG) 2022
  33. Contrastive Masked Transformers for Forecasting Renal Transplant Function
    Leo Milecki, Vicky Kalogeiton, Sylvain Bodard, and 4 more authors
    In International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2022
  34. Constrative Learning for Kidney Transplant Analysis using MRI data and Deep Convolutional Networks
    Leo Milecki, Vicky Kalogeiton, Sylvain Bodard, and 4 more authors
    In Medical Imaging with Deep Learning (MIDL) 2022
  35. A survey on reinforcement learning methods in character animation
    Ariel Kwiatkowski, Eduardo Alvarado, Vicky Kalogeiton, and 4 more authors
    In Computer Graphics Forum (Eurographics State-of-the-Art Report); arXiv:2203.04735 2022
  36. Name Your Style: An Arbitrary Artist-aware Image Style Transfer
    Zhi-Song Liu, Li-Wen Wang, Wan-Chi Siu, and 1 more author
    arXiv preprint arXiv:2202.13562 2022
  37. UGaitNet: Multimodal Gait Recognition With Missing Input Modalities
    Manuel J Marı́n-Jiménez, Francisco M Castro, Rubén Delgado-Escano, and 2 more authors
    IEEE Transactions on Information Forensics and Security (TIFS) 2021
  38. Face, body, voice: Video person-clustering with multiple modalities
    Andrew Brown, Vicky Kalogeiton, and Andrew Zisserman
    2021
  39. High-Level Features for Movie Style Understanding
    Robin Courant, Christophe Lino, Marc Christie, and 1 more author
    In ICCV 2021 Workshop on AI for Creative Video Editing and Understanding (ICCV-W) 2021
  40. Multimodal Gait Recognition Under Missing Modalities
    Rubén Delgado-Escano, Francisco M Castro, Nicolás Guil, and 2 more authors
    In 2021 IEEE International Conference on Image Processing (ICIP) 2021
  41. Me-NDT: Neural-backed Decision Tree for visual Explainability of deep Medical models
    Guanghui Fu, Ruiqian Wang, Jianqiang Li, and 2 more authors
    In Medical Imaging with Deep Learning (MIDL) 2021
  42. Multiple Style Transfer Via Variational Autoencoder
    Zhi-Song Liu, Vicky Kalogeiton, and Marie-Paule Cani
    In 2021 IEEE International Conference on Image Processing (ICIP) 2021
  43. LAEO-Net++: revisiting people Looking At Each Other in videos
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2021
  44. Constrained video face clustering using 1NN relations
    V Kalogeiton, and A Zisserman
    In The British Machine Vision Conference (BMVC) 2020
  45. Smooth-AP: Smoothing the path towards large-scale image retrieval
    Andrew Brown, Weidi Xie, Vicky Kalogeiton, and 1 more author
    In European Conference on Computer Vision (ECCV); arXiv:2007.12163 2020
  46. Real-time active SLAM and obstacle avoidance for an autonomous robot based on stereo vision
    Vicky Kalogeiton, Konstantinos Ioannidis, G Ch Sirakoulis, and 1 more author
    Cybernetics and Systems 2019
  47. LAEO-Net: revisiting people Looking At Each Other in videos
    Manuel J Marin-Jimenez, Vicky Kalogeiton, Pablo Medina-Suarez, and 1 more author
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2019
  48. Localizing spatially and temporally objects and actions in videos
    Vicky Kalogeiton
    PhD Thesis, University of Edinburgh, UK, Inria Grenoble 2017
  49. Joint learning of object and action detectors
    Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, and 1 more author
    In Proceedings of the IEEE International Conference on Computer Vision (ICCV) 2017
  50. Action tubelet detector for spatio-temporal action localization
    Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, and 1 more author
    In Proceedings of the IEEE International Conference on Computer Vision 2017
  51. Programmable crossbar quantum-dot cellular automata circuits
    Vicky Kalogeiton, Dim P Papadopoulos, Orestis Liolis, and 3 more authors
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2017
  52. Analysing domain shift factors between videos and images for object detection
    Vicky Kalogeiton, Vittorio Ferrari, and Cordelia Schmid
    IEEE transactions on pattern analysis and machine intelligence (TPAMI) 2016
  53. Bio-inspired electronic systems for crowd evacuation with bio-robot guidance
    Vicky Kalogeiton
    2015
  54. Cellular automaton model of crowd evacuation inspired by slime mould
    Vicky Kalogeiton, Dim P Papadopoulos, IP Georgilas, and 2 more authors
    International Journal of General Systems (IJGS) 2015
  55. Biomimicry of crowd evacuation with a slime mould cellular automaton model
    Vicky Kalogeiton, Dim P Papadopoulos, Ioannis P Georgilas, and 2 more authors
    In Computational Intelligence, Medicine and Biology 2015
  56. Hey Physarum! Can you Perform SLAM?
    Vicky Kalogeiton, Dim P Papadopoulos, and Georgios Ch Sirakoulis
    International Journal of Unconventional Computing 2014
  57. Morphological Edge Detector Implemented in Quantum Cellular Automata
    Orestis Liolis, Vicky Kalogeiton, Dim P Papadopoulos, and 3 more authors
    In Imaging Systems and Techniques (IST), 2013 IEEE International Conference on 2013
  58. Automatic summarization and annotation of videos with lack of metadata information
    Dim P Papadopoulos, Vicky Kalogeiton, Savvas A Chatzichristofis, and 1 more author
    Expert Systems with Applications 2013
  59. DUTH does Probabilities of Relevance at the Legal Track
    Dim P Papadopoulos, Vicky Kalogeiton, and Avi Arampatzis
    In The Nineteenth Text REtrieval Conference Proceedings (TREC 2010). National Institute of Standards and Technology (NIST) the Defense Advanced Research Projects Agency (DARPA) and the Advanced Research and Development Activity (ARDA) 2011
  60. A novel video summarization method based on the compact composite descriptors and fuzzy classifier
    Vicky Kalogeiton,  Papadopoulos, Dim P, and 2 more authors
    In 4th international conference for undergraduate and postgraduate students in computer engineering, informatics, related technologies and applications 2010

Teaching

2023-2024 2022-2023 2021-2022 2020-2021 Before 2020
  • Introduction to Computer Vision. Guest Lectures Lady Margaret Hall, St Hugh’s College, University of Oxford, 2019
  • Introduction to Computer Vision. Term Lectures, Oxford Royale Academy, 2018-2021

Misc

Service
  • Area Chair. ICCV 2025, CVPR 2025, ECCV 2024, ACCV 2024, WACV 2024, ICCV 2023, ACCV 2022, CVPR 2021
  • Associate Editor. CVIU 2024--, CMBBE: Imaging & Visualization 2017-2022
  • Conferences Program committee.
    • 2023: CVPR
    • 2022: CVPR
    • 2021: ICCV, TCSVT, AIMLSystems, AAAI, WiCV ICCV, WiCV CVPR
    • 2020: ECCV, CVPR, ACCV, BMVC, TCSVT, WiCV ECCV
    • 2019: CVPR, ICCV, BMVC, TCSVT, ICCV-W 'Neural Architects'
    • 2018: ECCV, CVPR, ACCV, NeurIPS, IVC, IMAVIS, ECCV-W 'Optical flow', WiCV CVPR, WiCV ECCV
    • 2017: TCSVT
  • Journal Reviewer.
    • 2024: TPAMI
    • 2023: TPAMI
    • 2022: TPAMI, CVIU, TIP
    • 2021: TPAMI, IJCN
    • 2020: TPAMI, IJCV, CVIU
    • 2019: TPAMI, IJCV, CVIU, TOM, TIP
    • 2018: TPAMI, IJCV, CVIU, TOM, TIP
Awards Miscellaneous