Vicky Kalogeiton

· · · ·

I am a Professor (HDR, 2024 from Polytechnique) in AI at the Computer Science Laboratory (LIX) of École Polytechnique, Paris, France. I am the head of the VISTA team and an Ellis member attached to the Paris Unit.

My research goal is to develop generalizable methods applicable to various domains and my current focus is on multimodal generative AI, from the angle of efficiency, structured or multiple outputs, and medical applications! At Polytechnique, I am the main genAI researcher and I publish papers in the most prestigious computer vision conferences (CVPR, ICCV, ECCV) and top journals (T-PAMI, IJCV). I support Slow Science and Open Science.

Previously, I was a research fellow at VGG, University of Oxford, where I worked with Andrew Zisserman. I completed my PhD at the CALVIN group, University of Edinburgh and the THOTH team, INRIA Grenoble (previous name LEAR) advised by Vittorio Ferrari and Cordelia Schmid.

I am always looking for new collaborations and students! Email me, if you want to discuss or work with me!

Open positions

I have a post-doc opening in multimodal generative AI. Apply if interested!
I am always looking for motivated students and researchers to join my group. Please consider applying if you are interested in generativeAI with a focus on multimodality!

News

Jun 2025 So many things!! I will serve as Program Chair for CVPR 2027.
Also, I am serving as Diversity Chair & Area Chair for ICCV 2025 and I have just received a Hi!Paris chaire!

Together with the great David and Matthieu, we are organising CVPR 2025 in Paris.
Finally, Greeks in AI is almost there and is gaining serious momentum!
Jan 2025 Organising the Computer Vision Workshop at IP Paris. Details here
Nov 2024 I spent a wonderful week visiting Ivan Laptev and the whole CV team at MBZUAI!
Oct 2024 Our E.T. work is featured in the CV magazine, congrats to Robin!
Happy to receive a grant from AMIAD. I will be serving as an AC for CVPR 2025.
Sep 2024 Wonderful experience talking to the Deep Learning Indaba (thanks Raoul and Benji!) and to the Paris GenAI autumn school.
Mar 2024 Happy to receive a Hi!Paris grant, a CIEDS grant and a Microsoft academic gift!
Jan 2024 I will be serving as Area Chair for WACV'24, ACCV'24 and ECCV'24.
Mar 2023 I received a Hi!Paris grant!
Dec 2022 Happy to be outstanding Area Chair for ACCV 2022!
Dec 2022 Happy that our paper received the best student honorable mention award at ACCV 2022!
Nov 2022 I will be serving as Area Chair for ICCV 2023.
Sep 2022 Happy to receive a Microsoft Academic gift for Azure Education Hub!
Jul 2022 Happy to receive funding for two of my projects: the Young Researchers in France (JCJC WhyBehindScenes) and for the ANR APATE!
Mar 2022 I will be serving as Area Chair for ACCV 2022.
Oct 2021 Happy and humbled to receive the best paper award at the CVEU ICCV-W!
Oct 2021 Happy to receive a DIM RFSI 2021 grant!
Sep 2021 I am outstanding reviewer for ICCV 2021!
Aug 2021 Alongside with ICCV 2021, we are organizing the Real-World Computer Vision from Inputs with Limited Quality Workshop (RLQ)!
Jul 2021 co-organizing the Doctoral Symposium of AIML Systems 2021.
Mar 2021 Alongside with CVPR 2021, we are organizing the 1st Workshop on Future Video Conferencing (FVC)!
Sep 2020 I have joined the GeoViC team at École Polytechnique.
Jul 2020 Rated as one of the top 215 reviewers in ECCV’20 and received the free registration.
May 2020 I will be serving as Area Chair for CVPR 2021.

People

Current

2025 - Luc Boudier: intern
2025 - Yuanzhi Zhu: PhD candidate
2025 - Lefteris Tsonis: PhD candidate
2024 - Xianjin Gong: PhD with Damien Rohmer at Polytechnique
2024 - Lucas Degeorge: PhD with David Picard at ENPC
2023 - Ridouane Ghermi: PhD with Ivan Laptev at MBZUAI
2022 - Robin Courant: PhD with Marc Christie at Inria Rennes
2022 - Julie Mordacq: PhD with Steve Oudot at Inria Saclay
2022 - Thanos Delatolas: PhD with Dim Papadopoulos at DTU
2021 - Nicolas Dufour: PhD with David Picard at ENPC
2021 - Yasser Benigmim: PhD with Slim Essid and Stéphane Lathuilière at Telecom Paris

Alumni

2023-2025Xi Wang: Post-doct
2022-2023 Nefeli Andreou: PhD visitor with ‪Victoria Fernández Abrevaya at MPI (now researcher at Amazon)
2021-2023 Léo Milecki: PhD collaborator with Maria Vakalopoulou at CentraleSupelec (now post-doc at Weill Cornell)
2020-2023 Ariel Kwiatkowski: PhD with Marie-Paule Cani and Julien Pettre at Inria Rennes (now researcher at AI Redefined)
2021-2022 - Dr. ZhiSong Liu (Post-doc), Polytechnique and Hong Kong Polytechnic (after at Dell, now Assis. Prof. at LUT)
2020-2021 - M. Edberg, L. Walewski, L. Milikic (BSc), École Polytechnique
2021-2021 - Isabel Jimenez Velasco with Manuel J Marin-Jimenez at University of Cordoba
2019-2021 - Andrew Brown, VGG, University of Oxford (now at Meta)
2019-2020 - Mohita Chowdhury (MSc), VGG, University of Oxford (now at Ufonia)

Selected publications

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

Nicolas Dufour, Vicky Kalogeiton, David Picard, and 1 more author

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025

PDF Website Code arXiv
AKiRa: Augmentation Kit on Rays for optical video generation

Xi Wang, Robin Courant, Marc Christie, and 1 more author

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025

PDF Website Code arXiv
Story-level multimodal generativeAI: from understanding to generating visual data using multiple modalities

Vicky Kalogeiton

In Habilitation to direct research (HDR) 2024

Hal
Analysis of Classifier-Free Guidance Weight Schedulers

Xi Wang, Nicolas Dufour, Nefeli Andreou, and 4 more authors

In Transactions on Machine Learning Research (TMLR) 2024

PDF Hal arXiv
ET the Exceptional Trajectories: Text-to-camera-trajectory generation with character awareness

Robin Courant, Nicolas Dufour, Xi Wang, and 2 more authors

In European Conference on Computer Vision (ECCV) 2024

Website Code arXiv
Your diffusion model is an implicit synthetic image detector

Xi Wang, and Vicky Kalogeiton

In European Conference on Computer Vision Workshop (ECCV-W) 2024

Website Hal
Bridging Text and Image for Artist Style Transfer via Contrastive Learning

Zhi-Song Liu, Li-Wen Wang, Jun Xiao, and 1 more author

In European Conference on Computer Vision Workshop (ECCV-W) 2024

arXiv
Conditional Gradient-based Textual Inversion

Xi Wang, and Vicky Kalogeiton

In European Conference on Computer Vision Workshop (ECCV-W) 2024

Hal
Don’t drop your samples! Coherence-aware training benefits Conditional diffusion

Nicolas Dufour, Victor Besnier, Vicky Kalogeiton, and 1 more author

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

PDF Website Code arXiv
ADAPT: Multimodal Learning for Detecting Physiological Changes under Missing Modalities

Julie Mordacq, Leo Milecki, Maria Vakalopoulou, and 2 more authors

In Medical Imaging with Deep Learning (MIDL) 2024

PDF Code
Collaborating Foundation models for Domain Generalized Semantic Segmentation

Yasser Benigmim, Subhankar Roy, Slim Essid, and 2 more authors

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

PDF Code arXiv
FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild

Zhi-Song Liu, Robin Courant, and Vicky Kalogeiton

In International Journal of Computer Vision (IJCV) 2024

PDF Website Code Dataset arXiv
Learning the What and How of Annotation in Video Object Segmentation

Thanos Delatolas, Vicky Kalogeiton, and Dim P Papadopoulos

In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

PDF Website Code
BluNF: Blueprint Neural Field

Robin Courant, Xi Wang, Marc Christie, and 1 more author

In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshop (ICCV-W) 2023

PDF Code
MEDIMP: 3D Medical Images with clinical Prompts from limited tabular data for renal transplantation

Leo Milecki, Vicky Kalogeiton, Sylvain Bodard, and 4 more authors

In Medical Imaging with Deep Learning 2023

PDF Code
One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models

Yasser Benigmim, Subhankar Roy, Slim Essid, and 2 more authors

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop (CVPR-W) 2023

PDF Code arXiv
Reward Function Design for Crowd Simulation via Reinforcement Learning

Ariel Kwiatkowski, Vicky Kalogeiton, Julien Pettré, and 1 more author

In Proceedings of the 16th ACM SIGGRAPH Conference on Motion, Interaction and Games (MIG) 2023

PDF
Machine Learning for Brain Disorders: Transformers and Visual Transformers

Robin Courant, Maika Edberg, Nicolas Dufour, and 1 more author

Book Chapter Machine Learning for Brain Disorders, Springer 2023

PDF arXiv
Name Your Style: Text-Guided Artistic Style Transfer

Zhi-Song Liu, Li-Wen Wang, Wan-Chi Siu, and 1 more author

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop (CVPR-W) 2023

PDF
FunnyNet: Audiovisual Learning of Funny Moments in Videos

Zhi-Song Liu, Robin Courant, and Vicky Kalogeiton

In Asian Conference on Computer Vision (ACCV) 2022

[Oral] [Student Honorable mention Award] PDF Website Code Dataset arXiv
SCAM! Transferring humans between images with Semantic Cross Attention Modulation

Nicolas Dufour, David Picard, and Vicky Kalogeiton

In European Conference on Computer Vision (ECCV) 2022

Website Code Video Poster arXiv
Understanding reinforcement learned crowds

Ariel Kwiatkowski, Vicky Kalogeiton, Julien Pettré, and 1 more author

In Motion, Interaction and Games (MIG) 2022

Hal arXiv
Contrastive Masked Transformers for Forecasting Renal Transplant Function

Leo Milecki, Vicky Kalogeiton, Sylvain Bodard, and 4 more authors

In International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2022

Website Code Hal
Constrative Learning for Kidney Transplant Analysis using MRI data and Deep Convolutional Networks

Leo Milecki, Vicky Kalogeiton, Sylvain Bodard, and 4 more authors

In Medical Imaging with Deep Learning (MIDL) 2022

PDF Website Code
A survey on reinforcement learning methods in character animation

Ariel Kwiatkowski, Eduardo Alvarado, Vicky Kalogeiton, and 4 more authors

In Computer Graphics Forum (Eurographics State-of-the-Art Report); arXiv:2203.04735 2022

Hal arXiv
Name Your Style: An Arbitrary Artist-aware Image Style Transfer

Zhi-Song Liu, Li-Wen Wang, Wan-Chi Siu, and 1 more author

arXiv preprint arXiv:2202.13562 2022

arXiv
UGaitNet: Multimodal Gait Recognition With Missing Input Modalities

Manuel J Marı́n-Jiménez, Francisco M Castro, Rubén Delgado-Escano, and 2 more authors

IEEE Transactions on Information Forensics and Security (TIFS) 2021

PDF Website Code
Face, body, voice: Video person-clustering with multiple modalities

Andrew Brown, Vicky Kalogeiton, and Andrew Zisserman

2021

[Spotlight] Website Code Dataset Hal arXiv
High-Level Features for Movie Style Understanding

Robin Courant, Christophe Lino, Marc Christie, and 1 more author

In ICCV 2021 Workshop on AI for Creative Video Editing and Understanding (ICCV-W) 2021

[Best Paper Award] PDF Video Hal
Multimodal Gait Recognition Under Missing Modalities

Rubén Delgado-Escano, Francisco M Castro, Nicolás Guil, and 2 more authors

In 2021 IEEE International Conference on Image Processing (ICIP) 2021

Website Code Colab Hal
Me-NDT: Neural-backed Decision Tree for visual Explainability of deep Medical models

Guanghui Fu, Ruiqian Wang, Jianqiang Li, and 2 more authors

In Medical Imaging with Deep Learning (MIDL) 2021

PDF Hal
Multiple Style Transfer Via Variational Autoencoder

Zhi-Song Liu, Vicky Kalogeiton, and Marie-Paule Cani

In 2021 IEEE International Conference on Image Processing (ICIP) 2021

PDF Website Poster Hal
LAEO-Net++: revisiting people Looking At Each Other in videos

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2021

PDF Website Code Dataset Hal arXiv
Constrained video face clustering using 1NN relations

V Kalogeiton, and A Zisserman

In The British Machine Vision Conference (BMVC) 2020

PDF Website Code Dataset
Smooth-AP: Smoothing the path towards large-scale image retrieval

Andrew Brown, Weidi Xie, Vicky Kalogeiton, and 1 more author

In European Conference on Computer Vision (ECCV); arXiv:2007.12163 2020

PDF Website Code arXiv
Real-time active SLAM and obstacle avoidance for an autonomous robot based on stereo vision

Vicky Kalogeiton, Konstantinos Ioannidis, G Ch Sirakoulis, and 1 more author

Cybernetics and Systems 2019
LAEO-Net: revisiting people Looking At Each Other in videos

Manuel J Marin-Jimenez, Vicky Kalogeiton, Pablo Medina-Suarez, and 1 more author

In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2019

PDF Website Code Dataset arXiv
Localizing spatially and temporally objects and actions in videos

Vicky Kalogeiton

PhD Thesis, University of Edinburgh, UK, Inria Grenoble 2017

PDF Hal
Joint learning of object and action detectors

Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, and 1 more author

In Proceedings of the IEEE International Conference on Computer Vision (ICCV) 2017

PDF Website Code Hal
Action tubelet detector for spatio-temporal action localization

Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, and 1 more author

In Proceedings of the IEEE International Conference on Computer Vision 2017

PDF Website Code Hal arXiv
Programmable crossbar quantum-dot cellular automata circuits

Vicky Kalogeiton, Dim P Papadopoulos, Orestis Liolis, and 3 more authors

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2017

arXiv
Analysing domain shift factors between videos and images for object detection

Vicky Kalogeiton, Vittorio Ferrari, and Cordelia Schmid

IEEE transactions on pattern analysis and machine intelligence (TPAMI) 2016

PDF Website Dataset arXiv
Bio-inspired electronic systems for crowd evacuation with bio-robot guidance

Vicky Kalogeiton

2015

PDF
Cellular automaton model of crowd evacuation inspired by slime mould

Vicky Kalogeiton, Dim P Papadopoulos, IP Georgilas, and 2 more authors

International Journal of General Systems (IJGS) 2015

PDF
Biomimicry of crowd evacuation with a slime mould cellular automaton model

Vicky Kalogeiton, Dim P Papadopoulos, Ioannis P Georgilas, and 2 more authors

In Computational Intelligence, Medicine and Biology 2015

PDF
Hey Physarum! Can you Perform SLAM?

Vicky Kalogeiton, Dim P Papadopoulos, and Georgios Ch Sirakoulis

International Journal of Unconventional Computing 2014
Morphological Edge Detector Implemented in Quantum Cellular Automata

Orestis Liolis, Vicky Kalogeiton, Dim P Papadopoulos, and 3 more authors

In Imaging Systems and Techniques (IST), 2013 IEEE International Conference on 2013

PDF
Automatic summarization and annotation of videos with lack of metadata information

Dim P Papadopoulos, Vicky Kalogeiton, Savvas A Chatzichristofis, and 1 more author

Expert Systems with Applications 2013

PDF
DUTH does Probabilities of Relevance at the Legal Track

Dim P Papadopoulos, Vicky Kalogeiton, and Avi Arampatzis

In The Nineteenth Text REtrieval Conference Proceedings (TREC 2010). National Institute of Standards and Technology (NIST) the Defense Advanced Research Projects Agency (DARPA) and the Advanced Research and Development Activity (ARDA) 2011

PDF
A novel video summarization method based on the compact composite descriptors and fuzzy classifier

Vicky Kalogeiton, Papadopoulos, Dim P, and 2 more authors

In 4th international conference for undergraduate and postgraduate students in computer engineering, informatics, related technologies and applications 2010

[Best Paper Award]

Teaching

2023-2024

co-responsible MScT Artificial Intelligence & Visual Computing École Polytechnique (20 hours)
INF581A: Advanced Deep Learning École Polytechnique (10 hours)
INF649: Advanced Computer Vision École Polytechnique (30 hours)
INF473V: Computer Vision with Deep Learning, École Polytechnique (60 hours)

2022-2023

INF634: Advanced Computer Vision École Polytechnique (30 hours)
Deep Learning, MVA. Guest Lecture (2 hours)
INF473V: Computer Vision with Deep Learning, École Polytechnique (60 hours)
Generative AI, Hi!Paris Summer School

2021-2022

INF634: Advanced Computer Vision École Polytechnique (30 hours)
VIC: Vision par Ordinateur. Guest Lecture, CentraleSupelec (2 hours)
INF573: Image Analysis and Computer Vision, École Polytechnique (25 hours)
INF473V: Computer Vision with Deep Learning, École Polytechnique (1.5 hours)

2020-2021

INF634: Advanced Computer Vision École Polytechnique (30 hours)
INF573: Image Analysis and Computer Vision, École Polytechnique (25 hours)
INF473V: Computer Vision with Deep Learning, École Polytechnique (50 hours)
Introduction to Computer Vision. Term Lecture, Oxford Royale Academy (2 hours)

Before 2020

Introduction to Computer Vision. Guest Lectures Lady Margaret Hall, St Hugh’s College, University of Oxford, 2019
Introduction to Computer Vision. Term Lectures, Oxford Royale Academy, 2018-2021

Misc

Service

Area Chair. ICCV 2025, CVPR 2025, ECCV 2024, ACCV 2024, WACV 2024, ICCV 2023, ACCV 2022, CVPR 2021
Associate Editor. CVIU 2024--, CMBBE: Imaging & Visualization 2017-2022
Conferences Program committee.
- 2023: CVPR
- 2022: CVPR
- 2021: ICCV, TCSVT, AIMLSystems, AAAI, WiCV ICCV, WiCV CVPR
- 2020: ECCV, CVPR, ACCV, BMVC, TCSVT, WiCV ECCV
- 2019: CVPR, ICCV, BMVC, TCSVT, ICCV-W 'Neural Architects'
- 2018: ECCV, CVPR, ACCV, NeurIPS, IVC, IMAVIS, ECCV-W 'Optical flow', WiCV CVPR, WiCV ECCV
- 2017: TCSVT
Journal Reviewer.
- 2024: TPAMI
- 2023: TPAMI
- 2022: TPAMI, CVIU, TIP
- 2021: TPAMI, IJCN
- 2020: TPAMI, IJCV, CVIU
- 2019: TPAMI, IJCV, CVIU, TOM, TIP
- 2018: TPAMI, IJCV, CVIU, TOM, TIP

Awards

Highlight paper (11% of papers) at CVPR 2024
Student honorable mention award at ACCV 2022
Outstanding AC award at ACCV 2022
Best paper award at ICCV-W Creative Video Editing and Understanding, 2021.
Outstanding Reviewer Award. ICCV 2021, ECCV 2020, ICCV 2019, CVPR 2018, "Neural Architects" ICCV-W 2019
Emergency Reviewer Award. ICCV 2021, ECCV 2020, ICCV 2019, ECCV 2018
Best Poster Award. Université Grenoble Alpes, France, 2016
Best Master Thesis Award, Valedictorian. DUTh Greece, 2013
Scholarship from the DUTh Research Committee, 2013
Best Paper Award at EUREKA! 2010
Scholarship for Erasmus at the University of Deusto, Spain, 2009
Scholarship (IKY) for performance during undergraduate studies, DUTh, 2008-2009

Miscellaneous

Press: Interview in ICCV 2017 Daily, Best of ICCV 2017, Women in computer vision, Our CVPR2019 paper in CVPR 2019 Daily, Science and Video Games Conference: AI and Cooperation 2021
Organizing Committees: Real-World Computer Vision from Inputs with Limited Quality Workshop (RLQ) at ICCV 2021, Doctoral Symposium at AIMLSystems 2021, 1st Workshop on Future Video Conferencing (FVC) at CVPR 2021