Vicky Kalogeiton
I am the Assistant Professor (MCF) in Computer Vision at École Polytechnique, Paris, France. I am affiated with the VISTA team (previously GeoViC) at the Computer Science Laboratory (LIX). My research interests focus on multimodal learning split into three axes: generativeAI (text-to-image generation), video understanding using text and audio, and multimodal medical applications! At Polytechnique, I am the main genAI researcher and I publish papers in the most prestigious computer vision conferences (CVPR, ICCV, ECCV) and top journals (T-PAMI, IJCV).
Previously, I was a research fellow at VGG, University of Oxford, where I worked with Andrew Zisserman. I completed my PhD at the CALVIN group, University of Edinburgh and the THOTH team, INRIA Grenoble (previous name LEAR) advised by Vittorio Ferrari and Cordelia Schmid.
I am always looking for new collaborations and students! Email me, if you want to discuss or work with me!
Open positions
- I have a post-doc opening in multimodal generative AI. Apply if interested!
- I am always looking for motivated students and researchers to join my group. Please consider applying if you are interested in generativeAI with a focus on multimodality!
News
- Nov 2024 I spent a wonderful week visiting Ivan Laptev and the whole CV team at MBZUAI!
- Oct 2024 Our E.T. work is featured in the CV magazine, congrats to Robin!
Happy to receive a grant from AMIAD. I will be serving as an AC for CVPR 2025. - Sep 2024 Wonderful experience talking to the Deep Learning Indaba (thanks Raoul and Benji!) and to the Paris GenAI autumn school.
- Mar 2024 Happy to receive a Hi!Paris grant, a CIEDS grant and a Microsoft academic gift!
- Jan 2024 I will be serving as Area Chair for WACV'24, ACCV'24 and ECCV'24.
- Mar 2023 I received a Hi!Paris grant!
- Dec 2022 Happy to be outstanding Area Chair for ACCV 2022!
- Dec 2022 Happy that our paper received the best student honorable mention award at ACCV 2022!
- Nov 2022 I will be serving as Area Chair for ICCV 2023.
- Sep 2022 Happy to receive a Microsoft Academic gift for Azure Education Hub!
- Jul 2022 Happy to receive funding for two of my projects: the Young Researchers in France (JCJC WhyBehindScenes) and for the ANR APATE!
- Mar 2022 I will be serving as Area Chair for ACCV 2022.
- Oct 2021 Happy and humbled to receive the best paper award at the CVEU ICCV-W!
- Oct 2021 Happy to receive a DIM RFSI 2021 grant!
- Sep 2021 I am outstanding reviewer for ICCV 2021!
- Aug 2021 Alongside with ICCV 2021, we are organizing the Real-World Computer Vision from Inputs with Limited Quality Workshop (RLQ)!
- Jul 2021 co-organizing the Doctoral Symposium of AIML Systems 2021.
- Mar 2021 Alongside with CVPR 2021, we are organizing the 1st Workshop on Future Video Conferencing (FVC)!
- Sep 2020 I have joined the GeoViC team at École Polytechnique.
- Jul 2020 Rated as one of the top 215 reviewers in ECCV’20 and received the free registration.
- May 2020 I will be serving as Area Chair for CVPR 2021.
People
Current- 2024 - Yuanzhi Zhu: pre-doc position
- 2024 - Lefteris Tsonis: pre-doc position
- 2024 - Xianjin Gong: PhD with Damien Rohmer at Polytechnique
- 2024 - Lucas Degeorge: PhD with David Picard at ENPC
- 2023 - Ridouane Ghermi: PhD with Ivan Laptev at MBZUAI
- 2023 - Xi Wang: Post-doct
- 2022 - Robin Courant: PhD with Marc Christie at Inria Rennes
- 2022 - Julie Mordacq: PhD with Steve Oudot at Inria Saclay
- 2022 - Thanos Delatolas: PhD with Dim Papadopoulos at DTU
- 2021 - Nicolas Dufour: PhD with David Picard at ENPC
- 2021 - Yasser Benigmim: PhD with Slim Essid and Stéphane Lathuilière at Telecom Paris
- 2022-2023 Nefeli Andreou: PhD visitor with Victoria Fernández Abrevaya at MPI (now researcher at Amazon)
- 2021-2023 Léo Milecki: PhD collaborator with Maria Vakalopoulou at CentraleSupelec (now post-doc at Weill Cornell)
- 2020-2023 Ariel Kwiatkowski: PhD with Marie-Paule Cani and Julien Pettre at Inria Rennes (now researcher at AI Redefined)
- 2021-2022 - Dr. ZhiSong Liu (Post-doc), Polytechnique and Hong Kong Polytechnic (after at Dell, now Assis. Prof. at LUT)
- 2020-2021 - M. Edberg, L. Walewski, L. Milikic (BSc), École Polytechnique
- 2021-2021 - Isabel Jimenez Velasco with Manuel J Marin-Jimenez at University of Cordoba
- 2019-2021 - Andrew Brown, VGG, University of Oxford (now at Meta)
- 2019-2020 - Mohita Chowdhury (MSc), VGG, University of Oxford (now at Ufonia)
Selected publications
- Story-level multimodal generativeAI: from understanding to generating visual data using multiple modalitiesIn Habilitation to direct research (HDR) 2024
- Bridging Text and Image for Artist Style Transfer via Contrastive LearningIn European Conference on Computer Vision Workshop (ECCV-W) 2024
- Conditional Gradient-based Textual InversionIn European Conference on Computer Vision Workshop (ECCV-W) 2024
- Reward Function Design for Crowd Simulation via Reinforcement LearningIn Proceedings of the 16th ACM SIGGRAPH Conference on Motion, Interaction and Games (MIG) 2023
- Name Your Style: Text-Guided Artistic Style TransferIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop (CVPR-W) 2023
- Name Your Style: An Arbitrary Artist-aware Image Style TransferarXiv preprint arXiv:2202.13562 2022
- High-Level Features for Movie Style UnderstandingIn ICCV 2021 Workshop on AI for Creative Video Editing and Understanding (ICCV-W) 2021
- Real-time active SLAM and obstacle avoidance for an autonomous robot based on stereo visionCybernetics and Systems 2019
- Programmable crossbar quantum-dot cellular automata circuitsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2017
-
- Cellular automaton model of crowd evacuation inspired by slime mouldInternational Journal of General Systems (IJGS) 2015
- Biomimicry of crowd evacuation with a slime mould cellular automaton modelIn Computational Intelligence, Medicine and Biology 2015
- Hey Physarum! Can you Perform SLAM?International Journal of Unconventional Computing 2014
- Morphological Edge Detector Implemented in Quantum Cellular AutomataIn Imaging Systems and Techniques (IST), 2013 IEEE International Conference on 2013
- Automatic summarization and annotation of videos with lack of metadata informationExpert Systems with Applications 2013
- DUTH does Probabilities of Relevance at the Legal TrackIn The Nineteenth Text REtrieval Conference Proceedings (TREC 2010). National Institute of Standards and Technology (NIST) the Defense Advanced Research Projects Agency (DARPA) and the Advanced Research and Development Activity (ARDA) 2011
- A novel video summarization method based on the compact composite descriptors and fuzzy classifierIn 4th international conference for undergraduate and postgraduate students in computer engineering, informatics, related technologies and applications 2010
Teaching
2023-2024- co-responsible MScT Artificial Intelligence & Visual Computing École Polytechnique (20 hours)
- INF581A: Advanced Deep Learning École Polytechnique (10 hours)
- INF649: Advanced Computer Vision École Polytechnique (30 hours)
- INF473V: Computer Vision with Deep Learning, École Polytechnique (60 hours)
- INF634: Advanced Computer Vision École Polytechnique (30 hours)
- Deep Learning, MVA. Guest Lecture (2 hours)
- INF473V: Computer Vision with Deep Learning, École Polytechnique (60 hours)
- Generative AI, Hi!Paris Summer School
- INF634: Advanced Computer Vision École Polytechnique (30 hours)
- VIC: Vision par Ordinateur. Guest Lecture, CentraleSupelec (2 hours)
- INF573: Image Analysis and Computer Vision, École Polytechnique (25 hours)
- INF473V: Computer Vision with Deep Learning, École Polytechnique (1.5 hours)
- INF634: Advanced Computer Vision École Polytechnique (30 hours)
- INF573: Image Analysis and Computer Vision, École Polytechnique (25 hours)
- INF473V: Computer Vision with Deep Learning, École Polytechnique (50 hours)
- Introduction to Computer Vision. Term Lecture, Oxford Royale Academy (2 hours)
- Introduction to Computer Vision. Guest Lectures Lady Margaret Hall, St Hugh’s College, University of Oxford, 2019
- Introduction to Computer Vision. Term Lectures, Oxford Royale Academy, 2018-2021
Misc
Service- Area Chair. ECCV 2024, ACCV 2024, WACV 2024, ICCV 2023, ACCV 2022, CVPR 2021
- Associate Editor. CMBBE: Imaging & Visualization 2017-2022
- Conferences Program committee.
- 2023: CVPR
- 2022: CVPR
- 2021: ICCV, TCSVT, AIMLSystems, AAAI, WiCV ICCV, WiCV CVPR
- 2020: ECCV, CVPR, ACCV, BMVC, TCSVT, WiCV ECCV
- 2019: CVPR, ICCV, BMVC, TCSVT, ICCV-W 'Neural Architects'
- 2018: ECCV, CVPR, ACCV, NeurIPS, IVC, IMAVIS, ECCV-W 'Optical flow', WiCV CVPR, WiCV ECCV
- 2017: TCSVT
- Journal Reviewer.
- 2023: TPAMI
- 2022: TPAMI, CVIU, TIP
- 2021: TPAMI, IJCN
- 2020: TPAMI, IJCV, CVIU
- 2019: TPAMI, IJCV, CVIU, TOM, TIP
- 2018: TPAMI, IJCV, CVIU, TOM, TIP
- Student honorable mention award at ACCV 2022
- Best paper award at ICCV-W Creative Video Editing and Understanding, 2021.
- Outstanding Reviewer Award. ICCV 2021, ECCV 2020, ICCV 2019, CVPR 2018, "Neural Architects" ICCV-W 2019
- Emergency Reviewer Award. ICCV 2021, ECCV 2020, ICCV 2019, ECCV 2018
- Best Poster Award. Université Grenoble Alpes, France, 2016
- Best Master Thesis Award, Valedictorian. DUTh Greece, 2013
- Scholarship from the DUTh Research Committee, 2013
- Best Paper Award at EUREKA! 2010
- Scholarship for Erasmus at the University of Deusto, Spain, 2009
- Scholarship (IKY) for performance during undergraduate studies, DUTh, 2008-2009
- Press: Interview in ICCV 2017 Daily, Best of ICCV 2017, Women in computer vision, Our CVPR2019 paper in CVPR 2019 Daily, Science and Video Games Conference: AI and Cooperation 2021
- Organizing Committees: Real-World Computer Vision from Inputs with Limited Quality Workshop (RLQ) at ICCV 2021, Doctoral Symposium at AIMLSystems 2021, 1st Workshop on Future Video Conferencing (FVC) at CVPR 2021