Nicolas Dufour personal website

About me

I’m currently working as a Post doc at Kyutai, working with Patrick Perez. I’ve completed my PhD at IMAGINE (ENPC) and VISTA (Ecole Polytechnique) labs. My PhD thesis focused on efficient conditional generative models, under the supervision of David Picard and Vicky Kalogeiton. Before my PhD, I graduated from the MVA master (Mathematiques, Vision et Apprentissage) at ENS Paris Saclay and I got my engineering degree from Telecom SudParis following the MSA specialization (Mathematics and Statistical Modeling). I’ve also done an internship at Meta, where i worked on Model Based Reinforcement Learning. During my PhD, I have worked on GANs and diffusion models. I also have a high interest in the current development of the field of generative models around Large Language Models.

News

May, 2026

I will be visiting UC Berkeley this summer to work with Alyosha Efros.

Apr, 2026

Our work MIRO got accepted at ICML 2026!

Mar, 2026

Our work PoM got accepted as a Findings paper at CVPR 2026! See you in Denver.

Jan, 2026

Started working at Kyutai as a PostDoc!

Publications

2026

MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

Nicolas Dufour, Lucas Degeorge*, Arijit Ghosh *, David Picard^†, Vicky Kalogeiton^†

ICML 2026

Website Paper Code Abstract Bibtex

@inproceedings{dufour2026miro, 
     title           ={MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency}, 
     author          ={Nicolas Dufour and Lucas Degeorge and Arijit Ghosh and David Picard and Vicky Kalogeiton}, 
     year            ={2026}, 
     booktitle       ={International Conference on Machine Learning (ICML)}, 
 }

PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer

David Picard, Nicolas Dufour, Lucas Degeorge, Arijit Ghosh, Davide Allegro, Tom Ravaud, Yohann Perron, Corentin Sautier, Zeynep Sonat Baltaci, Fei Meng, Syrine Kalleli, Marta López-Rauhut, Thibaut Loiseau, Ségolène Albouy, Raphael Baena, Elliot Vincent, Loic Landrieu

CVPR Findings

Paper Code Abstract Bibtex

@article{picard2026pom,
    author    = {David Picard and Nicolas Dufour and Lucas Degeorge and Arijit Ghosh and Davide Allegro and Tom Ravaud and Yohann Perron and Corentin Sautier and Zeynep Sonat Baltaci and Fei Meng and Syrine Kalleli and Marta López-Rauhut and Thibaut Loiseau and Ségolène Albouy and Raphael Baena and Elliot Vincent and Loic Landrieu},
    title     = {PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer},
    journal   = {arXiv},
    year      = {2026},
 }

One View Is Enough! Monocular Training for In-the-Wild Novel View Generation

Adrien Ramanana Rahary, Nicolas Dufour, Patrick Perez, David Picard

Arxiv Preprint

Paper Code Abstract Bibtex

@article{rahary2026one_view, 
     title           ={One View Is Enough! Monocular Training for In-the-Wild Novel View Generation}, 
     author          ={Adrien Ramanana Rahary and Nicolas Dufour and Patrick Perez and David Picard}, 
     year            ={2026}, 
     journal         ={arXiv}, 
 }

2025

DIPSY: Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance

Luc Boudier, Loris Manganelli, Eleftherios Tsonis, Nicolas Dufour, Vicky Kalogeiton

BMVC 2025

Website Paper Code Abstract Bibtex

@inproceedings{boudier2025dipsy,
    author    = {Luc Boudier and Loris Manganelli and Eleftherios Tsonis and Nicolas Dufour and Vicky Kalogeiton},
    title     = {DIPSY: Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance},
    booktitle = {BMVC},
    year      = {2025},
 }

PhD Thesis

Controllability and Efficiency in Generative Models

Nicolas Dufour

PhD Thesis, École des Ponts ParisTech

Website Paper Abstract Bibtex

@phdthesis{dufour2025thesis, 
    title  = {Controllability and Efficiency in Generative Models}, 
    author = {Nicolas Dufour}, 
    year   = {2025}, 
    school = {École des Ponts ParisTech}, 
    type   = {PhD Thesis} 
 }

How far can we go with ImageNet for Text-to-Image generation?

Lucas Degeorge*, Arijit Ghosh *, Nicolas Dufour, David Picard^†, Vicky Kalogeiton^†

Arxiv Preprint

Website Paper Code Abstract Bibtex

@article{dufour2024world80timestepsgenerative, 
     title           ={How far can we go with ImageNet for Text-to-Image generation?}, 
     author          ={Lucas Degeorge and Arijit Ghosh and Nicolas Dufour and David Picard and Vicky Kalogeiton}, 
     year            ={2025}, 
     journal         ={arXiv}, 
 }

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

Nicolas Dufour, David Picard, Vicky Kalogeiton, Loic Landrieu

CVPR 2025

Website Paper Code Abstract Bibtex

@article{dufour2024world80timestepsgenerative, 
     title           ={Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation}, 
     author          ={Nicolas Dufour and David Picard and Vicky Kalogeiton and Loic Landrieu}, 
     year            ={2025}, 
     journal         ={CVPR}, 
 }

2024

Analysis of Classifier-Free Guidance Weight Schedulers

Xi Wang, Nicolas Dufour, Nefeli Andreou, Victoria Fernandez Abrevaya, Marie-Paule Cani, David Picard*, Vicky Kalogeiton*

TMLR

Paper Abstract Bibtex

@article{wang2024analysis, 
    title={Analysis of Classifier-Free Guidance Weight Schedulers}, 
    author={Xi Wang and Nicolas Dufour and Nefeli Andreou and Marie-Paule Cani
    and Victoria Fernandez Abrevaya and David Picard and Vicky Kalogeiton}, 
    journal={TMLR}, 
    year={2024} 
 }

E.T. the Exceptional Trajectories: Text-to-Camera-Trajectory Generation with Character Awareness.

Robin Courant, Nicolas Dufour, Xi Wang, Marc Christie, Vicky Kalogeiton

ECCV 2024

Website Paper Code Abstract Bibtex

@article{courant2024et,
    author    = {Robin Courant and Nicolas Dufour and Xi Wang and Marc Christie and Vicky Kalogeiton},
    title     = {E.T. the Exceptional Trajectories: Text-to-camera-trajectory generation with character awareness},
    journal   = {arXiv},
    year      = {2024},
 }

Don't drop your samples! Coherence-aware training benefits Conditional diffusion

Nicolas Dufour, Victor Besnier, Vicky Kalogeiton, David Picard

CVPR 2024 (Highlight)

Website Paper Code Abstract Bibtex

@article{dufour2024dont, 
    title={Don't drop your samples! Coherence-aware training benefits Conditional diffusion}, 
    author={Dufour, Nicolas and Besnier, Victor and Kalogeiton, Vicky and Picard, David}, 
    booktitle={CVPR}, 
    year={2024}, 
 }

OpenStreetView-5M: The Many Roads to Global Visual Geolocation

Guillaume Astruc*, Nicolas Dufour*, Ioannis Siglidis*, Constantin Aronssohn, Nacim Bouia, Stephanie Fu, Romain Loiseau, Van Nguyen Nguyen, Charles Raude, Elliot Vincent, Lintao XU, Hongyu Zhou, Loic Landrieu

CVPR 2024

Website Paper Code Abstract Bibtex

@article{astruc2024openstreetview5m, 
    title={OpenStreetView-5M: The Many Roads to Global Visual Geolocation}, 
    author={Guillaume Astruc and Nicolas Dufour and Ioannis Siglidis 
    and Constantin Aronssohn and Nacim Bouia and Stephanie Fu and Romain Loiseau 
    and Van Nguyen Nguyen and Charles Raude and Elliot Vincent and Lintao XU 
    and Hongyu Zhou and Loic Landrieu}, 
    journal={CVPR}, 
    year={2024} 
 }

2023

Machine Learning for Brain Disorders: Transformers and Visual Transformers

Robin Courant, Maika Edberg, Nicolas Dufour, Vicky Kalogeiton

Springer, Machine Learning for Brain Disorders

Paper Abstract Bibtex

@incollection{courant2012transformers, 
    title={Transformers and Visual Transformers}, 
    author={Courant, Robin and Edberg, Maika and Dufour, Nicolas and Kalogeiton, Vicky}, 
    booktitle={Machine Learning for Brain Disorders}, 
    pages={193--229}, 
    year={2012}, 
    publisher={Springer} 
 }

2022

SCAM! Transferring humans between images with Semantic Cross Attention Modulation

Nicolas Dufour, Vicky Kalogeiton, David Picard

ECCV 2022

Website Paper Code Abstract Bibtex

@article{dufour2022scam, 
    title={Scam! transferring humans between images with semantic cross attention modulation}, 
    author={Dufour, Nicolas and Picard, David and Kalogeiton, Vicky}, 
    booktitle={European Conference on Computer Vision}, 
    pages={713--729}, 
    year={2022}, 
    organization={Springer} 
 }

Teaching

Apr 2024 - Jun 2024

Teaching Assistant for INF473V - Modal d'informatique - Deep Learning in Computer Vision at Ecole Polytechnique

Helped supervise the practical sessions of the course.
Created a Kaggle competition for the students end of the course project. The goal was to create a classifier but we only provided them with the val and test data. They needed to create their own training data with pretrained diffusion models. The goal was to classify among 37 different cheeses.

Feb 2023 - Jun 2023

Teaching Assistant for INF473V - Modal d'informatique - Deep Learning in Computer Vision at Ecole Polytechnique

Helped supervise the practical sessions of the course.
Built a new practical session on Transformers.
Created a Kaggle competition for the students end of the course project. The goal was to classify synthetic images in a weakly supervised setting.

Nov 2022

Teaching Assistant for INF573 - Image Analysis and Computer Vision at Ecole Polytechnique

Helped supervise students projects.

Open Source

Contributions

pytorch / rl

Lightning-AI / metrics

huggingface / diffusers

Projects

nicolas-dufour / SCAM

gastruc / osv5m

nicolas-dufour / cad

robincourant / DIRECTOR

nicolas-dufour / plonk

Miscellaneous

Talks

2025

Talk at MIT in Philip Isola's group: Controllable and Efficient Generative Models (December 18, 2025)

Talk at Cornell Tech in Andrew Owens's group: Controllable and Efficient Generative Models (December 17, 2025)

Talk at NYU in Saining Xie's group: Controllable and Efficient Generative Models (December 16, 2025)

Talk at BAIR in Alyosha Efros's group: Controllable and Efficient Generative Models (December 10, 2025)

Talk at ValeoAI: Controllable and Efficient Generative Models (October 30, 2025)

Talk at Kyutai: Controllable and Efficient Generative Models (October 23, 2025)

Podcast Underscore_: Around the World in 80 Timesteps (June 25, 2025)

2024

Talk at TUM in Zeynep Akata's group: Conditional Generative Models (September 19, 2024)

2023

IMAGINE Seminar: LLM advances in 2023 (April 12, 2023)

IMAGINE Seminar: Recent advances in diffusion models (April 12, 2023)

2022

IMAGINE Seminar: Presentation of the Perceiver papers (April 20, 2022)

GeoVic Seminar: SCAM! Transferring humans between images with Semantic Cross Attention Modulation (February 10, 2022)

IMAGINE Seminar: Tutorial on GANs (January 05, 2022)

2021

IMAGINE Seminar: SCAM! Transferring humans between images with Semantic Cross Attention Modulation (October 06, 2021)

Reviewer

ICML 2026 (July, 2026)
CVPR 2026 (June, 2026)
ICLR 2026 (April, 2026)
NeurIPS 2025 (December, 2025)
ICCV 2025 (August, 2025)
CVPR 2025 (June, 2025)
ECCV 2024 (October, 2024)
CVPR 2024 (June, 2024)
WACV 2024 (January, 2024)
ICCV 2023 (October, 2023)
ICCV 2023 (October, 2023)
ACCV 2022 (December, 2022)

Awards

Outstanding reviewer award at Neurips 2025 (Top 8%) (December 08, 2025)
Outstanding reviewer award at (ACCV 2022) (December 04, 2022)

Others

I have co-organized the first edition of IMAGINE Hackaton. The topic was geolocation (July 26, 2023)
I'm co-organizing the IMAGINE reading group for 2023-2024 (June 01, 2023)

Nicolas DUFOUR

PostDoc at Kyutai

About me

News

Publications

2026

2025

2024

2023

2022

Teaching

Open Source

Contributions

Projects

Miscellaneous

Talks

2025

2024

2023

2022

2021

Reviewer

Awards

Others