Nicolas DUFOUR profile picture

Nicolas DUFOUR

PhD student in Computer Vision working on conditional generative models

About me

I’m a third year Computer Vision PhD student at IMAGINE (ENPC) and VISTA (Ecole Polytechnique) labs. My research is focused on conditional generative models. I’m supervised by David Picard and Vicky Kalogeiton. Before my PhD, I graduated from the MVA master (Mathematiques, Vision et Apprentissage) at ENS Paris Saclay and I got my engineering degree from Telecom SudParis following the MSA specialization (Mathematics and Statistical Modeling). I’ve also done an internship at Meta, where i worked on Model Based Reinforcement Learning.

During my PhD, I have worked on GANs and diffusion models. I also have a high interest in the current development of the field of generative models around Large Language Models.

News

Sep, 2024

Attended ECCV 2024 to present our work E.T.

Jul, 2024

Attended the ICVSS 2024 Summer School!

Jun, 2024

Attended CVPR 2024 to present our work CAD and OSV-5M.

Apr, 2024

Our work CAD got an Highlight (top 11%) at CVPR 2024!

Publications

2024

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation thumbnail
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation
Arxiv Preprint
PoM: Efficient Image and Video Generation with the Polynomial Mixer thumbnail
PoM: Efficient Image and Video Generation with the Polynomial Mixer
David Picard, Nicolas Dufour
Arxiv Preprint
Analysis of Classifier-Free Guidance Weight Schedulers thumbnail
Analysis of Classifier-Free Guidance Weight Schedulers
TMLR
E.T. the Exceptional Trajectories: Text-to-Camera-Trajectory Generation with Character Awareness. thumbnail
E.T. the Exceptional Trajectories: Text-to-Camera-Trajectory Generation with Character Awareness.
ECCV 2024
Don’t drop your samples! Coherence-aware training benefits Conditional diffusion thumbnail
Don’t drop your samples! Coherence-aware training benefits Conditional diffusion
CVPR 2024 (Highlight)
OpenStreetView-5M: The Many Roads to Global Visual Geolocation thumbnail
OpenStreetView-5M: The Many Roads to Global Visual Geolocation
Guillaume Astruc*, Nicolas Dufour*, Ioannis Siglidis*, Constantin Aronssohn, Nacim Bouia, Stephanie Fu, Romain Loiseau, Van Nguyen Nguyen, Charles Raude, Elliot Vincent, Lintao XU, Hongyu Zhou, Loic Landrieu
CVPR 2024

2023

Machine Learning for Brain Disorders: Transformers and Visual Transformers thumbnail
Machine Learning for Brain Disorders: Transformers and Visual Transformers
Robin Courant, Maika Edberg, Nicolas Dufour, Vicky Kalogeiton
Springer, Machine Learning for Brain Disorders

2022

SCAM! Transferring humans between images with Semantic Cross Attention Modulation thumbnail
SCAM! Transferring humans between images with Semantic Cross Attention Modulation
ECCV 2022

Teaching

Apr 2024 - Jun 2024

Teaching Assistant for INF473V - Modal d'informatique - Deep Learning in Computer Vision at Ecole Polytechnique

  • Helped supervise the practical sessions of the course.
  • Created a Kaggle competition for the students end of the course project. The goal was to create a classifier but we only provided them with the val and test data. They needed to create their own training data with pretrained diffusion models. The goal was to classify among 37 different cheeses.

Feb 2023 - Jun 2023

Teaching Assistant for INF473V - Modal d'informatique - Deep Learning in Computer Vision at Ecole Polytechnique

  • Helped supervise the practical sessions of the course.
  • Built a new practical session on Transformers.
  • Created a Kaggle competition for the students end of the course project. The goal was to classify synthetic images in a weakly supervised setting.

Nov 2022

Teaching Assistant for INF573 - Image Analysis and Computer Vision at Ecole Polytechnique

  • Helped supervise students projects.

Open Source

Contributions

pytorch / rl
GitHub Repo stars
Lightning-AI / metrics
GitHub Repo stars
huggingface / diffusers
GitHub Repo stars

Projects

nicolas-dufour / SCAM
GitHub Repo stars
gastruc / osv5m
GitHub Repo stars
nicolas-dufour / cad
GitHub Repo stars
robincourant / DIRECTOR
GitHub Repo stars

Miscellaneous

Talks

2024

  • Talk at TUM: Conditional Generative Models (September 19, 2024)

2023

  • IMAGINE Seminar: LLM advances in 2023 (April 12, 2023)
  • IMAGINE Seminar: Recent advances in diffusion models (April 12, 2023)

2022

  • IMAGINE Seminar: Presentation of the Perceiver papers (April 20, 2022)
  • GeoVic Seminar: SCAM! Transferring humans between images with Semantic Cross Attention Modulation (February 10, 2022)
  • IMAGINE Seminar: Tutorial on GANs (January 05, 2022)

2021

  • IMAGINE Seminar: SCAM! Transferring humans between images with Semantic Cross Attention Modulation (October 06, 2021)

Reviewer

Awards

Others

  • I have co-organized the first edition of IMAGINE Hackaton. The topic was geolocation (July 26, 2023)
  • I'm co-organizing the IMAGINE reading group for 2023-2024 (June 01, 2023)