Louis Faury

Louis Faury

Machine Learning Researcher



I am a researcher at Criteo, focusing on bandit algorithms and reinforcement learning.

I received my PhD in 2021; I was lucky to be supervised by Olivier Fercoq and Marc Abeille. Recently, I mainly worked on understanding how to learn from structured bandit feedback in non-linear (and non-stationary) environments.

Download my resumé.

  • Bandit Algorithms
  • Reinforcement Learning
  • Machine Learning
  • PhD in Machine Learning, 2021

    Télécom Paris

  • MSc in Artificial Intelligence, 2016-2018

    Ecole Polytechnique Fédérale de Lausanne (EPFL)

  • MSc in Applied Mathematics, 2013-2018

    Ecole Polytechnique


Machine Learning Researcher
Mar 2018 – Present Paris
  • Research on bandit algorithms and reinforcement learning. Focus on the design of new algorithms with strong theoretical guarantees.
  • Internal consulting for engineering teams on production projects.
Research Intern
Sep 2017 – Mar 2018 Paris

Development of a deep reinforcement learning approach for learning hyper-parameter free optimizers for ML tasks.

🌟 Received maximal mark from EPFL.

Research Intern & AI Consultant
Mar 2016 – Sep 2017 Paris
  • Development of algorithms for the synchronization of mobile robot fleets in warehouses.
  • Design, development and implementation of robust and embedded control algorithms for wheeled robots.

🌟 Rewarded as best internship for industrial use by the Prix de la Fondation de l’École Polytechnique.


Logistic Bandits
Experimenting around Logistic Bandits.
Improving Evolutionary Strategies with Generative Neural Networks
Generative Neural Networks for Differentiable ES