I'm a postdoc at MILA supervised by Yoshua Bengio.

My interests in machine learning include alignment, LLM honesty, health applications, and data selection for large-scale deep learning. Before joining MILA, I was a PhD student at the University of Oxford under Yarin Gal, and I worked on learning human preferences and game-theoretical machine learning with David Duvenaud and Roger Grosse at Toronto’s Vector Institute, with the Center for Human-compatible AI at UC Berkeley, and with the Governance of AI group at Oxford.

I studied machine learning (UCL), maths (Amsterdam) and Future Planet Studies (Amsterdam). My PhD is co-funded by Oxford and DeepMind.  

Contact me

soeren.mindermann ατ gmail.com

Publications as first author

*equal contribution to first authorship
JM Brauner*, S Mindermann*, M Sharma*, D Johnston, J Salvatier, ...
Science, 2021
Sören Mindermann*, Muhammed Razzak*, Winnie Xu*, Andreas Kirsch, Mrinank Sharma, Aidan Gomez, Sebastian Farquhar, Jan Brauner, Yarin Gal
ICML, 2022
M Sharma*, S Mindermann*, C Rogers-Smith, G Leech, B Snodin, J Ahuja, ...
Nature Communications, 2021
S Mindermann*, R Shah*, A Gleave, D Hadfield-Menell
ICML workshop Goals in Reinforcement Learning, 2018
A Jesson*, S Mindermann*, U Shalit, Y Gal
NeurIPS, 2020
S Mishra*, S Mindermann*, M Sharma*, C Whittaker*, T Mellan, T Wilton, ...
The Lancet: EClinicalMedicine, 2021
M Sharma*, S Mindermann*, J Brauner*, G Leech, A Stephenson, ...
NeurIPS (Spotlight talk), 2020


Publications as senior author

*equal contribution to senior authorship
Yoshua Bengio, Geoffrey Hinton, Andrew Yao, Dawn Song, Pieter Abbeel, Yuval Noah Harari, Ya-Qin Zhang, Lan Xue, Shai Shalev-Shwartz, Gillian Hadfield, Jeff Clune, Tegan Maharaj, Frank Hutter, Atılım Güneş Baydin, Sheila McIlraith, Qiqi Gao, Ashwin Acharya, David Krueger, Anca Dragan, Philip Torr, Stuart Russell, Daniel Kahneman, Jan Brauner*, Sören Mindermann*
(I'm not as 'senior' as the other authors here ;) but this is based on leading the project)
In review, 2024
G Leech, C Rogers-Smith, J Sandbrink, B Snodin, R Zinkov, B Rader, J Brownstein, Y Gal, S Bhatt*, M Sharma*, S Mindermann*, J Brauner*, L Aitchison*
Proceedings of the National Academy of Sciences (PNAS), 2022
G Altman, J Ahuja, JT Monrad, G Dhaliwal, C Rogers-Smith, G Leech, B Snodin, JB Sandbrink, L Finnveden, AJ Norman, SB Oehm, JF Sandkühler, J Kulveit, S Flaxman, Y Gal, S Mishra, S Bhatt, M Sharma*, S Mindermann*, J Brauner*
Nature Scientific Data, 2022


Publications as co-author

R Ngo, L Chan, S Mindermann
International Conference on Learning Representations, 2024
Evan Hubinger, Carson Denison, (many others) ... Sören Mindermann, Ryan Greenblatt, Buck Shlegeris, Nicholas Schiefer, Ethan Perez
Arxiv, 2024
L Pacchiardi, AJ Chan, S Mindermann, I Moscovitz, AY Pan, Y Gal, O Evans, J Brauner
International Conference on Learning Representations, 2024
S Kundu, Y Bai, (many others) ... S Mindermann, N Joseph, S McCandlish, J Kaplan
Arxiv, 2024
A Jesson, S Mindermann, Y Gal, U Shalit
International Conference on Machine Learning, 2021
G Meyerowitz-Katz, S Bhatt, O Ratmann, JM Brauner, S Flaxman, S Mishra, M Sharma, S Mindermann, V Bradley, M Vollmer, L Merone, G Yamey
BMJ Global Health, 2021
Tomáš Gavenčiak, Joshua Teperowski Monrad, Gavin Leech, Mrinank Sharma, Sören Mindermann, Jan Marus Brauner, Samir Bhatt, Jan Kulveit
PLOS Computational Biology, 2022

Policy impact

TV and newspaper interviews

Invited talks