Thiago D. Simão

PostDoc researcher at Radboud University Nijmegen

portrait.jpg

The Netherlands

I am Thiago. Currently, I am a PostDoc researcher with the Department of Software Science (SWS) at Radboud University Nijmegen advised by Dr. Nils Jansen. Previously, I was a Ph.D. candidate in the Algorithmics Group at Delft University of Technology, advised by Dr. Matthijs Spaan. For more details checkout my biography or my cv .

Research Interests: The motivation for my research revolves around making AI techniques more reliable, to enable their deployment in real-world applications. I focus on developing AI algorithms for scenarios with constrained interactions with an unknown environment. I am currently interested in safe reinforcement learning, a research topic concerned with problems where a minimum performance must be guaranteed and catastrophic events must be avoided.

Academic Service:

  • Organization committee of the BeNeRL Workshop 2018.
  • Local organizing committee of the 28th ICAPS.
  • PC for NeurIPS22, ICML22, ICAPS22, AAAI21.
  • Reviewer for JAAMAS, ICRA, AAAI and BRACIS.

Besides my professional activities, I like to run, play boardgames, listen to music and read.

news

2022

September

July

June

May

April

March

  • Talk at the ADML meetup about Ensuring Safety for Reinforcement Learning.

January

2021

December

October

August

June

May

March

February

2020

December

September

  • I am serving as a PC member for AAAI-21.

May

2019

August

May

March

  • In Hilversum, presenting our work on reinforcement learning at the ICT.Open-19.

January

2018

November

October

July

June

2017

November

October

August

selected publications

  1. ML
    Safety-constrained reinforcement learning with a distributional safety critic
    Machine Learning 2022
  2. AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training
    Simão, Thiago D.Jansen, Nils, and Spaan, Matthijs T. J.
    In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS) 2021
  3. Safe Policy Improvement with an Estimated Baseline Policy
    Simão, Thiago D.Laroche, Romain, and Tachet des Combes, Rémi
    In Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS) 2020
  4. Safe Policy Improvement with Baseline Bootstrapping in Factored Environment
    Simão, Thiago D., and Spaan, Matthijs T. J.
    In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence 2019