Thiago D. Simão

PostDoc researcher at Radboud University Nijmegen

portrait.jpg

The Netherlands

I am Thiago. Currently, I am a PostDoc researcher with the Department of Software Science (SWS) at Radboud University Nijmegen advised by Dr. Nils Jansen. Previously, I was a Ph.D. candidate in the Algorithmics Group at Delft University of Technology, advised by Dr. Matthijs Spaan. For more details checkout my biography or my cv .

Research Interests: The motivation for my research revolves around making AI techniques more reliable, to enable their deployment in real-world applications. I focus on developing AI algorithms for scenarios with constrained interactions with an unknown environment. I am currently interested in safe reinforcement learning, a research topic concerned with problems where a minimum performance must be guaranteed and catastrophic events must be avoided.

Academic Service:

  • Organization committee of the BeNeRL Workshop 2018.
  • Local organizing committee of the 28th ICAPS.
  • PC for NeurIPS22, ICML22, ICAPS22, AAAI21.
  • Reviewer for JAAMAS, ICRA, AAAI and BRACIS.

Besides my professional activities, I like to run, play boardgames, listen to music and read.

news :mega:

2023

May

April

February

January

2022

December

November

October

September

August

July

June

May

April

March

  • Talk at the ADML meetup about Ensuring Safety for Reinforcement Learning.

January

2021

December

October

August

June

May

March

February

2020

December

September

  • I am serving as a PC member for AAAI-21.

May

2019

August

May

March

  • In Hilversum, presenting our work on reinforcement learning at the ICT.Open-19.

January

2018

November

October

July

June

2017

November

October

August

selected publications

  1. Safe Policy Improvement for POMDPs via Finite-State Controllers
    Simão, Thiago D.Suilen, Marnix, and Jansen, Nils
    In Proceedings of the AAAI Conference on Artificial Intelligence 2023
  2. AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training
    Simão, Thiago D.Jansen, Nils, and Spaan, Matthijs T. J.
    In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS) 2021
  3. Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
    Simão, Thiago D., and Spaan, Matthijs T. J.
    In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence 2019