publications

Peer-reviewed publications.

2024

  1. Factored Online Planning in Many-Agent POMDPs
    Galesloot, MarisSimão, Thiago D.Junges, Sebastian, and Jansen, Nils
    In AAAI 2024
  2. Robust Active Measuring under Model Uncertainty
    Krale, MerlijnSimão, Thiago D.Tumova, Jana, and Jansen, Nils
    In AAAI 2024

2023

  1. ECAI
    Reinforcement Learning by Guided Safe Exploration
    In ECAI 2023
  2. Risk-aware Curriculum Generation for Heavy-tailed Task Distributions
    Koprulu, CevahirSimão, Thiago D.Jansen, Nils, and Topcu, Ufuk
    In UAI 2023
  3. Scalable Safe Policy Improvement via Monte Carlo Tree Search
    In ICML 2023
  4. More for Less: Safe Policy Improvement with Stronger Performance Guarantees
    In IJCAI 2023
  5. Recursive Small-Step Multi-Agent A* for Dec-POMDPs
    Koops, WietzeJansen, NilsJunges, Sebastian, and Simão, Thiago D.
    In IJCAI 2023
  6. ICAPS
    Act-Then-Measure: Reinforcement Learning for Partially Observable Environments with Active Measuring
    Krale, MerlijnSimão, Thiago D., and Jansen, Nils
    In ICAPS 2023
  7. Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
    Hogewind, YannickSimão, Thiago D.Kachman, Tal, and Jansen, Nils
    In ICLR 2023
  8. Safe Policy Improvement for POMDPs via Finite-State Controllers
    Simão, Thiago D.Suilen, Marnix, and Jansen, Nils
    In Proceedings of the AAAI Conference on Artificial Intelligence 2023
  9. Targeted Adversarial Attacks on Deep Reinforcement Learning Policies via Model Checking
    Gross, DennisSimão, Thiago D.Jansen, Nils, and Pérez, Guillermo A.
    In ICAART 2023
  10. STTT
    Decision-making under uncertainty: beyond probabilities. Challenges and Perspectives
    Badings, ThomSimão, Thiago D.Suilen, Marnix, and Jansen, Nils
    STTT 2023
  11. Ph.D.
    Safe Online and Offline Reinforcement Learning
    Simão, Thiago D.
    Delft University of Technology 2023

2022

  1. Robust Anytime Learning of Markov Decision Processes
    Suilen, MarnixSimão, Thiago D.Parker, David, and Jansen, Nils
    In Advances in Neural Information Processing Systems 2022
  2. Safety-constrained reinforcement learning with a distributional safety critic
    Machine Learning 2022
  3. A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning
    In 25th IEEE International Conference on Intelligent Transportation Systems (ITSC) 2022

2021

  1. AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training
    Simão, Thiago D.Jansen, Nils, and Spaan, Matthijs T. J.
    In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS) 2021
  2. WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning
    In Proceedings of the AAAI Conference on Artificial Intelligence 2021

2020

  1. Safe Policy Improvement with an Estimated Baseline Policy
    Simão, Thiago D.Laroche, Romain, and Tachet des Combes, Rémi
    In Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS) 2020

2019

  1. Safe and Sample-Efficient Reinforcement Learning Algorithms for Factored Environments
    Simão, Thiago D.
    In Proceedings of the 28th International Joint Conference on Artificial Intelligence, IJCAI-19 2019
  2. Structure Learning for Safe Policy Improvement
    Simão, Thiago D., and Spaan, Matthijs T. J.
    In Proceedings of the 28th International Joint Conference on Artificial Intelligence, IJCAI-19 2019
  3. Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
    Simão, Thiago D., and Spaan, Matthijs T. J.
    In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence 2019

2018

  1. ICML workshop
    An Empirical Evaluation of Safe Policy Improvement in Factored Environments
    Simão, Thiago D., and Spaan, Matthijs T. J.
    2018
  2. When a Robot Reaches Out for Human Help
    In Advances in Artificial Intelligence - IBERAMIA 2018

2016

  1. ENIAC
    Heuristics for Dead-Ends Detection in Probabilisitic Planning
    In ENIAC 2016

2015

  1. ENIAC
    Probabilistic Planning with Dead-Ends
    Simão, Thiago D.Nunes de Barros, Leliane, and Silva, Felipe L.
    In ENIAC 2015

2011

  1. ESUD
    Development of 3D Games for Distance Education
    Leitão, Ulisses A.Simão, Thiago D., and Neves, Jefferson A.
    In ESUD 2011