Markus Peschl
Markus Peschl
Home
Blog
Light
Dark
Automatic
Posts
Constrained versus Multi-Objective Reinforcement Learning. A Fundamental Difference?
Introduction If you have ever taken a class in mathematical optimization, you are probably familiar with the notion of a constrained optimization problem. If not, then don’t worry, because the idea is really simple.
Apr 17, 2021
12 min read
ADRQN-PyTorch: A Torch implementation of the action-specific deep recurrent Q network.
Deep Q networks have proven to be an easy to implement method for solving control problems in both continuous or large discrete state spaces. The action-specific deep recurrent Q network (ADRQN) introduces an intermediate LSTM layer for remembering action-observation pairs when dealing with partial observability. This post explores a compact PyTorch implementation of the ADRQN including small scale experiments on classical control tasks.
Nov 6, 2020
10 min read
Making Sense of Particle Filtering for POMDPs
Particle Filters are a useful technique and can be found in many newly published papers about (model-based) POMDP algorithms. In this post, we explore the mathematical framework of filtering and discuss its implications for POMDPs with a simple example.
Oct 20, 2020
8 min read
Deep Image Prior: Independently Reproduced in a few lines of PyTorch.
Joint work with Cecilia Casolo and Mats van Tongeren. Published to
https://reproducedpapers.org
, a TU Delft repository for independent paper reproductions in Computer Vision and Deep Learning.
Oct 17, 2020
11 min read
Cite
×