Detecting partial observability in decision processes and improving value with memory
Published:
We explore a method aimed at reliably detecting aliasing in POMDPs and using this signal to search for memory functions that allow for finding higher performing policies–all without previous knowledge of the state-space.