POND-HINDSIGHT: APPLYING HINDSIGHT OPTIMIZATION TO PARTIALLY-OBSERVABLE MARKOV DECISION PROCESSES