POMDPs and Policy Gradients