Leveraging theories of long-term memory to improve recurrent networks