Learning Nonregular Languages: A Comparison of Simple Recurrent Networks and LSTM