Natural-Language Video Description With Deep Recurrent Neural Networks