Efficient deep learning models for video abstraction