I am curious about the motivation for [this step](https://github.com/atulkum/pointer_summarizer/blob/5eb298a66b4d3b53adb854ba9b5c82580cf2fa1e/training_ptr_gen/model.py#L151), as I couldn't anything about it in See et al., though I might have just missed it.