This part of the course is based on this research paper.
What is a Language Model (LM)?
A LM aims to model the generative likelihood of word sequences, so as to predict the probabilities of future (or missing) tokens.