2.4 Word Embeddings

As we have seen in the previous section, while BOW and TF-IDF can create numeric vectors for documents, the length of the vectors is usually quite large as it depends on the size of the vocabulary (number of n-grams).

Could we assign a vector in “low dimension”, encoding the “semantic” of a word (or document)?