In this course, we will equip you with the necessary knowledge to work with textual data!
What is textual data? It is only sometimes a sequence of words in English!
When working with textual data, it is essential to ask ourselves the following (non-exhaustive) set of questions:
- What is the alphabet and the language used in the dataset?
- Does it include any artifacts, structure, or emojis?
- What type of text is it? A paragraph, a sentence, a document?
- Does it include unknown words such as slang?
- What is the objective, what do we want to do with the text?
- Do I have the right to collect/manipulate this data?
- How do we know the model is doing the right thing?
What can we do with text data?
Where can I find some textual data?
Some NLP Terminology
Previous Section
Home
Next Section