DataSci 210: Natural Language Processing for Data Science
End: 3 weeks, 6 hrs. per week
Description:. In this 3 week course, students will learn about the text preprocessing, text analysis, and gain a thorough understanding of the field through the use of Python’s nltk, Spacy, and gensim modules.
Prerequisites: Python 101, DataSci 200, and DataSci 201, or proficiency in Python and data acquisition and visualization techniques. If these courses were not completed, students are expected to complete Introductory Python on ByteDev, Mathematics for Data Science, as well as the additional assessment available on ByteDev.
- Word Tagging
- Textual Data Cleaning
- Sentiment Analysis
- Information Extraction
- Entity and Relation Extraction
- Topic Modeling and Summarization
- Tools & Technologies: