Skip to content

Latest commit

 

History

History
4 lines (4 loc) · 205 Bytes

README.md

File metadata and controls

4 lines (4 loc) · 205 Bytes

Text-preprocessing

A tutorial for text preprocessing with NLTK

  • basic text processing with python
  • preprocessing text with NLTK: tokenization, stop words removal, stemming (porter stemmer), Word2Vec