NATURAL LANGUAGE PROCESSING TASKS

SIDDHARTHA VARMA, 18BCE0865

Task 1

  • learnt basic string processing with python
  • learnt about nltk

Task 2

  • learnt about COCA
  • saw mentions of a celebrity
  • analysed the chart of mentions of the celebrity

Task 3

  • learnt about corpora
  • downloaded nltk
  • explored brown corpus
  • explored inaugural corpus

Task 4

  • downloaded a text from ANC
  • analysed the text
  • explored project gutenberg
  • basic tokenisation

Task 5

  • what are stopwords
  • difference between word and sentence tokenizaton
  • tweet tokenization

Task 6

  • explored the differences between stemming and lemmatization
  • porter stemmer
  • lancaster stemmer
  • regexp stemmer
  • wnowball stemmer
  • wordnet lemmatizer

Task 7

  • tokenization
  • pos tagging
  • punkt
  • parse tree

Task 8

  • gender features
  • classification
  • naive bayes classification

Extra

  • NLP with tensorflow