Documente Academic
Documente Profesional
Documente Cultură
1. Process the Human Languages Such as English and other Indian Languages.
2. Develop Computational Methods for Real World Applications like Machine
Translation.
3. Implement language technology experiment step by step
4. For a given language technological problem consider which computer programs will be
suitable, install them and apply them to linguistic data.
5. Explain the interaction between rule based and probabilistic methods in language
technology.
SLOs 2,6,7,9
Module Topics L Hrs SLO
INTRODUCTION TO NLP
Introduction to various levels of natural language processing, Ambiguities and
1 computational challenges in processing various natural languages. Introduction to Real 3 2
life applications of NLP such as spell and grammar checkers, information extraction,
question answering, and machine translation.
TEXT PROCESSING
2 Character Encoding, Word Segmentation, Sentence Segmentation, Introduction to 6 7
Corpora, Corpora Analysis.
MORPHOLOGY
3 Inflectional and Derivation Morphology, Morphological Analysis and Generation 6 7
using finite state transducers.
LEXICAL SYNTAX
4 Introduction to word types, POS Tagging, Maximum Entropy Models for POS tagging, 6 7
Multi-word Expressions.
LANGUAGE MODELING
5 The role of language models. Simple N-gram models. Estimating parameters and 6 7
smoothing. Evaluating language models.
SYNTAX & SEMANTICS
Introduction to phrases, clauses and sentence structure, Shallow Parsing and Chunking,
6 Shallow Parsing with Conditional Random Fields (CRF), Lexical Semantics, Word 10 7
Sense Disambiguation, WordNet, Thematic Roles, Semantic Role Labelling with
CRFs.
7 APPLICATIONS OF NLP 6 6,7,9
NL Interfaces, Text Summarization, Sentiment Analysis, Machine Translation,
Question answering.
RECENT TRENDS
8 Recent Trends in NLP 2 7,17
1. Machine Translation.
2. Sentiment Analysis on Twitter Data.
3. Article Recommendations for News Feed.
4. Distinguishing Opinion from News.
5. Predicting Sentiment from Rotten Tomatoes Movie Reviews.
6. Information Extraction from Collection of Resumes.
7. Text Summarization.
8. Question Answering System.
9. Part of Speech Tagging using a Hidden Markov Model.
10. Semantics-based Text Mining of Biomedical Concepts in Scientific Publications
Text Books
1. Daniel Jurafsky and James H. Martin “Speech and Language Processing”, 3rd edition, Prentice
Hall, 2009.
Reference Books
1. Chris Manning and HinrichSchütze, “Foundations of Statistical Natural Language Processing”,
2nd edition, MITPress Cambridge, MA, 2003.
2. NitinIndurkhya, Fred J. Damerau “Handbook of Natural Language Processing”, Second Edition,
CRC Press, 2010.
3. James Allen “Natural Language Understanding”, Pearson Publication 8th Edition. 2012.
This course is a
Elective Course.
Suitable from 6th semester onwards.
Knowledge on Theory of Computation is essential.
This Course is designed with 165 minutes of in-classroom sessions per week, 60 minutes of
video/reading instructional material per week, as well as 200 minutes of non-contact time spent
on implementing course related project. Generally this course should have the combination of
lectures, in-class discussion, case studies, guest-lectures, mandatory off-class reading material,
quizzes.
How are students assessed?
[What type, and number, of assignments are students are expected to do? (papers, problem sets,
programming projects, etc.). How long do you expect students to spend on completing assessed
work?]
Additional weightage will be given based on their rank in crowd sourced projects/ Kaggle
like competitions.
Students can earn additional weightage based on certificate of completion of a related
MOOC course.
Additional topics
[List notable topics covered in the course that you do not find in the CS2013 Body of
Knowledge]