

GitHub - yandexdataschool/nlp_course: YSDA course in Natural Language Processing
source link: https://github.com/yandexdataschool/nlp_course
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

README.md
YSDA Natural Language Processing course
- Lecture and seminar materials for each week are in ./week* folders
- Create cloud jupyter session from this repo -
- Telegram chat room (russian).
- YSDA homework deadlines are listed in Anytask course page.
- Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue
- Installing libraries and troubleshooting: this thread.
Syllabus
-
week01 Embeddings
- Lecture: Word embeddings. Distributional semantics, LSA, Word2Vec, GloVe. Why and when we need them.
- Seminar: Playing with word and sentence embeddings.
-
week02 Text classification
- Lecture: Text classification. Classical approaches for text representation: BOW, TF-IDF. Neural approaches: embeddings, convolutions, RNNs
- Seminar: Salary prediction with convolutional neural networks; explaining network predictions.
-
week03 Language Models
- Lecture: Language models: N-gram and neural approaches; visualizing trained models
- Seminar: Generating ArXiv papers with language models
-
week04 Seq2seq/Attention
- Lecture: Seq2seq: encoder-decoder framework. Attention: Bahdanau model. Self-attention, Transformer. Pointer networks. Attention for analysis.
- Seminar: Machine translation of hotel and hostel descriptions
-
week05 Structured Learning
- Lecture: Structured Learning: structured perceptron, structured prediction, dynamic oracles, RL basics.
- Seminar: POS tagging
-
week06 Expectation-Maximization
- Lecture: Expectation-Maximization and Word Alignment Models
- Seminar: Implementing expectation maximizaiton
-
week07 Machine translation
- Lecture: Machine Translation: a review of the key ideas from PBMT, the application specific ideas that have developed in NMT over the past 3 years and some of the open problems in this area.
- Seminar: presentations by students
Contributors & course staff
Course materials and teaching performed by
- Elena Voita - course admin, lectures, seminars, homeworks
- Boris Kovarsky - lectures, seminars, homeworks
- David Talbot - lectures, seminars, homeworks
- Sergey Gubanov - lectures, seminars, homeworks
- Just Heuristic - lectures, seminars, homeworks
Recommend
-
160
spaCy: Industrial-strength NLP spaCy is a library for advanced Natural Language Processing in Python and Cython. It's built on the very latest research, and was designed from day one to be used in real products. sp...
-
58
SpaceX Falcon Heavy Launch — Credit SpaceX
-
44
Natural Language Processing - NLP bot builder - NEXT
-
42
Natural Language Processing is Fun! How computers understand Human Language Computers are great at working with structured data like spreadsheets and database tabl...
-
47
A new R package called crfsuite supported by BNOSAC landed safely on CRAN last week. The crfsuite package (
-
68
README.md Urduhack: NLP library for ( ?? ) Urdu language
-
36
Natural language processing (NLP), the technology that powers all the chatbots, voice assistants, predictive text, and other speech/text applications that permeate our lives, has evolved significantly in the last...
-
47
Logistics Lectures: are on Tuesday/Thursday 4:30-5:50pm PST in NVIDIA Auditorium . Lecture videos for enrolled students:
-
32
Roadmap to Natural Language Processing (NLP) An introduction to some of the most common techniques and models used in Natural Language Processing (NLP) ...
-
8
winkNLP Developer friendly Natural Language Processing winkNLP is a JavaScript library fo...
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK