NLP
Resources
CMU LTI Low Resource NLP Bootcamp 2020: This is a page for a low-resource natural language and speech processing bootcamp held by the Carnegie Mellon University Language Technologies Institute in May 2020.
This pandect (πανδέκτης is Ancient Greek for encyclopedia) was created to help you find almost anything related to Natural Language Processing that is available online.
Website (developed by@seb_ruder) provides useful information and resources for tracking the progress of many different kinds of common NLP tasks. http://nlpprogress.com
This website provides resources, like paper and code links, about state-of-the-art NLP methods. It also includes methods used in general in ML. You can also find information about tasks where the different methods are being used. https://paperswithcode.com
book recommendations for getting started with NLP. A few books shared in the list provide real-world use cases of different NLP methods and applications.
Getting an initial high-level understanding of different NLP tasks and applications is key. Survey papers help a lot. This repo contains a list of NLP survey papers for getting a bit more exposure to a wide range of NLP tasks. https://github.com/NiuTrans/ABigS
This website provides information about different NLP datasets and the tasks for which they are used. https://datasets.quantumstat.com
Transformers
Natural Language Processing: the age of Transformers https://blog.scaleway.com/2019/building-a-machine-reading-comprehension-system-using-the-latest-advances-in-deep-learning-for-nlp/
Transformers from scratch http://www.peterbloem.nl/blog/transformers
The illustrated transformer, Jay Allamar. http://jalammar.github.io/illustrated-transformer/
Non-English resources and compendiums
NLP Resources for Bahasa Indonesian [GitHub ~100 stars]
Indic NLP Catalog [GitHub ~150 stars]
Pre-trained language models for Vietnamese [GitHub ~200 stars]
Natural Language Toolkit for Indic Languages (iNLTK) [GitHub ~600 stars]
Indic NLP Library [GitHub ~300 stars]
Last updated