====== Software ====== ===== General NLP: Parsing, NER, etc ===== * [[https://stanfordnlp.github.io/CoreNLP/|Stanford Core NLP]] * [[https://corenlp.run]] * Stanza * [[https://stanfordnlp.github.io/stanza/|Main site]] * [[https://github.com/stanfordnlp/stanza|Github]] * Tutorial: [[https://colab.research.google.com/github/stanfordnlp/stanza/blob/master/demo/Stanza_CoreNLP_Interface.ipynb|Stanza: A Tutorial on the Python CoreNLP Interface]] * Paper: [[https://arxiv.org/pdf/2003.07082.pdf|Stanza: A Python Natural Language Processing Toolkit for Many Human Languages]] * [[https://www.nltk.org/|NLTK]] * [[https://spacy.io/|spaCy]] ===== Annotation Tools ===== * [[https://prodi.gy|Prodigy]] (From the creators of spaCy) * [[http://brat.nlplab.org|brat rapid annotation tool]] [[https://github.com/nlplab/brat|github]] * [[https://doccano.herokuapp.com/|doccano]] [[https://github.com/doccano/doccano|github]] Nilay has used this ===== Scraping, etc ===== * Web scraping * [[https://pypi.org/project/beautifulsoup4/|Beautiful Soup]] * Pdf processing * [[https://arxiv.org/pdf/2310.01206.pdf|appjsonify: An Academic Paper PDF-to-JSON Conversion Toolkit]] ===== Related Pages ===== * **[[ml:Software|ML - Software]]** (Deep learning libraries, etc) * [[Transformers#Software|Transformers - Software]] * [[ml:PyTorch]]