User Tools

Site Tools


nlp:patent_domain_nlp

Patent Domain NLP

NLP in the patent domain. Overlaps with Legal Domain NLP and Scientific Text Processing.

Papers

Datasets

  • EuroPat: Sentence-Aligned European Patent Corpus: website 2011 paper
  • ParaPat: paper Large parallel corpus of patent abstract (68M sentences total)
  • CMUmine: paper dataset backup copy Patent application dataset. Contains patent claims section, used for automatic construction of patent claims. Warning: They seem to be unaware of the EuroPat dataset, as well as the large amount of prior work on NLP for patents.

Workshops

nlp/patent_domain_nlp.txt · Last modified: 2023/06/15 07:36 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki