User Tools

Site Tools


nlp:patent_domain_nlp

This is an old revision of the document!


Patent Domain NLP

NLP in the patent domain. Overlaps with Legal Domain NLP and Scientific Text Processing.

Papers

Datasets

  • EuroPat: Sentence-Aligned European Patent Corpus: website 2011 paper
  • ParaPat: paper Large parallel corpus of patent abstract (68M sentences total)
  • CMUmine: paper dataset backup copy Patent application dataset. Contains patent claims section, used for automatic construction of patent claims. Warning: They seem to be unaware of the EuroPat dataset, as well as the large amount of prior work on NLP for patents.

Workshops

nlp/patent_domain_nlp.1654072292.txt.gz · Last modified: 2023/06/15 07:36 (external edit)

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki