Lexicon or Corpus name | (Main) Developer(s) | Description | License | Repository | Associated Paper(s) |
---|---|---|---|---|---|
Annotating negations | van Miltenburg, Morante, Elliott | Image descriptions annotated with negation & negation type | Descriptions are CC-licensed, taken from Flickr30K corpus, which is based on Flickr data. | GitHub | Pragmatic Factors in Image Description: The Case of Negations |
BiographyNet Enriched Biographies Corpus | Fokkens | 130.000 biographies that are enriched automatically, represented in NAF and RDF… | RDF: creative commons. Text and manual annotations: variety of licenses… | to appear | to appear |
The Circumstantial Event Ontology (CEO) | Segers | The Circumstantial Event Ontology is a manually constructed OWL ontology that models… | CC-BY-SA | to appear | The Circumstantial Event Ontology (CEO) |
Cornetto | Maks | Free for academic use | Cornetto, demo | ||
Dutch FrameNet | Maks | ||||
DutchSemCor | Vossen | A one-million word Dutch corpus that is fully sense-tagged with senses and domain tags… | DutchSemCor | DutchSemCor: building a semantically annotated corpus for Dutch | |
The ECB+ Corpus | Cybulska, Vossen | The ECB+ corpus is an extension to the EventCorefBank (ECB, Bejan and Harabagiu, 2010)… | NewsReader ▷ Results ▷ Data | Using a sledgehammer to crack a nut? | |
ECB+-CEO | Segers | ECB-CEO is an extension of the ECB+ corpus where the logical relation between… | to appear | The Circumstantial Event Ontology (CEO) | |
The Event and Implied Situation Ontology (ESO) | Segers | ESO is a manually constructed OWL-2 ontology which formalizes the pre-, … | CC-BY-SA | GitHub NewsReader ▷ ESO | The Event and Implied Situation Ontology |
ESO-FN-WN Mappings | Segers | ESO-FN-WN Mappings is a mapping file between ESO classes and Framenet frames… | CC-BY-SA | GitHub NewsReader ▷ ESO | The Event and Implied Situation Ontology |
The Gun Violence Corpus (GVC) | Vossen, Postma, Ilievski, Segers | GVC contains event coreference annotation for 510 documents from the gun violence domain. It was created following our data-to-text method, mostly as part of the development of… | CC-BY-SA | GitHub | Don’t Annotate, but Validate: a Data-to-Text Method for Capturing Event Data |
MEANTIME-ESO Corpus | Segers | The MEANTIME-ESO Corpus is developed for the evaluation of the ESO Ontology. For this, 120 articles… | CC-BY-SA | GitHub NewsReader ▷ ESO | The Event and Implied Situation Ontology |
Event StoryLine Corpus (ESC) | Caselli | The Event StoryLIne Corpus (ESC) is a manually annotated corpus of documents extracted from the ECB+… | CC-BY-SA | v0.9 GitHub CLTL ▷ EventStoryLine | The Event StoryLine Corpus |
Open Dutch WordNet | Postma | Open Dutch WordNet is a Dutch lexical semantic database. It was created by … | CC BY-SA 4.0 | Open Dutch WordNet | Open Dutch WordNet |
Referentiebestand Nederland (RBN) | Maks, Martin, van der Vliet | 50,000 frequent Dutch words annotated with linguistic information | Incorporated in Cornetto | INL/Lexica. RBN Online | |
SemEval Long Tail QA Task | Ilievski, Postma | We propose a ‘referential quantification’ task that requires systems to establish the meaning… | to appear | Stereotypes | Fokkens | Collection of small descriptions automatically extracted from text (in csv). It comes with… | Texts are under copy-right. Annotations… | to appear |
The Vaccination Corpus | Morante, van Son | This dataset contains online documents around the topic of vaccinations. The set contains news articles… | to appear | ||
The VU sound corpus | van Miltenburg, Timmermans, Aroyo | Collection of crowd-sourced annotations for the Freesound database | Sounds are CC-licenced | GitHub | The VU Sound Corpus: Adding… |