1 | | Links: |
| 1 | This page provides a subset of the OCAS 2008 corpus. |
| 2 | The data can be downloaded [http://www.dfki.uni-kl.de/~adrian/2008/08/ocas/public/ocas.zip here] as zip file. |
| 3 | |
| 4 | The archive is structured as follows: |
| 5 | |
| 6 | * '''annotations''': RDF annotations about instances and facts of the ontology that were manually annotated in text. |
| 7 | * '''ontology''': The RDFS scheme and RDF instance base. It also contains a Protege 3.2 project file. |
| 8 | * '''rdf''': RDF annotations about instances and facts of the ontology that were automatically inferred by taking the manual annotations and the ontology as base. |
| 9 | * '''txt''': The text documents. Originally, these document were published by BBC and ABC. Please consider the copyright at the end of each text file. |
| 10 | |
| 11 | Please refer to this publication when using this data set. |
| 12 | |
| 13 | Grothkast, Alexander; [http://www.dfki.de/web/research/km/publications?author=beho01 Adrian, Benjamin]; [http://www.dfki.de/web/research/km/publications?author=kisc01 Schumacher, Kinga]; [http://www.dfki.de/web/research/km/publications?author=ande00 Dengel, Andreas]; Sebastian Blohm (Hrsg.); Ulf Brefeld (Hrsg.); Felix Jungermann (Hrsg.); Roman Yangarber (Hrsg.) [http://www.dfki.de/web/research/km/publications/base_view?pubid=3871 OCAS: Ontology-Based Corpus and Annotation Scheme;] Proceedings of the High-level Information Extraction Workshop 2008; |
| 14 | This paper presents strategies and lessons learned from the creation of a corpus. It suggests a gold standard for evaluating ontology-based information extraction (OBIE) systems. This OBIE gold stan... |
| 15 | |
| 16 | == Other Links: == |