Context Navigation

Changes between Version 10 and Version 11 of WikiStart

Timestamp:: 03/18/09 09:15:32 (17 years ago)
Author:: horak
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

WikiStart

-                      v10
+                      v11
+''' This site is under heavy construction! '''
+= Intelligent Document Information Extraction =
+= iDocument: Intelligent Document Information Extraction =
 [[Image(WikiStart:logo.png, width=100px, right)]]
+iDocument is an [http://en.wikipedia.org/wiki/Information_extraction| information extraction] system. It uses existing background knowledge such as business databases, concept maps, or other information models for improving extraction results.
+iDocument is a  generic  ontology-based information  extraction  (OBIE)  system  that  uses  ontological  background  knowledge  in
+terms  of  existing  vocabularies  and  instance  knowledge.  iDocument  uses  existing knowledge  from personal or business domains (e.g. relational databases, concept maps, taxonomies,  etc.).  Following  Semantic  Web,  iDocument  exchanges  and  extracts
+knowledge based on  the W3C  standard RDF. Existing knowledge  is used as  input  in a serial  IE  pipeline  of  extraction  tasks  for  extracting  possible  answers  concerning  user specified  ad  hoc  queries  on  a  given  text  collection.
+It extracts entities (eg.,email adresses, names, URLs), instances, and semantic relations between instances with respect to used background knowledge. iDocument uses ontological knowledge representation techniques for interpreting background knowledge.
+= Unique Feature =
+[[Image(WikiStart:scenario.png, center)]]
+ * Domain ontologies are exchangeable as long as they are written in RDFS.
+ * The MOBIE mapping vocabulary allows to define relevant classes, attributes and relations for extraction purpose .
+ * Existing instance knowledge is reused for information extraction purpose
+ * Extracted results are formalized in the same RDF scheme as the input domain ontology.
+ * SPARQL queries are used for defining extraction templates.
+ * All intermediate and final extraction results are weighted hypothesis according to Dempster –Shafer’s belief function.
+= Table of Contents =
+ * [http://idocument.opendfki.de/ System Summary]
+ * [http://idocument.opendfki.de/wiki/Evaluation/Corpus/OlympicGames2004 Olympic Corpus and Annotation Scheme (OCAS)]
+ * [http://idocument.opendfki.de/ Publications]
+ * [http://idocument.opendfki.de/ Reference Projects]
 For further information please contact [mailto:benjamin.adrian@dfki.de].