DFG-Research Group Text Technological Modelling of Information

Homepage Research Group

Deutsche Forschungsgemeinschaft

HyTex-Resources

1. Software

GermaNet Viewer

GermaNet Pathfinder

GermaNet Pathfinder API

Three tools for

Download and detailed documentation: http://www.hytex.info/030_ergebnisse

GLexi

Lexical chainer with extensive preprocessing capabilities and various semantic relatedness measures included. See our publication Cramer/Finthammer, 2008, for GLexi’s performance. Please, contact Irene Cramer to get the pretty stable beta-version (currently only for windows, linux tool in preparation).

2. Data

HyTex-Core-Corpus (so called: Kernkorpus)

HyTex-Additional-Corpus (so called: Ergänzungskorpus)

HyTex-Stats-Corpus (so called: Ausbaukorpus)

Download of the raw and annotated corpora (of domain specific text documents), extensive documentation, and multiple annotation schemes: http://www.hytex.info/030_ergebnisse/020_korpus

InDeftigator-Corpus

IE corpus annotated for the automatic extraction of definitions.

Please, contact Irene Cramer to get a pre-version of this corpus with currently about 1 Mio. tokens annotated. There are also annotation guidelines (evaluated wrt. inter-annotator agreement) available.

SemRel evaluation data

Word pairs incl. human relatedness judgement for the evaluation semantic relatedness measures for German. Download: http://www.hytex.info/030_ergebnisse

3. Lexical Semantic Resources

TermNet

Lexical semantic net of domain specific terminology in various representation formats. Download, detailed documentation, and GermaNet-interface: http://www.hytex.info/040_werkstatt/030_owlmodellierung

This research is part of the cooperation with Harald Lüngen, SemDok, and Claudia Kunze/Lothar Lemnitzer, GermaNet group, Tübingen.


[close]