Types of resources:
- Empirical Lexical Resources
- Cognitive and Visual Resources
- Semantic Relation and Relatedness Datasets
- Semantic Change Datasets
- Datasets of Compositionality and Figurative Language Usage
- Plausibility Datasets
- Terminology Datasets
- Information and Networking Resources
- Software
Empirical Lexical Resources
- Empirical Lexical Information from Lexicalised PCFG Parses:
Based on manually developed grammars and statistical parsers, we acquired various types of lexical information from German and English corpora.
- Resources for German: lexical acquisition from the Huge German Corpus (HGC)
- Resources for English: lexical acquisition from the British National Corpus (BNC)
- Resources for German: lexical acquisition from the Huge German Corpus (HGC)
- Subcategorisation Database for German MATE Parses
- Analogies at the morpho-syntactic and syntactico-semantic interfaces:
- Deep Semantic Analogies: evaluation benchmarks regarding semantic analogies for English and for German
- Derivation Analogies for German Particle Verbs
- Deep Semantic Analogies: evaluation benchmarks regarding semantic analogies for English and for German
Cognitive and Visual Resources
- Association Norms for German, Italian and French verbs, and German noun compounds and their constituents
- Feature Norms for German noun compounds and their constituents
- Affective Norms: abstractness, arousal, imageability and valence ratings for German and Estonian
- Bildernetle: a dataset of German noun-to-ImageNet mappings
Semantic Relation and Relatedness Datasets
- Paradigmatic Semantic Relation Database compiled via human judgement experiments
- GermaNet-based Semantic Relation Networks
- Semantic Relatedness Datasets for Vietnamese
- DURel - Annotation of Diachronic Usage Relatedness
- SURel - Annotation of Synchronic Domain-specific Meaning Shifts
Diachronic Semantic Change Datasets
- German Diachronic Metaphor Annotation Dataset
- DURel - Annotation of Diachronic Usage Relatedness
- DiaWUG: Diatopic Word Usage Graphs for Spanish
- Dataset of Grammaticalisation for German Prepositions
- Clean Corpus of Historical American English (CCOHA)
Compositionality and Figurative Language Usage
- Compositionality Ratings for German noun compounds and German particle verbs
- (Non-)Literalness Ratings for German, French and Estonian complex verbs
- Datasets for meaning components and meaning shifts of German particle verbs:
- Sentence Generation for German Particle Verb Neologisms
- Directionality in German Particle Verbs
- Source-Target Domains, Directionalities and Mappings
- Analogies in Meaning Shifts
- German Diachronic Metaphor Annotation Dataset
- Synonymous Literal and Metaphorical Expressions in Discourse
Plausibility Datasets
Terminology Datasets
- Laypeople Study on Term Annotation
- SURel - Annotation of Synchronic Meaning Shifts
- Domain-Specific Cooking Compound Termhood Annotation
- Domain-Specific Compound Difficulty Ratings
Information and Networking Resources for Computational Linguistics (mainly in German)
- Studienbibliographie Computerlinguistik und Sprachtechnologie:
Referenz-Sammlung zu Grundlagen, Methoden/Ressourcen und Anwendungen
- Portal Computerlinguistik:
Informationen zur Computerlinguistik im deutschsprachigen Raum und zur deutschen Sprache
Software
- German Noun Chunker
- Predicate Argument Clustering (PAC)
- SubCat-Extractor: induction of verb subcategorisation from MATE dependency parses