Ebook: Linked Data in Linguistics: Representing and Connecting Language Data and Language Metadata
- Tags: Language Translation and Linguistics, Computational Linguistics, Document Preparation and Text Processing, Linguistics (general), Data Mining and Knowledge Discovery, Data Structures
- Year: 2012
- Publisher: Springer-Verlag Berlin Heidelberg
- Edition: 1
- Language: English
- pdf
The explosion of information technology has led to substantial growth of web-accessible linguistic data in terms of quantity, diversity and complexity. These resources become even more useful when interlinked with each other to generate network effects.
The general trend of providing data online is thus accompanied by newly developing methodologies to interconnect linguistic data and metadata. This includes linguistic data collections, general-purpose knowledge bases (e.g., the DBpedia, a machine-readable edition of the Wikipedia), and repositories with specific information about languages, linguistic categories and phenomena. The Linked Data paradigm provides a framework for interoperability and access management, and thereby allows to integrate information from such a diverse set of resources.
The contributions assembled in this volume illustrate the band-width of applications of the Linked Data paradigm for representative types of language resources. They cover lexical-semantic resources, annotated corpora, typological databases as well as terminology and metadata repositories. The book includes representative applications from diverse fields, ranging from academic linguistics (e.g., typology and corpus linguistics) over applied linguistics (e.g., lexicography and translation studies) to technical applications (in computational linguistics, Natural Language Processing and information technology).
This volume accompanies the Workshop on Linked Data in Linguistics 2012 (LDL-2012) in Frankfurt/M., Germany, organized by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN). It assembles contributions of the workshop participants and, beyond this, it summarizes initial steps in the formation of a Linked Open Data cloud of linguistic resources, the Linguistic Linked Open Data cloud (LLOD).
The explosion of information technology has led to substantial growth of web-accessible linguistic data in terms of quantity, diversity and complexity. These resources become even more useful when interlinked with each other to generate network effects.
The general trend of providing data online is thus accompanied by newly developing methodologies to interconnect linguistic data and metadata. This includes linguistic data collections, general-purpose knowledge bases (e.g., the DBpedia, a machine-readable edition of the Wikipedia), and repositories with specific information about languages, linguistic categories and phenomena. The Linked Data paradigm provides a framework for interoperability and access management, and thereby allows to integrate information from such a diverse set of resources.
The contributions assembled in this volume illustrate the band-width of applications of the Linked Data paradigm for representative types of language resources. They cover lexical-semantic resources, annotated corpora, typological databases as well as terminology and metadata repositories. The book includes representative applications from diverse fields, ranging from academic linguistics (e.g., typology and corpus linguistics) over applied linguistics (e.g., lexicography and translation studies) to technical applications (in computational linguistics, Natural Language Processing and information technology).
This volume accompanies the Workshop on Linked Data in Linguistics 2012 (LDL-2012) in Frankfurt/M., Germany, organized by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN). It assembles contributions of the workshop participants and, beyond this, it summarizes initial steps in the formation of a Linked Open Data cloud of linguistic resources, the Linguistic Linked Open Data cloud (LLOD).
The explosion of information technology has led to substantial growth of web-accessible linguistic data in terms of quantity, diversity and complexity. These resources become even more useful when interlinked with each other to generate network effects.
The general trend of providing data online is thus accompanied by newly developing methodologies to interconnect linguistic data and metadata. This includes linguistic data collections, general-purpose knowledge bases (e.g., the DBpedia, a machine-readable edition of the Wikipedia), and repositories with specific information about languages, linguistic categories and phenomena. The Linked Data paradigm provides a framework for interoperability and access management, and thereby allows to integrate information from such a diverse set of resources.
The contributions assembled in this volume illustrate the band-width of applications of the Linked Data paradigm for representative types of language resources. They cover lexical-semantic resources, annotated corpora, typological databases as well as terminology and metadata repositories. The book includes representative applications from diverse fields, ranging from academic linguistics (e.g., typology and corpus linguistics) over applied linguistics (e.g., lexicography and translation studies) to technical applications (in computational linguistics, Natural Language Processing and information technology).
This volume accompanies the Workshop on Linked Data in Linguistics 2012 (LDL-2012) in Frankfurt/M., Germany, organized by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN). It assembles contributions of the workshop participants and, beyond this, it summarizes initial steps in the formation of a Linked Open Data cloud of linguistic resources, the Linguistic Linked Open Data cloud (LLOD).
Content:
Front Matter....Pages I-XIV
Introduction and Overview....Pages 1-12
Front Matter....Pages 13-13
Treating Dictionaries as a Linked-Data Corpus....Pages 15-23
Integrating WordNet and Wiktionary with lemon ....Pages 25-34
Integrating Lexical Resources Through an Aligned Lemma List....Pages 35-44
Linking Localisation and Language Resources....Pages 45-54
Front Matter....Pages 55-55
Reusing Linguistic Resources: Tasks and Goals for a Linked Data Approach....Pages 57-64
A Discourse Information Radio News Database for Linguistic Analysis....Pages 65-76
Integrating Treebank Annotation and User Activity in Translation Research....Pages 77-84
Creating Linked Data for the Interdisciplinary International Collaborative Study of Language Acquisition and Use: Achievements and Challenges of a New Virtual Linguistics Lab....Pages 85-96
Front Matter....Pages 97-97
Linking to Linguistic Data Categories in ISOcat....Pages 99-107
Towards Linked Language Data for Digital Humanities....Pages 109-116
OntoLingAnnot’s Ontologies: Facilitating Interoperable Linguistic Annotations (Up to the Pragmatic Level)....Pages 117-127
Using Linked Data to Create a Typological Knowledge Base....Pages 129-138
TYTO – A Collaborative Research Tool for Linked Linguistic Data....Pages 139-149
Front Matter....Pages 151-151
The Open Linguistics Working Group of the Open Knowledge Foundation....Pages 153-160
Interoperability of Corpora and Annotations....Pages 161-179
The German DBpedia: A Sense Repository for Linking Entities....Pages 181-190
Linked Data for Linguistic Diversity Research: Glottolog/Langdoc and ASJP Online....Pages 191-200
Linking Linguistic Resources: Examples from the Open Linguistics Working Group....Pages 201-216
The explosion of information technology has led to substantial growth of web-accessible linguistic data in terms of quantity, diversity and complexity. These resources become even more useful when interlinked with each other to generate network effects.
The general trend of providing data online is thus accompanied by newly developing methodologies to interconnect linguistic data and metadata. This includes linguistic data collections, general-purpose knowledge bases (e.g., the DBpedia, a machine-readable edition of the Wikipedia), and repositories with specific information about languages, linguistic categories and phenomena. The Linked Data paradigm provides a framework for interoperability and access management, and thereby allows to integrate information from such a diverse set of resources.
The contributions assembled in this volume illustrate the band-width of applications of the Linked Data paradigm for representative types of language resources. They cover lexical-semantic resources, annotated corpora, typological databases as well as terminology and metadata repositories. The book includes representative applications from diverse fields, ranging from academic linguistics (e.g., typology and corpus linguistics) over applied linguistics (e.g., lexicography and translation studies) to technical applications (in computational linguistics, Natural Language Processing and information technology).
This volume accompanies the Workshop on Linked Data in Linguistics 2012 (LDL-2012) in Frankfurt/M., Germany, organized by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN). It assembles contributions of the workshop participants and, beyond this, it summarizes initial steps in the formation of a Linked Open Data cloud of linguistic resources, the Linguistic Linked Open Data cloud (LLOD).
Content:
Front Matter....Pages I-XIV
Introduction and Overview....Pages 1-12
Front Matter....Pages 13-13
Treating Dictionaries as a Linked-Data Corpus....Pages 15-23
Integrating WordNet and Wiktionary with lemon ....Pages 25-34
Integrating Lexical Resources Through an Aligned Lemma List....Pages 35-44
Linking Localisation and Language Resources....Pages 45-54
Front Matter....Pages 55-55
Reusing Linguistic Resources: Tasks and Goals for a Linked Data Approach....Pages 57-64
A Discourse Information Radio News Database for Linguistic Analysis....Pages 65-76
Integrating Treebank Annotation and User Activity in Translation Research....Pages 77-84
Creating Linked Data for the Interdisciplinary International Collaborative Study of Language Acquisition and Use: Achievements and Challenges of a New Virtual Linguistics Lab....Pages 85-96
Front Matter....Pages 97-97
Linking to Linguistic Data Categories in ISOcat....Pages 99-107
Towards Linked Language Data for Digital Humanities....Pages 109-116
OntoLingAnnot’s Ontologies: Facilitating Interoperable Linguistic Annotations (Up to the Pragmatic Level)....Pages 117-127
Using Linked Data to Create a Typological Knowledge Base....Pages 129-138
TYTO – A Collaborative Research Tool for Linked Linguistic Data....Pages 139-149
Front Matter....Pages 151-151
The Open Linguistics Working Group of the Open Knowledge Foundation....Pages 153-160
Interoperability of Corpora and Annotations....Pages 161-179
The German DBpedia: A Sense Repository for Linking Entities....Pages 181-190
Linked Data for Linguistic Diversity Research: Glottolog/Langdoc and ASJP Online....Pages 191-200
Linking Linguistic Resources: Examples from the Open Linguistics Working Group....Pages 201-216
....