Please use this identifier to cite or link to this item: https://doi.org/10.21256/zhaw-4974
Publication type: Conference paper
Type of review: Peer review (publication)
Title: German compound splitting using the compound productivity of morphemes
Authors: Sugisaki, Kyoko
Tuggener, Don
DOI: 10.21256/zhaw-4974
Proceedings: 14th Conference on Natural Language Processing - KONVENS 2018
Editors of the parent work: Barbaresi, Adrien
Biber, Hanno
Neubarth, Friedrich
Osswald, Rainer
Page(s): 141
Pages to: 147
Conference details: 14th Conference on Natural Language Processing (KONVENS 2018), Vienna, Austria, 19-21 September 2018
Issue Date: 2018
Publisher / Ed. Institution: Austrian Academy of Sciences Press
Other identifiers: 0xc1aa5576 0x003a2438
Language: English
Subjects: Compound splitting
Subject (DDC): 410.285: Computational linguistics
Abstract: In this work, we present a novel compound splitting method for German by capturing the compound productivity of morphemes. We use a giga web corpus to create a lexicon and decompose noun compounds by computing the probabilities of compound elements as bound and free morphemes. Furthermore, we provide a uniformed evaluation of several unsupervised approaches and morphological analysers for the task. Our method achieved a high F1 score of 0.92, which was a comparable result to state-of-the-art methods.
URI: https://digitalcollection.zhaw.ch/handle/11475/14372
Fulltext version: Published version
License (according to publishing contract): Licence according to publishing contract
Departement: School of Engineering
Organisational Unit: Institute of Computer Science (InIT)
Appears in collections:Publikationen School of Engineering

Files in This Item:
File Description SizeFormat 
2018_Sugisaki_German_compound_splitting_using_the_compound.pdf177.39 kBAdobe PDFThumbnail
View/Open
Show full item record
Sugisaki, K., & Tuggener, D. (2018). German compound splitting using the compound productivity of morphemes [Conference paper]. In A. Barbaresi, H. Biber, F. Neubarth, & R. Osswald (Eds.), 14th Conference on Natural Language Processing - KONVENS 2018 (pp. 141–147). Austrian Academy of Sciences Press. https://doi.org/10.21256/zhaw-4974
Sugisaki, K. and Tuggener, D. (2018) ‘German compound splitting using the compound productivity of morphemes’, in A. Barbaresi et al. (eds) 14th Conference on Natural Language Processing - KONVENS 2018. Austrian Academy of Sciences Press, pp. 141–147. Available at: https://doi.org/10.21256/zhaw-4974.
K. Sugisaki and D. Tuggener, “German compound splitting using the compound productivity of morphemes,” in 14th Conference on Natural Language Processing - KONVENS 2018, 2018, pp. 141–147. doi: 10.21256/zhaw-4974.
SUGISAKI, Kyoko und Don TUGGENER, 2018. German compound splitting using the compound productivity of morphemes. In: Adrien BARBARESI, Hanno BIBER, Friedrich NEUBARTH und Rainer OSSWALD (Hrsg.), 14th Conference on Natural Language Processing - KONVENS 2018. Conference paper. Austrian Academy of Sciences Press. 2018. S. 141–147
Sugisaki, Kyoko, and Don Tuggener. 2018. “German Compound Splitting Using the Compound Productivity of Morphemes.” Conference paper. In 14th Conference on Natural Language Processing - KONVENS 2018, edited by Adrien Barbaresi, Hanno Biber, Friedrich Neubarth, and Rainer Osswald, 141–47. Austrian Academy of Sciences Press. https://doi.org/10.21256/zhaw-4974.
Sugisaki, Kyoko, and Don Tuggener. “German Compound Splitting Using the Compound Productivity of Morphemes.” 14th Conference on Natural Language Processing - KONVENS 2018, edited by Adrien Barbaresi et al., Austrian Academy of Sciences Press, 2018, pp. 141–47, https://doi.org/10.21256/zhaw-4974.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.