Please use this identifier to cite or link to this item:
https://doi.org/10.21256/zhaw-30250
Publication type: | Conference paper |
Type of review: | Peer review (publication) |
Title: | Text-to-speech pipeline for Swiss German : a comparison |
Authors: | Bollinger, Tobias Deriu, Jan Milan Vogel, Manfred |
et. al: | No |
DOI: | 10.48550/arXiv.2305.19750 10.21256/zhaw-30250 |
Conference details: | 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023 |
Issue Date: | Jun-2023 |
Publisher / Ed. Institution: | arXiv |
Language: | English |
Subjects: | Speech synthesis; Text to speech |
Subject (DDC): | 410.285: Computational linguistics 430: German |
Abstract: | In this work, we studied the synthesis of Swiss German speech using different Text-to-Speech (TTS) models. We evaluated the TTS models on three corpora, and we found, that VITS models performed best, hence, using them for further testing. We also introduce a new method to evaluate TTS models by letting the discriminator of a trained vocoder GAN model predict whether a given waveform is human or synthesized. In summary, our best model delivers speech synthesis for different Swiss German dialects with previously unachieved quality. |
URI: | https://digitalcollection.zhaw.ch/handle/11475/30250 |
Fulltext version: | Published version |
License (according to publishing contract): | CC BY 4.0: Attribution 4.0 International |
Departement: | School of Engineering |
Organisational Unit: | Centre for Artificial Intelligence (CAI) |
Published as part of the ZHAW project: | End-to-End Low-Resource Speech Translation for Swiss German Dialects |
Appears in collections: | Publikationen School of Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
2023_Bollinger-etal_Text-to-speech-pipeline-for-Swiss-German.pdf | 1.03 MB | Adobe PDF | View/Open |
Show full item record
Bollinger, T., Deriu, J. M., & Vogel, M. (2023, June). Text-to-speech pipeline for Swiss German : a comparison. 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023. https://doi.org/10.48550/arXiv.2305.19750
Bollinger, T., Deriu, J.M. and Vogel, M. (2023) ‘Text-to-speech pipeline for Swiss German : a comparison’, in 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023. arXiv. Available at: https://doi.org/10.48550/arXiv.2305.19750.
T. Bollinger, J. M. Deriu, and M. Vogel, “Text-to-speech pipeline for Swiss German : a comparison,” in 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023, Jun. 2023. doi: 10.48550/arXiv.2305.19750.
BOLLINGER, Tobias, Jan Milan DERIU und Manfred VOGEL, 2023. Text-to-speech pipeline for Swiss German : a comparison. In: 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023. Conference paper. arXiv. Juni 2023
Bollinger, Tobias, Jan Milan Deriu, and Manfred Vogel. 2023. “Text-to-Speech Pipeline for Swiss German : A Comparison.” Conference paper. In 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023. arXiv. https://doi.org/10.48550/arXiv.2305.19750.
Bollinger, Tobias, et al. “Text-to-Speech Pipeline for Swiss German : A Comparison.” 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023, arXiv, 2023, https://doi.org/10.48550/arXiv.2305.19750.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.