A Framework to Analyze Quality of Service (QoS) for Text-To-Speech (TTS) Services

Md Fudzee, Mohd Farhan; Hassan, Mohamud; Mahdin, Hairulnizam; Kasim, Shahreen; Abawajy, Jemal

doi:10.1007/978-3-319-51281-5_59

Mohd Farhan Md Fudzee¹⁸,
Mohamud Hassan¹⁸,
Hairulnizam Mahdin¹⁸,
Shahreen Kasim¹⁸ &
…
Jemal Abawajy¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 549))

Included in the following conference series:

International Conference on Soft Computing and Data Mining

1169 Accesses

Abstract

Quality of service (QoS) evaluation is vital for text-to-speech (TTS) web service applications. Most of the current solutions focus on either evaluating functional or nonfunctional attributes of the TTS. In this paper, we propose a QoS framework to evaluate and analyze the perceived QoS that combines general and specific mechanisms for measuring both functional and nonfunctional requirements of speech quality. General mechanism measures the response time of TTS services while specific mechanism measures intelligibility and naturalness through subjective quality measurements, which are mapped onto mean opinion score (MOS). The result shows the workability of the framework, tested by predetermined users to three services: service1 (Fromtexttospeech) resulting 47.84%; service2 and service3 (NaturalReader and Yakitome) are 31.62 and 21.53% respectively. The TTS services evaluation can be to enhance the user experience.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

References

Patil, M., Kawitkar, R.S.: “Syllable” concatenation for text to speech synthesis for Devnagari script. Int. J. Adv. Res. Eng. Comput. Sci. Softw. 2(9), 180–184 (2012)
Google Scholar
Md Fudzee, M.F., Abawajy, J.: A protocol for discovering content adaptation services. In: Xiang, Y., Cuzzocrea, A., Hobbs, M., Zhou, W. (eds.) ICA3PP 2011. LNCS, vol. 7017, pp. 235–244. Springer, Heidelberg (2011). doi:10.1007/978-3-642-24669-2
Chapter Google Scholar
Wang, L., et al.: Evaluating text-to-speech intelligibility using template constrained generalized posterior probability. U.S. Patent Application (2012)
Google Scholar
Remes, U., Reima, K., Mikko, K.: Objective evaluation measures for speaker adaptive HMM-TTS systems. In: Proceedings of 8th ISCA Speech Synthesis Workshop (2013)
Google Scholar
Möller, S., Wai, Y.C., Cote, N., Falk, T., Raake, A., Waltermann, A.: Speech quality estimation: models and trends. IEEE Sign. Process. Mag. 28, 18–28 (2011)
Article Google Scholar
Egger, S., et al.: Waiting times in quality of experience for web based services. In: 2012 Fourth International Workshop on Quality of Multimedia Experience (QoMEX). IEEE (2012)
Google Scholar
Streijl, C.R., Winkler, S., Hands, D.S.: Mean Opinion Score (MOS) revisited: methods and applications, limitations and alternatives. Multimedia Syst. 22, 213–227 (2014)
Article Google Scholar
Md Fudzee, M.F., Abawajy, J.: Request-driven cross-media content adaptation technique. In: Ragab, K., Helmy, T., Hassanien, A.E. (eds.) Developing Advanced Web Services Through P2P Computing and Autonomous Agents: Trends and Innovations, chap. 6, pp. 91–113. IGI Global (2010)
Google Scholar
Eyben, F., et al.: Unsupervised clustering of emotion and voice styles for expressive TTS. In: International Conference on IEEE Acoustics, Speech and Signal Processing (ICASSP) (2012)
Google Scholar
Md Fudzee, M.F., Abawajy, J.: Management of Service level agreement for service-oriented content adaptation platform. In: Network and Traffic Engineering in Emerging Distributed Computing Applications, pp. 21–42 (2012)
Google Scholar

Download references

Acknowledgments

The authors would like to acknowledge the Malaysian Ministry of Higher Education for the Fundamental Research Grant Scheme vot 1238. This research also supported by GATES IT Solution Sdn. Bhd under its publication scheme.

Author information

Authors and Affiliations

Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, Parit Raja, Malaysia
Mohd Farhan Md Fudzee, Mohamud Hassan, Hairulnizam Mahdin & Shahreen Kasim
School of Information Technology, Deakin University, Burwood, Australia
Jemal Abawajy

Authors

Mohd Farhan Md Fudzee
View author publications
You can also search for this author in PubMed Google Scholar
Mohamud Hassan
View author publications
You can also search for this author in PubMed Google Scholar
Hairulnizam Mahdin
View author publications
You can also search for this author in PubMed Google Scholar
Shahreen Kasim
View author publications
You can also search for this author in PubMed Google Scholar
Jemal Abawajy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hairulnizam Mahdin .

Editor information

Editors and Affiliations

Department of Information System, University of Malaya, Kuala Lumpur, Malaysia
Tutut Herawan
Universiti Tun Hussein Onn Malaysia, Batu Pahat, Malaysia
Rozaida Ghazali
Universiti Tun Hussein Onn Malaysia, Batu Pahat, Malaysia
Nazri Mohd Nawi
Universiti Tun Hussein Onn Malaysia, Batu Pahat, Malaysia
Mustafa Mat Deris

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Md Fudzee, M.F., Hassan, M., Mahdin, H., Kasim, S., Abawajy, J. (2017). A Framework to Analyze Quality of Service (QoS) for Text-To-Speech (TTS) Services. In: Herawan, T., Ghazali, R., Nawi, N.M., Deris, M.M. (eds) Recent Advances on Soft Computing and Data Mining. SCDM 2016. Advances in Intelligent Systems and Computing, vol 549. Springer, Cham. https://doi.org/10.1007/978-3-319-51281-5_59

Download citation

DOI: https://doi.org/10.1007/978-3-319-51281-5_59
Published: 29 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-51279-2
Online ISBN: 978-3-319-51281-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics