Диссертация (1101009), страница 21
Текст из файла (страница 21)
О.,Соломенник А. И.Гибридная технология синтеза русской речи на основе скрытых Марковскихмоделей и алгоритма Unit Selection // Изв. вузов. Приборостроение. — 2013.— №2. — С. 33–38.48. Чистиков П. Г., Таланов А. О., Захаров Д. С., Соломенник А. И. Технологиясинтеза естественной речи с использованием базы данных небольшогообъема // Научно-технический вестник информационных технологий,механики и оптики.
— №4 (91) — 2014. — С. 83–97.49. ШироковаА. М.Буквенно-звуковоепреобразованиевсистемахавтоматической обработки речи // Структурная и прикладная лингвистика.— СПб., 2014. — Вып. 10. — С. 279–302.50. Bachan J., Kuczmarski T., Francuzik P. Evaluation of synthetic speech usingautomatic speech recognition // Proc. XIV International PhD Workshop OWD. —2012.
— P. 500–505.51. Benoit C., Grice M., Hazan V. The SUS test: a method for the assessment of textto speech synthesis intelligibility using Semantically Unpredictable Sentences //Speech Communication. — 1996. — Vol. 18. — P. 381–392.52. Black A. W, Taylor P.
CHATR: A Generic Speech Synthesis System // COLING94. — Japan, 1994. — P. 983–986.53. Black A. W. Perfect Synthesis for all of the people all of the time // Keynote, IEEETTS Workshop. — Santa Monica, CA, 2002. — P. 167–170.54. Black A. W., Zen H., Tokuda K. Statistical parametric speech synthesis // Proc.ICASSP 2007. — 2007. — P. 1229–1232.
144 55. Blizzard Challenge 2015 [Электронный ресурс]. — 2015. — Режим доступа:http://www.synsig.org/index.php/Blizzard_Challenge_2015.56. Campbell N. Evaluation of Speech Synthesis // Dybkjær L., Hamsen H. andMinker W.
(Eds.) Evaluation of text and speech systems. — Springer: TheNetherlands, 2007. — P. 29–64.57. Chevelu J., Barbot N., Boeffard O., Delhay A. Comparing set-covering strategiesfor optimal corpus design // Proceedings of the 6th International LanguageResources and Evaluation. — 2008.
— P. 969–974.58. Chistikov P., Korolkov E. Data-driven speech parameter generation for Russiantext-to-speech system // Proceedings of the Dialogue-2012 InternationalConference. — Bekasovo, 2012. — No 11 (18). — P. 103–111.59. Chistikov P. G., Korolkov E. A., Talanov A. O.
Combining HMM and UnitSelection technologies to increase naturalness of synthesized speech //Компьютерная лингвистика и интеллектуальные технологии: По материаламежегодной Международной конференции «Диалог». — М.: Изд-во РГГУ,2013. — Вып. 12 (19). — Т. 2. — С. 2–10.60. Clark R. A. G., Richmond K., King S.
Multisyn: Open-domain unit selection forthe Festival speech synthesis system // Speech Communication. — 2007. — Vol.49. — Issue 4. — P. 317–330.61. Dutoit T. Аn Introduction to Text-to-Speech Synthesis. — Dordrecht–Boston–London, 1997. — 286 p.62. EVALDA – EVASY. Evaluation des Synthétiseurs de parole en français[Электронныйресурс].—2006.—Режимдоступа:http://www.technolangue.net/article.php3?id_article=202. 145 63. Falk T. H., Möller S. Towards Signal-Based Instrumental Quality Diagnosis forText-to-Speech Systems // IEEE Signal Processing Letters.
— 2008. — Vol. 15.— P. 781–784.64. Fujisaki H. Dynamic characteristics of voice fundamental frequency in speechand singing // Production of Speech. — N. Y. 1983. — P. 39–55.65. Hao W., Soong, F. K., Meng H. A spectral space warping approach to crosslingual voice transformation in HMM-based TTS // ICASSP. — 2015. — P. 4874–4878.66. House A., Williams C., Hecker M., Kryter K.
Articulation-testing methods:Consonantal differentiation with a closed-response set // The Journal of theAcoustical Society of America 37. — 1965. — P. 158–166.67. Hunt A., Black A. Unit Selection in a Concatenative Speech Synthesis SystemUsing a Large Speech Database // Proceedings of ICASSP 96. — 1996. — P. 373–376.68.
Jekosch U. The Cluster-Identification Test // Proceedings of ICSLP 92 (1). —1992. — P. 205–208.69. Khomitsevich O. G., Chistikov P. G. Using statistical methods for prosodicboundary detection and break duration prediction in a Russian TTS system //Компьютерная лингвистика и интеллектуальные технологии: По материаламежегодной Международной конференции «Диалог». — М.: Изд-во РГГУ,2013.
— Вып. 12 (19). — Т. 2, с. 11–19.70. Klatt D. Review of Text-to-Speech Conversion for English // JASA. — 1987. —Vol. 82 (3). — P. 737–793. 146 71. Klatt D. H. Software for a cascade/parallel formant synthesizer // JASA. — 1980.— Vol. 67. — P. 971–995.72. Latorre J. A. Study on Speaker-Adaptable Multilingual Synthesis. — TokyoInstitute of Technology, 2006. — 121 p.73. Lemmetty, S. Review of Speech Synthesis Technology. Master’s Thesis, HelsinkiUniversity of Technology.
— 1999. — 104 p.74. Lobanov B. The phonemophon text-to-speech system // International Congress ofPhonetic Sciences: proc. Of the 11-th section ICPhS’87, Tallin, USSR, 6-10August 1987. — Tallin, 1987. — Vol. 1. — P. 120–124.75. Mattingly I. G. Speech Synthesis for Phonetic and Phonological Models // CurrentTrends in Linguistics, edited by T.
S. Sebeok. — Mouton, the Netherlands, 1974.— Vol. 12. — P. 2451–2487.76. Méndez F. et al. The Albayzín 2010 Text-to-Speech Evaluation // Proc. Fala-2010.— 2010. — P. 317–340.77. Method for Subjective Performance Assessment of the Quality of Speech VoiceOutput Devices, ITU-T Rec. P.85. — Int. Telecom. Union, 1994. — 13 p.78. Moore R. K., Nicolao M. Reactive Speech Synthesis: Actively Managing PhoneticContrast along an H&H Continuum // ICPhS 2011.
— Hong Kong, China, 2011.— P. 1422–1425.79. Moulines E., Verhelst W. Time-domain and frequency-domain techniques forprosodic modification of speech in Speech Coding and Synthesis // IEEE. —Netherland, 1995. — P. 519–555. 147 80. Norrenbrock C. R. Hinterleitner F., Heute U., Moller S. Towards PerceptualQuality Modeling of Synthesized Audiobooks – Blizzard Challenge 2012[Электронныйресурс].—2012(b).—Режимдоступа:http://festvox.org/blizzard/bc2012/Norrenbrock_etal_Blizzard_workshop_2012_final.pdf81. Norrenbrock C.
R., Hinterleitner F., Heute U., Moller S. Instrumental Assessmentof Prosodic Quality for Text-to-Speech Signals // Signal Processing Letters, IEEE.— 2012(a). — P. 255–258.82. Oparin I., Talanov A. (2007), Outline of a New Hybrid Russian TTS System //Proceedings of the 12th International conference on Speech and Computer,SPECOM 2007. — Moscow, 2007. — P. 603–608.83. Sagisaka Y. et al. ATR – n-Talk speech synthesis system // Proceedings ofICSLP92.
— Banff, Canada, 1992. — P. 483–486.84. Silverman K. et al. TOBI: a standard for labelling English prosody // SpokenLanguage Processing: proceedings of 2-nd International conference ICSLP’92.Alberta, Canada, 13-16 October 1992. — Alberta, 1992 — P. 867–870.85. Solomennik A. I., Cherentsova A. E. A Method for Auditory Evaluation ofSynthesized Speech Intonation // Miloš Železný et al. (Eds.): SPECOM 2013,Lecture Notes in Artificial Intelligence 8113. — Springer, 2013. — P.
9–16.86. Solomennik A. I., Chistikov P. G. Evaluation of naturalness of synthesized speechwithdifferentprosodicmodels//Компьютернаялингвистикаиинтеллектуальные технологии: По материалам ежегодной Международнойконференции «Диалог». — М.: Изд-во РГГУ, 2013. — Вып. 12 (19). — Т. 2.— С. 31–38. 148 87.
Solomennik A., Chistikov P. Automatic generation of text corpora for creatingvoice databases in a Russian text-to-speech system // Proceedings of the Dialogue2012 International Conference. — Bekasovo, 2012. — No 11 (18). — P. 607–615.88. Sproat R.,Black A. W.,Chen S.,Kumar S.,Ostendorf M.,Richards C.Normalization of non-standard words // Computer Speech and Language. — 2001.— Vol. 15. — P. 287–333.89.
Stilianou Y. Applying the Harmonic Plus Noise Model in Concatenative SpeechSynthesis // IEEE transactions on speech and audio processing. — 2001. — Vol. 9.— No. 1. — P. 21–29.90. Stylianou Y., Syrdal A. Perceptual and objective detection of discontinuities inconcatenative speech synthesis // Proc. Int. Conf. Acoustics, Speech, and SignalProcessing. — 2001.
— P. 837–840.91. Sydeserff H. A., Caley R. J., Isard S. D. et al. Evaluation of speech synthesistechniques in a comprehension task // Speech Communication 11, 2–3. — 1992.— P. 89–194.92. Syrdal A. K., Conkie A. D. Perceptually-based data-driven join costs: Comparingjoin types // In Proceedings of Eurospeech, Interspeech. — 2005. — P. 2813–2816.93. 't Hart J., Collier R., Cohen A.
A Perceptual Study of Intonation: an ExperimentalPhonetic Approach to Speech Melody. — Cambridge, 1991. — 212 p.94. Tatham M, Morton K. Developments in Speech Synthesis // John Wiley & SonsLtd. — 2005. — 342 p. 149 95. Taylor P. Analysis and synthesis of intonation using the tilt model // J. Acoust.Soc. America. — 2000.
V. 107. — № 3, p. 1697–1714.96. Taylor P. Text-to-Speech Synthesis. Cambridge University Press, 2009. 474 p.97. Tokuda K., Masuko T., Yamada T. An algorithm for speech parameter generationfrom continuous mixture HMMs with dynamic features // Proceedings ofEurospeech-1995. — 1995. — P. 757–760.98. Tokuda K., Nankaku Y., Toda T., Zen H., Yamagishi J., Oura K. Speech SynthesisBased on Hidden Markov Models // Proceedings of the IEEE.