Phonetic transcription of spontaneous children's speech with the aid of software: a systematic review

Debora Tomazi Moreira Caumo,
Márcio Pezzini França,
Clécio Homrich da Silva

Abstract

The aim of the study was to identify, synthesize and classify the software currently available that can help in the task of phonetic transcription of the spontaneous speech of pre-school children to evaluate the development of children's language. A systematic review was performed for articles published, for the 10-year period (June 2010 to June 2020), without restrictions as to location and language, using the Cochrane, Pubmed and Web of Science databases. The terms used in the search strategies were "phonological", "phonetic", "transcription", "computer" and "software". The studies were selected by two independent reviewers using pre-defined search strategies. In the initial search, after the exclusion of duplicates, 534 articles were found. By reading their titles and abstracts, 46 articles related to the theme were left, which were then read in full. After reading, 24 articles were included in the study. The results revealed a total of seven software available for the phonetic transcription of spontaneous speech from preschoolers used for different analyses: LENA and Timestamper (for babbling and pre-linguistic vocalizations), ELAN (for gestural communication, extralinguistic elements and the situational context), Phon (for phonetic and phonological analyses), CLAN and SALT (for morphosyntactic, grammatical and semantic aspects) and Praat (for acoustic measurements). Through this systematic review, it can be concluded that there are advantages to using software for phonetic transcription, sample storage, and child language analysis, especially concerning standardization and reliability for spontaneous speech samples. Phonetic transcription still relies on the ability and subjectivity of a human transcriber. The tools found in the software provide support to facilitate using phonetic symbols, audio segmentation and pairing to writing, and analysis of speech data.

References

ALVAREZ, A. et al. Automating live and batch subtitling of multimedia contents for several European lan-guages. Multimedia Tools and Applications, 75, p. 10823-10853, 2015. DOI https://doi.org/10.1007/s11042-015-2794-z

BALCIUNIENE, I. Lithuanian narrative language at preschool age. Eesti Rakenduslingvistika Uhingu Aastaraamat, n. 8, p. 21–36, 2012. DOI: http://dx.doi.org/10.5128/ERYa8.02

BREDIN-OJA, S. L. et al. “Clinician vs. Machine: Estimating Vocalizations Rates in Young Children With Develop-mental disorders”. American Journal of Speech-Language Pathology, v. 27, n.3, p. 1066–1072, August 2018. DOI https://doi.org/10.1044/2018_AJSLP-17-0016

BURGESS, S.; AUDET, L.; HARJUSOLA-WEBB, S. Quantitative and qualitative characteristics of the school and home language environments of preschool-aged children with ASD. Journal of Communication Disorders, v. 46, n. 5–6, 428–439, 2013. DOI https://doi.org/10.1016/j.jcomdis.2013.09.003

BUSCH, T. et al. Correlation and agreement between Language ENvironment Analysis (LENA™) and manual tran-scription for Dutch natural language recordings. Behavior Research Methods, v. 50, n.5, p. 1921–1932, 2018. DOI https://doi.org/10.3758/s13428-017-0960-0

BYUN, T. M.; ROSE, Y. Analyzing clinical phonological data using phon. Seminars in Speech and Language, v. 37, n. 2, p. 85–105, 2016. DOI https://doi.org/10.1055/s-0036-1580741

CANAULT, M. et al. Reliability of the Language ENvironment Analysis system (LENA™) in European French. Behavior research methods, v. 48, n.3, p. 1109–1124, 2016. DOI https://doi.org/10.3758/s13428-015-0634-8

CANAULT, M.; LE NORMAND M-T.; THAI VAN, H. LENATM (Language ENvironment Analysis System): An automated speech recognition system and child language environment. Enfance, v. 2, n. 2, p. 199-216, 2017. DOI https://doi.org/10.4074/S0013754517002038

CHAPARRO-MORENO, L. J.; REALI, F.; MALDONADO-CARREÑO, C. Wordless picture books boost preschoolers’ lan-guage production during shared reading. Early Childhood Research Quarterly, v. 40, p. 52–62, 2017. DOI https://doi.org/10.1016/j.ecresq.2017.03.001

COLLETTA, J-M.; COLLETTA, C.; GUIDETTI, M. Age-related changes in co-speech gesture and narrative: Evidence from French children and adults. Speech Communication, v. 52, n. 6, p. 565-576, June 2010. DOI https://doi.org/10.1016/j.specom.2010.02.009

D'APICE, K.; LATHAM, R. M.; VON STUMM, S. A naturalistic home observational approach to children's language, cog-nition, and behavior. Developmental Psychology, v. 55, n. 7, p. 1414–1427, 2019. https://doi.org/10.1037/dev0000733

DEL RE, A.; HILARIO, R. N.; MOGNO, A. S. Programa CLAN da base CHILDES: normas de transcrição (CHAT) e coman-dos básicos In: G. FERREIRA-GONÇALVES, M. R., BRUM-DE-PAULA & M. KESKE-SOARES (Eds.), Estudos em aquisi-ção fonológica. Pelotas: Editora e Gráfica Universitária PREC-UFPel, v. 4, p. 11-30, 2011.

FLIPSEN, P. JR.; KANGAS, K. Mean length of utterance (MLU) in children with cochlear implants. Volta Review, v. 114, n. 2, p. 135–55, 2014.

FREITAS, L. B. L.; SHELTON T. L. Atenção à Primeira Infância nos EUA e no Brasil. Psicologia: Teoria e Pesquisa, V. 21 n. 2, p. 197-205, Mai-Ago 2005. DOI https://doi.org/10.1590/S0102-37722005000200010

GENEST, C. S.; MASSON, C. The contribution of corpus linguistics to the study of clinical situations: using ecological resources. Studii de lingvistica, 7, 89–112, 2017.

GOMES DE MELO BEZERRA, J. T. et al. Softwares de transcrição como auxílio para as pesquisas com enfoque multi-modal no processo de aquisição da linguagem. Texto livre-linguagem e tecnologia, v. 9, n. 1, p. 77–93, 2016. DOI https://doi.org/10.17851/1983-3652.9.1.77-93

GREENWOOD, C. R. et al. Assessing Children’s Home Language Environments Using Automatic Speech Recognition Technology. Communication disorders quarterly, v. 32, n. 2, p. 83–92, 2011. DOI https://doi.org/10.1177/1525740110367826

IPA. International Phonetic Association. [cited 2021 Jun29]. Available from: https://www.internationalphoneticassociation.org/content/links.

JULIEN, C. et al. Morphosyntactic development and severe parental neglect in 4-year-old french-speaking children: ELLAN study. Child Maltreatment, v. 24, n. 3, p. 254–264, 2019. DOI https://doi.org/10.1177/1077559519829249

MOHER D. et al. The PRISMA Group (2009). Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med v. 6, n. 7, e1000097. DOI https://doi.org/10.1371/journal.pmed1000097

MOELLER, M. P. et al. Longitudinal development of phonology and morphology in children with late-identified mild-moderate sensorineural hearing loss. Ear and hearing, v. 3, n. 5, p. 625–635, 2010. DOI https://doi.org/10.1097/AUD.0b013e3181df5cc2

OLLER, D. K. et al. Automated vocal analysis of naturalistic recordings from children with autism, language delay, and typical development. Proceedings of the National Academy of Sciences, v. 107, n. 30, p. 13354–13359, 2010. DOI https://doi.org/10.1073/pnas.1003882107

OVERTON, S.; WREN, Y. Outcome measurement using naturalistic language samples: A feasibility pilot study using language transcription software and speech and language therapy assistants. Child language teaching & therapy, v. 30. n. 2 , p. 221–29, 2014. DOI: https://doi.org/10.1177/0265659013519251

PEZOLD, M. J.; IMGRUND, C. M.; STORKEL, H. L. Using Computer Programs for Language Sample Analysis. Language, Speech, and Hearing Services in Schools, v. 51, n.1, p. 103–114, 2020. DOI https://doi.org/10.1044/2019_LSHSS-18-0148

RÄSÄNEN, O. et. al. Automatic word count estimation from daylong child-centered recordings in various language environments using language-independent syllabification of speech. Speech Communication, n. 113, p. 63–80, 2019. DOI https://doi.org/10.1016/j.specom.2019.08.005

RATNER, N. B.; MACWHINNEY, B. Your Laptop to the Rescue: Using the Child Language Data Exchange System Ar-chive and CLAN Utilities to Improve Child Language Sample Analysis. Seminars in Speech and Language, v. 37, n. 2, 74–84, 2016. DOI https://doi.org/10.1055/s-0036-1580742

ROSE, Y.; STOEL-GAMMON, C. Using PhonBank and Phon in studies of phonological development and disorders. Clin-ical Linguistics & Phonetics, v. 29, n. 8–10, 686–700, 2015. DOI https://doi.org/10.3109/02699206.2015.1041609

SABRI, M.; FABIANO-SMITH, L. Phonological Development in a Bilingual Arabic-English-Speaking Child With Bilateral Cochlear Implants: A Longitudinal Case Study. American Journal Of Speech-Language Pathology, v. 27, n.4, 1506–1522, 2018. DOI https://doi.org/10.1044/2018_AJSLP-17-0162

SARAIVA, D. et al. Paediatric Automatic Phonological Analysis Tools (APAT). Logopedics, phoniatrics, vocology, 42, 153–159, 2017. DOI https://doi.org/10.1080/14015439.2016.1237544

SCLIAR-CABRAL, L.; VASILÉVSKI, V. Descrição do português com auxílio de programa computacional de interface. In: Anais da II Jornada de Descrição do Português, Cuiaba, MT, Brasil, 2011, p.39-46.

VENTURINI, C. et al. Confiabilidade intra e interexaminadores de dois métodos de medida da amplitude ativa de dor-siflexão do tornozelo em indivíduos saudáveis. Revista Brasileira de Fisioterapia, v.10, n. 4, 407-411, 2006. DOI https://doi.org/10.1590/S1413-35552006000400008

WILLADSEN, E.; PERSSON, C.; APPELBE, D. A software program to assist coding of prelinguistic vocalizations in real time. Clinical linguistics & phonetics, v. 32, n.10, 972–978, 2018. https://doi.org/10.1080/02699206.2018.1463396

XU, D.; RICHARDS, J. A.; GILKERSON, J. Automated Analysis of Child Phonetic Production Using Naturalistic Record-ings. Journal Of Speech Language and Hearing Research, v. 57, n.5, p. 1638–1650, 2014. DOI https://doi.org/10.1044/2014_JSLHR-S-13-0037