Identifying acoustic cues for dialect profiling: Policing in multilingual communities of India

Ravina Toppo, Sweta Sinha

Abstract


A multilingual country such as India with numerous languages and dialects provides fertile grounds for evasive language crimes. From threat letters to ransom demands, the scope of crime is huge. The cases of illegal immigrants have only added to the fragility of international boundaries especially, during political upheavals. This leads to further vulnerability of society and also creates challenges for the police and law enforcement agencies towards timely intervention. The purpose of the study is to exhibit dialectal variation in Indian English by comparing two varieties. The current paper is based on the acoustic analysis of Indian English spoken by two distinct groups with different mother tongues. Ten native speakers of Hindi and Bangla were recorded in an anechoic chamber. A phonetically balanced passage was selected to be read. The analysis is based on Native Language Influence Detection (Perkins & Grant, 2018) to derive acoustic phonetic correlates that can be used as significant identifying markers to distinguish Indian English speakers of Bangla and Hindi speech communities. The paper highlights that dialect profiling in the Indian context can be efficiently correlated with formant frequencies and Voice Onset Time for speech data. Acoustic analysis was done on PRAAT. PRAAT was used in this study because it has often been used by other similar studies to measure desired acoustic parameters simultaneously. Formant frequencies were measured at the midpoint of the vowels in the PRAAT using the LPC formant measurement algorithm. The normalization procedure was applied to the measured formant frequencies of vowels. The research affirms that acoustic analysis can provide verifiable cues for NLID. The framework can be used in the detection of native language influence in speech-centric criminal cases. The acoustic analysis shows that Indian English has subvarieties that could help in dialect profiling. The variation in Indian English vowel patterns could be due to the influence of the native language of the speakers.


Keywords


Bark difference method; dialect profiling; formant frequency; Indian English; VOT

Full Text:

PDF

References


Adank, P. (2003). Vowel normalization: a perceptual-acoustic study of Dutch vowels [Doctoral thesis, Catholic University of Nijmegen). Radbound Repository. https://hdl.handle.net/2066/19286

Adank, P., Van Hout, R., & Velde, H. V. D. (2007). An acoustic description of the vowels of northern and southern standard Dutch II: Regional varieties. The Journal of the Acoustical Society of America, 121(2), 1130-1141. https://doi.org/10.1121/1.2409492

Ainsworth, P. (2001). Offender profiling and crime analysis (1st ed.). Willan. https://doi.org/10.4324/9781843924630

Awan, S. N., & Stine, C. L. (2011). Voice onset time in Indian English-accented speech. Clinical Linguistics & Phonetics, 25(11-12), 998-1003. https://doi.org/10.3109/02699206.2011.619296

Barman, B. (2009). A contrastive analysis of English and Bangla phonemics. Dhaka University Journal of Linguistics, 2(4), 19-42. https://doi.org/10.3329/dujl.v2i4.6898

Blomgren, M., Robb, M., & Chen, Y. (1998). A note on vowel centralization in stuttering and nonstuttering individuals. Journal of Speech, Language, and Hearing Research, 41(5), 1042-1051. https://doi.org/10.1044/jslhr.4105.1042

Boersma, P., & Weenink, D. (2008). Praat: Doing phonetics by computer (Version 5.0.25). Praat. http://www.praat.org/

Borden, G., Harris, K., & Raphael, L. (2007). Speech science primer: Physiology, acoustics, and perception of speech (5th ed.). Lippincott Williams & Wilkins.

Chatterji, S. K. (2002). The origin and development of the Bengali language. Rupa.

Chaudhury, S.B.R., & Samaddar, R. (Eds.). (2018). The Rohingya in South Asia: People without a state. Routledge. https://doi.org/10.4324/9780429467677

Chifflet, P. (2015). Questioning the validity of criminal profiling: An evidence-based approach. Australian & New Zealand Journal of Criminology, 48(2), 238-255. https://doi.org/10.1177/0004865814530732

Clopper, C. G., & Paolillo, J. C. (2006). North American English vowels: A factor-analytic perspective. Literary and linguistic computing, 21(4), 445-462. https://doi.org/10.1093/llc/fql039

Cooley, C. M. (2012). Criminal profiling on trial: The admissibility of criminal profiling evidence. In B. E. Turvey (Eds.), Criminal profiling: An introduction to behavioral evidence analysis (pp. 628-652). Elsevier.

Cruttenden, A. (2001). Gimson’s pronunciation of English (6th ed.). Oxford University Press.

Das, S., & Hansen, J. H. (2004). Detection of voice onset time (VOT) for unvoiced stops (/p/, /t/, /k/) using Teager Energy Operator (TEO) for automatic detection of accented English. In J. Tanskanen (Eds.), Proceedings of the 6th Nordic Signal Processing Symposium, 2004. (pp. 344–347). IEEE.

Dasgupta, P. (2003). Bangla. In G. Cardona & D. Jains (Eds.), The Indo-Aryan languages, (pp. 351-390). Routledge.

Datta, A. K. (2018). Acoustics of Bangla speech sounds. Springer. https://doi.org/10.1007/978-981-10-4262-1

Deterding, D. (2003). An instrumental study of the monophthong vowels of Singapore English. English World-Wide, 24(1), 1-16. https://doi.org/10.1075/eww.24.1.02det

Foulkes, P., & French, P. (2001). Forensic phonetics and sociolinguistics. In R. Mesthrie (Ed.), The concise encyclopedia of sociolinguistics. Elsevier.

Foulkes, P., French, P., & Wilson, K. (2019). LADO as forensic speaker profiling. In P. Patrick, M. Schmid, & K. Wilson (Eds.), Language analysis for the determination of origin, (pp. 91-116). Springer.

French, P., & Harrison, P. (2006). Investigative and evidential applications of forensic speech science. In A. Heaton- Armstrong, E. Shepherd, G. Gudjonsson & D. Wolchover (Eds.), Witness testimony: Psychological, investigative and evidential perspectives, (pp. 247-262). Oxford University Press.

French, P., & Stevens, L. (2013). Forensic speech science. In M. Jones & R. Knight (Eds.), Bloomsbury companion to phonetics, (pp. 183-197). Bloomsbury Academic. http://doi.org/10.5040/9781472541895.ch-012

Geberth, V. J. (2015). Practical homicide investigation: Tactics, procedures, and forensic techniques. CRC Press. https://doi.org/10.4324/9781003095835

Grant, T. (2008). Approaching questions in forensic authorship analysis. In J. Gibbons & M. Turell (Eds.), Dimensions of forensic linguistics (pp. 215-229). John Benjamins. https://doi.org/10.1075/aals.5.15gra

Jessen, M. (2007). Speaker classification in forensic phonetics and acoustics. In C. Muller (Ed.), Speaker classification I (pp. 180-204). Springer. https://doi.org/10.1007/978-3-540-74200-5_10

Jessen, M. (2008). Forensic phonetics. Language and linguistics compass, 2(4), 671-711. https://doi.org/10.1111/j.1749-818X.2008.00066.x

Kalashnik, O. & Fletcher, J. (2007). An acoustic study of vowel contrasts in North Indian English. In J. Trouvain & W.J. Barry (Eds.), Proceedings of the 16th international congress of phonetic sciences (pp. 953–956).

Kent, R. D., & Read, C. (2002). The acoustic analysis of speech (2nd ed.). Singular.

Kulshreshtha, M., Singh, C. P., & Sharma R. M. (2012). Speaker profiling: The study of acoustic characteristics based on phonetic features of Hindi dialects for forensic speaker identification. In H. Patil, & A. Neustein (Eds.), Forensic speaker recognition (pp. 71-100). Springer. https://doi.org/10.1007/978-1-4614-0263-3_4

Labov, W., Ash, S., & Boberg, C. (2005). Atlas of North American English: Phonetics, phonology and sound change. De Gruyter Mouton. https://doi.org/10.1515/9783110167467

Ladefoged, P., & Johnson, K. (2011). A course in phonetics (6th ed.). Cengage Learning.

Liu, H. M., Tsao, F. M., & Kuhl, P. K. (2005). The effect of reduced vowel working space on speech intelligibility in Mandarin-speaking young adults with cerebral palsy. The Journal of the Acoustical Society of America, 117(6), 3879-3889. https://doi.org/10.1121/1.1898623

Malhotra, H. K., & Vogelaar, R. (2004). Accent reduction for Asian Indians. Advance for Speech-Language Pathologists and Audiologists, 14(49), 10-23.

Malmasi, S., & Dras, M. (2017). Multilingual native language identification. Natural Language Engineering, 23(2), 163-215. https://doi.org/10.1017/S1351324915000406

Maxwell, O. & Fletcher, J. (2009) Acoustic and durational properties of Indian English vowels. World Englishes, 28(1), 52–69. https://doi.org/10.1111/j.1467-971X.2008.01569.x

Mostafa, T. (2010). Problems bangladeshi learners face in pronouncing certain English phonemes. BRAC University Journal, Special Issue 01, 130–138. http://hdl.handle.net/10361/5165

National Crime Records Bureau. (2021). Crimes in India. https://ncrb.gov.in/sites/default/files/CII-2021/CII_2021Volume%201.pdf

Olagbaju, Y., Barkana, B. D., & Gupta, N. (2010, April). English vowel production by native Mandarin and Hindi speakers. In S. Latifi (Ed.), Proceedings of the Seventh International Conference on Information Technology: New Generations (pp. 343-347). Conference Publishing Service.

Oyama, S. (1976). A sensitive period for the acquisition of a nonnative phonological system. Journal of Psycholinguistic Research, 5(3), 261-283. https://doi.org/10.1007/BF01067377

Perkins, R., & Grant, T. (2018). Native language influence detection for forensic authorship analysis: Identifying L1 Persian bloggers. International Journal of Speech Language and the Law, 25(1), 1-20. https://doi.org/10.1558/ijsll.30844

Phull, D. K. & Kumar, G. B. (2016). Vowel analysis for Indian English. Procedia Computer Science 93, 533-538. https://doi.org/10.1016/j.procs.2016.07.264

Rahman, A. M. (2008). A comparison between English and Bangla vowel systems. Khulna University Studies, 9(1), 9-16. https://doi.org/10.53808/KUS.2008.9.1.0829-A

Rose, P. (2002). Forensic speaker identification. Taylor & Francis. https://doi.org/10.1201/9780203166369

Schneider, E. W. (2007). Postcolonial English: Varieties around the world. Cambridge University Press.

Schuller, B., Steidl, S., Batliner, A., Hirschberg, J., Burgoon, J.K., Baird, A., Elkins, A.C., Zhang, Y., Coutinho, E., & Evanini, K. (2016). The INTERSPEECH 2016 computational paralinguistics challenge: Deception, sincerity & native language. Proceedings of the Annual Conference of the International Speech Communication Association (pp. 2001-2005). https://doi.org/10.21437/Interspeech.2016-129

Shamshad, R. (2017). Bangladeshi migrants in India: Foreigners, refugees, or infiltrators? Oxford University Press.

Sharf, G., & Masur, H. (2002). Voice onset time in normal speakers of a German dialect: Effects of age, gender, and verbal material. In F. Windsor, M. L. Kelly, & N. Hewlett (Eds.), Investigations in clinical phonetics and linguistics. Psychology Press.

Sharma, D. (2017). English in India. In A. Bergs & L. Brinton (Eds.), Volume 5 Varieties of English (pp. 311-329). De Gruyter Mouton. https://doi.org/10.1515/9783110525045-016

Sinha, S., Jain, A., & Agrawal, S. S. (2019). Empirical analysis of linguistic and paralinguistic information for automatic dialect classification. Artificial Intelligence Review, 51(4), 647-672. https://doi.org/10.1007/s10462-017-9573-3

Tiwari, B. (1966). Hindi bhasha. Kitab Mahal.

Thomas, E. R. (2002). Instrumental phonetics. In J. K. Chambers, Peter Trudgill, and Natalie Schilling-Estes (Eds.), The handbook of language variation and change (pp. 168-200). Oxford.

Thomas, E. R. (2011). Sociophonetics: An introduction. Palgrave Macmillan.

Thomason, S. G. (2001). Language contact. Edinburgh University Press.

Watt, D., & Tillotson, J. (2001). A spectrographic analysis of vowel fronting in Bradford English. English World-Wide, 22(2), 269–303. https://doi.org/10.1075/eww.22.2.05wat

Wells, J. C. (1962). A study of the formants of the pure vowels of British English [Unpublished Doctoral dissertation]. University of London.

Wells, J. C. (1982). Accents of English: Volume 1. Cambridge University Press.

Wiltshire, C. R., & Harnsberger, J. D. (2006). The influence of Gujarati and Tamil L1s on Indian English: A preliminary study. World Englishes, 25(1), 91-104. http://doi.org/10.1111/j.0083-2919.2006.00448.x

Winerman, L. (2004, July). Psychological sleuths--criminal profiling: The reality behind the myth. Monitor on Psychology, 35(7). https://www.apa.org/monitor/julaug04/criminal

Yan, Q., & Vaseghi, S. (2003). Analysis, modeling, and synthesis of formants of British, American, and Australian accents. Proceedings of the IEEE international conference on acoustics, speech, and signal processing (pp. 712-715).

Yavas, M. (2002). Voice onset time patterns in bilingual phonological development. In F. Windsor, M. L. Kelly, & N. Hewlett (Eds.), Investigations in clinical phonetics and linguistics (pp. 327–339). Mahwah, Erlbaum. https://doi.org/10.4324/9781410613158

Zampieri, M., Malmasi, S., Ljubešić, N., Nakov, P., Ali, A., Tiedemann, J., Scherrer, Y., & Aepli, N. (2017). Findings of the VarDial evaluation campaign 2017. Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties, and Dialects, (pp. 1-15). Association for Computational Linguistics.




DOI: https://doi.org/10.17509/ijal.v12i2.43179

Refbacks

  • There are currently no refbacks.


View My Stats

Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.