Baking Verbs in Action: A Corpus Study of International Bakery and Pastry Recipe Discourse

Authors

  • Woravit Kitjaroenpaiboon Suan Dusit University

Keywords:

lexical verbs, international bakery and pastry recipes, corpus linguistics

Abstract

This study applies corpus linguistics methodologies from both Thai and international scholars to analyze and compile frequently occurring vocabulary in the Suan Dusit University International Pastry and Bakery Corpus (SDUIBPC). Using AntConc 4.3.2 software in conjunction with statistical measures such as Log Likelihood (LL), Bayes Factor, Bayesian Information Criterion (BIC), and Effect Size for Log Likelihood (ELL), as well as expert viewpoints in the field, the study identifies a total of 761 frequently used words in the corpus. A paired-sample hypothesis test comparing SDUIBPC with nine other reference corpora reveals statistically significant differences, confirming that the vocabulary found in SDUIBPC is highly representative of English-language international bakery and pastry recipe books. The investigation of frequently occurring verbs in SDUIBPC results in 103 distinct baking lexical verbs. The findings of this study have practical implications for culinary professionals and students, as they provide empirical linguistic insights that can enhance English proficiency in the field of baking. Furthermore, this study serves as a foundation for language learning, encouraging students to engage in corpus-based learning methods and draw independent conclusions from authentic linguistic data.

References

Ambele, E. A. (2022). Thai English? Non-Thai English lecturers’ perceptions of Thai English and world Englishes. LEARN Journal: Language Education and Acquisition Research Network, 15(2), 724-750. Retrieved from https://so04.tci-thaijo.org/index.php/LEARN/article/view/259949.

Anward, J. (2000). A dynamic model of part-of-speech differentiation. In Vogel & Comrie (Eds.), Proceedings of the Tenth International Congress of Linguistics. Copenhagen, Denmark.

Ari, O. (2006). Review of three sofware programs designed to identify lexical bundle. Language Learning & Technology, 10(1), 30-37.

Azanedo, L., Garcia-Garcia, G., Stone, J., & Rahimifard, S. (2020). Bread and its fortification: Nutrition and health benefits. Journal of Food Science, 85(4), 683-694.

Bais, J. (2022). Framework for categorizing recipe types. Journal of Culinary Studies, 12(1), 45-60.

Baoya, Z. (2015). Moves and inter-move linguistic variation in education research articles. Unpublished doctoral dissertation, Suranaree University of Technology.

Bartosz, J., & Igor, S. (2022). Global trends in the bakery and confectionery market: A review of the impact of COVID-19. Food Reviews International, 1-22.

Biber, D., Conrad, S., & Cortes, V. (2004). If you look at...: Lexical bundles in university teaching and textbooks. Applied Linguistics, 25(3), 371-405.

Biber, D., Conrad, S., & Leech, G. (2002). Longman student grammar of spoken and written English. Longman.

Biber, D., Johansson, S., Leech, G., Conrad, S., & Finegan, E. (1999). Longman Grammar of Spoken and Written English. Pearson Education.

Bisang, W. (2011). Word classes. In J. Song (Ed.), The Oxford handbook of linguistic typology (pp. 280-302). Oxford University Press.

Black, N., Van Rooyen, S., Godlee, F., Smith, R., & Evans, S. (1998). What makes a good reviewer and a good review for a general medical journal? JAMA, 280(1), 231-233.

Cargill, M., & O’Connor, P. (2009). Writing scientific research articles: Strategies and steps. Australia: Wiley-Blackwell.

Charles, M. (2006a). Phraseological patterns in reporting clauses used in citation: A corpus-based study of theses in two disciplines. English for Specific Purposes, 25(3), 310-331.

Chatterjee, U., Kumar, V., & Madalli, D. (2016). Formalizing food ingredients for data analysis and knowledge organization. Journal of Scientometrics and Information Management, 10(2), 289–309. https://doi.org/10.1080/09737766.2016.1213970.

Cheevanon, N. (2022). Factors influencing consumers to select bakery goods in Bangkok. A thematic paper submitted to the Degree of Master of Management, College of Management, Mahidol University.

Chiaro, D. (2008). Language and culture at work: The case of culinary discourse. Multilingua, 27(4), 301–320.

Conrad, S. (1999). The importance of corpus-based research for language teachers. System, 27(1), 1–18. https://doi.org/10.1016/S0346-251X(98)00047-1

Coxhead, A. (2000). A new academic word list. TESOL Quarterly, 34(2), 213–238. https://doi.org/10.2307/3587951

Crystal, D. (2003). The Cambridge Encyclopedia of the English Language (2nd ed.). Cambridge University Press.

Culy, C. (1996). "On the nature of recipe discourse and its implications for syntactic structure." Language Variation and Change, 8(1), 1-29.

Dang, T. N. Y., Coxhead, A., & Webb, S. (2017). The academic spoken word list. Language Learning, 67(4), 959–997. https://doi.org/10.1111/lang.12261

Davies, M. (2008). The Corpus of Contemporary American English: 425 million words, 1990–present. Available online at https://www.english-corpora.org/coca/

Davies, M. (2013). Corpus of Global Web-Based English: 1.9 billion words from speakers in 20 countries. Available online at https://www.english-corpora.org/glowbe/

Delormier, T., Frohlich, K. L., & Potvin, L. (2009). Food and eating as social practice–understanding eating patterns as social phenomena and implications for public health. Sociology of Health & Illness, 31(2), 215-228.

Dhanalakshmi, R., Chandrasekaran, M., & Govindarajan, M. (2009). Automatic rule-based part of speech tagging for Tamil. International Journal of Recent Trends in Engineering, 1(1), 589-592.

Donald J. B., Michal, B., Eve, L., & Charles, A. (2008). Context Variation and Definitions in Learning the Meanings of Words: An Instance-Based Learning Approach Discourse Processes, 45(2),122–159. DOI: 10.1080/01638530701792826.

Doro, M. (2013). The impact of lexical simplification on the readability and comprehensibility of texts. Journal of Applied Linguistics and Professional Practice, 10(2), 189-205.

Dunning, T. (1993). Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19(1), 61-74.

Evans, N. (2000). Word classes in the world’s languages. Cambridge University Press.

Evans, T. D., Matthews, G. A., & Bateman, R. P. (1993). Pesticide Application Technology. CAB International.

Fahad, S., Adnan, M., Noor, M., Shah, A. N., Ullah, H., & Wahid, F. (2020). Influence of climate change on the wheat productivity in Pakistan: A GIS-based approach. Environmental Monitoring and Assessment, 192(7), 1-14.

Fazilatfar, A. M., & Naseri, M. (2014). The effect of lexical bundles instruction on intermediate EFL learners’ writing. International Journal of Research Studies in Language Learning, 3(3), 53-67.

Flowerdew, J. (2011). Discourse in English Language Education. Routledge.

Gardner, D., & Davies, M. (2014). A new academic vocabulary list. Applied Linguistics, 35(3), 305–327.

Gerhardt, C. (2013). Culinary Linguistics. New York: John Benjamins Publishing Company.

Getkham, K. (2010). Politeness Strategies in Thai Graduate Research Paper Discussions: Implications for Second / Foreign Language Academic Writing. English Language Teaching, 7(11):159-159

Gil, D. (2000). The complexity of language: Developmental and evolutionary perspectives. Journal of Linguistics, 36(3), 527-570.

Gilner, L. (2011). A critical review of Thorndike’s semantic count technique. International Journal of Lexicography, 24(3), 302–319.

Gilner, L., & Morales, F. (2008). Revisiting the General Service List. Vocabulary Learning and Instruction, 1(1), 1–8. https://doi.org/10.7820/vli.v01.01

Görlach, M. (2004). Text Types and the History of English. Walter de Gruyter.

Gries, S. T. (2009). What is Corpus Linguistics. Language and Linguistics Compass, 3(5), 1225-1241.

Gwet, K. L. (2014). Handbook of inter-rater reliability: The definitive guide to measuring the extent of agreement among raters. Advanced Analytics.

Halliday, M. A. K. (1992). Language as system and language as instance: The corpus as a theoretical construct. Directions in Corpus Linguistics, 61, 61–77. https://doi.org/10.1515/9783110863492-006

Halliday, M. A. K., & Matthiessen, C. M. (2014). Halliday's Introduction to Functional Grammar. Routledge.

Haspelmath, M. (2012). Understanding morphology (2nd ed.). Oxford University Press.

Heine, B., & Kuteva, T. (2012). The evolution of grammar: Tense, aspect, and modality in the languages of the world. Oxford University Press.

Hsu, W. (2013). A medical academic word list. English for Specific Purposes, 32(2), 97–109. https://doi.org/10.1016/j.esp.2012.11.001

Hu, R., Li, H., & Li, Z. (2022). The effects of different baking conditions on the nutritional properties of bread. Food Science & Nutrition, 10(3), 1200-1210.

Huddleston, R., & Pullum, G. K. (2002). The Cambridge Grammar of the English Language. Cambridge: Cambridge University Press.

Huddleston, R., & Pullum, G. K. (2002). The Cambridge grammar of the English language. Cambridge University Press.

Hussin, A., & Roslim, N. (2021). Relevance of The New General Service List in Selecting Reading Passages for ESL Students. European Journal of English Language Teaching, 6(6), 223-241.

Hyland, K. (2008). Academic discourse: English in a global context. Continuum.

IBM. (2023). SPSS Statistics for Windows, Version 28.0 [Computer software]. IBM Corporation.

Javier, R., Gonzalez, L., & Lopez, M. (2023). The impact of baking technology on global markets. Food Science Press.

Jones, R., & Tschirner, E. (2015). Frequency-based vocabulary lists in language teaching. Language Teaching Research, 19(6), 645–659. https://doi.org/10.1177/1362168814540302

Kilgarriff, A., Reddy, S., Pomikálek, J., & Avinesh, P. V. S. (2010). A corpus factory for many languages. In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10). European Language Resources Association (ELRA).

Kitjaroenpaiboon, W., & Getkham, K. (2015). An Analysis of Interactional Metadiscourse Devices in Communication Arts Research Articles. International Journal of Management and Applied Science (IJMAS) 1(9), 125-131.

Kitjaroenpaiboon, W., & Getkham, K. (2016). Stylistic Patterns in Language Teaching Research Articles: A Multidimensional Analysis. PASAA: Journal of Language Teaching and Learning in Thailand 52, 169-208

Kitjaroenpaiboon, W., & Getkham, K. (2017). Patterns of Linguistic Feature and their Commuincative Functions in Nursing Research Articles. International Journal of Management and Applied Science, 3(3), 98-103.

Kitjaroenpaiboon, W., Khamsakul, B., Kesprathum, S., Fahkrajang, S., & Fongsrun, P., (2021a). Rhetorical Move and Multidimensional Analyses of Applied Linguistics Research Abstracts. Journal of Language and Culture 40(2), 137-165.

Kitjaroenpaiboon, W., Pholpuntin, S., Fahkrajang, S., Fongsrun. P., Najpinij. N., & Sriarun. J. (2024). An Analysis of International Food Processing Lexis and Formulaic Languages: An application of a corpus linguistic-based approach. A research report submitted to Suan Dusit University.

Kitjaroenpaiboon, W., Wongwiseskul, S., Puksa, T., & Khamsakul, B. (2021b). A Multidisciplinary Corpus-based Comparative Analysis: Lexical bundles in language teaching, health sciences, and business management research articles. NIDA Journal of Language and Communication 26(40), 41-68.

Kumar, S., & Singh, R. (2010). The role of food texture in sensory perception. International Journal of Food Science, 45(6), 1125-1133.

Kweon, S.-O., Slade, R., & Levine, M. (2016). The importance of collocations in culinary communication. Culinary Science & Technology, 8(2), 145-163.

Kwik, J. (2008). Memory and learning techniques for culinary students. Food Learning Press.

Leben, W. R. (2016). Phonology: The structure of sound systems. Wiley-Blackwell.

Lee, C., & Chen, J. (2009). Baking science: The chemistry behind the dough. Baking Research Institute.

Leech, G., Rayson, P., & Wilson, A. (2001). Word Frequencies in Written and Spoken English: Based on the British National Corpus. London: Longman.

Lehman, W. P. (2002). Historical linguistics: An introduction. Routledge.

Lock, A. (1985). Communication in the classroom: The importance of discourse. Oxford University Press.

Mallikamas, P. & Arunmanakun, W. (2006). The effect of using a specialized English corpus on English-Thai translation. Journal of Liberal Arts, 32(2), 35-62.

Matsuoka, W. (2012). Development of English vocabulary lists: A historical perspective. Vocabulary Learning and Instruction, 1(1), 1–10. https://doi.org/10.7820/vli.v01.01

Mattiello, E. (2017). Analysing English slang and colloquialisms. Palgrave Macmillan.

Ministry of Public Health. (2021). Guidelines on food safety and hygiene in bakeries. Bangkok, Thailand.

Moussu, L., & Llurda, E. (2008). Non-native English-speaking English language teachers: History and research. Language Teaching, 41(3), 315-348.

Nasrabady, M., Shirvan, M. E., & Golparvar, S. E. (2020). Corpus-based analysis of academic English vocabulary. Journal of Applied Linguistics, 15(2), 102-120.

Nation, I. S. (2001). Learning vocabulary in another language. Cambridge University Press.

Nation, I. S. P. (2016). Making and using word lists for language learning and testing. John Benjamins Publishing Company.

Nation, I. S. P., & Kyongho, H. (1995). Where would general service vocabulary stop and special purposes vocabulary begin? System, 23(1), 35–41. https://doi.org/10.1016/0346-251X(94)00020-T

Neely, Y., & Cortes, V. (2009). Lexical bundles in university student writing. Journal of English for Academic Purposes, 8(3), 183-196.

Nesi, H., & Basturkmen, H. (2006). Academic discourse and genre analysis. English for Specific Purposes, 25(1), 35-52.

Neuendorf, K. A. (2002). The content analysis guidebook. SAGE Publications.

Nini, A. (2020). Multidimensional Analysis Tagger 1.3.3 [Computer software]. Retrieved from https://sites.google.com/view/matagger.

Nordin, M. N. R., Stapa, S. H., & Darus, S. (2013). Developing a specialized vocabulary word list in a composition culinary course through lecture notes. Advances in Language and Literary Studies, 4(1), 78–88.

Nouraldeen, A. S. (2015). Meaning and Context-Three Different Perspectives. British Journal of English Linguistics, 3(2), 13-17.

Ozcakmak, S., & Ebrahim, N. (2022). Innovations in bakery production and food sustainability. Food Technology and Development Journal, 17(4), 98-115.

Paquot, M. (2007). Towards a productively-oriented academic word list. In P. Bouillon, B. Daille, K. Morita, & E. S. M. Tseng (Eds.), Proceedings of the 4th International Workshop on Computational Terminology (pp. 49–56). https://doi.org/10.1007/978-3-540-71887-0_7

Peris, A., Garcia, J., & Toral, A. (2019). Deep learning for machine translation: A review. Computational Linguistics, 45(1), 1-32.

Qahramonjon, K. (2021). Advances in technology and their impact on vocabulary compilation methods. Journal of Language and Linguistic Studies, 17(2), 845–859. https://doi.org/10.52462/jlls.217

Quero, B. (2015). Towards a medical English corpus: Compilation process and validation. Procedia - Social and Behavioral Sciences, 198, 377–382. https://doi.org/10.1016/j.sbspro.2015.07.057

Ranney, S. (2012). Defining and Teaching Academic Language: Developments in K-12 ESL. Language and Linguistics Compass, 6(9), 560-574.

Rayson, P. (2003). Matrix: A statistical method and software tool for linguistic analysis through corpus comparison. Ph.D. thesis, Lancaster University.

Rayson, P. (2008). From key words to key semantic domains. International Journal of Corpus Linguistics, 13(4), 519–549. https://doi.org/10.1075/ijcl.13.4.06ray​:contentReference[oaicite:2]{index=2}.

Rayson, P., & Garside, R. (2000). Comparing corpora using frequency profiling. In Proceedings of the Workshop on Comparing Corpora, held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics (ACL 2000), 1–8 October 2000, Hong Kong, 1–6.

Rayson, P., Berridge, D., & Francis, B. (2004). Extending the Cochran rule for the comparison of word frequencies between corpora. Proceedings of the 7th International Conference on Statistical Analysis of Textual Data (JADT 2004), Louvain-la-Neuve, Belgium, March 10-12, 2004, 926-936.

Rennie, D. (1998). Freedom and responsibility in medical publication. Journal of the American Medical Association, 280(2), 300–302.

Rokach, A. (2020). Belonging, togetherness and food rituals. Open Journal of Depression, 9(4), 77–85. https://doi.org/10.4236/ojd.2020.94007​:contentReference[oaicite:5]{index=5}.

Rosch, E. (1978). Principles of categorization. In E. Rosch & B. B. Lloyd (Eds.), Cognition and categorization (pp. 27–48). Lawrence Erlbaum Associates.

Rungrueang, M., Chumworatayee, T., & Wongsothorn, A. (2022). Development of a food service vocabulary list for Thai learners of English. LEARN Journal: Language Education and Acquisition Research Network, 15(1), 1–20. https://doi.org/10.57149/learn.2022.1.02

Rungrueang, T., Boonprasert, P., Poempongsajaroen, S., & Laosrirattanachai, P. (2022). Corpus-based approach to generate a word list for food service. Thai TESOL Journal, 35(1), 57–76.

Sedan, C., Erkan, K., & Jingjing, Q. (2016). Structure of Moves in Research Article Abstracts in Applied Linguistics. Publications, 4(23), 1–16.

Sicanore, J. M., Connell, K. J., Olthoff, A. J., Friedman, M. H., & Geght, M. R. (1999). A method for measuring interrater agreement on checklists. Evaluation and the Health Professions, 22(2), 221–234.

Sinclair, J. (1997). Corpus evidence in language description. In A. Wichmann, S. Fligelstone, T. McEnery, & G. Knowles (Eds.), Teaching and language corpora (pp. 27–39). Longman.

Sinclair, J. M., & Mauranen, A. (2006). Linear unit grammar: Integrating speech and writing. John Benjamins Publishing.

Singh, A., Joshi, B., & Mathur, R. (2013). Effects of heavy metals on soil, plants, human health, and aquatic life. International Journal of Research in Chemistry and Environment, 3(1), 15–21.

Swales, J. M. (1990). Genre analysis: English in academic and research settings. Cambridge University Press.

Tognini-Bonelli, E. (2001). Corpus Linguistics at Work. John Benjamins Publishing.

Tseng, M. (2011). Cognitive genre structures in Methods sections of research articles: A corpus study. Journal of English for Academic Purposes, 10(3), 152–166.

Vogel, P. (2000). Word class distinctions in typological perspective. Linguistic Typology, 4(1), 231-272.

Wang, J., Liang, S., & Ge, G. (2008). Establishment of a medical academic word list. English for Specific Purposes, 27(4), 442–458. https://doi.org/10.1016/j.esp.2008.02.003

Ward, J. (2009). A basic engineering English word list for less proficient foundation engineering undergraduates. English for Specific Purposes, 28(3), 170–182. https://doi.org/10.1016/j.esp.2009.02.003

Wesser, G. (2021). Thuringian festive cakes: Women’s labour of love and the taste of Heimat. In F. Edwards, R. Gerritsen, & G. Wesser (Eds.), Food, senses and the city (pp. 108–121). Routledge.

Xu, X. (2014). A study on the engineering English vocabulary. English for Specific Purposes, 35, 60–69. https://doi.org/10.1016/j.esp.2013.11.002

Xue, G., & Nation, I. S. P. (1984). A university word list. Language Learning and Communication, 3(2), 215–229. https://doi.org/10.1016/0149-0420(84)90011-7

Yathip, C., & Soranastaporn, S. (2011). A Comparison of Moves and the Sequence of Moves in Research Abstracts in Standard and Predatory Journals. Journal of Studies in the English Language, 11(1), 35-48.

Zhu, W. (2004). Faculty views on the importance of writing, the nature of academic writing, and teaching and responding to writing in the disciplines. Journal of Second Language Writing, 13(2), 29–48.

Published

05/11/2026

How to Cite

Kitjaroenpaiboon, W. (2026). Baking Verbs in Action: A Corpus Study of International Bakery and Pastry Recipe Discourse. PASAA, 71. retrieved from https://so11.tci-thaijo.org/index.php/PASAA/article/view/1779

Issue

Section

Research articles

Categories