DOI QR코드

DOI QR Code

Vocabulary Analyzer Based on CEFR-J Wordlist for Self-Reflection (VACSR) Version 2

  • Received : 2023.10.01
  • Accepted : 2023.12.10
  • Published : 2023.12.31

Abstract

This paper presents a revised version of the vocabulary analyzer for self-reflection (VACSR), called VACSR v.2.0. The initial version of the VACSR automatically analyzes the occurrences and the level of vocabulary items in the transcribed texts, indicating the frequency, the unused vocabulary items, and those not belonging to either scale. However, it overlooked words with multiple parts of speech due to their identical headword representations. It also needed to provide more explanatory result tables from different corpora. VACSR v.2.0 overcomes the limitations of its predecessor. First, unlike VACSR v.1, VACSR v.2.0 distinguishes words that are different parts of speech by syntactic parsing using Stanza, an open-source Python library. It enables the categorization of the same lexical items with multiple parts of speech. Second, VACSR v.2.0 overcomes the limited clarity of VACSR v.1 by providing precise result output tables. The updated software compares the occurrence of vocabulary items included in classroom corpora for each level of the Common European Framework of Reference-Japan (CEFR-J) wordlist. A pilot study utilizing VACSR v.2.0 showed that, after converting two English classes taught by a preservice English teacher into corpora, the headwords used mostly corresponded to CEFR-J level A1. In practice, VACSR v.2.0 will promote users' reflection on their vocabulary usage and can be applied to teacher training.

Keywords

References

  1. Anthony, L. (2022). AntConc (Version 4.2.0) [Computer Software]. Tokyo, Japan: Waseda University. Available from https://www.laurenceanthony.net/software
  2. Coxhead, A. (1998). The development and evaluation of an academic word list (Master Thesis, Victoria University of Wellington, New Zealand).
  3. Coxhead, A. (2000). A new academic word list. TESOL Quarterly, 34(2), 213-238. https://doi.org/10.2307/3587951
  4. Kilgarriff, A., Baisa, V., Busta, J., Jakubicek, M., Kovar, V., Michelfeit, J., Rychly, P., & Suchomel, V. (2014). The sketch engine: Ten years on. Lexicography, 1, 7-36.
  5. Negishi, M.,Takada, T., & Tono, Y. (2013). A progress report on the development of the CEFR-J. In E.D. Galaczi & C. J. Weir (Eds.), Exploring language frameworks. Proceedings of the ALTE Krakow Conference, July 2001, 135-163.
  6. Ohashi, Y., & Katagiri, N. (2020a). The Ratios of CEFR-J vocabulary usage compared with GSL and AWL in elementary EFL classrooms and suggestions of vocabulary items to be taught. Asia Pacific Journal of Corpus Research, 1(1), 35-65.
  7. Ohashi, Y., Katagiri, N., & Oshikiri, T. (2022b). Developing classroom corpus tagger for teachers' reflective practice: A spoken language tagger to compile classroom corpora. English Corpus Studies, 29, 41-62.
  8. Ohashi,Y., Katagiri, N., & Oshikiri, T. (2022). Vocabulary analyzer based on CEFR-J wordlist for self-reflection (VACSR): From classroom corpus compilation to self-reflection. International Journal of Language Learning and Applied Linguistics World, 31(1), 1-15.
  9. Qi, P., Zhang, Y., Zhang, Y., Bolton, J., & Manning, C. (2020). Stanza: A Python Natural Language Processing Toolkit for Many Human Languages. https://nlp.stanford.edu/pubs/qi2020stanza.pdf
  10. Schmitt, N. (2010). Researching Vocabulary: A Research Manual. Basingstoke: Palgrave Macmillan.
  11. West, M. (1953). A General Service List of English Words. Longman, London.
  12. Penn Treebank P.O.S. Tags. (n.d.). www.ling.upenn.edu. https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html