Core Urdu Vocabulary for Chines Business Community in Pakistan, A Corpus-based Perspective


  • Abrar Hussain Qureshi Institute of Southern Punjab Multan, Pakistan
  • Shamim Akhter Institute of Southern Punjab Multan, Pakistan
  • Musarat Shaheen Institute of Southern Punjab Multan, Pakistan



Urdu, Language Learning, Vocabulary Building, Corpus, Word List, Pakistan


With the dawn of 21st century, the world has grown into a global village and the need for inter-communal interactions has also increased many times.  Urdu language is said to be one the third biggest language of the world along with Chines and English and its speakers are constantly on the rise. With the emergence of the CPEC (China Pakistan Economic Corridor), Urdu has assumed ever increasing importance due to the geo-political and geo-economic condition of the south Asian region. The undertaken study is a systematic attempt in this regard to work out a list of most frequent words of the Urdu language for the Chinese business community in Pakistan.  Schmitt (2000) asserts that that learning a non-native vocabulary is a continual process as the core vocabulary should encompass the ever changing linguistic needs of the time. The name of the Urdu corpus for the undertaken research is urTenTen that has been compiled from internet data. The corpus belongs to TenTen corpus family that is corpora of the web with more than ten billion words. The corpus has been tagged according to Unified Parts of Speech (POS) Standard in Indian Languages.  In order to process data, “sketch Engine” has been used. List of frequent words for the Chines Business Community has been retrieved from urTenTen corpus with the help of sketch engine.  The retrieved list of core Urdu vocabulary is supposed to be useful for the Chines business who is supposed to interact with the Urdu speakers of the region.


Bauer, L.(1998). Vocabulary. London and New York: Routledge.

Biemiller, A. (2004). Vocabulary instruction (pp. 159-176). New York: Guilford Press.

Cacoullos, R. T., & Walker, J. A. (2009). Grammatical variation and collocations in discourseLanguage

Gleason, H.S. Jr. (1961). An Introduction to Descriptive Linguistics. New Delhi: Oxford and IBH Publishing Company

Granger, S. (1998). The computer learner corpus: London: Longman.

Henriksen, B. (1999). Three dimensions of vocabulary development, OUP.

Hoey, M. (2000) ‘A world beyond collocation London

McCarthy, M. J. (2001). Issues in Applied Linguistics. Cambridge: Cambridge University Press

Mukoroli, J. (2011).Effective Vocabulary Teaching Strategies: Longman

Reppen, R. (2009). English language teaching and corpus linguistics: London: Continuum Press.

Moon, R. (2006). Introducing Metaphor, Routledge

Neuman, S. B., & Dwyer, J. (2009): Vocabulary instruction in pre-K. OUP

Qian, D.D. (2002). Investigating the Relationship between Vocabulary Knowledge and Academic Reading Performance: Continuum press.

Schmitt, N. (2000). Vocabulary Learning Strategies. Cambridge University press




How to Cite

Abrar Hussain Qureshi, Shamim Akhter, & Musarat Shaheen. (2020). Core Urdu Vocabulary for Chines Business Community in Pakistan, A Corpus-based Perspective. Review of Economics and Development Studies, 5(4), 637-646.