RCPCE Profession-specific Corpora


The collection of RCPCE profession-specific corpora is developed by the Research Centre for Professional Communication in English (RCPCE) of The Hong Kong Polytechnic University. It contains real-life texts, discourses and genres collected from different professional communities and contexts in Hong Kong. It also contains two corpora of research/journal articles published in high impact journals of up to 39 disciplines.

Professionals, researchers, teachers, students and other language users can leverage these language resources to enhance their language proficiency and professional communicative competencies.

Click here for the list of publications by users of RCPCE Profession-specific Corpora.

If there are questions, please contact our Project Associate Mr. Amos Yung (amos.yung[at]polyu.edu.hk).


Choose the profession-specific corpus that you wish to search:
 
     
Professional Communication Corpora - English      
  • Hong Kong Corpus of Spoken English
  • ConcGramOnline Part-of-speech
    Speech-act
  • Hong Kong Corpus of Surveying and Construction Engineering
  • ConcGramOnline Part-of-speech  
  • Hong Kong Engineering Corpus
  • ConcGramOnline Part-of-speech  
  • Hong Kong Financial Services Corpus
  • ConcGramOnline Part-of-speech  
  • Hong Kong Corpus of Corporate Governance Reports
  • ConcGramOnline Part-of-speech  
  • Hong Kong Corpus of Corruption Prevention
  • ConcGramOnline Part-of-speech  
           
    Political and Governmental Addresses Corpora in English      
  • Addresses by the Hong Kong Governors and the Hong Kong Special Administrative Region Chief Executives
  • KWIC    
  • Speeches by the Republic of China (Taiwan) Presidents
  • KWIC    
  • Reports on the work of the government by the Premier of the State Council, People's Republic of China
  • KWIC    
  • Hong Kong Budget Corpus
  • ConcGramOnline Part-of-speech  
           
    Political and Governmental Addresses Corpora in other languages      
  • 香港總督及香港特別行政區行政長官施政報告
  • KWIC    
  • 中華民國總統演說
  • KWIC    
  • 國務院總理政府工作報告
  • KWIC    
           
    [ Academic Corpora in English ]      
  • Corpus of Research Articles 2007
  • ConcGramOnline Part-of-speech  
  • Corpus of Journal Articles 2014
  • ConcGramOnline Part-of-speech  
           
    Other English Corpora Resources in ENGL, PolyU      
  • The PolyU Language Bank
  • Link    
  • CQP web for Language Corpora (LAMAL)
  •     - Please email Dr. Xu at egxu[at]polyu.edu.hk for access password
    Link    
           
    Other Asian Languages Corpora Resources in ENGL, PolyU      
  • PolyU Corpus of Spoken Chinese: Cantonese
  • Link    
  • PolyU Corpus of Spoken Chinese: Mandarin
  • Link    
  • PolyU Corpus of Spoken Chinese: Chaozhou
  • Link    
  • PolyU Spoken Corpus of Asian Languages: Indonesian
  • Link    
  • PolyU Spoken Corpus of Asian Languages: Japanese
  • Link    
  • PolyU Spoken Corpus of Asian Languages: Korean
  • Link    
  • PolyU Spoken Corpus of Asian Languages: Hindi
  • Link    
           

    The computer software used to search the corpora is ConcGramOnline© designed and written by Chris Greaves. A newer revision was developed by RCPCE in early 2019 for searching across multiple sub-corpora. You can search for a word, or a phrase, or an additional word or phrase in combination with your search word or phrase, and find examples of contextual use in concordances.

    In addition to the software ConcGramOnline©, the centre developed a part-of-speech tag search software in 2017. Our corpora were tagged with Penn Treebank part-of-speech tag set by Stanford POS Tagger. The search engine can search for any n-grams and skipgrams with a specific part-of-speech in a corpus.

     

    Click here for tutorial on how to use the ConcGramOnline©
    Click here for tutorial on how to use the part-of-speech search

     

    Concgrams, ConcGram© and ConcGramCore


    Please click here to learn more about concgramming, the ConcGram software or to download a free copy of ConcGramCore software that can search concgrams in your corpus automatically.




    Back to RCPCE Home Page



    This website uses Google Analytics to monitor the performance of the website and the usage of our corpora, please read our Google Analytics Statement for details.

    Last updated on 21 January, 2019