PRC White Paper Corpus

Welcome to PRC White Paper Corpus (PRAWPC). The PRAWPC is a large collection of white papers published by the State Council Information Office of the People's Republic of China. The topics cover human rights, national defense, religious issues, population issues, energy, environmental issues, intellectual property rights, food and drug security, the Internet, Tibet and Xinjiang, etc.

The corpus size is 1,011,812 (English words).

The segmented Chinese Corpora(both the Simplified Chinese and the Traditional Chinese corpora) are also provided for concordance analysis.

  • You can search for a word, e.g.policy, power, people or a phrase, e.g. human rights,the Belt and Road, and find examples of its use in its context.

  • You can also search for an additional word in combination with your search word, e.g. Chinese (search word) and people (additional word), or search phrase, e.g. the Belt and Road (search phrase) and initiative (additional word).

Please select corpus/corpora **



Input search word/phrase


Additional word/phrase


Search span
 
characters
Extended span*
 
characters


 
  • You can search for a word or a phrase (e.g. Hong Kong, future.)

  • You can adjust the span width on the left and right of the concordances (20-50 characters).

  • * and specify the number of characters displayed on the left and right of the Search span (20-100 characters)..

  • ** Hold down the Ctrl (MS Windows) / Command (Mac) button to select multiple sub-corpora in the corpus.

Back to Main Profession-specific Corpora Search Page