TransWikia.com

What are the most common Mandarin words for each pinyin syllable?

Chinese Language Asked by Horatio on October 22, 2021

I’ve been practicing my pronunciation by reciting the different pinyin syllables described in my book.

My book lists some examples of common words for some syllables. I was wondering where I could find a more complete list of words for each phonetic?

3 Answers

Blackgreen's answer is an excellent resource, but I would also like to mention that the ABC dictionary is a valuable alternative that gives even more (albeit in an uncompiled format): it also gives the 2nd, 3rd, 4th etc most common character, for a given pronunciation (pinyin, including the tone).

You can buy a paper copy (probably - I haven't tried) or add it to the Pleco app (as a paid add-on).

It's very convenient when used from inside Pleco. Here is an example giving its listing of the top characters for jī:

screenshot - search for ji1 in Pleco

The raised digits, to the left of the pinyin (jī here), indicate ABC's assessment of the order of frequency (of usage).

In Pleco you can also leave out the tone and then get a list of all the pronunciations, also sorted by frequency. NB: in the next screenshot the star "*" which is next to "几 / jǐ" means that ABC states that "几 / jǐ" is the most commonly encountered, among all pairs (characters + reading) that are ji (of any tone).

Screenshot #2 screenshot - search for ji in Pleco

ABC dictionary also gives frequency indications for large numbers of two-character words.

Here is some further information about it - a direct quote from the ABC dictionary (found online here):

For entries with identical spelling, including tones, arrangement is by order of frequency, indicated by a raised number before the transcription, a device adapted from Western lexicographic practice to distinguish homonyms. In the case of monosyllabic entries, our frequency order is based largely on Xiàndài Hànyǔ Pínlǜ Cídiǎn. In the case of entries of more than one syllable, we have also made use of Zhōngwén Shūmiànyǔ Pínlǜ Cídiǎn. For entries not found in either work, we have made subjective judgments of relative frequency. For entries that are homographic if tones are disregarded, the item of highest frequency is indicated by an asterisk following the transcription. For example:

ba* ¹bā ²bā ³bā . . . ⁶bā ¹bá ²bá ¹bà ²bà . . . ⁶bà

(For the characters corresponding to these transcriptions, see the main body of the dictionary.)

Frequency information, while useful also for students, is provided chiefly as an aid to determine the default items in computer usage. Our unique combination of letters, tone marks, and raised numbers provides a simple and distinctive one-to-one correspondence between transcription and character( s) that is intended to facilitate computerized handling of the entries.

Finally, it's almost completely off-topic, but here's another plug! I also really like ABC because I find it really useful how it gives parts of speech, particularly for which readings/meanings are what it calls "bound forms" - an exceedingly useful notion for students IMO. Another quote, about bound forms:

Exhibiting a second level of boundness are those characters which do have meaning of their own, and often carry this meaning into many different compound words, but which do not occur independently as free words in standard modern Chinese (though they may be free words in classical Chinese or in very formal written styles of the language). Examples are nǚ 女 'female' in nǚrén 'woman', nǚháizi 'girl', nǚde 'woman, female', and fùnǚ 'woman, women'; and ²shēng 生 'student' in xuéshēng 'student', nánshēng 'male student', nǚshēng 'female student', and zhàoshēng 'recruit students'. Many characters are bound in some meanings but free in others. For example, ²shēng 生, in addition to being bound in the meaning of 'student', is also bound in its meaning of 'life', as in shēnghuó 'life, livelihood' and shēngsǐ 'life and death'. But in the meaning 'to give birth' or 'to be born' it is a free word, a verb. We label such characters B.F., for 'bound form', when they occur only in compound words; and those that are bound in some meanings and free in others are labeled accordingly in the several sub-definitions within their entries.

These categorizations should be valuable to users of the dictionary in at least two ways. Like the other entry labels, N., V., S.V., etc., they enhance the semantic definition of a term, providing grammatical information to improve the user's understanding of how the terms are used. And beyond that, they serve the very practical purpose of a caution sign, indicating that one cannot turn the Chinese-to-English definitional equivalents around and assume that a given English concept is expressed in Chinese by the single character in question. For example, one cannot say that the Chinese word for woman is nǚ or the word for student is shēng, because these are not "words" in Chinese but bound morphemes, or "parts" of words.

Having said that I'm not convinced ABC is 100% reliable for everything. Also, other dictionaries also have their merits and are better suited for other purposes. I like to use ABC alongside other dictionaries inside Pleco (such as 现代规范词典, CC-CEDICT, ...)

Answered by goPlayerJuggler on October 22, 2021

I just came across this Chinese Pinyin Table in archchinese.com, which might be able to help you: enter image description here

Pressing on any pinyin combination, for example ma (in red), and selecting [Characters...]: enter image description here

We arrive to a page where, at the top, we have the characters corresponding to this pinyin syllable (this is the information @blackgreen gave on his answer, though probably is not ordered by the frequency in usage). Moreover, pressing on Details..., we can see their meanings:

enter image description here

If you are interested in words containing these characters, you can find them below:

enter image description here

As well as example sentences using these words:

enter image description here


You could check also the Chinese Yabla dictionary or the Pleco dictionary (for the mobile phone). Searching for the pinyin syllable gives results for common characters.

Answered by Puco4 on October 22, 2021

After a bit of searching I could not find a proper resource to answer this question, so I started compiling compiled a comprehensive list by hand.

Since you say that the most common character for each initial+final+tone combination is enough for you, the answer is rather easy: we can cross-reference the pinyin and tone combinations with a list of character frequency in Chinese.

Character frequency source: hanzicraft.com

How to read this table:

  • Each row comes in the form <pinyin syllable> <1st tone char> <2nd tone char> <3rd tone char> <4th tone char> <neuter tone char, if applicable>

  • If a particular pinyin syllable and tone combination has no corresponding character, I add a note in place of the character. Other bracketed comments refer to the preceding character.

  • note that some of these characters are not common relative to the others, but are the most common for a particular pinyin+tone combination. Again, the source is the character frequency table. For example, the most common character for weng3 appears to be but I bet you're not going to find it any time soon in your studies. Nevertheless, it's the most common one for that pronunciation, so I still list it for completeness.

  • some characters have alternate pronunciations, e.g. 差别的「差」and 差评的「差」. The frequency list is based on the total frequency of the character, regardless of the pronunciation. So the most common character for cha1 is 差别的「差」 or something else (插、叉、etc.)? I try to disambiguate based on my personal knowledge of the language. Feel free to correct me.

  • an argument can be made that a character's frequency is absolute, therefore a character like「差」should appear both for cha1 and cha4.

  • in the title of your question, you ask for words. A word is a single distinct meaningful element of speech or writing. Since Chinese is an isolating language, words often overlap with characters. Furthermore, a combination of pinyin+tone applies to a character. So this is a list of characters. Some of them may also be words, some may be not.

Pinyin 阴平 阳平 上声 去声 轻声
a
ba
pa (no pa3)
ma
fa 发 (hair)
da
ta (no ta2)
na 那 (surname)
la
ga
ka (no ka2) (no ka4)
ha 哈 (same as 1) (no ha4)
zha 炸 (to fry) 炸 (to explode)
cha
sha
za (no za4)
ca (no ca2) (no ca4)
sa (no sa2) 撒 (same as 1)
ai
bai
pai (no pai3)
mai (no mai1)
dai (no dai2)
tai (no tai3)
nai (no nai1) (no nai2)
lai (no lai1) (no lai3)
gai (no gai2)
kai (no kai2)
hai
zhai
chai
shai (no shai2)
zai (no zai2) 在 (to stay)
cai
sai (no sai2) (no sai3)
an 俺 (topolect)
ban (no ban2)
pan (no pan3)
man
fan
dan (no dan2)
tan
nan 难 (disaster)
lan (no lan1)
gan 干 (dry) (no gan2) 干 (to do)
kan (no kan2)
han
zhan (no zhan2)
chan
shan (no shan2)
ran (no ran1) (no ran4)
zan
can
san (no san2)
ang (no ang3)
bang (no bang2)
pang 膀 (or 乓)
mang (no mang4)
fang 访
dang (no dang2)
tang
nang
lang
gang (no gang2)
kang (no kang3)
hang (no hang3)
zhang (no zhang2)
chang
shang (no shang2)
rang (no rang1)
zang (no zang2)
cang (no cang3) (no cang4)
sang (no sang2)
ao
bao
pao
mao
dao (no dao2)
tao
nao
lao
gao (no gao2)
kao (no kao2)
hao
zhao
chao (no chao4)
shao 少 (few) 少 (young)
rao (no rao1)
zao
cao
sao (no sao2) 扫 (to sweep) 扫 (broom)
e 饿
me (no me1) (no me2) (no me3) (no me4)
de (no de3) (no de4)
te (no te1) (no te2) (no te3)
ne (no ne1) (no ne2) (no ne3)
le (no le1) (no le2) (no le3) 了 (particle)
ge
ke
he (no he3)
zhe
che (no che2)
she
re (no re1) (no re2)
ze (no ze1) (no ze3)
ce (no ce1) (no ce2) (no ce3)
se (no se1) (no se2) (no se3)
ei
bei (no bei2)
pei (no pei3)
mei (no mei1)
fei
dei (no dei1) (no dei2) (no dei4)
nei (no nei1) (no nei2) (nei3 dubious)
lei
gei (gei1 dubious) (no gei2) (no gei4)
kei (no kei2) (no kei3) (no kei4)
hei (no hei2) (no hei3) (no hei4)
shei (no shei1) (no shei3) (no shei4)
zei (no zei1) (no zei3) (no zei4)
en (no en2) (no en3)
ben (no ben2)
pen (no pen3)
men 闷 (stuffy) (no men3) 闷 (depressed)
fen 分 (divide) 分 (part)
den (no den1) (no den2) (no den3)
nen (no nen1) (no nen2) (no nen3)
gen
ken (no ken1) (no ken2)
hen (no hen1)
zhen (no zhen2)
chen
shen
ren (no ren1)
zen (no zen1) (no zen2)
cen (no cen3) (no cen4)
sen (no sen2) (no sen3) (no sen4)
eng (no eng2) (no eng3) (no eng4)
beng (not in the first 9000)
peng
meng
feng
deng (no deng2)
teng (no teng1) (no teng3) (no teng4)
neng (no neng1) (no neng3) (no neng4)
leng (no leng1)
geng 更 (to change) (no geng2) 更 (more)
keng (no keng2) (no keng3) (no keng4)
heng 横 (across) (no heng3) 横 (perverse)
zheng (no zheng2)
cheng
sheng
reng (no reng3)
zeng (no zeng3)
ceng (no ceng3)
seng (no seng2) (no seng3) (no seng4)
er (no er1)
yi 一 (one)
bi
pi
mi
di
ti
ni
li (no li1)
ji
qi
xi 西
zhi
chi
shi 使 是 (to be)
ri (no ri1) (no ri2) (no ri3)
zi (zi2 not in first 9000)
ci
si (no si2)
ya
bia 吧 (dubious) (no bia2) (no bia3) (no bia4)
dia (no dia1) (no dia2) (no dia4)
lia (no lia1) (no lia2) (no lia4)
jia
qia (no qia2)
xia (no xia3)
yan
bian (no bian2)
pian 片 (sheet) 便 (cheap) 片 (slice)
mian (no mian1)
dian (no dian2)
tian
nian
lian (no lian1)
jian (no jian2)
qian
xian
yang
niang (no niang1) (no niang3)
liang (no liang1)
jiang (no jiang2)
qiang
xiang
yao 要 (require) 要 (want)
biao
piao
miao
diao (no diao2) (diao3 dubious)
tiao 挑 (choose) 挑 (provoke)
niao (no niao1) (no niao2) 尿
liao
jiao 矫 (jiao2)
qiao
xiao
ye
bie 别 (other) 别 (as in 别扭)
pie (no pie2) (no pie4)
mie (no mie2) (no mie3)
die (no die3) (no die4)
tie (no tie2)
nie (no nie3)
lie (no lie2)
jie
qie
xie
yin
bin (no bin2) (no bin3)
pin
min (no min1) (no min4)
nin (no nin1) (no nin3) (no nin4)
lin
jin (no jin2)
qin
xin
ying
bing (no bing2)
ping (no ping3) (no ping4)
ming (no ming1)
ding (no ding2)
ting 听(tw.)
ning (no ning1)
ling
jing (no jing2)
qing
xing
yo (no yo2) (no yo3) (no yo4)
yong
jiong (no jiong2) (no jiong4)
qiong (no qiong3) (no qiong4)
xiong (no xiong3)
you
miu (no miu1) (no miu2) (no miu3)
diu (no diu2) (no diu3) (no diu4)
niu
liu
jiu (no jiu2)
qiu (no qiu4)
xiu (no xiu2)
o 哦 (same as 2)
bo (bo4 dubious)
po
mo
fo (no fo1) (no fo3) (no fo4)
lo (no lo1) (no lo2) (no lo3) (no lo4)
weng (no weng2)
dong (no dong2)
tong
nong (no nong1) (no nong3)
long 隆 (sound of drums) 弄 (alley)
gong (no gong2)
kong (no kong2)
hong
zhong (no zhong2)
chong
rong (no rong1) (no rong4)
zong (no zong2)
cong (no cong3) (no cong4)
song 怂 (colloq.)
ou (no ou2)
pou (no pou4)
mou (no mou4)
fou (no fou1) (no fou2) (no fou4)
dou (no dou2)
tou
nou (no nou1) (no nou2) (no nou3)
lou
gou (no gou2)
kou (no kou2)
hou
zhou
chou
shou
rou (no rou1) (no rou3)
zou (no zou2)
cou (no cou1) (no cou2) (no cou3)
sou (no sou2)
wu
bu 不 (no)
pu
mu (no mu1)
fu
du
tu
nu (no nu1)
lu
gu (no gu2)
ku (no ku2)
hu
zhu
chu
shu
ru (no ru1)
zu (no zu4)
cu
su (no su3)
wa
gua (no gua2)
kua (no kua2)
hua (no hua3)
zhua (no zhua2) (no zhua4)
chua (no chua2) (no chua3) (no chua4)
shua (no shua2)
wai (no wai2)
guai (no guai2)
kuai (no kuai1) (no kuai2)
huai (no huai1) 怀 (no huai3)
zhuai (no zhuai2)
chuai 揣 (same as 1)
shuai (no shuai2)
wan
duan (no duan2)
tuan
nuan (no nuan1) (no nuan2) (no nuan4)
luan (no luan1)
guan (no guan2)
kuan (no kuan2) (no kuan4)
huan
zhuan (no zhuan2)
chuan 穿
shuan (no shuan2) (no shuan3)
ruan (no ruan1) (not in first 9000) (no ruan4)
zuan (no zuan2)
cuan 蹿 (no cuan3)
suan (no suan2) (not in first 9000)
wang
guang (no guang2) 广
kuang
huang
zhuang (no zhuang2)
chuang
shuang (no shuang2) (no shuang4)
yue (no yue2)
nüe (no nüe1) (no nüe2) (no nüe3)
lüe (no lüe1) (no lüe2) (no lüe3)
jue (no jue4)
que (no que3)
xue
wei
dui (no dui2)
tui 退
gui (no gui2)
kui
hui
zhui (no zhui2) (no zhui3)
chui (no chui3) (no chui4)
shui (no shui1)
rui (no rui1)
zui (not in the first 9000) (no zui2)
cui (no cui2)
sui
wen
dun (no dun2)
tun
nun (no nun1) (no nun3) (no nun4)
lun (no lun3)
gun (no gun1) (no gun2)
kun (no kun2)
hun (no hun3)
zhun (no zhun2) (no zhun4)
chun (no chun4)
shun (no shun1) (no shun2)
run (no run1) (no run3)
zun (no zun2) (zun4 dubious)
cun
sun (no sun2) (no sun4)
wo (no wo2)
duo
tuo
nuo (no nuo1)
luo
guo
kuo (no kuo1) (no kuo2) (no kuo3)
huo
zhuo (no zhuo3) (no zhuo4)
chuo (no chuo2) (no chuo3)
shuo (no shuo2) (no shuo3)
ruo (no ruo1) (no ruo3)
zuo 作 (troublesome)
cuo
suo (no suo2) (no suo4)
yu
(no nü1) (no nü2)
(no lü1)
ju
qu
xu
yuan
juan (no juan2)
quan
xuan
yun
jun (no jun2) (no jun3)
qun (no qun3) (no qun4)
xun (no xun3)

Answered by blackgreen on October 22, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP