Detailed Explanation of the Shitu Medical Input Method Lexicon

Having many words is good, but don’t overindulge.

If you already have a local large model for daily use, just enable the corresponding lexicon. Loading a large number of lexicons simultaneously will seriously affect the input experience. The maintenance levels of various lexicons differ, so you can refer to the Recommended Level column in the table below to decide which to enable.

Chinese Lexicons

Filename Main Content Number of Entries (Not Updated in Real Time) Recommended Level
cn_tcm_origin.dict.yaml Traditional Chinese Medicine (TCM) vocabulary
Relatively large; it is recommended to mount other smaller lexicons
Source
cn_tcm_clinician.dict.yaml Directory of TCM clinicians’ names, such as Zhang Zhongjing, Hu Xishu
cn_tcm_herb.dict.yaml Chinese herbal medicine lexicon, based on cn_medicine_list.dict.yaml and cn_tcm.dict.yaml simplified + personal additions, only including herb names, such as Sangzhi (Mulberry Twig)
cn_tcm_patent.dict.yaml Chinese patent medicine lexicon, based on cn_medicine_list.dict.yaml simplified + personal additions, only including patent medicine names, such as Jiawei Xiaoyao San
cn_tcm_formula Chinese formula names, now including all formulas from Shang Han Lun[1]
cn_tcm_acupuncture.dict.yaml Chinese acupuncture points lexicon
Source
cn_anatomy.dict.yaml Chinese anatomy professional lexicon
Source
6k+
cn_pharmacology.dict.yaml Chinese pharmacology professional lexicon, simplified, supplemented, and modified based on cn_clinic_origin.dict.yaml
cn_clinic_origin.dict.yaml Chinese medical vocabulary, such as disease names and drug names, relatively large; it is recommended to mount other smaller lexicons, such as pharmacology lexicon (cn_pharmacology)
Source
90k+ Strongly not recommended due to lag caused by large size
cn_clinic_dedulpicate.dict.yaml Content remaining after splitting cn_clinic_origin.dict.yaml into other smaller lexicons
cn_medicine_list_origin.dict.yaml Chinese Chinese/Western medicine drug names, including Chinese herbs, patent medicines, Western medicines, and Western medicine preparations.
e.g., Sangzhi (Mulberry Twig), Sang Guo Tang lozenges, Azithromycin, Sorbitol injection
Relatively large; it is recommended to mount other smaller lexicons, such as the Chinese Western medicine drug name lexicon (cn_medicine_list.dict_tiny)
Source
4.9k+ 4
cn_medicine_list_dedulplicate.dict.yaml Content remaining after splitting cn_medicine_list_origin.dict.yaml into other smaller lexicons
cn_medicine_tiny.dict.yaml Chinese Western medicine drug names, simplified based on cn_medicine_list.dict.yaml, only retaining drug names and removing preparation names
e.g., Azithromycin is retained, while preparation names such as Azithromycin granules, Azithromycin dispersible tablets, and Azithromycin capsules are removed
4.8k 5
cn_respiratory.dict.yaml Chinese respiratory medicine lexicon, Source 2k+ (statistics as of 2025-02-02)

English Lexicons

Filename Description Number of Entries (Not Updated in Real Time) Recommended Level
en_MAVL.dict.yaml The Medical Academic Vocabulary List (MAVL) was developed in 2015 by Lei Lei & Liu Dilin based on studies of a 2.7 million-word medical academic English corpus and a 3.5 million-word medical English textbook corpus. MAVL coverage is 19.44% and 20.18% in the two corpora respectively.[2] <br> Source 1.5k+ 5
en_disease.dict.yaml English disease names <br> Source 16k+ Not yet optimized, enable as needed
en_anatomy.dict.yaml English anatomy <br> Based on open textbook Anatomy and Physiology by OpenStax <br> Open Source License
en_medication.dict.yaml English medication names <br> Source 3k+ Not yet optimized, enable as needed
en_medical_speciality.dict.yaml English medical specialties - <br> Source
google.dict.yaml Google’s 1/3 million most frequent English words. - <br> Source
en_clinic_origin.dict.yaml English medical dictionary, currently being split into smaller lexicons; mounting directly will cause crashes <br> Source: English-Chinese Chinese-English Medical Dictionary 440k+ Strongly not recommended due to lag caused by large size
en_respiratory.dict.yaml English respiratory medicine lexicon, Source 2.5k+ (statistics as of 2025-02-02)

References


  1. 伤寒论113方集锦(附剂量换算) - 经方派 ↩︎

  2. Lei, L., Liu, D. (2016) ‘A new medical academic word list: A corpus-based study with enhanced methodology’, Journal of English for Academic Purposes, Vol 22, p.42-53. ↩︎