Copilot is your AI companion
Always by your side, ready to support you whenever and wherever you need it.
Microsoft Research Asia Chinese Word-Segmentation Data Set
A set of manually annotated Chinese word-segmentation data and specifications for training and testing a Chinese word-segmentation system for research purposes. Last published: August 16, 2007.
Important! Selecting a language below will dynamically change the complete page content to that language.
Version:
1.0
Date Published:
7/15/2024
File Name:
msra-chinese-word-segmentation-data-v1.zip
File Size:
4.4 MB
A set of manually annotated Chinese word-segmentation data and specifications for training and testing a Chinese word-segmentation system for research purposes. The data was extracted from the People's Daily, which we have licensed for commercial usage, and the annotation was done by the Natural Language Computing group within Microsoft Research Asia.Supported Operating Systems
Windows 10, Windows 7, Windows 8
- Windows 7, Windows 8, or Windows 10
- Click Download and follow the instructions.
Follow Microsoft