52

GitHub - wainshine/Company-Names-Corpus: 公司名语料库。

 5 years ago
source link: https://github.com/wainshine/Company-Names-Corpus
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

README.md

公司名语料库(Company-Names-Corpus)

业余项目“萌名(一个基于语料库技术的取名工具)”的副产品。不定期更新。只删词,不加词。
可用于中文分词、机构名识别。

480万。清洗后仍存有大量badcase。


@萌名 整理

2018.10.10


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK