Skip to content

DavidChenLondon/chinese-name-gender-analyse

This branch is up to date with cyy0523xc/chinese-name-gender-analyse:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

author
alex cai
May 16, 2016
de1677b · May 16, 2016

History

4 Commits
May 14, 2016
Apr 30, 2016
Apr 30, 2016
Apr 30, 2016
Apr 30, 2016
Apr 30, 2016
Apr 30, 2016
Apr 30, 2016
May 16, 2016
May 16, 2016
Apr 30, 2016

Repository files navigation

中文姓名与性别的相关性分析

原始数据:./data/

基本字段:

  • 姓名:name
  • 性别:gender
  • 省份:province
  • 民族:nation

省份和民族字段不一定都有。可以扩展用于研究不同省份和民族的名字特征。

数据格式化

grep ",女" data/chinese_name_gender_0*.csv|cut -d, -f1|cut -d: -f2|sort|uniq > data/female.txt

grep ",男" data/chinese_name_gender_0*.csv|cut -d, -f1|cut -d: -f2|sort|uniq > data/male.txt

测试

python test.py

About

中文姓名与性别的相关性分析

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%