Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

线性回归预测下一年人均年收入 #3

Open
yijingping opened this issue Dec 12, 2016 · 1 comment
Open

线性回归预测下一年人均年收入 #3

yijingping opened this issue Dec 12, 2016 · 1 comment

Comments

@yijingping
Copy link
Member

No description provided.

@yijingping
Copy link
Member Author

由于建档立卡录入错误,需要对样本数据进行筛选
1)人均年收入过高(大于7000)、过低(小于等于0)的删除
采用此方法后,建模数据从71648条变为70864条,减少784条,占1.1%,效果提高5%。

误差率 准确度(筛选前) 准确度(筛选后)
10% 59.82% 64.70%
20% 80.68% 82.98%
30% 88.71% 89.95%
40% 91.54% 92.47%
50% 93.19% 94.09%
60% 94.32% 95.12%
70% 95.09% 95.88%
80% 95.72% 96.39%
90% 96.23% 96.83%
100% 96.69% 97.13%

2)人均年收入除以100后,查看分布,少于5次的不纳入模型
采用此方法后,建模数据从70915条变为70864条,减少51条,仅占0.07%,对准确度的提升也微乎其微,甚至有的还下降,因此弃用。

误差率 准确度(筛选后) 准确度(二次筛选后)
10% 64.70% 64.85%
20% 82.98% 83.27%
30% 89.95% 89.77%
40% 92.47% 92.34%
50% 94.09% 93.99%
60% 95.12% 94.99%
70% 95.88% 95.77%
80% 96.39% 96.32%
90% 96.83% 96.75%
100% 97.13% 97.07%

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant