Abstract
The General Society Survey(GSS) is a kind of government-funded survey which aims at examining the Socio-economic status, quality of life, and structure of contemporary society. GSS dataset is regarded as one of the authoritative source for the government and organization practitioners to make data-driven policies. The previous analytic approaches for GSS dataset are designed by combining expert knowledges and simple statistics. In this paper, we proposed a comprehensive data management and data mining approach for GSS datasets. The approach is designed to be operated in a two-phase manner: a data management phase which can improve the quality of GSS data by performing attribute preprocessing and filter-based attribute selection; a data mining phase which can extract hidden knowledges from the dataset by performing data mining analysis including prediction analysis, classification analysis, association analysis and clustering analysis. By leveraging the power of data mining techniques, our proposed approach can explore knowledges in a fine-grained manner with minimum human interference. Experiments on Chinese General Social Survey dataset are conducted at the end to evaluate the performance of our approach.
Original language | English |
---|---|
Title of host publication | Proceedings of the 4th International Conference on Crowd Science and Engineering, ICCSE 2019 |
Publisher | Association for Computing Machinery |
Pages | 195-203 |
Number of pages | 9 |
ISBN (Electronic) | 9781450376402 |
DOIs | |
State | Published - 18 Oct 2019 |
Event | 4th International Conference on Crowd Science and Engineering, ICCSE 2019 - Jinan, China Duration: 18 Oct 2019 → 21 Oct 2019 |
Publication series
Name | ACM International Conference Proceeding Series |
---|
Conference
Conference | 4th International Conference on Crowd Science and Engineering, ICCSE 2019 |
---|---|
Country/Territory | China |
City | Jinan |
Period | 18/10/19 → 21/10/19 |
Bibliographical note
Publisher Copyright:© 2019 Association for Computing Machinery.
Keywords
- Data management
- Data mining
- Decision support systems
- Knowledge discovery
- Society survey