IPUMS.org Home Page

BIBLIOGRAPHY

Publications, working papers, and other research using data resources from IPUMS.

Full Citation

Title: The UCI KDD Archive of Large Data Sets for Data Mining Research and Experimentation

Citation Type: Journal Article

Publication Year: 2000

Abstract: Advances in data collection and storage have allowed organizations to create massive, complex and heterogeneous databases, which have stymied traditional methods of data analysis. This has led to the development of new analytical tools that often combine techniques from a variety of fields such as statistics, computer science, and mathematics to extract meaningful knowledge from the data. To support research in this area, UC Irvine has created the UCI Knowledge Discovery in Databases (KDD) Archive (http://kdd.ics.uci.edu) which is a new online archive of large and complex data sets that encompasses a wide variety of data types, analysis tasks, and application areas. This article describes the objectives and philosophy of the UCI KDD Archive. We draw parallels with the development of the UCI Machine Learning Repository and its effect on the Machine Learning community.

User Submitted?: No

Authors: Bay, Stephen D.; Smyth, Padhraic; Pazzani, Michael J.; Kibler, Dennis

Periodical (Full): SIGKDD Explorations

Issue: 2

Volume: 2

Pages: 14-181

Data Collections: IPUMS USA

Topics: Methodology and Data Collection

Countries:

IPUMS NHGIS NAPP IHIS ATUS Terrapop