May 14, 2024

Learning to Restructure Tables Automatically

Key Points

Key points are not available for this paper at this time.

Abstract

By now, it is widely-accepted folk wisdom that "half of the time in any data analysis project is spent wrangling the data". Analytic algorithms and tools-built on mathematical foundations of matrices and relations-require their data to be lined up in particular rows and columns. In the relational model (known in data science circles as "tidy data"), each row is an independent observation, and each column is a distinct attribute of the phenomenon described by the data. While there are many thorny aspects to data wrangling, perhaps none is more basic than the challenge of getting data reorganized, positionally, into the right form for analysis.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Joseph M. Hellerstein (Tue,) studied this question.

www.synapsesocial.com/papers/68e6a3b5b6db6435876270c4 — DOI: https://doi.org/10.1145/3665252.3665268

Authors

Joseph M. Hellerstein

Journals

ACM SIGMOD Record

Actions

Institutions

University of California, Berkeley

Berkeley College

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Learning to Restructure Tables Automatically

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion