Column2Vec: Structural Understanding via Distributed Representations of Database Schemas
Abstract
We present Column2Vec, a distributed representation of database columns based on column metadata. Our distributed representation has several applications. Using known names for groups of columns (i.e., a table name), we train a model to generate an appropriate name for columns in an unnamed table. We demonstrate the viability of our approach using schema information collected from open source applications on GitHub.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2019
- DOI:
- 10.48550/arXiv.1903.08621
- arXiv:
- arXiv:1903.08621
- Bibcode:
- 2019arXiv190308621M
- Keywords:
-
- Computer Science - Databases;
- Computer Science - Machine Learning