TREB: a BERT attempt for imputing tabular data imputation

Wang, Shuyue; Zhou, Wenjun; Jiang, Han drk-m-s; Wang, Shuo; Zheng, Ren

Computer Science > Machine Learning

arXiv:2410.00022 (cs)

[Submitted on 16 Sep 2024]

Title:TREB: a BERT attempt for imputing tabular data imputation

Authors:Shuyue Wang, Wenjun Zhou, Han drk-m-s Jiang, Shuo Wang, Ren Zheng

View PDF HTML (experimental)

Abstract:TREB, a novel tabular imputation framework utilizing BERT, introduces a groundbreaking approach for handling missing values in tabular data. Unlike traditional methods that often overlook the specific demands of imputation, TREB leverages the robust capabilities of BERT to address this critical task. While many BERT-based approaches for tabular data have emerged, they frequently under-utilize the language model's full potential. To rectify this, TREB employs a BERT-based model fine-tuned specifically for the task of imputing real-valued continuous numbers in tabular datasets. The paper comprehensively addresses the unique challenges posed by tabular data imputation, emphasizing the importance of context-based interconnections. The effectiveness of TREB is validated through rigorous evaluation using the California Housing dataset. The results demonstrate its ability to preserve feature interrelationships and accurately impute missing values. Moreover, the authors shed light on the computational efficiency and environmental impact of TREB, quantifying the floating-point operations (FLOPs) and carbon footprint associated with its training and deployment.

Comments:	12 pages, 7 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2410.00022 [cs.LG]
	(or arXiv:2410.00022v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.00022

Submission history

From: Shuyue Wang [view email]
[v1] Mon, 16 Sep 2024 01:47:22 UTC (3,553 KB)

Computer Science > Machine Learning

Title:TREB: a BERT attempt for imputing tabular data imputation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:TREB: a BERT attempt for imputing tabular data imputation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators