TableLlama: Towards Open Large Generalist Models for Tables

Zhang, Tianshu; Yue, Xiang; Li, Yifei; Sun, Huan

Computer Science > Computation and Language

arXiv:2311.09206 (cs)

[Submitted on 15 Nov 2023 (v1), last revised 4 Apr 2024 (this version, v3)]

Title:TableLlama: Towards Open Large Generalist Models for Tables

Authors:Tianshu Zhang, Xiang Yue, Yifei Li, Huan Sun

View PDF HTML (experimental)

Abstract:Semi-structured tables are ubiquitous. There has been a variety of tasks that aim to automatically interpret, augment, and query tables. Current methods often require pretraining on tables or special model architecture design, are restricted to specific table types, or have simplifying assumptions about tables and tasks. This paper makes the first step towards developing open-source large language models (LLMs) as generalists for a diversity of table-based tasks. Towards that end, we construct TableInstruct, a new dataset with a variety of realistic tables and tasks, for instruction tuning and evaluating LLMs. We further develop the first open-source generalist model for tables, TableLlama, by fine-tuning Llama 2 (7B) with LongLoRA to address the long context challenge. We experiment under both in-domain setting and out-of-domain setting. On 7 out of 8 in-domain tasks, TableLlama achieves comparable or better performance than the SOTA for each task, despite the latter often has task-specific design. On 6 out-of-domain datasets, it achieves 5-44 absolute point gains compared with the base model, showing that training on TableInstruct enhances the model's generalizability. We open-source our dataset and trained model to boost future work on developing open generalist models for tables.

Comments:	NAACL 2024 long paper
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
Cite as:	arXiv:2311.09206 [cs.CL]
	(or arXiv:2311.09206v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.09206

Submission history

From: Tianshu Zhang [view email]
[v1] Wed, 15 Nov 2023 18:47:52 UTC (959 KB)
[v2] Thu, 21 Mar 2024 17:56:37 UTC (20,715 KB)
[v3] Thu, 4 Apr 2024 17:10:25 UTC (21,178 KB)

Computer Science > Computation and Language

Title:TableLlama: Towards Open Large Generalist Models for Tables

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:TableLlama: Towards Open Large Generalist Models for Tables

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators