STGformer: Efficient Spatiotemporal Graph Transformer for Traffic Forecasting

Wang, Hongjun; Chen, Jiyuan; Pan, Tong; Dong, Zheng; Zhang, Lingyu; Jiang, Renhe; Song, Xuan

Computer Science > Machine Learning

arXiv:2410.00385 (cs)

[Submitted on 1 Oct 2024 (v1), last revised 15 Oct 2024 (this version, v2)]

Title:STGformer: Efficient Spatiotemporal Graph Transformer for Traffic Forecasting

Authors:Hongjun Wang, Jiyuan Chen, Tong Pan, Zheng Dong, Lingyu Zhang, Renhe Jiang, Xuan Song

View PDF HTML (experimental)

Abstract:Traffic forecasting is a cornerstone of smart city management, enabling efficient resource allocation and transportation planning. Deep learning, with its ability to capture complex nonlinear patterns in spatiotemporal (ST) data, has emerged as a powerful tool for traffic forecasting. While graph neural networks (GCNs) and transformer-based models have shown promise, their computational demands often hinder their application to real-world road networks, particularly those with large-scale spatiotemporal interactions. To address these challenges, we propose a novel spatiotemporal graph transformer (STGformer) architecture. STGformer effectively balances the strengths of GCNs and Transformers, enabling efficient modeling of both global and local traffic patterns while maintaining a manageable computational footprint. Unlike traditional approaches that require multiple attention layers, STG attention block captures high-order spatiotemporal interactions in a single layer, significantly reducing computational cost. In particular, STGformer achieves a 100x speedup and a 99.8\% reduction in GPU memory usage compared to STAEformer during batch inference on a California road graph with 8,600 sensors. We evaluate STGformer on the LargeST benchmark and demonstrate its superiority over state-of-the-art Transformer-based methods such as PDFormer and STAEformer, which underline STGformer's potential to revolutionize traffic forecasting by overcoming the computational and memory limitations of existing approaches, making it a promising foundation for future spatiotemporal modeling tasks.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB)
Cite as:	arXiv:2410.00385 [cs.LG]
	(or arXiv:2410.00385v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.00385

Submission history

From: Hongjun Wang [view email]
[v1] Tue, 1 Oct 2024 04:15:48 UTC (10,095 KB)
[v2] Tue, 15 Oct 2024 05:44:29 UTC (9,203 KB)

Computer Science > Machine Learning

Title:STGformer: Efficient Spatiotemporal Graph Transformer for Traffic Forecasting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:STGformer: Efficient Spatiotemporal Graph Transformer for Traffic Forecasting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators