research-article

Multimodal Deep Learning for Robust Road Attribute Detection

Authors:

An Tran,

See-Kiong NgAuthors Info & Claims

ACM Transactions on Spatial Algorithms and Systems, Volume 9, Issue 4

Article No.: 27, Pages 1 - 25

https://doi.org/10.1145/3618108

Published: 20 November 2023 Publication History

Get Access

Abstract

Automatic inference of missing road attributes (e.g., road type and speed limit) for enriching digital maps has attracted significant research attention in recent years. A number of machine learning-based approaches have been proposed to detect road attributes from GPS traces, dash-cam videos, or satellite images. However, existing solutions mostly focus on a single modality without modeling the correlations among multiple data sources. To bridge this gap, we present a multimodal road attribute detection method, which improves the robustness by performing pixel-level fusion of crowdsourced GPS traces and satellite images. A GPS trace is usually given by a sequence of location, bearing, and speed. To align it with satellite imagery in the spatial domain, we render GPS traces into a sequence of multi-channel images that simultaneously capture the global distribution of the GPS points, the local distribution of vehicles’ moving directions and speeds, and their temporal changes over time, at each pixel. Unlike previous GPS-based road feature extraction methods, our proposed GPS rendering does not require map matching in the data preprocessing step. Moreover, our multimodal solution addresses single-modal challenges such as occlusions in satellite images and data sparsity in GPS traces by learning the pixel-wise correspondences among different data sources. On top of this, we observe that geographic objects and their attributes in the map are not isolated but correlated with each other. Thus, if a road is partially labeled, then the existing information can be of great help on inferring the missing attributes. To fully use the existing information, we extend our model and discuss the possibilities for further performance improvement when partially labeled map data is available. Extensive experiments have been conducted on two real-world datasets in Singapore and Jakarta. Compared with previous work, our method is able to improve the detection accuracy on road attributes by a large margin.

References

[1]

DiDi Chuxing. 2022. GAIA Open Dataset Initiative. Retrieved from https://outreach.didichuxing.com/research/opendata/en/

Abstract

References

Cited By

Index Terms

Recommendations

Multimodal Fusion of Satellite Images and Crowdsourced GPS Traces for Robust Road Attribute Detection

A Multi-task Learning Framework for Road Attribute Updating via Joint Analysis of Map Data and GPS Traces

A Highly Efficient and Effective Attribute Learning Framework for Road Graph from Aerial Imagery and GPS

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Full Text

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations