skip to main content
research-article

Multimodal Deep Learning for Robust Road Attribute Detection

Published: 20 November 2023 Publication History

Abstract

Automatic inference of missing road attributes (e.g., road type and speed limit) for enriching digital maps has attracted significant research attention in recent years. A number of machine learning-based approaches have been proposed to detect road attributes from GPS traces, dash-cam videos, or satellite images. However, existing solutions mostly focus on a single modality without modeling the correlations among multiple data sources. To bridge this gap, we present a multimodal road attribute detection method, which improves the robustness by performing pixel-level fusion of crowdsourced GPS traces and satellite images. A GPS trace is usually given by a sequence of location, bearing, and speed. To align it with satellite imagery in the spatial domain, we render GPS traces into a sequence of multi-channel images that simultaneously capture the global distribution of the GPS points, the local distribution of vehicles’ moving directions and speeds, and their temporal changes over time, at each pixel. Unlike previous GPS-based road feature extraction methods, our proposed GPS rendering does not require map matching in the data preprocessing step. Moreover, our multimodal solution addresses single-modal challenges such as occlusions in satellite images and data sparsity in GPS traces by learning the pixel-wise correspondences among different data sources. On top of this, we observe that geographic objects and their attributes in the map are not isolated but correlated with each other. Thus, if a road is partially labeled, then the existing information can be of great help on inferring the missing attributes. To fully use the existing information, we extend our model and discuss the possibilities for further performance improvement when partially labeled map data is available. Extensive experiments have been conducted on two real-world datasets in Singapore and Jakarta. Compared with previous work, our method is able to improve the detection accuracy on road attributes by a large margin.

References

[1]
DiDi Chuxing. 2022. GAIA Open Dataset Initiative. Retrieved from https://outreach.didichuxing.com/research/opendata/en/
[2]
Mapbox. 2022. Static Images API. Retrieved from https://docs.mapbox.com/api/maps/static-images/
[3]
Kaggle. 2022. Uber Pickups in New York City. Retrieved from https://www.kaggle.com/fivethirtyeight/uber-pickups-in-new-york-city
[4]
Heba Aly, Anas Basalamah, and Moustafa Youssef. 2016. Robust and Ubiquitous smartphone-based lane detection. Pervas. Mobile Comput. 26 (2016), 35–56.
[5]
Heba Aly and Moustafa Youssef. 2015. semMatch: Road semantics-based accurate map matching for challenging positioning data. In Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 5:1–5:10.
[6]
Favyen Bastani, Songtao He, Sofiane Abbar, Mohammad Alizadeh, Hari Balakrishnan, Sanjay Chawla, and Sam Madden. 2018. Machine-assisted map editing. In Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 23–32.
[7]
Favyen Bastani, Songtao He, Sofiane Abbar, Mohammad Alizadeh, Hari Balakrishnan, Sanjay Chawla, Sam Madden, and David DeWitt. 2018. Roadtracer: Automatic extraction of road networks from aerial images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4720–4728.
[8]
Favyen Bastani, Songtao He, Satvat Jagwani, Mohammad Alizadeh, Hari Balakrishnan, Sanjay Chawla, Sam Madden, and Mohammad Amin Sadeghi. 2021. Updating street maps using changes detected in satellite imagery. In Proceedings of the 29th International Conference on Advances in Geographic Information Systems. 53–56.
[9]
Favyen Bastani, Songtao He, Satvat Jagwani, Edward Park, Sofiane Abbar, Mohammad Alizadeh, Hari Balakrishnan, Sanjay Chawla, Sam Madden, and Mohammad Amin Sadeghi. 2021. Inferring and improving street maps with data-driven automation. Commun. ACM 64, 11 (2021), 109–117.
[10]
Favyen Bastani and Samuel Madden. 2021. Beyond road extraction: A dataset for map update using aerial images. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 11905–11914.
[11]
James Biagioni and Jakob Eriksson. 2012. Map inference in the face of noise and disparity. In Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 79–88.
[12]
Gabriel Cadamuro, Aggrey Muhebwa, and Jay Taneja. 2018. Assigning a grade: Accurate measurement of road quality using satellite imagery. Retrieved from https://arXiv:1812.01699
[13]
Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille. 2017. DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 40, 4 (2017), 834–848.
[14]
Yihua Chen and John Krumm. 2010. Probabilistic modeling of traffic lanes from GPS traces. In Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 81–88.
[15]
Gordon Christie, Neil Fendley, James Wilson, and Ryan Mukherjee. 2018. Functional map of the world. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18). 6172–6180.
[16]
Soheil Feizi, Gerald Quon, Mariana Recamonde-Mendoza, Muriel Medard, Manolis Kellis, and Ali Jadbabaie. 2019. Spectral alignment of graphs. IEEE Trans. Netw. Sci. Eng. 7, 3 (2019), 1182–1197.
[17]
Stefan Funke, Robin Schirrmeister, and Sabine Storandt. 2015. Automatic extrapolation of missing road network data in OpenStreetMap. In Proceedings of the 2nd International Conference on Mining Urban Data-Volume 1392. 27–35.
[18]
M. Haklay and P. Weber. 2008. OpenStreetMap: User-generated street maps. IEEE Pervas. Comput. 7, 4 (2008), 12–18.
[19]
Trevor Hastie, Robert Tibshirani, and Jerome Friedman. 2009. Kernel smoothing methods. In The Elements of Statistical Learning. Springer, Berlin, 191–218.
[20]
Songtao He and Hari Balakrishnan. 2022. Lane-level street map extraction from aerial imagery. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2080–2089.
[21]
Songtao He, Favyen Bastani, Satvat Jagwani, Edward Park, Sofiane Abbar, Mohammad Alizadeh, Hari Balakrishnan, Sanjay Chawla, Samuel Madden, and Mohammad Amin Sadeghi. 2020. RoadTagger: Robust road attribute inference with graph neural networks. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 10965–10972.
[22]
Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. Mobilenets: Efficient convolutional neural networks for mobile vision applications. Retrieved from https://arXiv:1704.04861
[23]
Gao Huang, Zhuang Liu, Laurens Van Der Maaten, and Kilian Q. Weinberger. 2017. Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17). 4700–4708.
[24]
Xiaocheng Huang, Yifang Yin, Simon Lim, Guanfeng Wang, Bo Hu, Jagannadan Varadarajan, Shaolin Zheng, Ajay Bulusu, and Roger Zimmermann. 2019. Grab-Posisi: An extensive real-life GPS trajectory dataset in southeast asia. In Proceedings of the ACM SIGSPATIAL International Workshop on Prediction of Human Mobility. 1–10.
[25]
Z. Jan, B. Verma, J. Affum, S. Atabak, and L. Moir. 2018. A convolutional neural network based deep learning technique for identifying road attributes. In Proceedings of the International Conference on Image and Vision Computing New Zealand. 1–6.
[26]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems. MIT Press, 1097–1105.
[27]
Jun Li, Qiming Qin, Jiawei Han, Lu-An Tang, and Kin Hou Lei. 2015. Mining trajectory data and geotagged data in social media for road map inference. Trans. GIS 19, 1 (2015), 1–18.
[28]
Yin Lou, Chengyang Zhang, Xing Xie, Yu Zheng, Wei Wang, and Yan Huang. 2009. Map-matching for low-sampling-rate GPS trajectories. In Proceedings of the ACM SIGSPATIAL Internetional Conference on Advances in Geographical Information Systems.
[29]
Paul Newson and John Krumm. 2009. Hidden Markov map matching through noise and sparseness. In Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 336–343.
[30]
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-assisted Intervention. 234–241.
[31]
Sijie Ruan, Cheng Long, Jie Bao, Chunyang Li, Zisheng Yu, Ruiyuan Li, Yuxuan Liang, Tianfu He, and Yu Zheng. 2020. Learning to generate maps from trajectories. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 890–897.
[32]
Tao Sun, Zonglin Di, Pengyu Che, Chun Liu, and Yin Wang. 2019. Leveraging Crowdsourced GPS data for road extraction from aerial imagery. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19). 7509–7518.
[33]
Karl Van Winden, Filip Biljecki, and Stefan Van der Spek. 2016. Automatic update of road attributes by mining GPS tracks. Trans. GIS 20, 5 (2016), 664–683.
[34]
John E. Vargas-Munoz, Shivangi Srivastava, Devis Tuia, and Alexandre X. Falcao. 2020. OpenStreetMap: Challenges and opportunities in machine learning and remote sensing. IEEE Geosci. Remote Sens. Mag. 9, 1 (2020), 184–199.
[35]
Hangbin Wu, Zeran Xu, and Guangjun Wu. 2019. A novel method of missing road generation in city blocks based on big mobile navigation trajectory data. ISPRS Int. J. Geo-Info. 8, 3 (2019), 142.
[36]
Wei Yang, Tinghua Ai, and Wei Lu. 2018. A method for extracting road boundary information from Crowdsourcing vehicle GPS trajectories. Sensors 18, 4 (2018), 1261.
[37]
Yifang Yin, Rajiv Ratn Shah, Guanfeng Wang, and Roger Zimmermann. 2018. Feature-based map matching for low-sampling-rate GPS trajectories. ACM Trans. Spatial Algor. Syst. 4, 2 (2018).
[38]
Yifang Yin, Rajiv Ratn Shah, and Roger Zimmermann. 2016. A general feature-based map matching framework with trajectory simplification. In Proceedings of the ACM SIGSPATIAL International Workshop on GeoStreaming. 7:1–7:10.
[39]
Yifang Yin, An Tran, Ying Zhang, Wenmiao Hu, Guanfeng Wang, Jagannadan Varadarajan, Roger Zimmermann, and See-Kiong Ng. 2021. Multimodal fusion of satellite images and crowdsourced GPS traces for robust road attribute detection. In Proceedings of the 29th International SIGSPATIAL Conference on Advances in Geographic Information Systems. 107–116.
[40]
Yifang Yin, Jagannadan Varadarajan, Guanfeng Wang, Xueou Wang, Dhruva Sahrawat, Roger Zimmermann, and See-Kiong Ng. 2020. A multi-task learning framework for road attribute updating via joint analysis of map data and GPS traces. In Proceedings of the World Wide Web Conference. 2662–2668.
[41]
Yu Zhang, Ying Wei, and Qiang Yang. 2018. Learning to multitask. Adv. Neural Info. Process. Syst. 31 (2018).
[42]
Lichen Zhou, Chuang Zhang, and Ming Wu. 2018. D-LinkNet: LinkNet with pretrained encoder and dilated convolution for high resolution satellite imagery road extraction. In Proceedings of the Computer Vision and Pattern Recognition Workshops. 182–186.
[43]
Clément Zinoune, Philippe Bonnifait, and Javier Ibañez-Guzmán. 2012. Detection of missing roundabouts in maps for driving assistance systems. In Proceedings of the IEEE Intelligent Vehicles Symposium. IEEE, 123–128.

Cited By

View all
  • (2025)Deep learning for cross-domain data fusion in urban computing: Taxonomy, advances, and outlookInformation Fusion10.1016/j.inffus.2024.102606113(102606)Online publication date: Jan-2025
  • (2024)Multimodal Sensing for Predicting Real-time Biking Behavior based on Contextual Information2024 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops)10.1109/PerComWorkshops59983.2024.10503501(441-444)Online publication date: 11-Mar-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Spatial Algorithms and Systems
ACM Transactions on Spatial Algorithms and Systems  Volume 9, Issue 4
December 2023
218 pages
ISSN:2374-0353
EISSN:2374-0361
DOI:10.1145/3633511
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 November 2023
Online AM: 02 September 2023
Accepted: 26 August 2023
Revised: 29 November 2022
Received: 31 January 2022
Published in TSAS Volume 9, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Road attributes
  2. satellite images
  3. digital maps
  4. GPS trajectories

Qualifiers

  • Research-article

Funding Sources

  • Grab-NUS AI Lab
  • GrabTaxi Holdings Pte. Ltd.
  • National University of Singapore
  • Industrial Postgraduate Program
  • Economic Development Board of Singapore
  • Singapore Ministry of Education Academic Research Fund Tier 2

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)261
  • Downloads (Last 6 weeks)19
Reflects downloads up to 14 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2025)Deep learning for cross-domain data fusion in urban computing: Taxonomy, advances, and outlookInformation Fusion10.1016/j.inffus.2024.102606113(102606)Online publication date: Jan-2025
  • (2024)Multimodal Sensing for Predicting Real-time Biking Behavior based on Contextual Information2024 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops)10.1109/PerComWorkshops59983.2024.10503501(441-444)Online publication date: 11-Mar-2024

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media