skip to main content
10.1145/3570991.3571045acmotherconferencesArticle/Chapter ViewAbstractPublication PagescodsConference Proceedingsconference-collections
tutorial

AI for Immersive Metaverse Experience

Published: 04 January 2023 Publication History

Abstract

Metaverse has received a huge attention in recent times with several Big Techs having invested in this concept. Accenture defines the metaverse as “an evolution of the Internet that enables a user to move beyond ‘browsing’ to ‘inhabiting’ in a persistent, shared experience that spans the spectrum of our real world to the fully virtual and in between”. The evolution that Metaverse brings can be seen along three dimensions: 1) shift towards spatial experiences: which includes 2D, 3D, augmented, virtual, and mixed reality immersive experiences, 2) shared co-presence: where users experience a persistent shared space with a sense of co-presence with others, and 3) trusted identities and transactions to address challenges of fake identities, products, and transactions as present in today’s internet.
For example, a retail marketplace, on Metaverse could be seen as an immersive spatial experience where users can shop along with their families and friends who join virtually in the same environment. The sense of shared co-presence gives them the ability to discuss about products in real time and persistency gives them ability to come back to the same space. This evolution opens an enormous opportunity to rethink the digital experiences future applications would offer to the people. AI would be the core engine behind making these experiences richer, immersive, and engaging. The role of AI, in the Metaverse, is broad; however, in this tutorial, we will focus on two areas where AI will play a major role in shaping up the form and function of the Metaverse by: 1) bringing more realism in Metaverse with high fidelity immersive content generated through AI techniques and 2) enhancing user interactions by bringing more intelligence in the interaction modes.

References

[1]
2022. AvatarSDK. (2022). https://avatarsdk.com [Online; accessed 19-Nov-2022].
[2]
2022. DeepMotion. (2022). https://www.deepmotion.com [Online; accessed 19-Nov-2022].
[3]
2022. Expert, Natural Q&A with NVIDIA Omniverse Avatar for Project Tokkio. (2022). https://www.youtube.com/watch?v=U9Zh57dGsH4 [Online; accessed 19-Nov-2022].
[4]
2022. GANverse3D: a neural network from NVIDIA reconstructs a 3D Model from a single photo. (2022). https://neurohive.io/en/news/ganverse3d-a-neural-network-from-nvidia-reconstructs-a-3d-model-from-a-single-photo/ [Online; accessed 19-Nov-2022].
[5]
2022. Into The Metaverse: The Future Of Virtual Interactions. (2022). https://www.forbes.com/sites/forbestechcouncil/2022/07/11/into-the-metaverse-the-future-of-virtual-interactions/ [Online; accessed 19-Nov-2022].
[6]
2022. Omniverse Audio2Face: Generate expressive facial animation from just an audio source with NVIDIA’s Deep Learning AI technology. (2022). https://www.nvidia.com/en-in/omniverse/apps/audio2face/ [Online; accessed 19-Nov-2022].
[7]
2022. Project CAIRaoke: Building the assistants of the future with breakthroughs in conversational AI. (2022). https://ai.facebook.com/blog/project-cairaoke/ [Online; accessed 19-Nov-2022].
[8]
2022. Project Starline: Feel like you’re there, together. (2022). https://www.youtube.com/watch?v=Q13CishCKXY [Online; accessed 19-Nov-2022].
[9]
2022. Teaching AI to translate 100s of spoken and written languages in real time. (2022). https://ai.facebook.com/blog/teaching-ai-to-translate-100s-of-spoken-and-written-languages-in-real-time/ [Online; accessed 19-Nov-2022].
[10]
2022. UnrealEngine’s MetaHuman. (2022). https://www.unrealengine.com/en-US/metahuman [Online; accessed 19-Nov-2022].
[11]
Louis Airale, Dominique Vaufreydaz, and Xavier Alameda-Pineda. 2022. Socialinteractiongan: Multi-person interaction sequence generation. IEEE Transactions on Affective Computing(2022).
[12]
Nitish Bhardwaj, Dhornala Bharadwaj, and Alpana Dubey. 2022. SingleSketch2Mesh: Generating 3D Mesh model from Sketch. arXiv preprint arXiv:2203.03157(2022).
[13]
Zehranaz Canfes, M Furkan Atasoy, Alara Dirik, and Pinar Yanardag. 2022. Text and Image Guided 3D Avatar Generation and Manipulation. arXiv preprint arXiv:2202.06079(2022).
[14]
Aysegul Dundar, Jun Gao, Andrew Tao, and Bryan Catanzaro. 2022. Fine Detailed Texture Learning for 3D Meshes with Generative Models. arXiv preprint arXiv:2203.09362(2022).
[15]
Yao Feng, Haiwen Feng, Michael J Black, and Timo Bolkart. 2021. Learning an animatable detailed 3D face model from in-the-wild images. ACM Transactions on Graphics (ToG) 40, 4 (2021), 1–13.
[16]
Thien Huynh-The, Quoc-Viet Pham, Xuan-Qui Pham, Thanh Thi Nguyen, Zhu Han, and Dong-Seong Kim. 2022. Artificial Intelligence for the Metaverse: A Survey. arXiv preprint arXiv:2202.10336(2022).
[17]
Ting-En Lin, Yuchuan Wu, Fei Huang, Luo Si, Jian Sun, and Yongbin Li. 2022. Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue System. arXiv preprint arXiv:2205.15060(2022).
[18]
Zhaoliang Lun, Matheus Gadelha, Evangelos Kalogerakis, Subhransu Maji, and Rui Wang. 2017. 3D shape reconstruction from sketches via multi-view convolutional networks. In 2017 International Conference on 3D Vision (3DV). IEEE, 67–77.
[19]
Oscar Michel, Roi Bar-On, Richard Liu, Sagie Benaim, and Rana Hanocka. 2022. Text2mesh: Text-driven neural stylization for meshes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 13492–13502.
[20]
Charlie Nash, Yaroslav Ganin, SM Ali Eslami, and Peter Battaglia. 2020. Polygen: An autoregressive generative model of 3d meshes. In International conference on machine learning. PMLR, 7220–7229.
[21]
Sergey Prokudin, Michael J Black, and Javier Romero. 2021. SMPLpix: Neural Avatars from 3D Human Models. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 1810–1819.
[22]
Stephen Roller, Emily Dinan, Naman Goyal, Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Kurt Shuster, Eric M Smith, 2020. Recipes for building an open-domain chatbot. arXiv preprint arXiv:2004.13637(2020).
[23]
Shunsuke Saito, Tomas Simon, Jason Saragih, and Hanbyul Joo. 2020. Pifuhd: Multi-level pixel-aligned implicit function for high-resolution 3d human digitization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 84–93.
[24]
Abhinav Upadhyay, Alpana Dubey, Suma Mani Kuriakose, and Devasish Mahato. 2022. 3DSTNet: Neural 3D Shape Style Transfer. In 2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW). IEEE, 1–6.
[25]
Kangxue Yin, Jun Gao, Maria Shugrina, Sameh Khamis, and Sanja Fidler. 2021. 3dstylenet: Creating 3d shapes with geometric and texture style variations. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 12456–12465.
[26]
Song-Hai Zhang, Yuan-Chen Guo, and Qing-Wen Gu. 2021. Sketch2Model: View-aware 3d modeling from single free-hand sketches. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6012–6021.

Cited By

View all
  • (2024)Natural Language Processing Influence on Digital Socialization and Linguistic Interactions in the Integration of the Metaverse in Regular Social LifeElectronics10.3390/electronics1307133113:7(1331)Online publication date: 2-Apr-2024
  • (2024)Engaging recently incarcerated and gang affiliated Black and Latino/a young adults in designing social collocated applications for mixed reality smart glasses through community-based participatory design workshops.Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642895(1-17)Online publication date: 11-May-2024
  • (2024)Beyond Reality: The Pivotal Role of Generative AI in the MetaverseIEEE Internet of Things Magazine10.1109/IOTM.001.23001747:4(126-135)Online publication date: Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CODS-COMAD '23: Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD)
January 2023
357 pages
ISBN:9781450397971
DOI:10.1145/3570991
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 January 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. 3D Computer Vision
  2. Metaverse
  3. Neural network generators
  4. Procedural Content Generation

Qualifiers

  • Tutorial
  • Research
  • Refereed limited

Conference

CODS-COMAD 2023

Acceptance Rates

Overall Acceptance Rate 197 of 680 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)159
  • Downloads (Last 6 weeks)6
Reflects downloads up to 15 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Natural Language Processing Influence on Digital Socialization and Linguistic Interactions in the Integration of the Metaverse in Regular Social LifeElectronics10.3390/electronics1307133113:7(1331)Online publication date: 2-Apr-2024
  • (2024)Engaging recently incarcerated and gang affiliated Black and Latino/a young adults in designing social collocated applications for mixed reality smart glasses through community-based participatory design workshops.Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642895(1-17)Online publication date: 11-May-2024
  • (2024)Beyond Reality: The Pivotal Role of Generative AI in the MetaverseIEEE Internet of Things Magazine10.1109/IOTM.001.23001747:4(126-135)Online publication date: Jul-2024
  • (2023)Redefining E-Commerce ExperienceInternational Journal on Semantic Web and Information Systems10.4018/IJSWIS.33412319:1(1-24)Online publication date: 28-Nov-2023

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media