Structure from Articulated Motion: Accurate and Stable Monocular 3D Reconstruction without Training Data

Kovalenko, Onorina; Golyanik, Vladislav; Malik, Jameel; Elhayek, Ahmed; Stricker, Didier

doi:10.3390/s19204603

Computer Science > Computer Vision and Pattern Recognition

arXiv:1905.04789 (cs)

[Submitted on 12 May 2019 (v1), last revised 12 Nov 2019 (this version, v2)]

Title:Structure from Articulated Motion: Accurate and Stable Monocular 3D Reconstruction without Training Data

Authors:Onorina Kovalenko, Vladislav Golyanik, Jameel Malik, Ahmed Elhayek, Didier Stricker

View PDF

Abstract:Recovery of articulated 3D structure from 2D observations is a challenging computer vision problem with many applications. Current learning-based approaches achieve state-of-the-art accuracy on public benchmarks but are restricted to specific types of objects and motions covered by the training datasets. Model-based approaches do not rely on training data but show lower accuracy on these datasets. In this paper, we introduce a model-based method called Structure from Articulated Motion (SfAM), which can recover multiple object and motion types without training on extensive data collections. At the same time, it performs on par with learning-based state-of-the-art approaches on public benchmarks and outperforms previous non-rigid structure from motion (NRSfM) methods. SfAM is built upon a general-purpose NRSfM technique while integrating a soft spatio-temporal constraint on the bone lengths. We use alternating optimization strategy to recover optimal geometry (i.e., bone proportions) together with 3D joint positions by enforcing the bone lengths consistency over a series of frames. SfAM is highly robust to noisy 2D annotations, generalizes to arbitrary objects and does not rely on training data, which is shown in extensive experiments on public benchmarks and real video sequences. We believe that it brings a new perspective on the domain of monocular 3D recovery of articulated structures, including human motion capture.

Comments:	21 pages, 8 figures, 2 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1905.04789 [cs.CV]
	(or arXiv:1905.04789v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1905.04789
Journal reference:	Sensors 2019, 19(20), 4603
Related DOI:	https://doi.org/10.3390/s19204603

Submission history

From: Onorina Kovalenko [view email]
[v1] Sun, 12 May 2019 20:33:49 UTC (5,460 KB)
[v2] Tue, 12 Nov 2019 13:08:08 UTC (7,225 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Structure from Articulated Motion: Accurate and Stable Monocular 3D Reconstruction without Training Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Structure from Articulated Motion: Accurate and Stable Monocular 3D Reconstruction without Training Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators