IAPR MVA 2015

May 18-22, 2015 @ Tokyo, Japan

Towards 3D Human Posture Estimation Using Multiple Kinects Despite Self-Contacts

Geodesic distance graph edges: a) Naive (left, note the invalid edge near Worker 1’s feet) b) MVS refined (right)

Abstract

We present a marker-less human motion capture system that uses multiple RGB-D cameras to estimate the 3D posture of multiple people online at interactive rates in an indoor workspace measuring approximately 5m × 5m × 2m. An interesting aspect of this work is how we handle the self-contact problem. We propose a novel multi-view voting scheme (MVS) to fuse measurements from different 2D or 3D algorithms. As a proof of concept, we present an MVS implementation that fuses optical flow images from each view and labels points in the current instance using the previously estimated posture. These labels allow us to trim invalid edges in a geodesic distance graph model and improve localization of geodesic extrema corresponding to the head, hands and feet for posture estimation. The system performs at ∼8.3 Hz with a cumulative latency of ∼570.40 ms and a projected median localization error of ∼0.149 m. In addition, we propose a new multi-view Kinect and Vicon publicly accessible motion capture dataset for validation and benchmarks.

Files

Dataset Links

Bibtex

@InProceedings{phan2015towards,
  Title       = {Towards 3D Human Posture Estimation Using Multiple Kinects Despite Self-Contacts},
  Author      = {Phan, Andrew and Ferrie, Frank P.},
  Booktitle   = {15th IAPR International Conference on Machine Vision Applications},
  Pages       = {567--571},
  Year        = {2015}
}