Real-time gesture recognition from depth data through key poses learning and decision forests
Leandro Miranda, Thales Vieira, Dimas Martinez, Thomas Lewiner, Antônio Wilson Vieira, Mario F. M. Campos
Sibgrapi 2012 (XXV Conference on Graphics, Patterns and Images): pp. 268-275 (august 2012)
Selected for publication in Pattern Recognition Letters
Selected for publication in Pattern Recognition Letters
Abstract:
Human gesture recognition is a challenging task with many applications. The popularization of real time depth sensors even diversifies potential applications to end-user natural user interface (NUI). The quality of such NUI highly depends on the robustness and execution speed of the gesture recognition. This work introduces a method for real-time gesture recognition from a noisy skeleton stream, such as the ones extracted from Kinect depth sensors. Each pose is described using a tailored angular representation of the skeleton joints. Those descriptors serve to identify key poses through a multi-class classifier derived from Support Vector learning machines. The gesture is labeled on-the-fly from the key pose sequence through a decision forest, that naturally performs the gesture time warping and avoids the requirement for an initial or neutral pose. The proposed method runs in real time and shows robustness in several experiments.Downloads:
PDF paper (1.8 MB)Keynote presentation (85 MB)
Slides in PDF (10.1 MB)
movie (47.5 MB)
YouTube
BibTeX:
@inproceedings{gesture_learning_sibgrapi,author = {Leandro Miranda and Thales Vieira and Dimas Martinez and Thomas Lewiner and Antônio Wilson Vieira and Mario F. M. Campos},
title = {Real-time gesture recognition from depth data through key poses learning and decision forests},
year = {2012},
month = {august},
booktitle = {Sibgrapi 2012 (XXV Conference on Graphics, Patterns and Images)},
pages = {268--275},
publisher = {IEEE},
address = {Ouro Preto, MG},
doi = {10.1109/SIBGRAPI.2012.44},
url = {\url{http://thomas.lewiner.org/pdfs/gesture_learning_sibgrapi.pdf}}
}