Article Contents

Article Navigation> JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY> 2017> 26(2): 215-222

Shuyang Li, Wei Liang, Qun Zhang. Human-Object Interaction Recognition Based on Modeling Context[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2017, 26(2): 215-222. doi: 10.15918/j.jbit1004-0579.201726.0210

Citation:

Shuyang Li, Wei Liang, Qun Zhang. Human-Object Interaction Recognition Based on Modeling Context[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2017, 26(2): 215-222.doi:10.15918/j.jbit1004-0579.201726.0210

Citation:

Shuyang Li, Wei Liang, Qun Zhang. Human-Object Interaction Recognition Based on Modeling Context[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2017, 26(2): 215-222.doi:10.15918/j.jbit1004-0579.201726.0210

PDF( 5243 KB)

Human-Object Interaction Recognition Based on Modeling Context

doi:10.15918/j.jbit1004-0579.201726.0210

School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China

Received Date:2016-07-25

Abstract

Abstract

This paper proposes a method to recognize human-object interactions by modeling context between human actions and interacted objects. Human-object interaction recognition is a challenging task due to severe occlusion between human and objects during the interacting process. Since that human actions and interacted objects provide strong context information, i.e. some actions are usually related to some specific objects, the accuracy of recognition is significantly improved for both of them. Through the proposed method, both global and local temporal features from skeleton sequences are extracted to model human actions. In the meantime, kernel features are utilized to describe interacted objects. Finally, all possible solutions from actions and objects are optimized by modeling the context between them. The results of experiments demonstrate the effectiveness of our method.
- human-object interaction,
- action recognition,
- object recognition,
- modeling context

FullText(HTML)

References (16)

References

[1]	Marszalek M, Laptev I, Schmid C. Actions in context[C]//Computer Vision and Pattern Recognition, IEEE Conference on, 2009:2929-2936.
[2]	Sun J, Wu X, Yan S, et al. Hierarchical spatio-temporal context modeling for action recognition[C]//Computer Vision and Pattern Recognition, IEEE Conference on,2009:2004-2011.
[3]	Wei P, Zhao Y, Zheng N, et al. Modeling 4D human-object interactions for event and object recognition[C]//IEEE International Conference on Computer Vision, 2013:3272-3279.
[4]	Yao B, Li F F. Recognizing human-object interactions in still images by modeling the mutual context of objects and human poses[J]. IEEE Transactions on Software Engineering, 2012, 34(9):1691-1703.
[5]	Yao B, Li F F. Modeling mutual context of object and human pose in human-object interaction activities[M]//Modeling Mutual Context of Object and Human Pose in Human, 2010:17-24.
[6]	Chang A X, Funkhouser T, Guibas L, et al. ShapeNet:an information-rich 3D model repository[J]. arXiv preprint arXiv:1512.03012,2015. https://arxiv.org/abs/1512.03012.
[7]	Xia L, Chen C C, Aggarwal J K. View invariant human action recognition using histograms of 3D joints[C]//Computer Vision and Pattern Recognition Workshops, 2012 IEEE Computer Society Conference on, 2012:20-27.
[8]	Bo Liefeng, Sminchisescu Cristian. Efficient match kernel between sets of features for visual recognition[C]//Advances in Neural Information Processing Systems 22, Conference on Neural Information Processing Systems 2009. Proceedings of A Meeting Held 7-10 December 2009, Vancouver, British Columbia, Canada, 2009:135-143.
[9]	Grauman K, Darrell T. The pyramid match kernel:discriminative classification with sets of image features[C]//IEEE International Conference on Computer Vision, 2015,2:1458-1465.
[10]	Yu K, Xu W, Gong Y. Deep learning with kernel regularization for visual recognition[J]. Advances in Neural Information Processing Systems, 2009,21(1):1889-1896.
[11]	Bo Liefeng, Ren Xiaofeng, Fox Dieter. Kernel descriptors for visual recognition[C]//Advances in Neural Information Processing Systems 23, Conference on Neural Information Processing Systems 2010. Proceedings of A Meeting Held 6-9 December 2010, Vancouver, British Columbia, Canada, 2010:244-252.
[12]	Nam C, Park J C, Kim D S. Indoor human activity recognition with contextual cues in videos[C]//International Conference on Ubiquitous Information Management and Communication, ACM, 2016:1-4.
[13]	Gupta A, Kembhavi A, Davis L S. Observing human-object interactions:using spatial and functional compatibility for recognition.[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2009, 31(10):1775-89.
[14]	Ller M, Der T. Motion templates for automatic classification and retrieval of motion capture data[C]//ACM Siggraph/eurographics Symposium on Computer Animation, SCA 2006, Vienna, Austria, 2006:137-146.
[15]	Rabiner L R. A Tutorial on hidden Markov models and selected applications in speech recognition[J]. Proceedings of the IEEE, 1989, 77(2):257-286.
[16]	Wang J, Liu Z, Wu Y. Learning actionlet ensemble for 3D human action recognition[M]//Human Action Recognition with Depth Cameras. Berlin:Springer International Publishing, 2014:11-40.

Relative Articles

Supplements (0)

Cited By

Proportional views

Proportional views

通讯作者:陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Get Citation

PDF

XML

Article Metrics

Article views (575) PDF downloads(849)

Human-Object Interaction Recognition Based on Modeling Context

doi:10.15918/j.jbit1004-0579.201726.0210

Abstract

References

Proportional views

Catalog

通讯作者:陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Human-Object Interaction Recognition Based on Modeling Context

doi:10.15918/j.jbit1004-0579.201726.0210

Abstract

References

Proportional views

Catalog

通讯作者:陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Export File

Citation

Format

Content