Citation: | Shuyang Li, Wei Liang, Qun Zhang. Human-Object Interaction Recognition Based on Modeling Context[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2017, 26(2): 215-222.doi:10.15918/j.jbit1004-0579.201726.0210 |
[1] |
Marszalek M, Laptev I, Schmid C. Actions in context[C]//Computer Vision and Pattern Recognition, IEEE Conference on, 2009:2929-2936.
|
[2] |
Sun J, Wu X, Yan S, et al. Hierarchical spatio-temporal context modeling for action recognition[C]//Computer Vision and Pattern Recognition, IEEE Conference on,2009:2004-2011.
|
[3] |
Wei P, Zhao Y, Zheng N, et al. Modeling 4D human-object interactions for event and object recognition[C]//IEEE International Conference on Computer Vision, 2013:3272-3279.
|
[4] |
Yao B, Li F F. Recognizing human-object interactions in still images by modeling the mutual context of objects and human poses[J]. IEEE Transactions on Software Engineering, 2012, 34(9):1691-1703.
|
[5] |
Yao B, Li F F. Modeling mutual context of object and human pose in human-object interaction activities[M]//Modeling Mutual Context of Object and Human Pose in Human, 2010:17-24.
|
[6] |
Chang A X, Funkhouser T, Guibas L, et al. ShapeNet:an information-rich 3D model repository[J]. arXiv preprint arXiv:1512.03012,2015. https://arxiv.org/abs/1512.03012.
|
[7] |
Xia L, Chen C C, Aggarwal J K. View invariant human action recognition using histograms of 3D joints[C]//Computer Vision and Pattern Recognition Workshops, 2012 IEEE Computer Society Conference on, 2012:20-27.
|
[8] |
Bo Liefeng, Sminchisescu Cristian. Efficient match kernel between sets of features for visual recognition[C]//Advances in Neural Information Processing Systems 22, Conference on Neural Information Processing Systems 2009. Proceedings of A Meeting Held 7-10 December 2009, Vancouver, British Columbia, Canada, 2009:135-143.
|
[9] |
Grauman K, Darrell T. The pyramid match kernel:discriminative classification with sets of image features[C]//IEEE International Conference on Computer Vision, 2015,2:1458-1465.
|
[10] |
Yu K, Xu W, Gong Y. Deep learning with kernel regularization for visual recognition[J]. Advances in Neural Information Processing Systems, 2009,21(1):1889-1896.
|
[11] |
Bo Liefeng, Ren Xiaofeng, Fox Dieter. Kernel descriptors for visual recognition[C]//Advances in Neural Information Processing Systems 23, Conference on Neural Information Processing Systems 2010. Proceedings of A Meeting Held 6-9 December 2010, Vancouver, British Columbia, Canada, 2010:244-252.
|
[12] |
Nam C, Park J C, Kim D S. Indoor human activity recognition with contextual cues in videos[C]//International Conference on Ubiquitous Information Management and Communication, ACM, 2016:1-4.
|
[13] |
Gupta A, Kembhavi A, Davis L S. Observing human-object interactions:using spatial and functional compatibility for recognition.[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2009, 31(10):1775-89.
|
[14] |
Ller M, Der T. Motion templates for automatic classification and retrieval of motion capture data[C]//ACM Siggraph/eurographics Symposium on Computer Animation, SCA 2006, Vienna, Austria, 2006:137-146.
|
[15] |
Rabiner L R. A Tutorial on hidden Markov models and selected applications in speech recognition[J]. Proceedings of the IEEE, 1989, 77(2):257-286.
|
[16] |
Wang J, Liu Z, Wu Y. Learning actionlet ensemble for 3D human action recognition[M]//Human Action Recognition with Depth Cameras. Berlin:Springer International Publishing, 2014:11-40.
|