Welcome to Journal of Beijing Institute of Technology
Volume 26Issue 2
.
Turn off MathJax
Article Contents
Shuyang Li, Wei Liang, Qun Zhang. Human-Object Interaction Recognition Based on Modeling Context[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2017, 26(2): 215-222. doi: 10.15918/j.jbit1004-0579.201726.0210
Citation: Shuyang Li, Wei Liang, Qun Zhang. Human-Object Interaction Recognition Based on Modeling Context[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2017, 26(2): 215-222.doi:10.15918/j.jbit1004-0579.201726.0210

Human-Object Interaction Recognition Based on Modeling Context

doi:10.15918/j.jbit1004-0579.201726.0210
  • Received Date:2016-07-25
  • This paper proposes a method to recognize human-object interactions by modeling context between human actions and interacted objects. Human-object interaction recognition is a challenging task due to severe occlusion between human and objects during the interacting process. Since that human actions and interacted objects provide strong context information, i.e. some actions are usually related to some specific objects, the accuracy of recognition is significantly improved for both of them. Through the proposed method, both global and local temporal features from skeleton sequences are extracted to model human actions. In the meantime, kernel features are utilized to describe interacted objects. Finally, all possible solutions from actions and objects are optimized by modeling the context between them. The results of experiments demonstrate the effectiveness of our method.
  • loading
  • [1]
    Marszalek M, Laptev I, Schmid C. Actions in context[C]//Computer Vision and Pattern Recognition, IEEE Conference on, 2009:2929-2936.
    [2]
    Sun J, Wu X, Yan S, et al. Hierarchical spatio-temporal context modeling for action recognition[C]//Computer Vision and Pattern Recognition, IEEE Conference on,2009:2004-2011.
    [3]
    Wei P, Zhao Y, Zheng N, et al. Modeling 4D human-object interactions for event and object recognition[C]//IEEE International Conference on Computer Vision, 2013:3272-3279.
    [4]
    Yao B, Li F F. Recognizing human-object interactions in still images by modeling the mutual context of objects and human poses[J]. IEEE Transactions on Software Engineering, 2012, 34(9):1691-1703.
    [5]
    Yao B, Li F F. Modeling mutual context of object and human pose in human-object interaction activities[M]//Modeling Mutual Context of Object and Human Pose in Human, 2010:17-24.
    [6]
    Chang A X, Funkhouser T, Guibas L, et al. ShapeNet:an information-rich 3D model repository[J]. arXiv preprint arXiv:1512.03012,2015. https://arxiv.org/abs/1512.03012.
    [7]
    Xia L, Chen C C, Aggarwal J K. View invariant human action recognition using histograms of 3D joints[C]//Computer Vision and Pattern Recognition Workshops, 2012 IEEE Computer Society Conference on, 2012:20-27.
    [8]
    Bo Liefeng, Sminchisescu Cristian. Efficient match kernel between sets of features for visual recognition[C]//Advances in Neural Information Processing Systems 22, Conference on Neural Information Processing Systems 2009. Proceedings of A Meeting Held 7-10 December 2009, Vancouver, British Columbia, Canada, 2009:135-143.
    [9]
    Grauman K, Darrell T. The pyramid match kernel:discriminative classification with sets of image features[C]//IEEE International Conference on Computer Vision, 2015,2:1458-1465.
    [10]
    Yu K, Xu W, Gong Y. Deep learning with kernel regularization for visual recognition[J]. Advances in Neural Information Processing Systems, 2009,21(1):1889-1896.
    [11]
    Bo Liefeng, Ren Xiaofeng, Fox Dieter. Kernel descriptors for visual recognition[C]//Advances in Neural Information Processing Systems 23, Conference on Neural Information Processing Systems 2010. Proceedings of A Meeting Held 6-9 December 2010, Vancouver, British Columbia, Canada, 2010:244-252.
    [12]
    Nam C, Park J C, Kim D S. Indoor human activity recognition with contextual cues in videos[C]//International Conference on Ubiquitous Information Management and Communication, ACM, 2016:1-4.
    [13]
    Gupta A, Kembhavi A, Davis L S. Observing human-object interactions:using spatial and functional compatibility for recognition.[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2009, 31(10):1775-89.
    [14]
    Ller M, Der T. Motion templates for automatic classification and retrieval of motion capture data[C]//ACM Siggraph/eurographics Symposium on Computer Animation, SCA 2006, Vienna, Austria, 2006:137-146.
    [15]
    Rabiner L R. A Tutorial on hidden Markov models and selected applications in speech recognition[J]. Proceedings of the IEEE, 1989, 77(2):257-286.
    [16]
    Wang J, Liu Z, Wu Y. Learning actionlet ensemble for 3D human action recognition[M]//Human Action Recognition with Depth Cameras. Berlin:Springer International Publishing, 2014:11-40.
  • 加载中

Catalog

    通讯作者:陈斌, bchen63@163.com
    • 1.

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (575) PDF downloads(849) Cited by()
    Proportional views
    Related

    /

      Return
      Return
        Baidu
        map