Welcome to Journal of Beijing Institute of Technology
Volume 26Issue 4
.
Turn off MathJax
Article Contents
Senlin Luo, Qianrou Chen, Jia Guo, Ji Zhang, Limin Pan. Microblog Summarization via Enriching Contextual Features Based on Sentence-Level Semantic Analysis[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2017, 26(4): 505-516. doi: 10.15918/j.jbit1004-0579.201726.0410
Citation: Senlin Luo, Qianrou Chen, Jia Guo, Ji Zhang, Limin Pan. Microblog Summarization via Enriching Contextual Features Based on Sentence-Level Semantic Analysis[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2017, 26(4): 505-516.doi:10.15918/j.jbit1004-0579.201726.0410

Microblog Summarization via Enriching Contextual Features Based on Sentence-Level Semantic Analysis

doi:10.15918/j.jbit1004-0579.201726.0410
  • Received Date:2016-11-18
  • A novel microblog summarization approach via enriching contextual features on sentence-level semantic analysis is proposed in this paper. At first, a Chinese sentential semantic model (CSM) is employed to analyze the semantic structure of each microblog sentence. Then,a combination of sentence-level semantic analysis and latent dirichlet allocation is utilized to acquire extra features and related words to enrich the collection of microblog messages. The simlilarites between the two sentences are calculated based on the enriched features. Finally, the semantic weight and relation weight are calculated to select the most informative sentences, which form the final summary for microblog messages.Experimental results demonstrate the advantages of our proposed approach. The results indicate that introducing sentence-level semantic analysis for context enrichment can better represent sentential semantic.The proposed criteria,namely, semantic weight and relation weight enhance summary result. Furthermore, CSM is a useful framework for sentence-level semantic analysis.
  • loading
  • [1]
    Long Rui, Wang Haofen, Chen Yuqiang, et al. Towards effective event detection, tracking and summarization on microblog data[C]//International Conference on Web-Age Information Management. Berlin, Heidelberg:Springer, 2011:652-663.
    [2]
    Hua W, Zhang Yanqing. Threshold and associative based classification for social spam profile detection on twitter[C]//Semantics, Knowledge and Grids (SKG), 2013 Ninth International Conference on IEEE, Beijing, China,2013.
    [3]
    Olariu A. Clustering to improve microblog stream summarization[C]//Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 201214th International Symposium on IEEE, Omaha, NE, USA,2012.
    [4]
    Khan M A H, Bollegala D, Liu Guangwen,et al. Multi-tweet summarization of real-time events[C]//Social Computing (SocialCom), 2013 International Conference on IEEE, Washington, DC, USA,2013.
    [5]
    Zhang Renxian, Li Weijie, Gao Dehong, et al. Automatic twitter topic summarization with speech acts[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2013, 21(3):649-658.
    [6]
    Blei D M, Ng A Y, Jordan M I. Latent dirichlet allocation[J]. Journal of Machine Learning Research, 2003, 3:993-1022.
    [7]
    Luo Senlin, Han Lei, Pan Limin, et al. Construction method of Chinese sentential semantic structure[J]. Journal of Beijing Institute of Technology, 2015, 24(1):110-117.
    [8]
    Harabagiu S M, Hickl A. Relevance modeling for microblog summarization[C]//International AAAI Conference on Web and Social Media, Barcelona, Spain, 2011.
    [9]
    Liu Yinbin, Shi Mengyao. The influence factors to the enterprise microblogs[C]//Computing Technology and Information Management (ICCM), 20128th International Conference on IEEE, Seoul, Korea,2012.
    [10]
    Wang Xuming, ZuoWanli, Wang Ying, et al. Microblog comments sentiment analysis based on extended emotional lexicon[C]//International Conference on Information Computing and Applications. Springer, Berlin:Heidelberg, 2013:385-397.
    [11]
    Sharifi B, Hutton M A, Kalita J K. Experiments in microblog summarization[C]//Social Computing (SocialCom), 2010 IEEE Second International Conference on IEEE, Minneapolis, MN, USA,2010.
    [12]
    Sharifi B P, Inouye D I, Kalita J K. Summarization of twitter microblogs[J]. The Computer Journal, 2013, 57(3):378-402.
    [13]
    Chakrabarti D, Punera K. Event summarization using tweets[J]. ICWSM, 2011, 11:66-73.
    [14]
    Inouye D, Kalita J K. Comparing twitter summarization algorithms for multiple post summaries[C]//Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on IEEE, Boston, USA,2011.
    [15]
    Vanderwende L, Suzuki H, Brockett C, et al. Beyond SumBasic:Task-focused summarization with sentence simplification and lexi-cal expansion[J]. Information Processing & Management, 2007, 43(6):1606-1618.
    [16]
    Mihalcea T R, Textrank P T. Bringing order into texts[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, Barcelona, Spain, 2004.
    [17]
    Erkan G, Radev D R. Lexrank:Graph-based lexical centrality as salience in text summarization[J]. Journal of Artificial Intelligence Research, 2004, 22:457-479.
    [18]
    Liu Fei, Liu Yang, WengFuliang. Why is sxsw trending?:exploring multiple text sources for twitter topic summarization[C]//Workshop on Languages in Social Media. Association for Computational Linguistics, Stroudsburg, PA, USA,2011.
    [19]
    Gao Dehong, Li Wenjie, CaiXiaoyan, et al. Sequential summarization:A full view of twitter trending topics[J]. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 2014, 22(2):293-302.
    [20]
    Chen Yan, Zhang Xiaoming, Li Zhoujun, et al. Search engine reinforced semi-supervised classification and graph-based summarization of microblogs[J]. Neurocomputing, 2015, 152:274-286.
    [21]
    He Yanxiang, Su Wen, Tian Ye, et al. Summarizing microblogs on network hot topics[C]//Internet Technology and Applications (iTAP), 2011 International Conference on IEEE, Wuhan, China, 2011.
    [22]
    Bian Jingwen, Yang Yang, Chua T S. Multimedia summarization for trending topics in microblogs[C]//the 22nd ACM International Conference on Information & Knowledge Management ACM, New York, USA,2013.
    [23]
    Luo Senlin, Liu Yingying, Feng Yang, et al. Method of building BFS-CTC:A Chinese tagged corpus of sentential semantic structure[J]. Transactions of Beijing Institute of Technology, 2012, 32(3):311-315. (in Chinese)
    [24]
    Heinrich G. Parameter estimation for text analysis[R]. Germany:University of Leipzig, 2008.
    [25]
    Phan X H, Nguyen C T. GibbsLDA++:A C/C++ implementation of latent Dirichlet allocation (LDA)[EB/OL].[2013/04/15]. http://gibbslda.sourceforge.net/.
    [26]
    Lin C Y. Rouge:a package for automatic evaluation of summaries[C]//Text Summarization Branches out:Proceedings of the ACL-04 workshop, Barcelona, Spain, 2004.
    [27]
    Sharifi B, Hutton M A, Kalita J. Summarizing microblogs automatically[C]//Human Language Technologies:The 2010 Annual Con-ference of the North American Chapter of the Association for Computational Linguistics, Los Angeles, California, USA, 2010.
    [28]
    Gong Yihong, Xin Liu. Generic text summarization using relevance measure and latent semantic analysis[C]//the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, USA, 2001.
    [29]
    Arora R, Ravindran B. Latent dirichlet allocation based multi-document summarization[C]//The Second Workshop on Analytics for Noisy Unstructured Text Data ACM, New York, NY, USA, 2008.
  • 加载中

Catalog

    通讯作者:陈斌, bchen63@163.com
    • 1.

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (934) PDF downloads(527) Cited by()
    Proportional views
    Related

    /

      Return
      Return
        Baidu
        map