Welcome to Journal of Beijing Institute of Technology
Volume 28Issue 2
.
Turn off MathJax
Article Contents
Lerong Ma. Entity Burst Discriminative Model for Cumulative Citation Recommendation[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2019, 28(2): 356-364. doi: 10.15918/j.jbit1004-0579.18141
Citation: Lerong Ma. Entity Burst Discriminative Model for Cumulative Citation Recommendation[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2019, 28(2): 356-364.doi:10.15918/j.jbit1004-0579.18141

Entity Burst Discriminative Model for Cumulative Citation Recommendation

doi:10.15918/j.jbit1004-0579.18141
  • Received Date:2018-08-02
  • Knowledge base acceleration-cumulative citation recommendation (KBA-CCR) aims to detect citation-worthiness documents from a chronological stream corpus for a set of target entities in a knowledge base. Most previous works only consider a number of semantic features between documents and target entities in the knowledge base, and then use powerful machine learning approaches such as logistic regression to classify relevant documents and non-relevant documents. However, the burst activities of an entity have been proved to be a significant signal to predict potential citations. In this paper, an entity burst discriminative model (EBDM) is presented to substantially exploit such burst features. The EBDM presents a new temporal representation based on the burst features, which can capture both temporal and semantic correlations between entities and documents. Meanwhile, in contrast to the bag-of-words model, the EBDM can significantly decrease the number of non-zero entries of feature vectors. An extensive set of experiments were conducted on the TREC-KBA-2012 dataset. The results show that the EBDM outperforms the performance of the state-of-the-art models.
  • loading
  • [1]
    Shen Wei, Wang Jianyong, Han Jiawei. Entity linking with a knowledge base:issues, techniques, and solutions[J]. Knowledge and Data Engineering, IEEE Transactions on, 2015,27(2):443-460.
    [2]
    Yih Wentau, Chang Mingwei, He Xiaodong, et al. Semantic parsing via staged query graph generation:question answering with knowledge base[C]//Association for Computational Linguistics (ACL), 2015.
    [3]
    Nambi S N A U, Sarkar C, Prasad R V, et al. A unified semantic knowledge base for IoT[C]//2014 IEEE World Forum on Internet of Things (WF-IoT), March, 2014:575-580.
    [4]
    Krisztian Balog, Heri Ramampiaro. Cumulative citation recommendation:classification vs. ranking[C]//SIGIR, ACM, 2013:941-944.
    [5]
    Wang Jingang, Liao Lejian, Song Dandan, et al. Resorting relevance evidences to cumulative citation recommendation for knowledge base acceleration[C]//WAIM, 2015.
    [6]
    Zellig S Harris. Distributional structure[J]. Word, 1954,10(2-3):146-162.
    [7]
    Richard Berendsen, Edgar Meij, Daan Odijk, et al. The University of Amsterdam at TREC 2012[C]//TREC, NIST, 2012.
    [8]
    Krisztian Balog, Heri Ramampiaro, Naimdjon Takhirov, et al. Multi-step classification approaches to cumulative citation recommendation[C]//OAIR, ACM, 2013:121-128.
    [9]
    Ludovic Bonnefoy, Vincent Bouvier, Patrice Bellot. A weakly-supervised detection of entity central documents in a stream[C]//SIGIR, ACM, 2013:769-772.
    [10]
    Zhou Mianwei, Chang Kevin Chen-Chuan. Entity-centric document filtering:boosting feature mapping through meta-features[C]//CIKM, ACM, 2013:119-128.
    [11]
    Liu Xitong, Darko Jerrey, Fang Hui. A related entity based approach for knowledge base acceleration[C]//TREC, NIST, 2013.
    [12]
    Gebrekirstos G Gebremeskel, He Jiyin, Arjen P de Vries, et al. Cumulative citation recommendation:a feature-aware comparison of approaches[C]//Database and Expert Systems Applications (DEXA), IEEE, 2014:193-197.
    [13]
    Dalton Jerrey, Dietz Laura. Bi-directional linkability from wikipedia to documents and back again:Umass at TREC 2012 knowledge base acceleration track[C]//TREC, NIST, 2012.
    [14]
    Witter Z, Tompkins C, Small S G. Sawus:Siena's automatic wikipedia update system[C]//TREC, NIST, 2012.
    [15]
    Wang Jingang, Song Dandan, Lin Chin-Yew, et al. Bit and msra at treckbaccr track 2013[C]//TREC, NIST, 2013.
    [16]
    Ma Lerong, Song Dandan, Liao Lejian, et al. PSVM:a preference-enhanced SVM model using preference data for classification[J]. Sci China Inf Sci, 2017, 60:122103.
    [17]
    Ma Lerong, Song Dandan, Liao Lejian, et al. A latent entity-document class mixture of experts model for cumulative citation recommendation[J]. Tsinghua Science and Technology, 2018, 23(6):660-670.
    [18]
    Ma Lerong, Song Dandan, Liao Lejian, et al. A joint deep model of entities and documents for cumulative citation recommendation, Cluster Comput (2017)[EB/OL].[2017-10-17]. https://doi.org/10.1007/s10586-017-1273-x.
    [19]
    Yang Yiming, Pierce Tom, Carbonell Jaime. A study of retrospective and on-line event detection[C]//SIGIR, ACM, 1998:28-36.
    [20]
    Kleinberg Jon. Bursty and hierarchical structure in streams[C]//KDD, ACM, 2002:91-101.
    [21]
    He Qi, Chang Kuiyu, Lim Ee-Peng, et al. Bursty feature representation for clustering text streams[C]//SDM, SIAM, 2007:491-496.
    [22]
    He Qi, Chang Kuiyu, Lim Ee-Peng. Using burstiness to improve clustering of topics in news streams[C]//ICDM, IEEE, 2007:493-498.
    [23]
    Zhao Xin Wayne, Chen Rishan, Fan Kai, et al. A novel burst-based text representation model for scalable event detection[C]//ACL, ACL, 2012:43-47.
    [24]
    Allan James. Introduction to topic detection and tracking[C]//Topic Detection and Tracking, volume 12 of The Information Retrieval Series, Springer, US, 2002:1-16.
    [25]
    Andrew Y Ng, Michael I Jordan. On discriminative vs. generative classifiers:a comparison of logistic regression and naive Bayes[C]//Dietterich T G, Becker S, Ghahramani Z, eds. Advances in Neural Information Processing Systems 14, MIT Press, 2002:841-848.
    [26]
    Genkin Alexander, Lewis David D, Madigan David. Large-scale Bayesian logistic regression for text categorization[J]. Technometrics, 2007,49(3):291-304.
    [27]
    Michail Vlachos, Christopher Meek, Zografoula Vagena, et al. Identifying similarities, periodicities and bursts for online search queries[C]//SIGMOD, ACM, 2004:131-142.
    [28]
    Brain Kjersten, Paul McNamee. The hltcoe approach to the TREC 2012 kba track[C]//TREC, NIST, 2012.
  • 加载中

Catalog

    通讯作者:陈斌, bchen63@163.com
    • 1.

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (416) PDF downloads(233) Cited by()
    Proportional views
    Related

    /

      Return
      Return
        Baidu
        map