[1]郑飘飘,万健,司华友. 基于评论的热点新闻事件识别方法研究[J].浙江科技学院学报,2019,(05):392-399.
 ZHENG Piaopiao,WAN Jian,SI Huayou. Research on methodology of identifying hot news event based on comments[J].,2019,(05):392-399.
点击复制

 基于评论的热点新闻事件识别方法研究()
分享到:

《浙江科技学院学报》[ISSN:1001-3733/CN:61-1062/R]

卷:
期数:
2019年05期
页码:
392-399
栏目:
出版日期:
2019-10-29

文章信息/Info

Title:
 Research on methodology of identifying hot news event based on comments
文章编号:
1671-8798(2019)05-0392-08
作者:
 郑飘飘万健司华友
 浙江科技学院 信息与电子工程学院,杭州 310023;杭州电子科技大学 计算机学院,杭州 310018
Author(s):
 ZHENG Piaopiao WAN Jian SI Huayou
 School of Information and Electronic Engineering, Zhejiang University of Science and Technology,Hangzhou 310023, Zhejiang, China;School of Computer Science and Technology,Hangzhou Dianzi University, Hangzhou 310018, Zhejiang, China
关键词:
 新闻评论事件识别信息抽取
分类号:
TP391.43
文献标志码:
A
摘要:
 随着互联网的普及,非结构化文本数据的规模不断扩大且越来越多地用于大众传播。因此,从海量数据抽取热点信息已成为一个重要的研究课题。针对新闻的热点挖掘进行方法改进及分析,结合新闻及事件模型,使用TextRank算法提取关键词,运用相似度计算方法,提出了一种基于评论的热点新闻事件识别方法。研究结果表明该方法具有一定的可行性。

参考文献/References:

[1]KLUEGL P, TOEPFER M, BECK P D, et al. UIMA Ruta: rapid development of rulebased information extraction applications[J].Natural Language Engineering,2016,22(1):1.
[2]YIM W, DENMAN T, KWAN S W, et al. Tumor information extraction in radiology reports for hepatocellular carcinoma patients[C]//2016 Joint Summits on Translational Science. San Franciso:AMIA,2016:455.
[3]WI C I, SOHN S, ROLFES M C, et al. Application of a natural language processing algorithm to asthma ascertainment. An automated chart review[J].American Journal of Respiratory and Critical Care Medicine,2017,196(4):430.
[4]AFZAL N, MALLIPEDDI V P, SOHN S, et al. Natural language processing of clinical notes for identification of critical limb ischemia[J].International Journal of Medical Informatics,2018,111:83.
[5]YAZDANI S, FALLET S, VESIN J M. A novel shortterm event extraction algorithm for biomedical signals[J].IEEE Transactions on Biomedical Engineering,2018,65(4):754.
[6]DAVE K, LAWRENCE S, PENNOCK D M. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews[C]//Proceedings of the 12th international conference on World Wide Web. Budapest:ACM,2003:519.
[7]ALLAN J, CARBONELL J G, DODDINGTON G, et al. Topic detection and tracking pilot study final report[C]//Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop. San Francisco: Morgan Kaufmann Publishers,1998:194.
[8]CAP′O M, PE′REZ A, LOZANO J A. An efficient approximation to the Kmeans clustering for massive data[J].KnowledgeBased Systems,2017,117:56.
[9]CUI X, ZHU P, YANG X, et al. Optimized big data Kmeans clustering using MapReduce[J].The Journal of Supercomputing, 2014,70(3):1249.
[10]STEINBACH M, KARYPIS G, KUMAR V. A comparison of document clustering techniques[C]//KDD2000 Workshop on Text Mining. Boston:SIGKDD,2000:1.
[11]GARRIDO A L, BUEY M G, ESCUDERO S, et al. The genie project:a semantic pipeline for automatic document categorisation[C]//The 10th International Conference on Web Information Systems and Technologies. Barcelona, Spain:WEBIST,2014:161.
[12]LEFEVER E, HOSTE V. A classificationbased approach to economic event detection in dutch news text[C]//Tenth International Conference on Language Resources and Evaluation (LREC’16). Portoroz, Slovenia: European Language Resources Association (ELRA),2016:330.
[13]JACOBS G, LEFEVER E, HOSTE V. Economic event detection in companyspecific news text[C]//The 56th Annual Meeting of the Association for Computational Linguistics. Melbourne, Australia: Association for Computational Linguistics,2018:1.
[14]YANG Z, LI Q, WENYIN L, et al. Shared multiview data representation for multidomain event detection[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2019,99:1.
[15]RUPNIK J, MUHICA, LEBAN G, et al. News across languages: crosslingual document similarity and event tracking[J].Journal of Artificial Intelligence Research,2016,55(1):283.
[16]CUI X, ZHU P, YANG X, et al. Optimized big data Kmeans clustering using MapReduce[J].The Journal of Supercomputing,2014,70(3):1249.
[17]POWERS D MW. Evaluation: from precision, recall and Fmeasure to ROC, informedness, markedness and correlation[J].Journal of Machine Learning Technologies,2011,2(1):37.

相似文献/References:

[1]张 伟,沈亚华. 气相色谱法测定神香苏合丸中冰片的含量 [J].浙江科技学院学报,2010,(02):85.
 ZHANG Wei,SHEN Ya-hua.Determination of content of borneol in Shenxiang Suhe pill by GC[J].,2010,(05):85.
[2]张银南. 微机接口实验辅助教学系统设计 [J].浙江科技学院学报,2010,(02):107.
 ZHANG Yin-nan.Design of assisted instruction system on computer interface experiment[J].,2010,(05):107.
[3]陈烨.小球在粘滞液体中运动情况的探讨 [J].浙江科技学院学报,2001,(02):1.
 Chen Ye.Discussion of the globule movement in the liquid of viscosity[J].,2001,(05):1.
[4]彭荷芬.翻译中的意义与阐释 [J].浙江科技学院学报,2001,(02):50.
 Peng He-fen.Meaning and interpretation[J].,2001,(05):50.
[5]李明.最可几速率与波耳兹曼因子 [J].浙江科技学院学报,2000,(02):10.
 Li Ming.The most probable speed and Boltzmann factor[J].,2000,(05):10.
[6]陶松垒 冯全宏 邱春芳 张志坚.塑料排水板加固水下软基的方法及设备 [J].浙江科技学院学报,2000,(03):33.
 Tao Songlei,Feng Quanhong,Qiu Chunfang,et al.Methods and devices for reinforcement of underwater soft\|ground with plastic drainage plate[J].,2000,(05):33.
[7]诸森儿 谢列卫 夏怡新 马莉萍.院级实验室管理体制改革的思考与实践 [J].浙江科技学院学报,2000,(03):40.
 Zhu Shener,Xie Liewei,Xia Yixin,et al.Reflection and practice on reformation of the college laboratory management system[J].,2000,(05):40.
[8]彭鸿广.研发竞赛中参与人的策略与发起者的收益研究 [J].浙江科技学院学报,2011,(03):234.[doi:10.3969/j.issn.1671-8798.2011.03.014]
 Peng Hong-guang.Contestants strategy choices and sponsor‘s revenue in R&D contest[J].,2011,(05):234.[doi:10.3969/j.issn.1671-8798.2011.03.014]
[9]陈烨.探测电路与导电介质对模拟静电场测绘的影响 [J].浙江科技学院学报,2000,(04):1.
 Chen Ye.The effect of measurement in simulating electrostatic field made by plumb circuit and conducting dielectric[J].,2000,(05):1.
[10]周文杰.分析流行潮头的大众化趋势 [J].浙江科技学院学报,2000,(04):60.
 Zhou Wen-jie.Popular tendency of prevalence trend[J].,2000,(05):60.

备注/Memo

备注/Memo:
收稿日期: 2019-05-02基金项目: 国家自然科学基金项目(61572163)通信作者: 万健(1969—),男,福建省泉州人,教授,博士,主要从事云计算大数据研究。
更新日期/Last Update: