您的位置:首页  > 论文页面

基于实时爬取的提醒服务

发表时间:2013-08-31  浏览量:1126  下载量:557
全部作者: 杨同峰,马军
作者单位: 山东大学计算机科学与技术学院
摘 要: 随着互联网信息的爆炸性增长和人们对互联网依赖的日益加强,传统的通过用户主动浏览网页获取信息的模式,很难满足用户对信息实时性和完整性的需求。提出一个通知服务:首先由用户指定需要从中获取信息的普通网页,系统半自动地生成提取信息模板,并构建用户配置文件。系统对配置中的网页实时监控,发现新的信息后,根据用户指定的方式,将更新的信息推送给用户。该系统支持邮件、短信、客户端等多种推送方式,允许用户随时随地获取最新的信息。系统同时给出了根据用户历史数据获取用户兴趣并推送相应消息的功能。最后,对该系统的实时性和有效性进行评测。
关 键 词: 计算机应用;信息检索;实时爬取;信息抽取
Title: A notification service based on real-time crawling
Author: YANG Tongfeng, MA Jun
Organization: College of Computer Science & Technology, Shandong University
Abstract: With the explosive growth of the internet information and the increasing dependence of people on web, the traditional model that user browsing web pages actively can not satisfy the real-time and full the demand on information. In this paper, a notification system was proposed: user specified the web page whose information needed to be obtained at first, then the system semi-automatically created the extraction template, and added to user configuration, and the system monitored all the pages that users specified, sended message to users by the way they wanted. This system supported many notification methods such as email, short message, mobile application, and allowed the users to obtain the new information. User’s interest was detected with their history data, and messages of related topics were recommended to them as well. At last, the real-time performance and effectiveness of the system was evaluated.
Key words: computer application; information retrieval; real-time crawling; information extraction
发表期数: 2013年8月第16期
引用格式: 杨同峰,马军. 基于实时爬取的提醒服务[J]. 中国科技论文在线精品论文,2013,6(16):1494-1500.
 
0 评论数 0
暂无评论
友情链接