您的位置:首页  > 论文页面

打印审计系统中基于MSH算法的打印PDL文件内容提取引擎研究

发表时间:2016-05-31  浏览量:1885  下载量:800
全部作者: 刘思佩,郭燕慧
作者单位: 北京邮电大学信息安全中心,北京邮电大学信息安全中心
摘 要: 为提高打印审计在打印传输过程中打印作业文件的内容提取工作的实时性和效率,在调研国内外打印审计技术基础之上,详细分析多种打印作业文件协议的特征并建立特征集。最终完成了基于MSH(max-shift horspool)算法的多种打印作业文件格式的内容提取的框架设计,并进行功能实现和测试。实验结果显示了页面描述语言(page description language,PDL)内容提取的准确性,提高了打印审计效率,对后期分布式打印审计系统研究工作有指导意义。此外,该内容提取方法为计算机犯罪取证、事后溯源取证也提供了一定的参考价值。
关 键 词: 计算机应用;网络打印;打印审计;页面描述语言;模式匹配算法
Title: Information extraction from printer PDL files of various formats based on max-shift horspool algorithm
Author: LIU Sipei, GUO Yanhui
Organization: School of Computer Science, Beijing University of Posts and Telecommunications
Abstract: In order to extract the contents of print jobs in the transmission process of print files and improve the real-time performance and efficiency of print auditing, we propose a framework design of content extraction from printer page description language (PDL) files of various formats based on max-shift horspool (MSH) algorithm in this paper, after investigating print auditing technology at home and abroad and analyzing the feature of various print job protocols and establish a feature set. We also program for realizing the design and test our code. The results show that the accuracy of extracting content from printer PDL files, which means that the method in the paper improves the efficiency of print auditing and have a guiding significance for the researching of distributed print auditing system. Besides, the method of extracting contents provides a certain reference value on computer forensics and tracing forensics later.
Key words: computer applications; network-printing; print audit; page description language; string matching algorithm
发表期数: 2016年5月第10期
引用格式: 刘思佩,郭燕慧. 打印审计系统中基于MSH算法的打印PDL文件内容提取引擎研究[J]. 中国科技论文在线精品论文,2016,9(10):988-996.
 
0 评论数 0
暂无评论
友情链接