Application of Apriori Algorithm in Finding User’s Webpage Browsing Mode
Abstract: The log file of web server which recorded a large number of user’s visiting webpage information, and how to analyze these data and discover the user’s webpage browsing mode such as the webpages which users’ interested in browsing and the best page composition so as to provide a good decision support for merchants has become increasingly important. In this paper, Apriori algorithm was used to mine the log data of recording use’s accessing information for finding the regular pattern of user’s browsing the webpage. Firstly, this paper made data preprocessing to the log data for extracting one session access record of user. Secondly, the Apriori algorithm was used to mine these record data, considering the feature of these data, the paper made litter improvement for the algorithm at the matching of k-candidate set and the transaction. The experimental results showed that the performance of the improved algorithm in handling a large amount of data has a good improvement. Finally, this paper analysed the rules by excavating, and through these rules, some browsing modes were found, which provided decision supports for merchants.
文章引用: 魏 林 , 刘建毅 , 王 枞 (2013) Apriori算法在发现用户网页浏览模式上的应用。 软件工程与应用， 2， 125-130. doi: 10.12677/SEA.2013.26022
 朱扬勇, 周欣, 施伯乐 (2000) 规则型数据采掘工具集 AMINER. 高技术通讯, 3, 19-22.
 朱靖君, 吴海燕, 高国柱等 (2010) 一种基于日志分析的 Web负我测试方法. 计算机工程, 23, 25-27.
 季成, 李晓东, 袁坚等 (2010) 基于k-means算法的DNS查询模式分析. 清华大学学报: 自然科学版, 4, 601-604.
 杨文兵 (2010) 基于Rough集理论的入侵检测方法研究. 硕士论文, 南昌大学, 南昌.
 许晓东, 李柯, 朱士瑞 (2010) Web使用挖掘中Apriori算法的改进研究. 计算机工程与设计, 3, 539-541.
 李燕, 冯博琴, 鲁晓锋 (2009) Web日志挖掘中的数据预处理技术. 计算机工程, 22, 44-46.
 周爱武, 程博, 李孙长等 (2010) Web日志挖掘中的会话识别方法. 计算机工程与设计, 5, 936-938.
 Hall, M., Frank, E., Holmes, G., et al. (2009) The WEKA data mining software: An update. ACM SIGKDD Explorations Newsletter, 11, 10-18.