下载此文档

基于web日志挖掘的用户访问兴趣分析word论文.docx


文档分类:IT计算机 | 页数:约63页 举报非法文档有奖
1/63
下载提示
  • 1.该资料是网友上传的,本站提供全文预览,预览什么样,下载就什么样。
  • 2.下载该文档所得收入归上传者、原创者。
  • 3.下载的文档,不会出现我们的网址水印。
1/63 下载此文档
文档列表 文档介绍
Abstract
With the rapid development of the technology, the amount of information on the has reached an unprecedented scale. People can get any information they want whether from puter or mobile phone. How to get more useful information quickly and accurately from the massive data and how to explore the potential valuable knowledge and patterns to make the more intelligent so that people can get better experience has e a serious problem in the era. In this context Web data mining emerged as one of the effective ways to solve this problem.
There are three areas in web data mining including web content mining, web structure mining and web log mining. The main background of this thesis is web log mining. Since the web log data is of high-dimensional, massive, semi-structured or unstructured characteristics, traditional data mining algorithms can not meet the performance requirements. So the particle swarm algorithm of the swarm intelligence is applied to the user clustering. Studies have shown that the algorithm has better performance on high-dimensional data than tradition clustering algorithms.
This thesis firstly researches on the basic principles of classic cluster algorithm and Particle Swarm Optimization (PSO) algorithm. And then analyzes pares the advantages and disadvantages between several classic user clustering algorithms and particle swarm clustering algorithm. Secondly, for the problems of existing clustering algorithm such as easy to fall into local optimum result and instability on high-dimensional data, an improved PSO algorithm based on K-means is proposed. By defining the divergence to determine the timing of K-means algorithm operation, the new algorithm makes full use of the local search capability of K-means and the global search capability of PSO to accelerate the convergence speed and also improve the results accuracy. Thirdly, the thesis introduces the concept of fitness variance to make inertia weight in particle swarm algorithm adjust i

基于web日志挖掘的用户访问兴趣分析word论文 来自淘豆网m.daumloan.com转载请标明出处.

相关文档 更多>>
非法内容举报中心
文档信息
  • 页数63
  • 收藏数0 收藏
  • 顶次数0
  • 上传人wz_198613
  • 文件大小2.10 MB
  • 时间2018-02-24
最近更新