用户名: 密码: 验证码:
推荐系统的协同过滤算法与应用研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
随着网络技术的应用和普及、电子商务的迅猛发展,越来越多的信息充斥在网络之上。如何在众多的资源中找到适合自己需求的信息,成为众多学者、专家和网络用户关心的核心问题之一。推荐系统在这样的背景下应运而生。
     协同过滤技术是推荐系统(Recommender System)最为核心的技术之一,也是目前应用最为广泛和成功的技术。本文以推荐系统的协同过滤算法为研究目标,旨在解决协同过滤算法在应用中所遇到的稀疏性问题、冷启动问题、用户的信任问题等关键问题。针对推荐系统的协同过滤算法,本文在以下几个方面作了相应的理论研究和应用工作:
     1、对目前推荐系统的总体发展进行了综述,从心理学和认知科学的角度探讨了个性概念的界定,试图从信息科学以外的角度探讨推荐系统构建是否可行及其意义;总结归类了现有的推荐技术,指出其各自的特点、适用范围;在此基础上对推荐系统实现的体系结构和模块进行了概括,为进一步的应用工作提供理论指导;最后对协同过滤算法目前的研究进展进行总结、分类,指出存在的问题,为下一步的研究奠定理论基础。
     2、提出了稀疏矩阵下的一种基于最近邻居评价的改进的协同过滤算法。首先通过余弦相似度计算公式,为目标用户选取最近邻居集,然后根据每个最近邻居对项目的评价与用户间的相似度,产生一个对项目的预测值,所有最近邻用户对项目的加权预测值就构成了一个虚拟的最近邻评价矩阵。由于这个矩阵体现了与目标评价在空间上的相似性,从而将目标评价的预测问题转化到一个最近邻评价矩阵上进行预测计算。和历史评价矩阵相比,虚拟最近邻评价矩阵不但规模比较小,而且包含了用户维上和项目维上对目标评价最有价值的信息。最后再根据虚拟的最近邻居评价矩阵,进行加权均值预测。实验表明,本文提出的针对稀疏矩阵的改进算法,在精度上优于传统算法,尤其在最近邻居个数较少的情况下,精度有较大提高。
     3、提出基于项目关键词预测与协同过滤相结合的混合推荐算法。分析了在系统中项目的内容信息不够丰富的情况下,如何应用基于项目关键词预测与协同过滤技术相结合的问题。首先把项目的关键词进行二进制代码表示,以达到对项目内容进行形式化描述的目的,然后通过winnow算法,对用户的评价进行初步预测,得到用户的预测矩阵;为了确保用户评价矩阵在空间上同目标评价的相似性,构造了两个约束参数——用户评价密度α_i和预测精度β_i,通过参数的设定和遴选,在满足约束条件的评价数据集上应用协同过滤算法。实验表明,加入约束条件的混合推荐算法远优于传统的协同过滤算法以及没有任何约束条件的混合推荐算法。
     4、提出了把信任引入协同过滤推荐系统的构想,构建了一种基于信任的协同过滤推荐算法,在对信任进行了形式化的定义和描述的基础上,构造了协同过滤推荐系统中的两种信任模型——局部信任(local trust)和全局信任(global trust),并分别指出两种信任的区别,确定了各自的影响因素,通过这两种可计算的信任模型可以对系统中的用户的信任程度进行不同范围内的度量;进而,提出了一种基于信任因子的协同过滤推荐算法,并通过实验验证了算法的有效性和优越性。最后的实验同样分析了两种信任的分布特性,通过与相似度的分布的对比,可以得出结论:在推荐系统中对用户信任的研究是有意义的,信任是同相似度不同的对最后的推荐产生影响的重要因素之一。
     5、构建了基于事例推理(CBR)的推荐系统框架模型。对基于事例的推理和协同过滤的推荐过程进行了比较,指出异同,进而把基于事例的推理的过程与推荐系统相结合,应用基于事例的推理过程更好地更新用户的档案,跟踪用户的兴趣变化,提高系统的学习能力。在框架模型中,本文对推荐系统的各部分作了充分的总结和说明,为今后的进一步理论和实践研究奠定基础,同时实现了一个以电影推荐为例的电影推荐系统。
     通过上述的研究工作,从一定程度上解决了推荐系统的协同过滤算法所遇到的稀疏性问题、冷启动问题、信任问题,从而从一定程度上推动协同过滤算法的理论研究和应用研究的进展。
With the fast development of Internet and applications of E-Commerce, more and more information swirles in the net. To get the right information from the information sea has become one of the key issues nowadays for the researchers, experts and the Internet users. Personalized Recommender Systems emerge under the background of this, which becomes the research focus in the domestic and overseas.
     Collaborative filtering algorithm is the most key technology in the personalized recommender systems, which has got the most success and wild applications. This dissertation takes collaborative filtering algorithm of personalized recommender system as the research project to deal with the sparsity problem, cold start problem and trust problem, etc. Research work are taken as following:
     (1) Review the research development of personalized recommender system and discuss the concept of personal from the angle of recognization and phsycology to give some advice for the configuration of personalized recommender systems; give a general analysis of recommender technologies and indicate their individual characters and application fields; analyze the system constructure and modules to give some instructions for applications; at last, give a division of the collaborative filtering and indicate their challenges for research work.
     (2) Put forward an improved collaborative filtering algorithms based on nearest neighbor rating matrix for the sparsity problem of collaborative filtering algorithm. First, get the nearest neighbor rating matrix through Cosine similarity calculation metrix and produce a prediction for items according to similarity between users and neighbors' ratings. Then all nearest neighbors' prediction ratings forms a virtual nearest neighbor rating matrix. This matrix takes on a similarity with the active user rating and the problem of prediction for the active user could be transfer to the matrix. Compared with history rating matrix, virtual nearest neighbors' rating matrix has small scale and contains the most useful information. At last, a prediction based on weights of similarity between users and ratings is produced. Experiments improved the improved algorithm this paper advanced and especially when the rating matrix is very sparse, prediction accuracy is much better.
     (3) Advanced a hybrid recommender algorithm based on the combination of collaborative filtering and item keywords based prediction. Analyze the combination problem of item keywords based prediction and collaborative filtering when item keywords information is not adequate. Through the content abstraction of the items, items in the recommender system are represented by 0 and 1. After this users' prediction for the items are got by using winnow algorithm. To guarantee the accuracy of the prediction matrix, two constraint parametersα_i (which indicate the rating number of each user) andβ_i (number of the prediction which is accurate ) are used to filter the prediction. Only those users' predictionwhich is accurate could be into next step filtering——collaborative filtering. Experimentimproved that the hybrid recommender algorithm with the constraint parameter is much superior to the traditional collaborative filtering algorithm and the algorithm without the constraint parameters. The hybrid algorithm this dissertation put forward improved the cold start problem from certain extent.
     (4) Put forward an assumption that introduce social trust into recommender systems andconstruct a collaborative filtering algorithm based on trust. Configuate two trust models——global trust and local trust under the foundation of formatlate description of the trust and then indicate their effective parameters. Through this two calculatable trust model, user trust could be measured and a collaborative filtering algorithm based on trust is then advanced. The expeiriment proved the superiority of the algorithm. In the experiment the distribution of the trust of two trust model and the distribution of similarity are also be demonstrated which could be safely concluded that trust is a parameter that affect the final recommendation which is very different between the similarity. So the assumption advanced at first is very meaningful.
     (5) Construct a general model of the personalized recommender system based on CBR. Comparing the difference between CBR and CF, the similarity and difference is generalized. Then CBR and CF are combined together which using CBR to improve the learning ability of the personalized recommender system. A personalized film recommender prototype system is also developed at last which could prove the models and algorithms this dissertation put forward.
引文
[1]John Riedl,Joseph Konstan.Word of Mouse.Warner Books Publishing House.200(?).
    [2]Goldberg,D.,Nichols,D.,Oki,B.M.,& Terry,D."Using collaborative(?)ring to weave an information tapestry",Communications of the ACM,Vol.35,No.12,pp61-70,1992.
    [3]Breese J.,Herckman D.,Kadie C.Empirical analysis of predictive algorithms for collaborative filtering,In proceedings of Fourteenth Conference on Uncertainty AI,1998.
    [4]John Riedl,Joseph Konstan.Word of Mouth-The Marketing Power of Collaborative Filtering.Warner Boooks,2002.
    [5]郝先成,赵德干,尹国成.用于电子商务中的数据挖掘技术研究.小型微型计算机系统,2007,(9):pp.785-788.
    [6]刑春晓,高凤荣.适应用户兴趣变化的个性化推荐算法.计算机研究与发展,2006,44(2):pp.296-301.
    [7]孙守义,王蔚,(?)种基于用户聚类的协同过滤个性化图书推荐系统.现代情报,2007(11):pp.139-142,2007.
    [8]陈静,蔡鸿明,徐博艺.网站内容管理及个性化网页系统的研究与实现.计算机应用与软件,2007,24(9):pp.14-16.
    [9]邢东山,沈钧毅.基于Web日志的因特网协作推荐系统的研究,西安交通大学学报,2002(36):2,1271-1274.
    [10]Lorraine Mc Ginty and Barry Smyth,Evaluating Preference-Based Feedb(?)ck in Recommender Systems,AICS 2002,LNAI 2464,pp.209-214,2002.
    [11]李欣璐,刘鲁.基于协同过滤的银行产品推荐系统建模.计算机与数字工程,2007,35(9):pp.6-11.
    [12]余力,刘鲁,罗掌华.我国电子商务推荐策略的比较分析,系统工程理论与实践,2004(8),pp.96-101.
    [13]Badrul Sarwar,George Karypis,Joseph Konstan,John Ried,Item-based C(?)llaborative Filtering Recommendation Algorithms,www.cs.umn.edu/Research/GroupLens/papers/pd(?)www10_sarwar.pdf
    [14]Shardanand,U.,Maes P.Social Information Filtering:Algorithms for automating "Word of Mouth",In proceedings of CHI ' 95,pp.210-217.
    [15]Soohong Daniel Park,Cheolju Hwang,Glenn A.Adams,Video Recommendation Based on User Preference on the Web,http://www.w3.org/2007/08/video/positions/SAMSUNG.ndf.
    [16]Ken Goldberg,Tavi Nathosan,Ephrat Bitton.Jester 4.0:Jokes for your sense of hummer,http://eigentaste.berkeley.edu/user/index.php
    [17]Terveen L.,Hill W.,Amento B.,McDonald,PHOAKS:A System for Sharing Recommendations,Communications of ACM,Vol.40(3),pp.59-62,1997.
    [18]余力.电子商务个性化--理论、方法与应用,清华大学出版社,2007年1月。
    [19]刘平峰,聂规划,陈冬林.电子商务推荐系统研究综述.情报杂志,2007,(9):pp.45-49.
    [20]郝先成,赵德干,尹国成.用于电子商务中的数据挖掘技术研究[J].小型微型计算机系统,2007,(9):pp.785-788.
    [21]赵超超.基于用户与基于项目结合的个性化推荐算法.内蒙古农业大学学报(社会科学版),2007,(6):pp.139-141。
    [22]马丽.基于群体兴趣偏向度的数字图书馆协同过滤技术研究.现代图书情报技术,2007,(9):pp.19-22.
    [23]Badrul Munir Sarwar.Sparsity,Scalability and Distribution in Recommender Systems.Ph.D dissertation,University of Minisota,2001.
    [24]余力,董斯维,郭斌.电子商务推荐攻击研究.计算机科学,2007,34(5):pp.134-138.
    [25]Cheng Yang,Usama Fayyad t,Paul S.Bradley.Efficient Discovery of Error-Tolerant Frequent Itemsets in High Dimensions,KDD 2001,pp.194-203,2001.
    [26]郑雪.人格心理学,广东高等教育出版社,2004.
    [27]孔繁胜.知识库系统原理.浙江大学出版社,2000.
    [28]何钦铭,王中康.机器学习与知识获取,器械工业出版社,2000.
    [29]N.J.Nilsson 译.人工智能,机械工业出版社,2000.
    [30]蔡智兴,徐光祐.人工智能及其应用(研究生用书).清华大学出版社,2004.
    [31]石纯一.人工智能原理.清华大学出版社,1993.
    [32]史忠植.高级人工智能,科学出版社,1998.
    [33]边肇祺.模式识别.清华大学出版社,2000.
    [34]傅京孙.模式识别及其应用,科学出版社,1983.
    [35]郑南京.计算机视觉与模式识别,国防工业出版社,1998
    [36]Resnick,P.,Iacovou,N.,Sucack,M.,Bergstrom,P.,and Riedl,J.Grouplens:A open architechture for collaborative filtering of NetNews.In proceedings of CSCW' 94,Chaple Hill,NC,1994
    [37]Konstan,J.,Miller B.,Maltz D.,Herlocker J.,Gordon L.,Riedl J.Grouplens:Applying Collaborative filtering to Usenet News,Communications of the ACM,Vol.40(3),pp77-87,1996.
    [38]Miller B.,Riedl J.,Konstan J.Experiences with Grouplens:Making Usenet Useful again,in proceedings of the Usenix Technical Conference,pp.356-362,2001.
    [39]Zan Huang,Wingyan Chung,Thian-Huat Ong,Hsinchun Chen.A Graph-based Recommender System for Digital Library,JCDL'02,2002,
    [40]Byungyeon Hwangl,Euichan Kiml,and Bogiu Lee.An Efficient Intelligent Agent System for Automated Recommendation in Electronic Commerce,ISMIS 2002,pp.403-411,2002.
    [41]JonathanL.Herlocker,Joseph A.Konstan,Loren G.Terveen,John T.Riedl.Evaluatin Collaborative Filtering recommender systems,ACM Transactions on information systems,Vil.22,No.1,2004,pp.5-53.
    [42]Agrawal R.,Imielinski T.,Swami A.Mining Associations Between Sets of Items in Large Databases,Proc of the ACM SIGMOD International Conference on Management of Data,pp.207-216,1993.
    [43]李勇,徐振宁.Internet个性化信息服务研究综述,计算机工程与应用,2002(38) 19,pp.183-187.
    [44]Henry Liebenman,Letizia,An AgentThatAssists Web Browsing,International Joint Conference on Artificial Intelligence,pp.189-195,1995.
    [45]Mooney R.J.,Roy L.,Content-based Book Recommending Using Learning for Text Catgorization,Proceedings of the Fifth ACM Conference on Digital Libraries,pp.195-204,2004.
    [46]Steve Lawrence,Lee Giles,Kurt Bollacker.Citeseer.http://citeseer.ist.psu.edu/cs.
    [47]David Goldberg,David Nicols.Using collaborative filtering to weave an information Tapstry,Communications of the ACM,vol(35) 12,pp.61-70,1992.
    [48]Grouplens Research,Movielens:film recommendations,http://movielens.umn.edu
    [49]Tong Zhang,Vijay S.lyengar.Recommender systems using linear classifiers,Journal of Machine Learning Research,vol.(2),pp.314-334.
    [50]Pryor,M.,The effects of singular value decomposition on collaborative fitering,Dartmouth College CS,PCS-TR,pp.395-405,1998.
    [51]Lu,H.,Lu,Z.,and Li,Y.,″EVS:Enhanced Vector Similarity for Collaborative Information recommendation based on SVD,″ In International ICSC-NAISOCongress on Computaional Intelligence:Methods And Applications(CIMA 2001),2001.
    [52]张娜,何建民.基于项目与客户聚类的协同过滤推荐方法.合肥工业大学学报(自然科学版),2007,30(9):pp.1159-1162.
    [53]王建东,叶飞跃,冯新宇,张有东.一种基于用户聚类的协同过滤推荐算法.系统工程与电子技术,2007,29(7):pp.1178-1182.
    [54]辛治运,马兆丰,顾明.服务于定向信息推荐的模糊聚类协同推荐算法.计算机科学,2007,(9).
    [55]王惠敏,聂规划.基于模糊聚类和资源平滑的协同过滤推荐.情报杂志,2007,(7):pp.1-3.
    [56]Andreas Nürnberger,Marcin Detyniecki.Weighted Self-Organizing Maps:Incorporating User Feedback,ICANN/ICONIP 2003,pp.883-890,2003.
    [57]Myung Won Kim,Eun Ju Kim,and Joung Woo Ryu.A Collaborative Recommendation Based on Neural Networks,DASFAA 2004,pp.425-430,2004.
    [58]Su-Jeong Ko and Jiawei Han.Mining Typical Preferences of Collaborative User Groups,ER 2003,pp.419-432,2003.
    [59]李涛,王建东.基于多层相似性用户聚类的推荐算法.南京航空航天大学学报,2006,(6).
    [60]孙多.基于兴趣度的聚类协同过滤推荐系统的设计.安徽大学学报(自然科学版),2007,31(5):pp.19-22.
    [61]杨风召,白慧.电子商务推荐系统的算法与模型分析.情报杂志,vol34(9):pp.128-131,2007年12月.
    [62]Tae Hyup Roh,Kyong Joo Oh,Ingoo Han.The collaborative filtering recommendation based on SOM cluster indexing CBR,Expert Systems with Applications,Vol.25,pp413-423,2003.
    [63]Choonbo Kim,Jumtae Kim,A Recommendation Algorithm Using Multi-Level Association Rules,Proceedings of the IEEE/WIC International Conference on Web intelligence,2003
    [64]Oyanagi,S.,Kubot,K.,Application of Matrix Clustering to Web Log Analysiys and Access Prediction,Proceedings of the WebKDD Workshop,pp.13-21,San Francisco,CA,2005.
    [65]马书刚,郭娜,崔忠强.新客户在商务站点中的个性化推荐.微计算机应用,2007,28(9):pp.935-938.
    [66]孙小华,协同过滤推荐系统的稀疏性与冷启动问题研究,博士论文,浙江大学.
    [67]Blabanovic M.,Shohalm Y..Fab:Content based collaborative recommendation,Communication of the ACM,40(3):pp.66-72,1997.
    [68]麦永浩.基于内容预测和项目评分的协同过滤推荐,计算机应用,24(1):111-113,2004
    [69]Byeong Man Kim,QingLi,Jong-Wan Kim,A New collaborative recommender system addressing three problems,Proceedings of PRICAI,pp.495-504,2004.
    [70]Paolo Massa,Bobby Bhattacharjee.Using trust in recommender systems:an experimental analysis.Proceedings of 2~(nd) International Conference on Trust Managment,Oxford,England,2004.
    [71]Billsus,D.and Pazzani,M.,Learning Collaborative Information Filters,In MachineLearning:Proceedings of the Fifteenth International Conference,Morgan Kaufmann Publishers,San Francisco,CA,1998,pp.46-54.
    [72]Paolo Masse,Paolo Avesani.Trust-aware collaborative filtering for recommender systems.To Appear in:Proceedings of International Conference on Cooperative Information Systems,Agia Napa,Cyprus,25 Oct - 29Oct 2004.
    [73]John O'Donovan,Barry Smyth.Trust in recommender systems,Proceedings of IUI'05,pp.167-174,2005.
    [74]Beth T,Borcherding M,Klein B.Valuation of trust in open network.Gollmann D,ed.Proceedings of the European Symposium on Research in Security(ESORICS).Brighton:Springer-Verlag,1994.3-18.
    [75]Miquel Montaner,Beatriz Lopez,Josep Liuis de la Rosa.Developing trust in recommender agents.In Proceedings of the first international joint conference on Autonomous agents and multiagent systems,pages 304- 305.ACM Press,2002.
    [76]Jφsang A,Knapskog SJ.A metric for trusted systems.Global IT Security.Wien:Austrian Computer Society,1998.541-549.
    [77]Gambetta D.Can we trust trust? Gambetta D,ed.Trust:Making and Breaking Cooperative Relations.Basil Blackweil:Oxford Press,pp.213-237,1990..
    [78]D.H.McKnight & N.L.Chervany.(1996).The Meaning of Trust.Technical Report MISRC Working Paper Series 96-04,University of Minnesota.Management Information Systems Research Center.
    [79]贾佳,曲向丽,杨文婧.信任管理相关技术研究,通讯和计算机,vol(3):23,pp.37-42.
    [80]A.Jφsang.(1999).An Algebra for Assessing Trust inCertification Chains.In:J.Kochmar,ed.Proceedings ofthe Network and Distributed Systems Security Symposium(NDSS'99).The Internet Society.
    [81]C.Castelfranchi,R.Falcone.Principles of Trust for MAS:Cognitive Anatomy,Social Importance,and Quantification.Proceedings of the third International Conference on Multi-Agent Systems,1998.
    [82]T.Grandison,M.Sloman A Survey of Trust inInternet Applications.IEEE Communications Survey and Tutorials,3,2000.
    [83]Jonathan L.Herlocker,Joseph A.Konstan,Loren G.Terven John Riedl.Evaluating collaborative filtering recommender systems.ACM Transactions on Information Systems,vol(22):1,pp.5-53.
    [84]Ekkawut Rojsattarat,Nuanwan Soonthornphisaj.Hybrid Recommendation:Combining Content-Based Prediction and Collaborative Filtering.IDEAL,Springer-Verlag Berlin Heidelberg pp.337.344,2003.
    [85]Robin Burke.A Case-Based Reasoning Approach to Collaborative Filtering,EWCBR,Springer-Verlag Berlin Heidelberg pp.370-379,2000.
    [86]Sonny Han Seng Chee,Jiawei Han,Ke Wang.Rec Tree:An Efficient Collaborative Filtering Method,Da WaK 2001,pp.141-151,2001.
    [87]郭艳红,邓贵仕.基于事例的推理(CBR)研究综述.计算机工程与应用,2004(21),pp.1-5.
    [88]柳炳祥.盛昭翰.基于案例推理的企业危机预警系统设计.中国软科学,2003(3):67-70.
    [89]黎铭,薛晓冰.基于多示例学习web目录页面推荐,软件学报,vol(15):9,pp.1328-1335,2004.
    [90]Ramon Lopez de Mantaras.Case-based Reasoning.Springer-Verlag Heidelberg,2001.
    [91]Hinrichs T.R.and Koiodner J.The roles of adaptation in case-based design.Proceedings Case-Based Reasoning Workshop,Washington,121- 132,1991.
    [92]Watoson.l.Case-based reasoning is a methodology not a technology.Knowledge-based System.Volume:12,Issue:5-6,October,1999,pp.303-308.
    [93]Aamodt A.and Plaza E.(1994).Case-based reasoning:foundational issues,methodological variations and system approaches.AI Communications 7(1),39-59.
    [94]Ko lodner J L.Improving human decision making th rough cased based reasoning techniques[J].AIM agazine,1991,12(3):52259.
    [95]Abdus Salam Khan,Achim Hoffmann.Acquiring Adaptation Knowledge for CBR with MIKAS.Springer-Verlag Heidelberg,2001.
    [96]Guo Yanhong,Deng Guishi,Using CBR to Improve Collaborative filtering recommendersystem in E-Commerce,International conference on Service Systems and Service Management,Beijing,pp.700-704,2004.
    [97]Guo Yanhong,Deng Guishi.Using CBR and Social Trust to Improve the Performance of Recommender System in E-Commerce.The second IEEE International Conference on Innovative Computing,Information and Control,Japan,2007.
    [98]Cain T.,Pazzani M.J.and Silverstein G.Using domain knowledge to influence similarity judgment.Proceedings Case-Based Reasoning Workshop,Washington Morgan Kaufmann,191-202,1991.
    [99]Stanfili C.Memory-based reasoning applied to English pronunciation.Proceedings AAAI-87,Seattle,577-581,1987.
    [100]Surma J.Enhancing similarity measures with domain specific knowledge.Preprints Second European Workshop on Case-Based Reasoning,AcknoSoft Press,365-372,1994.
    [101]吴吉义,林志洁,龚祥国.基于协同过滤的移动电子商务推荐系统若干研究.电子技术应用,2007,(1):pp.5-8.
    [102]Peng Han,Bo Xie,Fan Yang,Ruimin Shen.A scalable P2P recommender system based on distributed collaborative filtering,Expert Systems with Applications Vol.27,pp203-210,2004.
    [103]Jonathan L.nerlocker,Joseph A.Konstan,John Riedl.Explaining Collaborative Filtering Recommendations.www.citeseer.ist.psu.edu/herlocker00explaining.html

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700