UT Repository 東京大学
 

UT Repository >
124 情報理工学系研究科 >
40 電子情報学専攻 >
1244025 修士論文(電子情報学専攻) >

Please use this identifier to cite or link to this item: http://hdl.handle.net/2261/28816

タイトル: A Study of Methods for Extracting the Boundaries of Web Spam
その他のタイトル: ウェブスパムの境界抽出手法に関する研究
著者: Chung, Young joo
著者(別言語): 鄭, 容朱
キーワード: Web spam
Link analysis
Link hijacking
Information retrieval
Issue Date: Mar-2008
抄録: As the search result ranking is getting important for attracting visitors and yielding profits, more and more people are now trying to mislead search engines in order to get higher ranking. Since link-based ranking algorithms are important tools for current search engines, web spammers are making a significant effort to manipulate the link structure of the Web, namely, link spamming. Link hijacking is one technique of link spamming. By hijacking links from normal sites to target spam sites, spammers can make search engines believe that normal sites endorse spam sites. In this research, we propose link analysis techniques for finding out link-hijacked sites using modified PageRank algorithms. We tested our methods on a large scale Japanese web archive and evaluated the accuracy.
内容記述: 報告番号: ; 学位授与年月日: 2008-03-24 ; 学位の種別: 修士 ; 学位の種類: 修士(情報理工学) ; 学位記番号: ; 研究科・専攻: 情報理工学系研究科電子情報学専攻
URI: http://hdl.handle.net/2261/28816
Appears in Collections:1244025 修士論文(電子情報学専攻)
025 修士論文

Files in This Item:

File Description SizeFormat
48066450.pdf2.73 MBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback