|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object pt.tumba.links.HITS
public class HITS
Kleinberg's hypertext-induced topic selection (HITS) algorithm is a very popular and effective algorithm to rank documents based on the link information among a set of documents. The algorithm presumes that a good hub is a document that points to many others, and a good authority is a document that many documents point to. Hubs and authorities exhibit a mutually reinforcing relationship: a better hub points to many good authorities, and a better authority is pointed to by many good hubs. Because the HITS algorithm ranks documents only depending on the in-degree and out-degree of links, it will cause problems in some cases. For example, Improved Algorithms for Topic Distillation in a Hyperlinked Environment presents two problems: mutually reinforcing relationships between hosts and topic drift. Both can be solved or alleviated by adding weights to documents. The first problem can be solved by giving the documents from the same host much less weight, and the second problem can be alleviated by adding weights to edges based on text in the documents or their anchors.
Field Summary | |
---|---|
private java.util.Map |
authorityScores
A Map containing the Authority score for each page |
private WebGraph |
graph
The data structure containing the Web linkage graph |
private java.util.Map |
hubScores
A Map containing the Hub score for each page |
Constructor Summary | |
---|---|
HITS(WebGraph graph)
Constructor for HITS |
Method Summary | |
---|---|
private java.lang.Double |
authorityScore(java.lang.Integer id)
Returns the Authority score value associated with a given link identifyer. |
java.lang.Double |
authorityScore(java.lang.String link)
Returns the Authority score associated with a given link |
void |
computeHITS()
Computes the Hub and Authority scores for all the nodes in the Web Graph. |
void |
computeHITS(int numIterations)
Computes the Hub and Authority scores for all the nodes in the Web Graph. |
private java.lang.Double |
hubScore(java.lang.Integer id)
Returns the Hub score value associated with a given link identifyer. |
java.lang.Double |
hubScore(java.lang.String link)
Returns the Hub score associated with a given link |
void |
initializeAuthorityScore(java.lang.Integer id,
double value)
Initializes Authority score associated with a given link identifyer. |
void |
initializeAuthorityScore(java.lang.String link,
double value)
Initializes the Authority score associated with a given link. |
void |
initializeHubScore(java.lang.Integer id,
double value)
Initializes Hub score associated with a given link identifyer. |
void |
initializeHubScore(java.lang.String link,
double value)
Initializes the Hub score associated with a given link. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
private WebGraph graph
private java.util.Map hubScores
Map
containing the Hub score for each page
private java.util.Map authorityScores
Map
containing the Authority score for each page
Constructor Detail |
---|
public HITS(WebGraph graph)
graph
- The data structure containing the Web linkage graphMethod Detail |
---|
public void computeHITS()
public void computeHITS(int numIterations)
iter
- The maximum number of iterations for the algorithmpublic java.lang.Double hubScore(java.lang.String link)
link
- The url for the link
private java.lang.Double hubScore(java.lang.Integer id)
WebGraph
to
represent the Web graph for efficiency reasons.
link
- An identifyer for the link
WebGraph.IdentifyerToURL()
public void initializeHubScore(java.lang.String link, double value)
link
- The url for the linkvalue
- The Hub score to assignpublic void initializeHubScore(java.lang.Integer id, double value)
WebGraph
to
represent the Web graph for efficiency reasons.
link
- An identifyer for the linkvalue
- The Hub score to assignWebGraph.IdentifyerToURL()
public java.lang.Double authorityScore(java.lang.String link)
link
- The url for the link
private java.lang.Double authorityScore(java.lang.Integer id)
WebGraph
to
represent the Web graph for efficiency reasons.
link
- An identifyer for the link
WebGraph.IdentifyerToURL()
public void initializeAuthorityScore(java.lang.String link, double value)
link
- The url for the linkvalue
- The Authority score to assignpublic void initializeAuthorityScore(java.lang.Integer id, double value)
WebGraph
to
represent the Web graph for efficiency reasons.
link
- An identifyer for the linkvalue
- The Authority score to assignWebGraph.IdentifyerToURL()
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |