Tugas5 Komputer&masyarakat
PageRanking WordNet Synsets: An Application to
Opinion Minin
Penulis :andrea Esuli and Fabrizio Sebastiani
Several works have recently tackled
the automateddetermination of term polarity.
Hatzivassiloglou andMcKeown (1997)
determine the polarity of adjec-tives by mining pairs of conjoined adjectives
fromtext, and observing that conjunctions such as andtend to conjoin adjectives
of the same polarity whileconjunctions such as but tend to conjoin adjectivesof
opposite polarity.
Turney and Littman (2003)
de-termine the polarity of generic terms by computingthe pointwise mutual
information (PMI) between thetarget term and each of a set of “seed” terms
ofknown positivity or negativity, where the marginaland joint probabilities
needed for PMI computationare equated to the fractions of documents from agiven
corpus that contain the terms, individually orjointly. Kamps et al.
(2004) determine the polarityof
adjectives by checking whether the target adjec-tive is closer to the term good
or to the term badin the graph induced on WordNet by the synonymyrelation.
Kim and Hovy (2004) determine the po-larity of
generic terms by means of two alternativelearning-free methods that use two
sets of seed termsof known positivity and negativity, and are basedon the
frequency with which synonyms of the targetterm also appear in the respective
seed sets. Amongthese works, (Turney and Littman, 2003) has provenby far the
most effective, but it is also by far the mostcomputationally intensive
The Mathematics Behind Google’s PageRank
Penulis: Ilse Ipsen
URL : https://ipsen.math.ncsu.edu/ps/slides_man.pdf
Google Matrix
Convex combination G = α S + (1 − α ) 1 1 v T | {z }
rank 1 • Stochastic matrix S • Damping factor 0 ≤ α < 1 e.g. α = .85 •
Column vector of all ones 1 1 • Personalization vector v ≥ 0 k v k 1 = 1 Models
teleportation
Properties of Matrix S
• Row i of S: Links from page i to other pages •
Column i of S: Links into page i • S is a stochastic matrix: All elements in [0
, 1] Elements in each row sum to 1 • Dominant left eigenvector: ω T S = ω T ω ≥
0 k ω k 1 = 1 • ω i is probability of visiting page i • But: ω not unique
Google PageRank
PENULIS:???
URL : https://www.mathworks.com/content/dam/mathworks/mathworks-dot-com/moler/exm/chapters/pagerank.pdf
Further Reading Further reading on matrix
computation includes books by Demmel [?], Golub and Van Loan [?], Stewart [?,
?], and Trefethen and Bau [?]. The definitive references on Fortran matrix
computation software are the LAPACK Users’ Guide and Web site [?]. The Matlab
sparse matrix data structure and operations are described in [?]. Information
available on Web sites about PageRank includes a brief explanation at Google
[?], a technical report by Page, Brin, and colleagues [?], and a comprehensive
survey by Langville and Meyer [?]. Recap %% Page Rank Chapter Recap % This is
an executable program that illustrates the statements % introduced in the Page
Rank Chapter of "Experiments in MATLAB". % You can access it with % %
pagerank_recap % edit pagerank_recap % publish pagerank_recap % % Related EXM
programs % % surfer % pagerank %% Sparse matrices 10 Chapter 7. Google PageRank
n = 6 i = [2 6 3 4 4 5 6 1 1] j = [1 1 2 2 3 3 3 4 6] G = sparse(i,j,1,n,n)
spy(G) %% Page Rank p = 0.85; delta = (1-p)/n; c = sum(G,1); k = find(c~=0); D
= sparse(k,k,1./c(k),n,n); e = ones(n,1);j I = speye(n,n); x = (I - p*G*D)\e; x
= x/sum(x) %% Conventional power method z = ((1-p)*(c~=0) + (c==0))/n; A =
p*G*D + e*z; x = e/n; oldx = zeros(n,1); while norm(x - oldx) > .01 oldx =
x; x = A*x; end x = x/sum(x) %% Sparse power method G = p*G*D; x = e/n; oldx =
zeros(n,1); while norm(x - oldx) > .01 oldx = x; x = G*x + e*(z*x); end x =
x/sum(x) %% Inverse iteration x = (I - A)\e; x = x/sum(x) %% Bar graph bar(x)
title(’Page Rank’)
A Survey and Comparative Study of Different PageRank
Algorithms
Penulis : Tahseen A. Jilani University of Karachi Karachi,
Pakistan
Ubaida Fatima
NED University Karachi, Pakistan
Mirza Mahmood Baig NED University Karachi, Pakistan
Saba Mahmood IoBM Karachi, Pakistan
url : https://www.researchgate.net/publication/281170066_A_Survey_and_Comparative_Study_of_Different_PageRank_Algorithms
In contemporary
computer generation, internet has become
indispensable in our lives and cognizance is only a
click away.
We just open our desired search engines, like Google, Yahoo,
Bing, and the search engine will show the webpages
appropriate
for our search.
Google’s humongous
triumph as a search engine
can be
ascribing to numerous elements, including its naiveness,
acceleration and ease of employ. Conversely, the
most eminent
grounds for their triumph
is
due to their search
algorithm;
contrast
to other search
engines; Google provides
the most
significant ensue first.
Google as a search engine inevitably to be able to
execute two
errands. First it requires to acquire and retain all
of the webpages
it is able to; this attain by crawling the web ad
indexing the data
that it encounters. Second, it requires being able
to figure out the
order of pages resumed
by any search
survey. This is
accomplished through
Google’s PageRank algorithm as
proposed
by Wills[1], which
assesses each webpage
and its
status relative to other webpages. “Significance”, as defined by
PageRank algorithm, is subject on the number of other pages,
linking to a webpage. This is a prejudiced factor
which includes
the importance of
other pages, meaning that a high importance
webpage will impart more importance to a linked
page than an
irrelevant page
The anatomy of a large-scale hypertextual Web search
engine
Penulis : omputer Networks and ISDN Systems 30 (
1998) 107- 117
URL : http://www.cse.fau.edu/~xqzhu/courses/cap6777/google.search.engine.pdf
The Web creates new challenges for information
retrieval. The amount of information on the Web is growing rapidly, as well as
the number of new users inexperienced in the art of Web research. People are ’
Corresponding author. ’ There are two versions of this paper - a longer full
version and a shorter printed version. The full version is available on the Web
and the conference CD-ROM. ’ E-mail: (sergey, page] @cs.stanford.edu likely to
surf the Web using its link graph, often starting with high quality human
maintained indices such as Yahoo! 3 or with search engines. Human maintained
lists cover popular topics effectively but are subjective, expensive to build
and maintain, slow to improve, and cannot cover all esoteric topics. Automated
search engines that rely on keyword matching usually return too many low
quality matches. To make matters worse, some advertisers attempt to gain
people’s attention by taking measures meant to mislead ’ http://www.yahoo.com
0169-7552/9X/$19.00 0 1998 Published by Elsevier Science B.V. All rights
reservedautomated search engines. We have built a large-scale search engine
which addresses many of the problems of existing systems. It makes especially
heavy use of the additional structure present in hypertext to provide much
higher quality search results. We chose our system name, Google, because it is
a common spelling of googol, or 10100 and fits well with our goal of building
very large-scale search engines.
Comments
Post a Comment