EFFICIENT SEARCHING USING PAGE RANK
#1

Presented by:
Amit Kumar Pathak

[attachment=15093]
EFFICIENT SEARCHING USING PAGE RANK
INTRODUCTION
Search Engines have become an integral part of the World Wide Web.
One such search engine proposed by Larry Page and Sergey Brin known as GOOGLE.
Most widely accepted search engine.
Works on its own unique patented algorithm known as Page Rank.
WHAT IS PAGE RANK ??
PageRank is a numeric value.
It represents how important a page is on the web.
PageRank is Google's way of deciding a page's importance.
It isn't the only factor that Google uses to rank pages, but it is an important one.
When one page links to another page, it is effectively casting a vote for the other page.
The more votes that are cast for a page, the more important the page must be.
The importance of the page that is casting the vote also determines how important the vote itself is.
Google calculates a page's importance from the votes cast for it.
HISTORY
PageRank was developed at Stanford University by Larry Page and later Sergey Brin as part of a research project about a new kind of search engine.
Project started in 1995 and led to a functional prototype, named Google, in 1998.
PageRank is based on citation analysis that was developed in the 1950s by Eugene Garfield at the University of Pennsylvania
Google's founders have cited Garfield's work in their original paper.
Web page link analysis first developed by Jon Kleinberg and his team.
PAGE RANK CONCEPT
The occurrence of a search phrase within a document is the ethical ranking technique of most search engines.
For the purpose of better search results the concept of page link popularity was developed.
This also make search engines resistant against automatically generated web pages based upon the analysis of content specific ranking criteria.
Following this concept, the number of inbound links for a document measures its general importance.
Hence, a web page is generally more important, if many other web pages page link to it.
WHY PAGE RANK IS DIFFERENT
Page Rank is not simply based upon the total number of inbound links.
In Page Rank concept, all inbound links are not equal.
The rank of the documents which page link to the main page is also important.
PageRank is based on the linking structure of the whole web.
DESCRIPTION OF PAGE RANK CALCULATION
Page Rank works largely by counting citations or back links to a given page.
Page Rank then extends this idea by not counting links from all pages equally, and by normalizing by the number of links on a page.
We assume page A has pages T1...Tn which point to it (i.e., are citations).
The parameter d is a damping factor which can be set between 0 and 1 (usually 0.85).
C(A) is defined as the number of links going out of page A.
Then Page Rank of page A is given as:
PR(A) = (1-d) + d (PR(T1)/C(T1) +...+PR(Tn)/C(Tn))
DAMPING FACTOR
The Page Rank theory holds that even an imaginary surfer who is randomly clicking on links will eventually stop clicking.
The probability, at any step, that the person will continue is a damping factor d.
This is generally set to 0.85
The damping factor is subtracted from 1.
This term is then added to the product of the damping factor and the sum of the incoming Page Rank scores.
So any page's Page Rank is derived in large part from the Page Ranks of other pages.
The damping factor adjusts the derived value downward.
The sum of all Page Ranks will be one due to probability distribution over the web pages.
The sum of all page ranks converges to the total number of web pages.
Page Rank Algorithm
PAGE RANK ALGORITHM
PR(A) = (1-d) + d (PR(T1)/C(T1) +...+PR(Tn)/C(Tn))
PR(A) is the Page Rank of page A,
PR(Ti) is the Page Rank of pages Ti which page link to page A,
C(Ti) is the number of outbound links on page Ti and
d is a damping factor.
EXAMPLE
We regard a small web consisting of three pages A, B and C.
Page A links to the pages B and C, page B links to page C and page C links to page A.
Here we assume d=0.50 to keep the calculation simple
PR(A) = 0.5 + 0.5 PR© PR(B) = 0.5 + 0.5 (PR(A) / 2) PR© = 0.5 + 0.5 (PR(A) / 2 + PR(B))
On solving these, we get:
PR(A) = 14/13 = 1.07692308 PR(B) = 10/13 = 0.76923077 PR© = 15/13 = 1.15384615
Sum of these=3=number of web pages
AUTHORS FINDINGS
FACTORS AFFECTING PAGE RANK

Visibility of a link
Position of a page link within a document
Distance between web pages
Importance of a linking page
Up-to-dateness of a linking page.
FLAWS IN PAGE RANK
Page Rank can be manipulated. e.g. Google Bombs.
Using “Link Farming”
HOW TO IMPROVE YOUR WEBSITES PAGE RANK
Update your website every day by adding more unique content.
Provide inside linking to your website.
Create sitemap for your website.
Trade your page link with other web owners.
CONCLUSION
Frequent content updates don’t improve Page Rank automatically. Content is not part of the PR calculation.
Sub-directories don’t necessarily have a lower Page Rank than root-directories.
Wikipedia links don’t improve PageRank automatically.
Efficient internal onsite linking has an impact on PageRank.
Links from and to high quality related sites have an impact on Page Rank.
Multiple votes to one page link from the same page cost as much as a single vote.
Reply

Important Note..!

If you are not satisfied with above reply ,..Please

ASK HERE

So that we will collect data for you and will made reply to the request....OR try below "QUICK REPLY" box to add a reply to this page
Popular Searches: mg university btech rank holders, page rank algorithm pdf, academic rank of texas, rank list of mba colleges in india, where does the university of louisville rank, sitemap changefreq, rank of ieee,

[-]
Quick Reply
Message
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Possibly Related Threads...
Thread Author Replies Views Last Post
  An Efficient Algorithm for Mining Frequent Patterns full report project topics 3 4,735 01-10-2016, 10:02 AM
Last Post: Guest
  Efficient Graph-Based Image Segmentation seminar class 2 3,331 02-02-2013, 01:58 PM
Last Post: seminar details
  Efficient Broadcasting in Mobile Ad Hoc Networks project report tiger 11 9,996 20-12-2012, 12:30 PM
Last Post: seminar details
  Energy-Efficient Routing in Mobile Ad Hoc Networks: Mobility-Assisted Case (Java) project topics 1 1,740 12-11-2012, 12:42 PM
Last Post: seminar details
  EFFICIENT ROUTING IN INTERMITTENTLY CONNECTED MOBILE NETWORKS: THE MULTIPLE COPY CASE electronics seminars 6 4,762 23-02-2012, 04:47 PM
Last Post: seminar paper
  EFFICIENT RESOURCE ALLOCATION FOR WIRELESS MULTICAST - MOBILE COMPUTING electronics seminars 5 4,576 15-02-2012, 02:25 PM
Last Post: seminar paper
  NEPHELE: EFFICIENT PARALLEL DATA PROCESSING IN THE CLOUD smart paper boy 4 3,438 30-01-2012, 12:40 PM
Last Post: seminar addict
  BloomCast: Efficient and Effective Full-Text Retrieval in Unstructured P2P Networks Projects9 0 789 23-01-2012, 05:14 PM
Last Post: Projects9
  An Efficient Approach for Mobile Asset Tracking Using Contexts Projects9 0 964 23-01-2012, 05:13 PM
Last Post: Projects9
  Efficient Server Provisioning with Control for End-to-End Response Time Guarantee on Projects9 0 886 23-01-2012, 04:58 PM
Last Post: Projects9

Forum Jump: