Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Removed  





2 External links modified  
2 comments  




3 Minor error in "Damping Factor"  
1 comment  




4 Damping factor for biological data  
3 comments  




5 Examples are Wrong  
2 comments  




6 Variables lack clear names  
1 comment  




7 site:example.com  
1 comment  




8 reference 25 is old  
4 comments  




9 "Supplemental Result" listed at Redirects for discussion  
1 comment  




10 Merge PageRank algorithm in biochemistry  
3 comments  













Talk:PageRank




Page contents not supported in other languages.  









Article
Talk
 

















Read
Edit
Add topic
View history
 








Tools
   


Actions  



Read
Edit
Add topic
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Get shortened URL
Download QR code
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Removed[edit]

Google uses an automated web spider called Googlebot to actually count links and gather other information on web pages.

It is by no means clear that the counting can be said to be done by Googlebot, and it is not intuitively a spidering operation, more likely a feature of the database to which the spidering software stores its flies. Therefore this needs a citation to be in the article. Clearly what parts of the Google infrastructure are called "Googlebot" is up to Google, however if it extends too far, the description needs to be changed. All the best: Rich Farmbrough13:25, 11 August 2014 (UTC).

External links modified[edit]

Hello fellow Wikipedians,

I have just added archive links to one external link on PageRank. Please take a moment to review my edit. You may add {{cbignore}} after the link to keep me from modifying it, if I keep adding bad data, but formatting bugs should be reported instead. Alternatively, you can add {{nobots|deny=InternetArchiveBot}} to keep me off the page altogether, but should be used as a last resort. I made the following changes:

When you have finished reviewing my changes, please set the checked parameter below to trueorfailed to let others know (documentation at {{Sourcecheck}}).

This message was posted before February 2018. After February 2018, "External links modified" talk page sections are no longer generated or monitored by InternetArchiveBot. No special action is required regarding these talk page notices, other than regular verification using the archive tool instructions below. Editors have permission to delete these "External links modified" talk page sections if they want to de-clutter talk pages, but see the RfC before doing mass systematic removals. This message is updated dynamically through the template {{source check}} (last update: 5 June 2024).

Cheers.—cyberbot IITalk to my owner:Online 05:05, 31 March 2016 (UTC)[reply]

Minor error in "Damping Factor"[edit]

In "Damping Factor", after the two formulas, it states, "The difference between them is that the PageRank values in the first formula sum to one, while in the second formula each PageRank is multiplied by N and the sum becomes N." However (unless I'm blind), from the first equation to the second, only the first part has been multiplied by N. (Otherwise you'd have 1 - d + Nd(stuff).) With the second equation given as-is, the sum of page ranks is rather trickier than "N". Given that the equation is already acknowledged to be wrong, it's probably not urgent, but hey. — Preceding unsigned comment added by 98.102.161.228 (talk) 17:37, 22 August 2016 (UTC) The parameter of this code is good to be used in many problems..hoursguru.com[reply]

Damping factor for biological data[edit]

The article mentions 0.31 as the optimal value for d, however nowhere in the cited paper can I directly find 0.31. I can find a reference to an epsilon of 0.3 in Appendix 1.1, but I am not yet convinced this is equivalent to the damping factor d. Can someone clarify? --José Devezas (talk) 08:19, 19 December 2018 (UTC)[reply]

I also wonder whether the statement even makes sense. How can there be a single damping factor for all "biological data"? What application are we talking about? What is being modelled? I will remove the statement. If someone feels it adds value or explanatory power to the article, perhaps they can clarify it. GBMorris (talk) 13:25, 9 March 2020 (UTC)[reply]

I agree with the points raised by both of you and consider the removal of the statement a good decision. BernardoSulzbach (talk) 20:50, 9 March 2020 (UTC)[reply]

Examples are Wrong[edit]

everything links to A which means A is an important page, and A links to C, thus C is important as well.

   M = np.array([
       [1, 1, 1],  # * -> A
       [0, 0, 0], 
       [1, 0, 0]   # A -> C
   ])
  print(pagerank(M, 0.001, 0.85)) 
  array([[0.61536926],  
         [0.28131799],
         [0.10331275]])

B should be last but its second. Am I missing something?

I'm not sure which version of the algorithm this edit was referring to, but I've been working with it and it works; in this case, [1, 1, 1] should be impossible because A cannot link to itself. A links to C would be [0, 0, 1] on the A row if [A, B, C], and A links to both B and C would be [0, 1, 1]. I'm also not sure what the call to `pagerank(..., 0.001, ...)` is meant to do, since `0.001` doesn't make sense as the number of iterations (I'm guessing it would iterate once).
--D.g.lab. (talk) 16:31, 10 April 2021 (UTC)[reply]
I'm also not sure what was meant by the sum of (i, M_i, j) is 1 since half those variable names don't exist in the code. It could be said that the sum of every column in the matrix is 1?
The sum of the result vector, called v in the code, should also be approximately 1, I believe, since it contains percentages.
--D.g.lab. (talk) 16:39, 10 April 2021 (UTC)[reply]

Variables lack clear names[edit]

It's already difficult enough to understand as it is, why not clarify the variable names and add comments, as in any good code? For example:

   def pagerank(matrix, num_iterations: int = 100, damping: float = 0.85):
       size = matrix.shape[1]
       res_vector = [1/size] * size
       M_hat = (damping * matrix + (1 - damping) / size)
       for i in range(num_iterations):
           res_vector = M_hat @ res_vector # matrix multiplication
       return res_vector

Also, notice the lines in the article:

   v = np.random.rand(N, 1)
   v = v / np.linalg.norm(v, 1)

these insert random numbers in the multiplication vector. However, as far as I understand, this vector is supposed to start initially as `1/number_of_pages` (1/size) for each of its elements, which could be re-written

   v = [1/N] * N

or, using the new varible names,

   res_vector = [1/size] * size

According to this video on YouTube: Linear Algebra – Introduction to PageRank (4:21 timestamp).

I'm unsure why this was like this, so I'm not editing the page; however, if someone can confirm, the initial method seems "way too complicated" and there's no explanation for it. --D.g.lab. (talk) 16:31, 10 April 2021 (UTC)[reply]

site:example.com[edit]

😐site:example.com 2A00:F41:4828:36B6:0:4B:5D7E:4801 (talk) 23:52, 24 February 2022 (UTC)[reply]

reference 25 is old[edit]

the article states:'PageRank continues to be the basis of googles....'[25] reference 25 is over 10y old so it does not prove anything. 213.247.64.239 (talk) 14:32, 18 January 2023 (UTC)[reply]

@213.247.64.239
actual text from the article:
'While just one of many factors that determine the ranking of Google search results, PageRank continues to provide the basis for all of Google's web-search tools.' 213.247.64.239 (talk) 14:33, 18 January 2023 (UTC)[reply]
Google doesn't disclose the details of its ranking systems, but they have affirmed that PageRank remains one of the many signals they use. While there might be a more recent reference, it wouldn't be very precise. They don't disclose those kinds of details. While this article could use some clean up, it's not inaccurate in saying that Google still uses the algorithm. Michael Martinez (talk) 15:01, 18 January 2023 (UTC)[reply]
Google recently added new documentation that confirms PageRank is still used in their ranking systems although it has evolved since the original paper (in unspecified ways).Michael Martinez (talk) 13:01, 25 April 2023 (UTC)[reply]

The redirect Supplemental Result has been listed at redirects for discussion to determine whether its use and function meets the redirect guidelines. Readers of this page are welcome to comment on this redirect at Wikipedia:Redirects for discussion/Log/2024 April 25 § Supplemental Result until a consensus is reached. Utopes (talk / cont) 05:12, 25 April 2024 (UTC)[reply]

Most of the content on PageRank algorithm in biochemistry is redundant to material here, including a confusing and incomplete explanation of how PageRank works. It also hasn't had any meaningful updates since 2014. The small amount of material that actually pertains to PageRank's biochem applications should be merged here. Apocheir (talk) 01:56, 23 June 2024 (UTC)[reply]

I'm not sure merging the articles is the right thing to do. PageRank is based on citation analysis. To date no one has suggested merging PageRank into the Citation analysis - Wikipedia article. And I don't think that would be appropriate, either. While I agree there is some redundancy between the two PageRank articles, there are similar redundancies across Wikipedia. Cross-linking them and minimizing the duplication is the better solution in my opinion. Michael Martinez (talk) 03:08, 23 June 2024 (UTC)[reply]
I would recommend just taking this to AFD. InfiniteNexus (talk) 08:45, 26 June 2024 (UTC)[reply]

Retrieved from "https://en.wikipedia.org/w/index.php?title=Talk:PageRank&oldid=1231069962"

Categories: 
B-Class Computing articles
Mid-importance Computing articles
B-Class software articles
Mid-importance software articles
B-Class software articles of Mid-importance
All Software articles
All Computing articles
B-Class Internet articles
High-importance Internet articles
WikiProject Internet articles
B-Class Google articles
High-importance Google articles
Unknown-importance Alphabet articles
Alphabet task force articles
WikiProject Google articles
B-Class Marketing & Advertising articles
Low-importance Marketing & Advertising articles
WikiProject Marketing & Advertising articles
 



This page was last edited on 26 June 2024, at 08:45 (UTC).

Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



Privacy policy

About Wikipedia

Disclaimers

Contact Wikipedia

Code of Conduct

Developers

Statistics

Cookie statement

Mobile view



Wikimedia Foundation
Powered by MediaWiki