Springe nei ynhâld

Meidogger:ReyBrujo/Dumps/20070405

Ut Wikipedy
Dumps

April 05, 2007

Articles with more than 5 external links as of April 05, 2007. Only articles in the main space are considered.

External
links
Article IDArticle
118192Frankfurt
109425Earste Wrâldkriich
102339Frysk op kompjûters
105783Man
92500George Orwell
79298Publius Cornelius Tacitus
611034Eastfrysk (Nederdútsk)
610029Brugge
69289Dútsk
57212Ubuntu Linux
52674Leonard Cohen
52765Friezen
59791Ryksuniversiteit Leien
52563Fryske beweging
510819ISO 639
510162Meidogger/skipper/nij haadside
58540Drafsport
SELECT COUNT(el_from) AS total, el_from, page_title
FROM externallinks, page
WHERE externallinks.el_from = page_id AND page_is_redirect = 0 AND page_namespace = 0
GROUP BY el_from
ORDER BY total DESC;

Sites linked more than 5 times as of April 05, 2007. Only articles in the main space are considered.

Link countSite
780http://fy.wikipedia.org
31http://www.roman-emperors.org
13http://www.tresoar.nl
10http://www.leonardcohen.com
9http://www.uitinlimburg.nl
9http://orwell.ru
8http://home.planet.nl
8http://www.nsesoftware.nl
7http://wikisource.org
6http://www.elfelf.nl
6http://home.hetnet.nl
6http://www.stinseninfriesland.nl
6http://www.soortenbank.nl
5http://www2.pbf.nl
5http://www.nobel.se
5http://incubator.wikimedia.org
5http://www.waarneming.nl
5http://www.wapedia.org
5http://www12.statcan.ca
SELECT COUNT(el_to) AS total, SUBSTRING_INDEX(el_to, '/', 3) AS search
FROM externallinks, page
WHERE page_id = el_from AND page_namespace = 0
GROUP BY search
ORDER BY total DESC;

Additional information

Some more information about this dump:

  • 4507 articles that are in the main space and not redirects
  • 5671 articles and redirects in the main space
  • 8743 pages in all namespaces
  • 1313 redirects in all namespaces
  • 8767 external links in every namespace
  • 1866 external links in the main space

Very probable spambot pages

If index.php is found in a page title, it is very likely the article talk page has been created by a spambot. These pages should be deleted and protected if possible.

Article IDArticle
10406W/index.php

Possible spambot pages

Possible pages created by spambots ending with /.

Article IDArticle
SELECT page_id, page_title, page_namespace
FROM page
WHERE page_title LIKE '%index.php%' OR page_title LIKE '%/wiki/%' OR page_title LIKE '%/w/%' OR page_title LIKE '%/';