Analytics/Archive/Wikistats2.0/Design
Appearance
This page contains historical information. It may be outdated or unreliable.
Data about how humans collaborate. Loads of it.
Data on wikistats right now
https://stats.wikimedia.org/EN/Sitemap.htm
Concepts
page types
namespace 0
not namespace 0
article (content)
talk
not talk or article
Revert types
sha matching
unknown (comment mentions partial revert but no sha match)
self-revert
User categories
bot
anonymous
registered
unknown (? this appears in the charts with no definition)
project
language
"start of project"
per-year totals
per-month totals
top ranking
reverters
reverted editors
reverted articles
reverted non-project pages
5+ edits
regions?
with speakers of a language
with a lot of traffic to a project / language
millions of speakers of a language
article count
views per hour
truncate / exclude
less than 10 edits per month
less than 10 articles per project
classic metrics
article count
new articles per day
edits per month
new editors
active editors (5+ / month)
very active editors (100+ / month)
article size
> 0.5kb
> 2kb
mean edits
bytes per article
database size
edits
size
words
links
internal
interwiki
external
redirects
quarterly rankings
edits per month thresholds
users: 1+, 3+, 5+, 10+, 100+, 250+, 1000+, 2500+, 10000+
bots: 5+, 10+, 100+, 1000+, 10000+
Dimensions:
project
- name
- creation
- language
- Regions associated
page
- project
- creation
- namespace
Region
- name
- estimated population
language
- name
Region-language
- Estimated number of spearkers
Wikipedia page https://stats.wikimedia.org/EN/Sitemap.htm
For a given month:
--- per project, article count, views per hour, regions ????? , speakers in million (probably sum over the regions listed), editors (5+ edits) per million spearker (for the regions), prim + sec speakers ????
-- Only Wikipedias which contain 10 or more articles and which received 10 or more edits in last month are listed above + list of not included
-- Links to other pages:
-- Summary https://stats.wikimedia.org/EN/SummarySIMPLE.htm
-- pageview, article count, new article per daym edits per month, active editors, very active editors, new editors, speakers (same as above), editors per million speakers + charts (namespace 0 only)
-- tables https://stats.wikimedia.org/EN/TablesWikipediaSIMPLE.htm
--Monthly counts & quterly rankings
-- editors data, articles data, data base data ????, links (internal, interwiki, image, external, redicrects)
-- Variation per month over last 6 month
-- Absolute number per month since project creation
-- Rank for this project in comparision to other projects with more than 1000 articles over every dimension, per month since project creation)
-- Edit actuvity levels per editor class and namespace
-- By bucket of number of edits (1, 2, 5, 10, 25, 100 ...)
-- Monthly since project creation
--current Distribution of number of article edits over registered editors (no bots) (1 : √10 : 10 : 10√10 : 100 : 100√10 : 1000 ...scale)
--current top 50 recently active wikipedians (no bot), by number of contributions
--current top 20 recently absent top editors, by number of contributions
--current anonymous users stats (not for enwiki - only percentage of total number of edits)
--current top 50 bots, by number of contributions
--monthly Articles containing at least one internal link, by number of characters (readable text, disregarding wiki- and html codes, hidden links, etc.; also headers do not count)
--monthly database records per namespace, categorised article and binaries (images, sound files...)
-- monthly Most edited articles (out of date)
-- charts https://stats.wikimedia.org/EN/ChartsWikipediaSIMPLE.htm
Charts starting at project start, one bar per month
Color coding (on/off toggle), % variation (toggle on/off)
Link to dedicated chart per wikipedia project + grand total
-- wikipedians
-- contributors -- https://stats.wikimedia.org/EN/TablesWikipediansContributors.htm
-- new wikipedians -- https://stats.wikimedia.org/EN/TablesWikipediansNew.htm
-- active wikipedians (5+ contribs this month) -- https://stats.wikimedia.org/EN/TablesWikipediansEditsGt5.htm
-- very active wikipedians (100+ contribs this month) -- https://stats.wikimedia.org/EN/TablesWikipediansEditsGt100.htm
--articles
-- count official https://stats.wikimedia.org/EN/TablesArticlesTotal.htm
-- count alternate (last updated jan 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesArticlesTotalAlt.htm
-- new article per day https://stats.wikimedia.org/EN/TablesArticlesNewPerDay.htm
-- edits per article https://stats.wikimedia.org/EN/TablesArticlesEditsPerArticle.htm
-- bytes per article (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesArticlesBytesPerArticle.htm
-- articles overs 0.5kb (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesArticlesGt512Bytes.htm
-- articles overs 2kb (%) (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesArticlesGt2048Bytes.htm
-- database
-- edits per month https://stats.wikimedia.org/EN/TablesDatabaseEdits.htm
-- database size (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesDatabaseSize.htm
-- words (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesDatabaseWords.htm
-- links
-- internal inks (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesDatabaseLinks.htm
-- links to other wikipedias last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesDatabaseWikiLinks.htm
-- binaries (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesDatabaseImageLinks.htm
-- external llinks (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesDatabaseExternalLinks.htm
-- redirects https://stats.wikimedia.org/EN/TablesDatabaseRedirects.htm
-- Other section: Comparisions
List of dedicated cahrts listed above
-- Overview over recent month: summarize every dedicated chart over 5 month for every project https://stats.wikimedia.org/EN/TablesRecentTrends.htm
Visits per day : https://stats.wikimedia.org/EN/TablesUsageVisits.htm (discontinued since 2004)
Page requests per day : https://stats.wikimedia.org/EN/TablesUsagePageRequest.htm (discontinued since 2004)
-- pageviews https://stats.wikimedia.org/EN/TablesPageViewsMonthlyCombined.htm
-- projects current status https://stats.wikimedia.org/EN/TablesCurrentStatusVerbose.htm
-- bot activity - editing https://stats.wikimedia.org/EN/BotActivityMatrixEdits.htm
-- bot activity -- creation only https://stats.wikimedia.org/EN/BotActivityMatrixCreates.htm
Edits & Reverts
--- Edits & reverts count http://stats.wikimedia.org/EN/PlotsPngEditHistoryTable.htm
-- Edits on namespace 0) & reverts using sha1 matching
-- since start of project, for each project
-- Registered / anonymous / bots (editor class) + total
-- Absolute and relative numbers - Charts
---- Edit and Revert Trends http://stats.wikimedia.org/EN/EditsRevertsEN.htm
-- Chart of number of edits per month since beginning of project + smoothed
-- Chart for revert ratio (#revert / #edits) per editor class + smoothed
-- Distribution of reverts
-- Namespace 0 + other namespaces (not detailed)
-- Special Unknown category
-- User class for: Reverted editor, reverted by, self revert
-- Different views: percentages across subdivisions or global, plus grand totals per year for each type
-- Top ranking
-- Most active reverters
-- most reverted editors
-- most reverted articles
-- most revertted other non-project pages
Thoughts and Conclusions
=====================
Dan
There are two main kinds of use cases I see in Wikistats:
* get lost clicking through stats and engage your mind to new questions. This kind of brainstorming is not possible with a rigid dashboard.
* find answers to specific questions. This kind of search is hard on wikistats unless you know where to go.
Other projects page