Jump to content

Analytics/Archive/Wikistats2.0/Design

From Wikitech
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.
This page contains historical information. It may be outdated or unreliable.

Data about how humans collaborate. Loads of it.

Data on wikistats right now

 https://stats.wikimedia.org/EN/Sitemap.htm

 Concepts
 page types
 namespace 0
 not namespace 0
 article (content)
 talk
 not talk or article
 Revert types
 sha matching
 unknown (comment mentions partial revert but no sha match)
 self-revert
 User categories
 bot
 anonymous
 registered
 unknown (? this appears in the charts with no definition)
 project
 language
 "start of project"
 per-year totals
 per-month totals
 top ranking
 reverters
 reverted editors
 reverted articles
 reverted non-project pages
 5+ edits
 regions?
 with speakers of a language
 with a lot of traffic to a project / language
 millions of speakers of a language
 article count
 views per hour
 truncate / exclude
 less than 10 edits per month
 less than 10 articles per project
 classic metrics
 article count
 new articles per day
 edits per month
 new editors
 active editors (5+ / month)
 very active editors (100+ / month)
 article size
 > 0.5kb
 > 2kb
 mean edits
 bytes per article
 database size
 edits
 size
 words
 links
 internal
 interwiki
 external
 redirects
 quarterly rankings
  edits per month thresholds
 users: 1+, 3+, 5+, 10+, 100+, 250+, 1000+, 2500+, 10000+
 bots: 5+, 10+, 100+, 1000+, 10000+

Dimensions:
    project
        - name
        - creation
        - language
        - Regions associated
    page
        - project
        - creation
        - namespace
        
        
    Region
        - name
        - estimated population
    language
        - name
    Region-language
       - Estimated number of spearkers



Wikipedia page https://stats.wikimedia.org/EN/Sitemap.htm
For a given month: 
   --- per project, article count, views per hour, regions ????? , speakers in million (probably sum over the regions listed), editors (5+ edits) per million spearker (for the regions), prim + sec speakers ????
            -- Only Wikipedias which contain 10 or more articles and which received 10 or more edits in last month are listed above + list of not included
   
   -- Links to other pages:
      
       -- Summary https://stats.wikimedia.org/EN/SummarySIMPLE.htm
               -- pageview, article count, new article per daym edits per month, active editors, very active editors, new editors,  speakers (same as above), editors per million speakers + charts (namespace 0 only)
      
       -- tables https://stats.wikimedia.org/EN/TablesWikipediaSIMPLE.htm
            --Monthly counts & quterly rankings
                -- editors data, articles data, data base data ????, links (internal, interwiki, image, external, redicrects)
                -- Variation per month over last 6 month
                -- Absolute number per month since project creation
                -- Rank for this project in comparision to other projects with more than 1000 articles over every dimension, per month since project creation)
           -- Edit actuvity levels per editor class and namespace
                -- By bucket of number of edits (1, 2, 5, 10, 25,  100 ...)
                -- Monthly since project creation
          --current Distribution of number of article edits over registered editors (no bots) (1 : √10 : 10 : 10√10 : 100 : 100√10 : 1000 ...scale)
          --current top 50 recently active wikipedians (no bot), by number of contributions
          --current top 20 recently absent top editors, by number of contributions
          --current anonymous users stats (not for enwiki - only percentage of total number of edits)
          --current top 50 bots, by number of contributions
          --monthly Articles containing at least one internal link, by number of characters (readable text, disregarding wiki- and html codes, hidden links, etc.; also headers do not count)
          --monthly  database records per namespace, categorised article and binaries (images, sound files...)
          -- monthly Most edited articles (out of date)
       -- charts https://stats.wikimedia.org/EN/ChartsWikipediaSIMPLE.htm
                  Charts starting at project start, one bar per month
                  Color coding (on/off toggle), % variation (toggle on/off)
                  Link to dedicated chart per wikipedia project + grand total 
         -- wikipedians
           -- contributors -- https://stats.wikimedia.org/EN/TablesWikipediansContributors.htm
           -- new wikipedians -- https://stats.wikimedia.org/EN/TablesWikipediansNew.htm
           -- active wikipedians (5+ contribs this month) -- https://stats.wikimedia.org/EN/TablesWikipediansEditsGt5.htm
           -- very active wikipedians (100+ contribs this month) --   https://stats.wikimedia.org/EN/TablesWikipediansEditsGt100.htm
        --articles
          -- count official https://stats.wikimedia.org/EN/TablesArticlesTotal.htm
          -- count alternate (last updated jan 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesArticlesTotalAlt.htm
          -- new article per day  https://stats.wikimedia.org/EN/TablesArticlesNewPerDay.htm
          -- edits per article  https://stats.wikimedia.org/EN/TablesArticlesEditsPerArticle.htm
          -- bytes per article (last updated 2010 for en, feb 2014 for most others)  https://stats.wikimedia.org/EN/TablesArticlesBytesPerArticle.htm
          -- articles overs 0.5kb (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesArticlesGt512Bytes.htm
          -- articles overs 2kb (%) (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesArticlesGt2048Bytes.htm
        -- database
          -- edits per month https://stats.wikimedia.org/EN/TablesDatabaseEdits.htm
          -- database size (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesDatabaseSize.htm
          -- words (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesDatabaseWords.htm
        -- links
          -- internal inks (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesDatabaseLinks.htm
          -- links to other wikipedias last updated 2010 for en, feb 2014 for most others)  https://stats.wikimedia.org/EN/TablesDatabaseWikiLinks.htm
          -- binaries (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesDatabaseImageLinks.htm
          -- external llinks (last updated 2010 for en, feb 2014 for most others) https://stats.wikimedia.org/EN/TablesDatabaseExternalLinks.htm
          -- redirects https://stats.wikimedia.org/EN/TablesDatabaseRedirects.htm
       
     
     
       
    -- Other section: Comparisions
       List of dedicated cahrts listed above
       -- Overview over recent month: summarize every dedicated chart over 5 month for every project https://stats.wikimedia.org/EN/TablesRecentTrends.htm
                  Visits per day : https://stats.wikimedia.org/EN/TablesUsageVisits.htm (discontinued since 2004)
                  Page requests per day : https://stats.wikimedia.org/EN/TablesUsagePageRequest.htm (discontinued since 2004)
      -- pageviews https://stats.wikimedia.org/EN/TablesPageViewsMonthlyCombined.htm
      -- projects current status https://stats.wikimedia.org/EN/TablesCurrentStatusVerbose.htm
      -- bot activity - editing https://stats.wikimedia.org/EN/BotActivityMatrixEdits.htm
      -- bot activity -- creation only https://stats.wikimedia.org/EN/BotActivityMatrixCreates.htm
      
      
       
       
       



Edits & Reverts
 --- Edits & reverts count http://stats.wikimedia.org/EN/PlotsPngEditHistoryTable.htm
         -- Edits  on namespace 0) & reverts  using sha1 matching
         -- since start of project, for each project
         -- Registered / anonymous / bots (editor class) + total
         -- Absolute and relative numbers - Charts
        
         
 ---- Edit and Revert Trends  http://stats.wikimedia.org/EN/EditsRevertsEN.htm
      -- Chart of number of edits per month since beginning of project + smoothed
      -- Chart for revert ratio (#revert / #edits) per editor class + smoothed
      --  Distribution of reverts
             -- Namespace 0 + other namespaces (not detailed)
             -- Special Unknown category
             -- User class for: Reverted editor, reverted by, self revert
             -- Different views: percentages across subdivisions or global, plus grand totals per year for each type
    -- Top ranking
        -- Most active reverters
        -- most reverted editors
        -- most reverted articles
        -- most revertted other non-project pages



Thoughts and Conclusions
=====================
Dan
There are two main kinds of use cases I see in Wikistats:
* get lost clicking through stats and engage your mind to new questions.  This kind of brainstorming is not possible with a rigid dashboard.
* find answers to specific questions.  This kind of search is hard on wikistats unless you know where to go.



Other projects page