CZ:Statistics: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Aleksander Stos
(human resources section, graphs to be added)
imported>Aleksander Stos
(human resources)
Line 3: Line 3:
At present, the graphs are scaled in working days. Here is the translation into the calendar dates.
At present, the graphs are scaled in working days. Here is the translation into the calendar dates.


1  : 2006-10-22
1  : 2006-10-22<br>
 
50 : 2006-12-11<br>
50 : 2006-12-11
100: 2007-01-30<br>
 
150: 2007-03-21<br>
100: 2007-01-30
200: 2007-05-10<br>
 
launch: 2007-03-28, i.e. 157th working day<br>
150: 2007-03-21
 
200: 2007-05-10
 
launch: 2007-03-28 (?), i.e. 157th working day
 
last pictured day = 2007-05-17
last pictured day = 2007-05-17


Line 26: Line 20:
</td></tr>
</td></tr>
<tr><td width="100%">
<tr><td width="100%">
The second graph shows number of all pages from all namespaces (e.g. userpages, talk pages and images are included, redirects are _not_). This is the green line. The blue line is the one from the first graph (i.e. the mainspace pages). What happened about 125th day? It was Saint Valentine's, 14/02/2007, when after slashdotting many new user registered (and were welcomed on their talk pages!). Notice that at the same time there was no significant growth in the mainspace. Apparently, the newly registered users were mainly watching (at the time there was no unregistered access to the wiki). Again, a more stable growth rate has been established after the launch.
The second graph shows number of all pages from all namespaces (e.g. userpages, talk pages and images are included, redirects are _not_). This is the green line. The blue line is the one from the first graph (i.e. the mainspace pages). What happened about 125th day? It was Saint Valentine's, 14/02/2007, when after slashdotting many new user registered (and were welcomed on their talk pages!). Notice that at the same time there was no parallel growth in the mainspace. Apparently, the newly registered users were mainly watching (at the time there was no unregistered access to the wiki). Again, a more stable growth rate has been established after the launch.
[[Image:Number_of_all_pages.png|thumb|left|600px|Fig. 2. Number of all pages from all namespaces without redirects (green) and articles (blue)]]
[[Image:Number_of_all_pages.png|thumb|left|600px|Fig. 2. Number of all pages from all namespaces without redirects (green) and articles (blue)]]
</td></tr>
</td></tr>
Line 45: Line 39:


==Human resources==
==Human resources==
The following graphs describe the human resources.  
The following graphs describe the CZ human resources.  
 
* How many authors edit each month? See Fig. 6 below.
 
<table width="100%">
<tr><td width="100%">
[[Image:Editing_users.png|thumb|left|600px|Fig. 6. Editing authors]]
</td></tr>
</table>
 
 
* How many users are active? If by "activity" we define at least 20 edits per month, and by "high activity" we understand at least 100 edits per month, then the answer is given by the Fig. 7 below.
 
<table width="100%">
<tr><td width="100%">
[[Image:Active_users.png|thumb|left|600px|Fig. 7. Active authors]]
</td></tr>
</table>
 
 
*  How many users you could meet here daily? This seems to be interesting measure of how 'vibrating' the community is. The answer is on Fig. 8.
 
<table width="100%">
<tr><td width="100%">
[[Image:Users_daily.png|thumb|left|600px|Fig. 8. Authors daily]]
</td></tr>
</table>
 
 
* Fig. 9: How many new authors arrive each month? This can be measured by counting new user pages. More substantial measure would be, however, to detect a new user on his first edit. Notice that in the period of self-registration (essentially, February 2007) the two measures largely coincide, as the new users were supposed to provide their bio.
 
<table width="100%">
<tr><td width="100%">
[[Image:New_users.png|thumb|left|600px|Fig. 9. New arrivals]]
</td></tr>
</table>


* Fig. 6. How many authors edit each month
* Fig. 7. How many new authors arrive each month. This can be measured by counting new user pages. More substantial measure would be, however, to detect a new user on his first edit. Notice that in the period of self-registration (essentially, February 2007) the two measures largely coincide, as the new users were supposed to provide their bio.
* Fig. 8. How many active users edit each month. By an active user here we mean an author that made more than 20 edits. Those who made more than 100 edits are called ''very'' active.
* Fig. 9. How many users you could meet here daily (on average).


<!--It was observed that if one counts the registered users then CZ's human resources of these figures are comparable to those of Wikipedias from the category "more than 25,000 entries" (as listed on the main page of the English Wikipedia). For example, CZ would be in order of magnitude of lt.wikipedia.org, hr.wikipedia.org (these were slightly smaller) or sr.wikipedia.org  (this one was slightly bigger).-->
<!-- These figures would be more meaningful if compared to some other successful and growing projects. A Wikipedia in some language, perhaps. Of course, not (yet) the English one --  a big and active wiki at this stage --one month after the launch-- would do.
To this end, suppose that we count the registered users only, considering that IP anonymous users, while numerous, globally make not too many edits (8-15%, depending on the wiki). Then it turns out that CZ human resources (in terms of the above figures) are comparable to Wikipedias from the category "more than 25,000 entries", as listed on the main page of the English Wikipedia. For example, as of April 2007, CZ would be in order of magnitude of lt.wikipedia.org, hr.wikipedia.org (these were slightly smaller) or sr.wikipedia.org  (this one was slightly bigger than CZ). Notice that there are 24 Wikipedias in the categories  "more  than 50000", "more than 100000" and "more than 250000" entries. -->

Revision as of 12:37, 18 May 2007

Here we present basic statistics concerning the project.

At present, the graphs are scaled in working days. Here is the translation into the calendar dates.

1  : 2006-10-22
50 : 2006-12-11
100: 2007-01-30
150: 2007-03-21
200: 2007-05-10
launch: 2007-03-28, i.e. 157th working day
last pictured day = 2007-05-17

Pages

The first graph shows the number of articles (technically speaking, all pages from mainspace without redirects). Observe an acceleration after the launch (about 150). As a trivia fact, one may notice a small jump about 50th day. What was it? On December 7, 2006, some viper articles were uploaded.

Fig. 1. Number of articles

The second graph shows number of all pages from all namespaces (e.g. userpages, talk pages and images are included, redirects are _not_). This is the green line. The blue line is the one from the first graph (i.e. the mainspace pages). What happened about 125th day? It was Saint Valentine's, 14/02/2007, when after slashdotting many new user registered (and were welcomed on their talk pages!). Notice that at the same time there was no parallel growth in the mainspace. Apparently, the newly registered users were mainly watching (at the time there was no unregistered access to the wiki). Again, a more stable growth rate has been established after the launch.

Fig. 2. Number of all pages from all namespaces without redirects (green) and articles (blue)

The third and forth figure present "global creation rate". It measures somehow the activity on the wiki expressed in new pages per day. The rate for "pure" articles (technically: mainspace without redirects) is depicted in blue; the green line corresponds to all pages (still, without redirects). This is calculated as the number of articles (pages, respectively) divided by the number of working days from the beginning.[1] Obviously, this is a "global average" and the recent creation rate is higher than the one on the beginning. This can be seen on the 5th and last graph of this section, which represents the creation rate for articles taking into account last 30 days only.

Fig. 3. Creation rate (articles per day)
Fig. 4. Overall creation rate in pages per day (green line); the blue line corresponds to articles' creation rate
Fig. 5. Recent creation rate (last 30 day counted)

Human resources

The following graphs describe the CZ human resources.

  • How many authors edit each month? See Fig. 6 below.
Fig. 6. Editing authors


  • How many users are active? If by "activity" we define at least 20 edits per month, and by "high activity" we understand at least 100 edits per month, then the answer is given by the Fig. 7 below.
Fig. 7. Active authors


  • How many users you could meet here daily? This seems to be interesting measure of how 'vibrating' the community is. The answer is on Fig. 8.
Fig. 8. Authors daily


  • Fig. 9: How many new authors arrive each month? This can be measured by counting new user pages. More substantial measure would be, however, to detect a new user on his first edit. Notice that in the period of self-registration (essentially, February 2007) the two measures largely coincide, as the new users were supposed to provide their bio.
Fig. 9. New arrivals


  1. Technical sidenote: on the third graph the first two days were truncated for the obvious reason: starting with, say, 25 pages --so with a realtively high creation rate-- is not very relevant, results in changing the scale and makes the graph less readable.