Do Distrowatch's statistics equate popularity?

By Oli on Thursday, 04th October 2007. More information. Comments.

Distrowatch's H.P.D. statistics are banded about by numerous factions trying to make the point that their distribution is the best. But do the numbers actually equate popularity and why do the fanboys care so much anyway?

Distrowatch's Hits-Per-Day table

For those not in the know, Distrowatch is a very excellent site that tracks the latest releases of dozens of different Linux and BSD distributions. For each distro, there is an information page. There is also a table on the main page (right) showing how many page views each information page gets per day.

While I have no problem with the presence of the table, I've serious reservations over the way some people interpret the data. I've begun to write this post so many times and never gotten around to finishing it. The reason it keeps floating to the top of my to-write pile is the volume of people flagrantly bandying HPD statistics around as if they mean something in the real world.

I'm going to say a few things that sound an awful lot like "Distribution X isn't really the most popular" but I'm going to try and give some plausible, even logical evidence to back those statements up so please don't skip straight to the comment section in an attempt to call me a Distribution Y fanboy. Let's see where this journey takes us...

What do the H.P.D. figures really mean?

H.P.D. stands for hits per day. This count represents how many unique computers have visited a distro's information page. This means that if one person sits on the Ubuntu information page and reloads over and over again, only one vote is tallied.

So in short, H.P.D. stats show how many people are visiting a certain Distrowatch page.

Why doesn't that equate popularity?

For starters, as I've just said, all conclusively tells us is how many people are visiting the page. It doesn't break that figure into people that are just coming from inside distrowatch.com, search engine hits or even the community forums for a distribution (in an effort to game the statistics).

  1. Search Engines

    I mention search engine hits because I believe them to be highly enlightening to the whole argument. Yes, I know this is fuzzy science but I believe there is logic behind my thinking. Let's compare the first two distributions in the table on Google. For quick comparison, I've highlighted the position of Distrowatch in green.

    PCLinuxOS vs Ubuntu on Google

    As you can see, the PCLinuxOS information page features a lot higher up. The Ubuntu info page falls way below "the fold" (most people would have to scroll down to see it) and is therefore a lot less likely to get visited from that search.

    To back this up with even fuzzier statistics, we can look at Google Trends (as Seopher did when trying to argue the other way). Ubuntu gets so many more searches than PCLinuxOS that the latter doesn't even peak off the chart's baseline. Assuming Distrowatch appears in equal measure in these searches, the position has a massive effect on the likelihood of clickage.

  2. Relative obscurity

    You also have to take into account what purpose people have in mind when they click through to the information page. Quite simply: they want to find out more about the distribution. So what are you getting at, Captain Obvious?

    Lots more people know what Ubuntu is all about. For Linux newcomers I would go as far as saying that Ubuntu is probably a synonym for Linux, just as many people assume a PC runs Windows. It's many people's first experience into the wide world of Linux distro. From that assumption you could conclude that fewer people click through to Ubuntu (or other, older distributions) because they're well known.

    The same applies for both people on search engines and people already on the Distrowatch site.

  3. Fanaticism

    Put simply, some distributions have users that like it too much. They're willing to be militant in the ways they promote their favourite distribution in the hope that they can get it more attention — including organising community members to visit polls, the Distrowatch page and other sites to inflate it's statistics.

    The proof required to make such a claim usually evaporates shortly after being discovered but plenty of people in this thread have anecdotal evidence.

    It wouldn't be so bad if it was just a positive campaign — it would still wreck the DW stats — but for the most part where there are people promoting, there are people smearing any criticism. I was lynched when I dared mention Ubuntu while reviewing another distro because they thought it was an irrelevant comparison. Sorry but if I think something is done better elsewhere, I'm going to tell you all about it.

    Giving example of something done better does help people differentiate between distributions and it also helps developers see what they should borrow for their own distribution.

    Anyway, you might argue that this should apply to all distributions and that all the fanboys cancel each other out but I believe that to be untrue. People just seem to get more fanatical about some distros than others. Who in their right mind would get their knickers in a twist over Red Hat Enterprise Linux?

In conclusion...

What can you use these statistics to show? Not too much — not accurately anyway. The fact is there are too many variables in the mix to say more than "Distribution X's DIstrowatch information page gets more hits than Distribution Y's". People that do try and use them as the prime basis for a distribution popularity value are misrepresenting the figures.

But why does it matter? Why do people insist on using irrelevant things like estimated popularity to compare distributions? If you've got something good to say about something a distro does, say it! Conversely, if there's something you think spoils the finished product, tell people.

Grav

Written by Oli on Thursday, 04 October 2007. Tagged with linux, distrowatch. Read 2066 times. If you liked it, please give it a digg.

1 to 10 of 11 < 1 2 >
#1 /* 11 months, 27 days ago */
Those numbers are just an indication of the level of interest. Ubuntu was instrumental in bringing a lot of people into Linux. Now, a lot of those newbies are looking around wondering if there is more to Linux than Ubuntu. The reports coming out of those trying PCLinuxOS have been mostly positive.

http://www.linuxtoday.com/news_story.php3?ltsn=200 ...

http://www.pclinuxos.com/forum/index.php?board=45. ...

Therefore the interest and the number of clicks. No conspiracy. No need to get all upset about it. Ah, it is you Ohly. So, you are still carrying that ax against PCLOS. Please, give it a rest.
#2 /* 11 months, 27 days ago */
It's obvious that Distrowatch's ranking system is skewed. If you look at all the distros that are primarily used for internet servers (Red Hat, CentOS), they rank very low. This is because they're out there doing their job as server machines, and don't have someone sitting behind them on a browser making visits to the Distrowatch site every day. Fedora ranks fairly well yet is basically considered the same as the two mentioned (no offense to Fedora fans).

I'd also like to suggest that since the use of live CD setups, it's become easier to fake out the ranking system. Someone can boot up a group of work or school computers very quickly using live CDs just to make hits on the site, and then revert them to their normal state.
#3 — Author comment /* 11 months, 27 days ago */
Justathought: perhaps I don't fully understand the mentality of finding out about distributions. If I read about a distro on LinuxToday or a review site, why would I go Distrowatch for more information? I would go straight to the distribution's home page but perhaps I'm just perverse like that.

I agree it's not a conspiracy. I'm not saying anything about the actual quality of any distro here or that certain distributions don'e deserve the traffic they get — quite the opposite. The issue I have is the statistics do not substantiate some people's claims, mainly: "this distro is more popular because it has more hits per day on Distrowatch!"

It may very well be true in some cases but it's severely flawed logic to take the HPDs and jump to a popularity ranking.
#4 /* 11 months, 27 days ago */
Justathought: perhaps I don't fully understand the mentality of finding out about distributions. If I read about a distro on LinuxToday or a review site, why would I go Distrowatch for more information? I would go straight to the distribution's home page but perhaps I'm just perverse like that.¨


Ohly, I agree completely with you, but apparently some people use Distrowatch as a sort of distro home page. See Jeff Cobb's first comment on the first link I linked for you. (Buy the way, why is the link broken? Let me try posting it again with taggs.)

The link


I agree it's not a conspiracy. I'm not saying anything about the actual quality of any distro here or that certain distributions don'e deserve the traffic they get — quite the opposite. The issue I have is the statistics do not substantiate some people's claims, mainly: "this distro is more popular because it has more hits per day on Distrowatch!"


Again, I agree that there may be some people getting the wrong idea about the meaning of the stats, but is it really that big of a deal? Why make such a fuss about a few misguided comments. I imagine that this article is part of the reason you posted this entry:

Is Ubuntu losing its crown to PCLinuxOS?

I think the author must have been surprised about the place PCLOS occupies in distrowatch's rankings and thought it was news enough to make an article about it. He even qualified his comments a bit.

It may very well be true in some cases but it's severely flawed logic to take the HPDs and jump to a popularity ranking.


You have to admit that, back when Ubuntu reach that place in distrowatch, the same thing happened with many over enthusiastic Ubuntu users. Big deal! No need to come up with conspiracy theories, or to rely on "anecdotal evidence¨ to paint a whole community as fanatical. Read the comments that were favorable to PCLOS on those articles and you will see, for the vastly most part, that they are very reasonable, level headed, and fair. The fact is that PCLinuxOS is grabbing a lot of attention because it has a lot to give. (Let me try to post that other link again as well.)

Happy PCLOS Users

Links fixed by Oli
#5 — Author comment /* 11 months, 27 days ago */
No the system really hates your links! I caught the raw data on my magical admin RSS feed so I'll squirt the right links in place for you.
#6 — Author comment /* 11 months, 27 days ago */
They point where they should now? Anyway

but is it really that big of a deal?
No but it just niggles away at me and the amount of people that read it in one place and re-blog it as if it's Gospel. As I said to begin with, this post has been brewing for a long time and yes it was the Tectonic post that tipped me over the edge -- but that really is one of the more balanced posts I've read over the past few months and not really the style of post this is targeting.

Re: fanatics, you're reading too deep into what I really said. Yes every community has them but they are usually a tiny, yet vocal, minority. Yeah plenty of people would love PCLinuxOS - it's a great distro! My only beef was the level of maturity shown against some criticisms.

That episode was just an example of fanaticism. I've had similar, yet smaller doses of it from other people in other places for other distros. Perhaps I'm just abrasive and deserve what I get but there you go.

The important part over what I was saying about them is that different communities have different levels of fanatics. The older and more regimented a product is, the less sexy it seems and the less likely you're going to have people swinging from the rafters screaming about it.
#7 /* 11 months, 27 days ago */
What I had rather see is a day to day listing of what Os's and their collective browsers hit the DW page...I'd find that information much more interesting than their top 100 list.
#8 /* 11 months, 27 days ago */
Thanks for your honesty, it is refreshing. But, I really hope you are able to find inspiration on some other topic. How about that OOXML thingy going on, he? What do you think? They are sure trying to cement the lock-in don't you think?

ODF and OOXML: Something New to Ponder

Sayonara, bee good, peace.
#9 — Author comment /* 11 months, 27 days ago */
Another good, yet irrelevant (here) example of proportionate fanaticism is the Apple community vs the Windows community... Hell even Apple vs Windows + Linux. They've practically got a religion going on over there!

The whole document standards debate is far bigger than choosing between two standards. It goes deep into the corruption of democracy through commercial lobbyists, something that is happening right across the board and something that something as socialist (in a good way) as FOSS has a hard time competing with because there just isn't the same amount of money on its side...

It would help if OpenOffice applications (et al FOSS MS-alternatives) were as good as their MS counterparts in terms of interface experience. You may disagree with me but I really think MS has made something quite beautiful and functional with Office 2007... They have the advantage of time but I guess it won't be long before Sun and IBM get OO and Lotus Symphony to the same level of user interface.

It's mainly that reason I haven't aired any views on OOXML vs ODF before. I think ODF is clearly the better standard but I totally sympathise with anybody for wanting to work in MS Office over OpenOffice. I'd get creamed if I pushed that sort of argument out as its own proper post =)
#10 /* 11 months, 26 days ago */
The was an artcile at reviewlinux.com inviting all to vote as the webmaster was comparing the distrowatch.com hit counter to actual people using the linux? I think this vote is not perfect a starting point and enough to raise a bit of conversation.

reviewlinux article: http://www.reviewlinux.com/?m=show&id=7516

Vote on frontpage: http://www.reviewlinux.com right side menu amoungst all the other stuff.
1 to 10 of 11 < 1 2 >

Don't just sit there like a lemon! Reply!

Got something to say? Now's the time to share it with the author and everybody else that reads this posting! Lemons need not apply.

edtBOX - xHTML: yes - bbcode:no
Home | Advertise | About | Contact | Legal © Oli Warner 2001—2007 Proud 9rules member