Ok, so smaller result sets show an MSN Search bias to IIS... what about a random selection of 1000 words? What happens then?
So one dictionary file and perl shuffle later.... et viola
The results might need a bit more processing... I think unreachable hosts (that currently drop into the 'Unknown' category) might be altering the results slightly.
But there's still a 7% IIS bias with this large set of completely random words
A few of the accented characters are causing problems, I'll sort out escaping the character codes asap.
Detailed Results:The full set of 1000 words creates a rather heavy page so the detailed per word analysis is now on a separate page and can be found here. Out of the full set of words (currently 984 actually) a startling trend can be seen by looking at the number of times Apache is the lowest counted webserver for each search engine and IIS the highest. Using MSN search you are three times as likely to get a result set containing more IIS server results than using Google, Teoma or Yahoo.
|
||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||||||||
Top 100 summaryThis is the total results totted up across all words between Google and
MSN search. As you can see the results for Google, Teoma and Yahoo are fairly consistent, but the MSN search results are significantly skewed. The numbers right at the bottom of the table show the total count of times where over the full list of words each search engine returned either the lowest number of Apache servers or the highest number of IIS servers
|
||||||||||||||||||||||||||||||||||||
Top 10 summaryThis shows the distribution of webservers just within the crucial top
ten front page. The results are inline with the top 100 results.
|
All content Copyright © 2005 Ivor Hewitt.
http://www.ivor.it - Technology - http://www.ivor.org - The Hedge.