Archive for the ‘Internet Stuff’ Category

New Search Engine! Cuil

Wednesday, July 30th, 2008

A new Search Engine was launched recently, Cuil, which intends to rival Google and claims to have indexed over 120 billion webpages.

Cuil LogoCuil is a product from some of Google’s finest former minds and is attracting alot of interest. Although we really shouldn’t draw comparisons between Cuil and Google this early after launch, we will. And by the way, Cuil is an old Irish word for knowledge.

Give Cuil a try at http://www.cuil.com

Cuil Features

Cuil looks quite good, but doesnt feel like a search engine to me. Infact Cuil feels more than a magazine. Cuil’s features include drilldown searching, roll-over definitions, tabbed searching and Google-style search suggestions. Tabbed searching in particular could be very useful for doing research.

This kind of excellant front-end innovation could allow Cuil to get a foothold in the highly competitive search industry providing the results from Cuil’s back-end are relevant.

Cuil Crawler: Twiceler

Cuil claims to have indexed 120 billion web pages, compared to Google’s 40 billion. Cuil crawls the web using Twiceler, it’s own custom built crawler which respects the robots.txt standard and even the crawl-delay directive which is a non-standard extension to reduce the crawler’s drain on your site’s bandwidth.

Cuil’s crawler, Twiceler, has indexed my site 63 times since Nov 2007 (8 months). Thats not alot.

Google on the otherhand has visited me 19,704 times in the same period. Google also has a variety of different crawlers ranging from the original Googlebot to Feedfetcher. So not only does Google appear alot more thorough than Cuil, but it also appears to be fresher with indexing being done on many different levels than Cuil.

It’s not clear to me at this time whether Cuil/Twiceler read sitemaps.

Cuil/Twiceler User Agent

Here’s the user agent string for Cuil’s crawler, Twiceler:

Mozilla/5.0 (Twiceler-0.9 http://www.cuill.com/twiceler/robot.html)

Cuil Results

The proof of the pudding lies in how accurate Cuil’s results are. Just because Cuil crawls something doesn’t mean they can make good sense of it.

So lets play around with Cuil and Google and try to find out about that episode of Seinfeld when George lies to a girl to impress her by saying he’s a Marine Biologist.

Googling ‘marine biology seinfeld’ will correctly bring up articles on the episode of Seinfeld ‘The Marine Biologist‘.

Cuil for ‘marine biology seinfeld’ and you get results about Marine Biologists, photos and profiles of Marine Biologists and a profile of Larry David, the creator of Seinfeld. So that first click gave us nothing.

The Cuil Explore By Category box has some links to Seinfeld Episodes and the episode is listed there. But when you click that, the search string now just gets longer and the Cuil results are in no way any more accurate to what we wanted. So that second click gave us nothing either. I’m sure on one of the subsequent pages the information we want is there, but I would’ve lost interest and Googled it by now.

So personally I think Cuil’s result relevancy right now is utter rubbish.

Cuil Management

Cuil’s Senior Management all seem to have long and prestigious careers in Silicon Valley; having been involved in major advances in search techniques. Most Cuil management have worked for Google at some point and have skills in Computer Science, Mathematics and/or AI.

Cuil claim to have written their search algorithms from scratch with the philosophy that all pages should be indexed and available for search, not just the popular pages. This is an interesting idea, but to be honest, pages that no-one reads, that few people link to and that probably havent been relevant for years are of little interest.

Maybe it would be good to use Cuil in the event that Google fails to find what you’re looking for. But then, how often does that happen? Its quite rare for me!

Most people who can’t find something using Google simply aren’t using it properly, so unless Cuil’s search and ranking algorithms are more intuitive and user-friendly, I don’t see the advantage.

Cuil vs. Google

Google have a long history of being very open and responsible with regard to competition, collaboration and the internet in general. But how do they feel about so many of their former employees launching Cuil in direct competition with them? Google said with regard to Cuil:

“[we welcome] competition that stimulates innovation and provides users with more choice.”

That seems very fair and sporting.

So, have any of you guys used Cuil? What do you think?

Firefox 3 BETA User Agent List

Sunday, May 4th, 2008

Here is a complete list of the 49 distinct Firefox 3 BETA User Agent that have visited my site from 2007-11-21 to 2008-05-03.

Firefox 3 BETA User Agent On Microsoft Windows

Mozilla/5.0 (Windows; U; Windows NT 5.0; en-GB; rv:1.9b5) Gecko/2008032620 Firefox/3.0b5

Mozilla/5.0 (Windows; U; Windows NT 5.1; de; rv:1.9b4) Gecko/2008030714 Firefox/3.0b4

Mozilla/5.0 (Windows; U; Windows NT 5.1; en-GB; rv:1.9b4) Gecko/2008030714 Firefox/3.0b4

Mozilla/5.0 (Windows; U; Windows NT 5.1; en-GB; rv:1.9b5) Gecko/2008032620 Firefox/3.0b5

Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9b1) Gecko/2007110904 Firefox/3.0b1

Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9b2) Gecko/2007121120 Firefox/3.0b2

Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9b3) Gecko/2008020514 Firefox/3.0b3

Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9b4) Gecko/2008030714 Firefox/3.0b4

Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9b5) Gecko/2008032620 Firefox/3.0b5

Mozilla/5.0 (Windows; U; Windows NT 5.1; es-ES; rv:1.9b5) Gecko/2008032620 Firefox/3.0b5

Mozilla/5.0 (Windows; U; Windows NT 5.1; fr; rv:1.9b2) Gecko/2007121120 Firefox/3.0b2

Mozilla/5.0 (Windows; U; Windows NT 5.1; nb-NO; rv:1.9b5) Gecko/2008032620 Firefox/3.0b5

Mozilla/5.0 (Windows; U; Windows NT 5.1; pl; rv:1.9b3) Gecko/2008020514 Firefox/3.0b3

Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.9b5) Gecko/2008032620 Firefox/3.0b5

Mozilla/5.0 (Windows; U; Windows NT 5.2; en-US; rv:1.9b4) Gecko/2008030714 Firefox/3.0b4

Mozilla/5.0 (Windows; U; Windows NT 6.0; en-GB; rv:1.9b4) Gecko/2008030714 Firefox/3.0b4

Mozilla/5.0 (Windows; U; Windows NT 6.0; en-GB; rv:1.9b5) Gecko/2008032620 Firefox/3.0b5

Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.9b1) Gecko/2007110904 Firefox/3.0b1

Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.9b3) Gecko/2008020514 Firefox/3.0b3

Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.9b4) Gecko/2008030714 Firefox/3.0b4

Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.9b5) Gecko/2008032620 Firefox/3.0b5

Firefox 3 BETA User Agent On Apple Mac

Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.4; en-US; rv:1.9b1) Gecko/2007110903 Firefox/3.0b1

Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.4; en-US; rv:1.9b4) Gecko/2008030317 Firefox/3.0b4

Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-GB; rv:1.9b5) Gecko/2008032619 Firefox/3.0b5

Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9b1) Gecko/2007110903 Firefox/3.0b1

Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9b4) Gecko/2008030317 Firefox/3.0b4

Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9b5) Gecko/2008032619 Firefox/3.0b5

Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; fr; rv:1.9b5) Gecko/2008032619 Firefox/3.0b5

Mozilla/5.0 (Macintosh; U; PPC Mac OS X 10.4; en-US; rv:1.9b3) Gecko/2008020511 Firefox/3.0b3

Mozilla/5.0 (Macintosh; U; PPC Mac OS X 10.4; en-US; rv:1.9b5) Gecko/2008032619 Firefox/3.0b5

Mozilla/5.0 (Macintosh; U; PPC Mac OS X Mach-O 10.5; en; rv:1.9b2pre) Gecko/2007111101 Camino/2.0a1pre (like Firefox/3.0b2pre)

Firefox 3 BETA User Agent On Linux

Mozilla/5.0 (X11; U; Linux i686; en-GB; rv:1.9b4) Gecko/2008030318 Firefox/3.0b4

Mozilla/5.0 (X11; U; Linux i686; en-GB; rv:1.9b5) Gecko/2008041514 Firefox/3.0b5

Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.14) Gecko/20080414 Firefox/3.0 blog/xinzhi.org

Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9b2) Gecko/2007121016 Firefox/3.0b2

Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9b3) Gecko/2008021416 Firefox/3.0b3

Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9b4) Gecko/2008031317 Firefox/3.0b4

Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9b5) Gecko/2008032619 Firefox/3.0b5

Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9b5) Gecko/2008040514 Firefox/3.0b5

Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9b5) Gecko/2008041514 Firefox/3.0b5

Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9b5pre) Gecko/2008030913 Firefox/3.0b5pre (Swiftfox)

Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9pre) Gecko/2008040320 Firefox/3.0pre (Swiftfox)

Mozilla/5.0 (X11; U; Linux i686; nl; rv:1.9b5) Gecko/2008041514 Firefox/3.0b5

Mozilla/5.0 (X11; U; Linux i686; pl; rv:1.9b2) Gecko/2007121016 Firefox/3.0b2

Mozilla/5.0 (X11; U; Linux x86_64; en-GB; rv:1.9b5) Gecko/2008040514 Firefox/3.0b5

Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9b3) Gecko/2008021416 Firefox/3.0b3

Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9b3pre) Gecko/2008020509 Firefox/3.0b3pre

Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9b5) Gecko/2008040514 Firefox/3.0b5

Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9b5) Gecko/2008041515 Firefox/3.0b5

You can see here how Firefox’s BETA testing program has really taken off as only a few user agent were from the first build 3.0b1, but the vast majority are from the latest 3.0b4 and 3.0b5 builds.

Firefox 3 has some really great features, I’ve already become addicited to the idea of Googling stuff directly from the address bar, and the ability to see detailed site information by clicking the favicon is a genuis idea. But I have yet to deduce how Site Owner information is derived in Firefox 3. It doesnt seem to come from OWNER or AUTHOR META tags, so I’m puzzled. Anyone got any ideas?

BBC Flying Penguins April Fool - BBC iPlayer

Tuesday, April 1st, 2008

April Fools’ Day 2008 was pretty good I thought, Google made me laugh with Virgle and G’day and so did the BBC with this Flying Penguin April Fool commercial for iPlayer. The 90 second April Fool showed a newly discovered variety of Penguin that flies from the South Pole in Winter to live it up in the Amazon.

The first I heard of this was on The Telegraph website who got in on the joke and you know, for a split second I almost half caught myself maybe believing it despite knowing full well today is April Fools’ Day.

The Flying Penguins April Fool is the latest in a long line of BBC April Fools’ dating back to the 1950s and showed CGI penguins taking off in the Antarctic, flying over the tropics and eventually landing in thick Amazonian undergrowth.

BBC iPlayer Flying Penguin April FoolBBC iPlayer Flying Penguin April Fool

The Flying Penguin April Fool featured ex-Python Terry Jones and was created by blending library footage and footage shot on green-screen with CGI and was created entirely in-house at the BBC.

BBC iPlayer Flying Penguin April FoolBBC iPlayer Flying Penguin April Fool

Spaghetti Trees

In 1957 the BBC current affairs programme Panorama aired a 3min spoof report on how the Swiss were cultivating Spaghetti Trees as a April Fools’ Day joke. Hundreds of gullible viewers rang the BBC on April 2nd asking how to cultivate their own Spaghetti Trees and were told

“…place a sprig of spaghetti in a tin of tomato sauce and hope for the best.”

Success In Photographing A Fail Blog Fail

Monday, March 31st, 2008

Whilst driving about with Marcel (of Bromley) in Crawley yesterday we noticed this idiot who somehow succeeded in filling out a provisional license form but is a FAIL when it comes to attaching ‘learner plates’ correctly.

Learner Fail From FailBlog By Leyton Jay

Being a huge fan of The FAIL Blog I snapped a photo of this fail and sent it in this morning. Fail Blog published it this afternoon!

‘What is Fail Blog?‘ Well, it’s a momument to the human capacity to fail. It can be a fail in something easy, a fail in something hard or just a fail in the game of life itself. But Fail Blog is primary a Wordpress Blog so funny its made me cry on several occassions.

Here are some other great Fail Blog fails:Angular Fail
Asian Fail
About To Fail