After posting the Yahoo microsearch entry last night I had a thought. Google has already mastered web 1.0 style search. I use it all the time (even if I am not always thrilled with the results), rarely do I use Microsoft or Yahoo or anything else when I just want to do a quick search. So how can others compete? Even if Microsoft and Yahoo merge and Microhoo emerges, how can they take on Google. Well, with basic search I don’t think they can.
This is when Yahoo microsearch jumps up and starts waving its arms… “Why do a search on conference in San Francisco on Google and get this”:
“When you can do the same search on me, microsearch and get this:” (not sure why microsearch speaks in the first person)
Note the map, the timeline and the little bar sitting above it all. It shows the amount of extracted metadata - the really useful bits. In the above search it pulled out 16 vCards and 182 events. It also makes all this metadata goodness available as RDF if you click the little box logo
located at the top left.
Of course nothing stops Google from offering this (I am sure they are already thinking about it) which would be great. The thing that levels the playing field is the embedded structured data (microformats, RDF, RDFa, eRDF). This semantic data gives important information that doesn’t require amazing guesswork. No longer does a search engine have to guess what was meant, the author can explicitly say what they meant. That is an important difference. A critical difference.
I know this opens up a HUGE can of worms for spam, but with social networking and trust systems (FOAF, OpenID, OpenAuth) it can be counteracted. All embedded data will be indexed, but the new, future web search engines will have trust algorithms as their secret sauce. A new “Do I trust this?” or “Does my network trust this” filter will sit on top of this mass of information.
Google works with secret recipes that do a pretty good job with the huge mass of unstructured information, but when the importance of that starts to diminish where does that put Google? I am not going to shed any tears for them yet, it is still early days, but Yahoo microsearch is leading in the right direction.
The classic chicken and egg syndrome looks to finally be resolving itself; no one will go to the effort of embedding structured data if no one is using it or asking for it. Making this data a first class citizen will answer that problem. Microformats and Semantic Web metadata formats can easily live happily with each other and in fact can leverage each other strengths to make the web a more usable place for everyone.










February 25th, 2008 at 8:18 am
[…] I like David Peterson’s blog: yesterday I read an interesting article on how structured data (such as embedded meta or microformats) might take on Google. […]
February 26th, 2008 at 6:38 am
Being an Internet Dinosaur (I can remember software development BEFORE the web), I recall that is precisely what search engines used to do. In the days of Alta-Vista, the author would use meta-tags to describe the content (their original purpose).
February 26th, 2008 at 6:46 am
[…] Structured data - set to take on Google […]
February 26th, 2008 at 11:51 am
Thanks ParkinT for the comment :) It brought a smile to my face.
I also remember the good old days of AltaVista and the good old days of meta tags — and the fact that they actually worked!
That is an important challenge with embedded metadata, do we trust it? I think we are far enough along and can use social network like principals to filter out the rubbish and just return what “we” or our “trusted group” trusts.
March 15th, 2008 at 1:17 am
[…] a post last month I mused that Yahoo or Microsoft might be the first to embrace the Data Web/Semantic Web. […]
May 23rd, 2008 at 5:23 pm
[…] have previously blogged about the new Yahoo! search platform called SearchMonkey. It is a major step in the right […]