Blog Post RSS ?

Blogs » Web Tech » Structured data - set to take on Google
 

Structured data - set to take on Google

by David Peterson

After posting the Yahoo microsearch entry last night I had a thought. Google has already mastered web 1.0 style search. I use it all the time (even if I am not always thrilled with the results), rarely do I use Microsoft or Yahoo or anything else when I just want to do a quick search. So how can others compete? Even if Microsoft and Yahoo merge and Microhoo emerges, how can they take on Google. Well, with basic search I don’t think they can.

This is when Yahoo microsearch jumps up and starts waving its arms… “Why do a search on conference in San Francisco on Google and get this”:

Google search on conferences in San Francisco

“When you can do the same search on me, microsearch and get this:” (not sure why microsearch speaks in the first person)

microsearch on conferences in San Francisco

Note the map, the timeline and the little bar sitting above it all. It shows the amount of extracted metadata - the really useful bits. In the above search it pulled out 16 vCards and 182 events. It also makes all this metadata goodness available as RDF if you click the little box logo Semantic Web logo located at the top left.

Of course nothing stops Google from offering this (I am sure they are already thinking about it) which would be great. The thing that levels the playing field is the embedded structured data (microformats, RDF, RDFa, eRDF). This semantic data gives important information that doesn’t require amazing guesswork. No longer does a search engine have to guess what was meant, the author can explicitly say what they meant. That is an important difference. A critical difference.

I know this opens up a HUGE can of worms for spam, but with social networking and trust systems (FOAF, OpenID, OpenAuth) it can be counteracted. All embedded data will be indexed, but the new, future web search engines will have trust algorithms as their secret sauce. A new “Do I trust this?” or “Does my network trust this” filter will sit on top of this mass of information.

Google works with secret recipes that do a pretty good job with the huge mass of unstructured information, but when the importance of that starts to diminish where does that put Google? I am not going to shed any tears for them yet, it is still early days, but Yahoo microsearch is leading in the right direction.

The classic chicken and egg syndrome looks to finally be resolving itself; no one will go to the effort of embedding structured data if no one is using it or asking for it. Making this data a first class citizen will answer that problem. Microformats and Semantic Web metadata formats can easily live happily with each other and in fact can leverage each other strengths to make the web a more usable place for everyone.

If you liked this blog, share the love:

  • Save to Del.icio.us

This post has 5 responses so far

  1. […] I like David Peterson’s blog: yesterday I read an interesting article on how structured data (such as embedded meta or microformats) might take on Google. […]

     
  2. No longer does a search engine have to guess what was meant, the author can explicitly say what they meant. That is an important difference. A critical difference.

    Being an Internet Dinosaur (I can remember software development BEFORE the web), I recall that is precisely what search engines used to do. In the days of Alta-Vista, the author would use meta-tags to describe the content (their original purpose).

     
  3. […] Structured data - set to take on Google […]

     
  4. Thanks ParkinT for the comment :) It brought a smile to my face.

    I also remember the good old days of AltaVista and the good old days of meta tags — and the fact that they actually worked!

    That is an important challenge with embedded metadata, do we trust it? I think we are far enough along and can use social network like principals to filter out the rubbish and just return what “we” or our “trusted group” trusts.

     
  5. […] a post last month I mused that Yahoo or Microsoft might be the first to embrace the Data Web/Semantic Web. […]

     

Sponsored Links

Leave a response

You are not logged in, log in with your SitePoint Forum username and password.

-OR- Post Anonymously

* Make sure any code samples are escaped (i.e. ‘<b>’ becomes ‘&lt;b&gt;’).

If not logged in, your comments will be placed in a moderation queue. This means your comment may not appear until one of our moderators approves it.

SitePoint Marketplace

Buy and sell Websites, templates, domain names, hosting, graphics and more.

Logo Design, Web page Design and more!

99designs

  • Custom logo designs created ‘just for you’.
  • Pick the design you like best.
  • Only pay if you’re satisfied with the result.