Better search for the Twitter age

Search engines are cropping up to sift through constantly updated sites like Twitter, Flickr, and blogs, but they still have a long way to go.

EMAIL  |   PRINT  |   SHARE  |   RSS
 
google my aol my msn my yahoo! netvibes
Paste this link into your favorite RSS desktop reader
See all CNNMoney.com RSS FEEDS (close)
By Jessi Hempel, writer

How will the economy fare in the second half of 2009?
  • It will get worse
  • It will get better
  • It will stay about the same

NEW YORK (Fortune) -- When Iran cracked down on journalists following its recent election, international focus turned to Twitter as citizen journalists posted 140-character reports and links to photos and videos to the site. Trouble was, it was hard to sift the useful and reliable nuggets of information from scores of tweets that included plenty of spam, useless remarks, and stray sentiments.

Few events more clearly define the newest problem on the Web: how to make sense of all that real-time information bubbling up from Twitter, Facebook, Flickr, blogs, and really every single other self-publishing platform in cyberspace. It simply overwhelms.

Google (GOOG, Fortune 500) -- the tool consumers have relied on for a decade to organize the Internet for us -- isn't cutting it. At Google's Zeitgeist conference in May, co-founder Larry Page acknowledged the company has fallen behind Twitter, saying they've "done a relatively poor job of doing things that work on a per second basis."

A fast-growing group of startups aspires to pick up the slack by offering real-time search results. With names like Collecta, OneRiot and Scoopler, these companies attempt to provide answers to the question: what is happening on the web right now? Nearly every week, a new one is added to the mix. IDC analyst Hadley Reynolds explains, "Yahoo, Microsoft and Google will take a while to figure out how to cope with this. There is definitely a window of business opportunity for startups."

But providing these results is not straightforward. Inevitably, there is tension between information that is most recent, stuff that is most popular, and a subjective concept of content that is most important. There is no perfect solution.

Information filtered only by time is nearly as unwieldy as the data stream itself. But once you begin to add filters, weighting the results according to the authority of the publisher, for example, or the rate at which they're spreading across the web, you risk missing important trends and information tidbits because they are not popular.

The bulk of this stream of constantly updating information comes through Twitter. The site's search engine, a startup it purchased in 2008 called Summize, turns up results filtered only by time. Collecta, a search firm started this month, also filters by time, but draws from other blogs and social media sites on the web.

Time might not be the magic filter for search results -- or the best way to make sense of the information stream. Many of these startups, like Scoopler, have developed algorithms that attempt to unearth not just the latest tweet, but also the most popular. Among the more well known so far, OneRiot relies on a system called PulseRank that takes into account how fresh information is, how much authority the information's author has, and how quickly it's spreading to determine its rank.

So far, however, none of these search engines is particularly good. The interfaces are largely gaudy and none of the results are reliably useful. The day after Bernie Madoff was sentenced to 150 years in prison, a quick search on a smattering of these sites revealed: a "goodbye Bernie" YouTube video, a tweet linking to a Daily Show video, and another tweet asking who should play Bernie Madoff in a potential movie.

Maybe it's inevitably true that even a short lag lets the web sort out the kind of information that's ultimately useful. Google's search the day after Madoff's sentencing yielded a Los Angeles Times news story on the hearing followed by the fraudster's Wikipedia entry. To top of page

CompanyPrice% Change
YRC Worldwide Inc 1.01 6.23%
Freddie Mac 1.26 -3.82%
US Airways Group Inc 5.35 3.50%
Allegheny Technologies Inc 45.68 3.30%
Dec 24 12:43pm ET †
IndexLast% Change
Dow Jones10,520.100.51%
Nasdaq2,285.690.71%
S&P 5001,126.480.53%
10yr96 15/32Yield: 3.80%
Dec 25 12:00am ET †
CompanyPrice% Change
SanDisk Corp 29.86 5.62%
Apple Inc 208.74 3.28%
Sanmina Sci Corp 11.16 3.24%
Dell Inc 14.76 2.93%
Dec 24 12:58pm ET †
More Galleries
Biggest losers: Where Americans aren't moving Through most of the decade Florida was one of the fastest growing states. But the sunny clime -- and 6 others -- lost more residents than they gained in the year ended July 1. More
8 hot cars: Class of 2000 In just 10 years, the market's changed a lot when it comes to cars. Where are these models now? The Prius became a hit; the Aztek got killed. More
Obama's Main Street favorites President Obama meets often with small business owners, peppering his speeches with their stories. We checked in with 6 entrepreneurs touted by the President to find out how they handle health care. More
Sponsors

© 2009 Cable News Network. A Time Warner Company. All Rights Reserved. Terms under which this service is provided to you. Privacy Policy. Advertising Practices.
Copyright © 2009 BigCharts.com Inc. All rights reserved. Please see our Terms of Use.
MarketWatch, the MarketWatch logo, and BigCharts are registered trademarks of MarketWatch, Inc.
Intraday data provided by Interactive Data Real-Time Services and subject to the Terms of Use.
Intraday data is at least 20-minutes delayed. All times are ET.
Historical, current end-of-day data, and splits data provided by Interactive Data Pricing and Reference Data.
Fundamental data provided by Morningstar, Inc..
SEC Filings data provided by Edgar Online Inc..
Earnings data provided by FactSet CallStreet, LLC.