If, like me, you’re tired of running into loads of spam on Technorati and Feedster these days, you know how exciting the prospect of blog-only searching by the two big tyke search engines is. Well Yahoo’s making overtures (no pun intended).
If, like me, you’re tired of running into loads of spam on Technorati and Feedster these days, you know how exciting the prospect of blog-only searching by the two big tyke search engines is. At Technorati, I even have old posts coming up as new, occasionally followed by no search results at all, one time and huge list on the same results the next time. If this fascinating Business Week article is any indication, these scalability issues are anything but going away.
Google would be the first usual suspect to introduce this functionality with it’s acquisition of Blogger, but looks like Yahoo has been making most of the first strides (e.g., Creative Commons search).
Now, Steve Rubel exposes Yahoo’s stealth blog/RSS search engine, which has since been taken down. From Rubel’s screenshot, it looks like Yahoo simply applied its web search engine to a corpus of blog posts, i.e. yet another Feedster.
Read all about these interests from SearchEngineWatch here.
This post is tagged Musings


2 Comments
Like most people, I have been extremely frustrated with Technorati’s unreliable server behavior, especially the search feature and the feature that keeps track of in-links. It’s not easy running such technology behind the scenes, even with the latest hardware because blog search engines, unlike Google, have to update their data continuously.
Google has Google dance which comes by every week or so and does its jig. Technorati on the other hand is hit constantly by blog posts, and it has to sort and share this data almost real time. It’s a different ball game.
For instance, last week with the London bombing, the traffic exploded, taxing the Technorati system. Instead of the usual 800,000 new posts, Technorati was on track yesterday to process 1.2 million of them. It’ll be a pain ever for a Yahoo or Google to figure that out.
and then of course there’s the “spam log” problem! http://www.geektronica.com/2005-06-30-the-strange-world-of-blogspot-spam-blogs
Incoming Links