Archive
Dear Lazy Web, Blog Search Engine
Dear Lazy Web,
We have a bunch of internal blogs at work, or we will soon. Problem is that we have many different ways for people to blog. We’re working on an official solution but there are other ways to blog. For example, some of the wiki’s have blog like features, some groups have set up their own servers, and many of the “collaboration” products out there have similar features. It would be really nice to allow people to post in the solution they like best, yet still have a central location for people to see what’s going on in our “blogosphere” so to speak.
A traditional aggregation solution like planet or Feedjack isn’t going to work because they won’t scale to the number of feeds we’d need to track. After a certain number of feeds are configured in the system its going to spend almost as much time (if not more) crawling the feeds as it would displaying them. Especially when you consider most of those feeds won’t have been updated, crawling all of them each time isn’t very efficient. Its become very clear that a solution more like Technorati is the direction we’d want to go. By only indexing sites when they “ping” it to tell it they have been updated the content can remain up to date without wasting time crawling pages that haven’t been updated.
I’m somewhat surprised that I wasn’t able to just find something to accomplish this task very quickly. It seems like it should already exist and a simple search over a freshmeat should have turned up several options.. I think I’m looking for the wrong things though because I haven’t found anything yet that does what I’d like it to. So dear lazy web what should I be looking for instead? I know it must be out there…