I'm the CEO of Tailrank. This is my old (personal) blog. See my new blog over at Feedblog.org. Tailrank is proudly hosted by ServerBeach.

My Photo

Reading

June 2008

Sun Mon Tue Wed Thu Fri Sat
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30          

« DMCA Takedown Time | Main | Rojo lunch! »

Bittorrent.com Uses Nutch, Tomcat, and Java.

Wow. I just did a search on Bittorrent.com and got a nice pretty exception!

java.lang.NoClassDefFoundError
at net.nutch.analysis.NutchAnalysis.compound(NutchAnalysis.java:250)
at net.nutch.analysis.NutchAnalysis.parse(NutchAnalysis.java:115)
at net.nutch.analysis.NutchAnalysis.parseQuery(NutchAnalysis.java:39)
at net.nutch.searcher.Query.parse(Query.java:395)
at org.apache.jsp.search_jsp._jspService(search_jsp.java:85)

Some other nice little tidbits of information.

They're running Tomcat 4.1. I couldn't begin to imagine how horrible it must be to run on this version of Tomcat. The recent 5.5.4 release still has major issues.

They're also running JSP and as their search engine are running Nutch.

Nutch is pretty decent. I've reviewed most of the code. At Rojo we have different requirements and most of the other search technologies like Nutch and Heretrix wouldn't really work in our environment.

The interesting thing was that I heard that Ask Jeeves was providing the Bittorrent crawl data. Does this means that Ask Jeeves runs Nutch? Of course maybe Bittorrent just re-indexed it and Ask just provided a flat file dump of their data.

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d8341cabb353ef00d8351183f053ef

Listed below are links to weblogs that reference Bittorrent.com Uses Nutch, Tomcat, and Java.:

Comments

Post a comment

Comments are moderated, and will not appear on this weblog until the author has approved them.

If you have a TypeKey or TypePad account, please Sign In