Bittorrent.com Uses Nutch, Tomcat, and Java.
Wow. I just did a search on Bittorrent.com and got a nice pretty exception!
java.lang.NoClassDefFoundError
at net.nutch.analysis.NutchAnalysis.compound(NutchAnalysis.java:250)
at net.nutch.analysis.NutchAnalysis.parse(NutchAnalysis.java:115)
at net.nutch.analysis.NutchAnalysis.parseQuery(NutchAnalysis.java:39)
at net.nutch.searcher.Query.parse(Query.java:395)
at org.apache.jsp.search_jsp._jspService(search_jsp.java:85)
Some other nice little tidbits of information.
They're running Tomcat 4.1. I couldn't begin to imagine how horrible it must be to run on this version of Tomcat. The recent 5.5.4 release still has major issues.
They're also running JSP and as their search engine are running Nutch.
Nutch is pretty decent. I've reviewed most of the code. At Rojo we have different requirements and most of the other search technologies like Nutch and Heretrix wouldn't really work in our environment.
The interesting thing was that I heard that Ask Jeeves was providing the Bittorrent crawl data. Does this means that Ask Jeeves runs Nutch? Of course maybe Bittorrent just re-indexed it and Ask just provided a flat file dump of their data.
Comments