Wednesday, 15 June 2011

java - Connection Time out error for Crawler while using Jsoup for crawler -


I am trying to execute a crawler program from my office is a very basic one that is available on the Internet and Which works fine in my home PC However, when I'm trying to run the same program in my office PC, I am connecting with a timed error. I thought this was a proxy problem and some sites tried to reach the internal browser with eclipse and it worked fine too.

Document Doctor = Jsoup.connect ("http://flipkart.com/"). Timeout (0) .get ();

Please locate my stack trace

  Exception in the thread "main" java.net.ConnectException: The connection timed out: java. Connect on net. DualStackPlainSocketImpl .connect0 (native resident method) java.net.AbstractPlainSocketImpl.connect on java.net.AbstractPlainSocketImpl.connectToAddress on java.net.AbstractPlainSocketImpl.doConnect on java.net.DualStackPlainSocketImpl.socketConnect (unknown source) (unknown source) ) On java.net.PlainSocketImpl.connect (unknown source) on java.net.SocksSocketImpl.connect (unknown source) at java.net.Socket.connect (unknown source) at sun.net .netclient.ddo Connect (unknown sun) at sun.net.www.http.HttpClient.openServer (unknown source) at sun.net.www.http.HttpClient.openServer (unknown source) sun.net.www.http.HttpClient & Lt; Init & gt; (Unknown source at Sun.net.www.http.HttpClient.New at sun.net.www.http.HttpClient.New (unknown source) sun.net.www.protocol.http.HttpURLConnection.getNewH) Sun.net TtpClient (unknown source) at www.protocol.http Sun.co.uk at http: //www.protocol.plainConnect (unknown source) .www.protocol.http.HttpURLConnection.connect (or unknown source) at org.jsoup.helper.HttpConnection $ Response.execute (HttpConnection.java: 449) org.syntel.crawler.Crawler at org.jsoup org.jsoup.helper.HttpConnection.execute (HttpConnection.java181) at org.jsoup.helper.HttpConnection $ Response.execute (HttpConnection.java:434) Main on .helper.HttpConnection.get (HttpConnection.java:170) org.syntel.crawler.Crawler.processPage (Crawler.java:44) (Crawler.java20)  

How can I fix this problem?

@alkis suggested:

set up a user agent Try to FF You are probing a proxy using this other question:


No comments:

Post a Comment