Level: | SEVERE |
Message: | On crawl: NewsSohu You must set the User-Agent and From HTTP header values to acceptable strings.
User-Agent: [software-name](+[info-url])[misc]
From: [email-address]
|
Exception: | org.archive.crawler.framework.exceptions.FatalConfigurationException: unacceptable 'user-agent' or 'from' (correct your configuration). Stacktrace: org.archive.crawler.framework.exceptions.FatalConfigurationException: unacceptable 'user-agent' or 'from' (correct your configuration). at org.archive.crawler.datamodel.CrawlOrder.checkUserAgentAndFrom(CrawlOrder.java:458) at org.archive.crawler.framework.CrawlController.initialize(CrawlController.java:339) at org.archive.crawler.admin.CrawlJob.setupForCrawlStart(CrawlJob.java:853) at org.archive.crawler.admin.CrawlJobHandler.startNextJobInternal(CrawlJobHandler.java:1144) at org.archive.crawler.admin.CrawlJobHandler$3.run(CrawlJobHandler.java:1127) at java.lang.Thread.run(Thread.java:662) |