Heritrix 学习笔记1.Heritrix defined codes

本文为博主翻译,转载请注明出处。如有翻译不妥,请指出以便改正,谢谢。

1 Successful DNS lookup
DNS 查找成功

0 Fetch never tried (perhaps protocol unsupported or illegal URI)
从未获取(可能协议未授权或者不合法URI)

-1 DNS lookup failed
DNS 查找失败

-2 HTTP connect failed
HTTP连接失败

-3 HTTP connect broken
HTTP连接中断

-4 HTTP timeout (before any meaningful response received)
HTTP协议超时(在接收到响应之前)

-5 Unexpected runtime exception; see runtime-errors.log
未处理的运行时异常 会记录在runtime-errors.log

-6 Prerequisite domain-lookup failed, precluding fetch attempt
运行先决条件,也就是没有得到域名的DNS

-7 URI recognized as unsupported or illegal
无支持或者非法的URI

-8 Multiple retries all failed, retry limit reached
多次尝试全部失败,重试次数(可以自己设置)达到限制

-50 Temporary status assigned URIs awaiting preconditions; appearance in logs may be a bug
临时的状态 已分配的URIs等待先决条件(DNS),出现在log可能是一个bug

-60 Failure status assigned URIs which could not be queued by the Frontier (and may in fact be unfetchable)
失败的状态 已分配的URIs不能被Frontier(调度器)加入队列

-61 Prerequisite robots.txt-fetch failed, precluding a fetch attempt
运行先决条件(DNS) 被robots.txt(爬虫协议)拒绝

-62 Some other prerequisite failed, precluding a fetch attempt
其他的一些获取先决条件(DNS)失败

-63 A prerequisite (of any type) could not be scheduled, precluding a fetch attempt
DNS在所有的类型中不能被加入列表

-3000 Severe Java 'Error' conditions (OutOfMemoryError, StackOverflowError, etc.) during URI processing.
-4000 'chaff' detection of traps/content of negligible value applied
-4001 Too many link hops away from seed
-4002 Too many embed/transitive hops away from last URI in scope
-5000 Out of scope upon reexamination (only happens if scope changes during crawl)
-5001 Blocked from fetch by user setting
-5002 Blocked by a custom processor
-5003 Blocked due to exceeding an established quota
-5004 Blocked due to exceeding an established runtime
-6000 Deleted from Frontier by user
-7000 Processing thread was killed by the operator (perhaps because of a hung condition)
-9998 Robots.txt rules precluded fetch
HTTP codes
1xx Informational
100 Continue
101 Switching Protocols
2xx Successful
200 OK
201 Created
202 Accepted
203 Non-Authoritative Information
204 No Content
205 Reset Content
206 Partial Content
3xx Redirection
300 Multiple Choices
301 Moved Permanently
302 Found
303 See Other
304 Not Modified
305 Use Proxy
307 Temporary Redirect
4xx Client Error
400 Bad Request
401 Unauthorized
402 Payment Required
403 Forbidden
404 Not Found
405 Method Not Allowed
406 Not Acceptable
407 Proxy Authentication Required
408 Request Timeout
409 Conflict
410 Gone
411 Length Required
412 Precondition Failed
413 Request Entity Too Large
414 Request-URI Too Long
415 Unsupported Media Type
416 Requested Range Not Satisfiable
417 Expectation Failed
5xx Server Error
500 Internal Server Error
501 Not Implemented
502 Bad Gateway
503 Service Unavailable
504 Gateway Timeout
505 HTTP Version Not Supported
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值