html格式图片转换为pdf,html转为pdf格式 或者 图片格式

工具地址:

安装工具后,用起来非常简单

linux版本:

wkhtmltopdf google.pdf

也可以直接转换HTML文件

./wkhtmltopdf --javascript-delay 2000 file:///home/zjx/function/html_result.html baidu4.pdf

--javascript-delay 2000  这个是延迟2000毫秒后,再保存,是给网页加载的时间。

WINDOWS 版本:

安装:wkhtmltox-0.12.3.2_msvc2013-win64.exe,cmd命令进入安装目录

运行:wkhtmltopdf.exe [参数,可选,可多个;wkhtmltopdf中文参数详解]

a.例子: wkhtmltopdf.exe –page-size A4 test.pdf

附上可用的选项:

Name:

wkhtmltopdf 0.12.4 (with patched qt)

Synopsis:

wkhtmltopdf [GLOBAL OPTION]... [OBJECT]...

Document objects:

wkhtmltopdf is able to put several objects into the output file, an object is

either a single webpage, a cover webpage or a table of content. The objects

are put into the output document in the order they are specified on the

command line, options can be specified on a per object basis or in the global

options area. Options from the Global Options section can only be placed in

the global options area

A page objects puts the content of a single webpage into the output document.

(page)? [PAGE OPTION]...

Options for the page object can be placed in the global options and the page

options areas. The applicable options can be found in the Page Options and

Headers And Footer Options sections.

A cover objects puts the content of a single webpage into the output document,

the page does not appear in the table of content, and does not have headers

and footers.

cover [PAGE OPTION]...

All options that can be specified for a page object can also be specified for

a cover.

A table of content object inserts a table of content into the output document.

toc [TOC OPTION]...

All options that can be specified for a page object can also be specified for

a toc, further more the options from the TOC Options section can also be

applied. The table of content is generated via XSLT which means that it can be

styled to look however you want it to look. To get an idea of how to do this

you can dump the default xslt document by supplying the

--dump-default-toc-xsl, and the outline it works on by supplying

--dump-outline, see the Outline Options section.

Description:

Converts one or more HTML pages into a PDF document, using wkhtmltopdf patched

qt.

Global Options:

--collate Collate when printing multiple copies

(default)

--no-collate Do not collate when printing multiple

copies

--cookie-jar Read and write cookies from and to the

supplied cookie jar file

--copies Number of copies to print into the pdf

file (default 1)

-d, --dpi Change the dpi explicitly (this has no

effect on X11 based systems) (default 96)

-H, --extended-help Display more extensive help, detailing

less common command switches

-g, --grayscale PDF will be generated in grayscale

-h, --help Display help

--htmldoc Output program html help

--image-dpi When embedding images scale them down to

this dpi (default 600)

--image-quality When jpeg compressing images use this

quality (default 94)

--license Output license information and exit

-l, --lowquality Generates lower quality pdf/ps. Useful to

shrink the result document space

--manpage Output program man page

-B, --margin-bottom Set the page bottom margin

-L, --margin-left Set the page left margin (default 10mm)

-R, --margin-right Set the page right margin (default 10mm)

-T, --margin-top Set the page top margin

-O, --orientation Set orientation to Landscape or Portrait

(default Portrait)

--page-height Page height

-s, --page-size Set paper size to: A4, Letter, etc.

(default A4)

--page-width Page width

--no-pdf-compression Do not use lossless compression on pdf

objects

-q, --quiet Be less verbose

--read-args-from-stdin Read command line arguments from stdin

--readme Output program readme

--title The title of the generated pdf file (The

title of the first document is used if not

specified)

--use-xserver Use the X server (some plugins and other

stuff might not work without X11)

-V, --version Output version information and exit

Outline Options:

--dump-default-toc-xsl Dump the default TOC xsl style sheet to

stdout

--dump-outline Dump the outline to a file

--outline Put an outline into the pdf (default)

--no-outline Do not put an outline into the pdf

--outline-depth Set the depth of the outline (default 4)

Page Options:

--allow Allow the file or files from the specified

folder to be loaded (repeatable)

--background Do print background (default)

--no-background Do not print background

--bypass-proxy-for Bypass proxy for host (repeatable)

--cache-dir Web cache directory

--checkbox-checked-svg Use this SVG file when rendering checked

checkboxes

--checkbox-svg Use this SVG file when rendering unchecked

checkboxes

--cookie Set an additional cookie (repeatable),

value should be url encoded.

--custom-header Set an additional HTTP header (repeatable)

--custom-header-propagation Add HTTP headers specified by

--custom-header for each resource request.

--no-custom-header-propagation Do not add HTTP headers specified by

--custom-header for each resource request.

--debug-javascript Show javascript debugging output

--no-debug-javascript Do not show javascript debugging output

(default)

--default-header Add a default header, with the name of the

page to the left, and the page number to

the right, this is short for:

--header-left='[webpage]'

--header-right='[page]/[toPage]' --top 2cm

--header-line

--encoding Set the default text encoding, for input

--disable-external-links Do not make links to remote web pages

--enable-external-links Make links to remote web pages (default)

--disable-forms Do not turn HTML form fields into pdf form

fields (default)

--enable-forms Turn HTML form fields into pdf form fields

--images Do load or print images (default)

--no-images Do not load or print images

--disable-internal-links Do not make local links

--enable-internal-links Make local links (default)

-n, --disable-javascript Do not allow web pages to run javascript

--enable-javascript Do allow web pages to run javascript

(default)

--javascript-delay Wait some milliseconds for javascript

finish (default 200)

--keep-relative-links Keep relative external links as relative

external links

--load-error-handling Specify how to handle pages that fail to

load: abort, ignore or skip (default

abort)

--load-media-error-handling Specify how to handle media files

that fail to load: abort, ignore or skip

(default ignore)

--disable-local-file-access Do not allowed conversion of a local file

to read in other local files, unless

explicitly allowed with --allow

--enable-local-file-access Allowed conversion of a local file to read

in other local files. (default)

--minimum-font-size Minimum font size

--exclude-from-outline Do not include the page in the table of

contents and outlines

--include-in-outline Include the page in the table of contents

and outlines (default)

--page-offset Set the starting page number (default 0)

--password HTTP Authentication password

--disable-plugins Disable installed plugins (default)

--enable-plugins Enable installed plugins (plugins will

likely not work)

--post Add an additional post field (repeatable)

--post-file Post an additional file (repeatable)

--print-media-type Use print media-type instead of screen

--no-print-media-type Do not use print media-type instead of

screen (default)

-p, --proxy Use a proxy

--radiobutton-checked-svg Use this SVG file when rendering checked

radiobuttons

--radiobutton-svg Use this SVG file when rendering unchecked

radiobuttons

--resolve-relative-links Resolve relative external links into

absolute links (default)

--run-script Run this additional javascript after the

page is done loading (repeatable)

--disable-smart-shrinking Disable the intelligent shrinking strategy

used by WebKit that makes the pixel/dpi

ratio none constant

--enable-smart-shrinking Enable the intelligent shrinking strategy

used by WebKit that makes the pixel/dpi

ratio none constant (default)

--stop-slow-scripts Stop slow running javascripts (default)

--no-stop-slow-scripts Do not Stop slow running javascripts

--disable-toc-back-links Do not link from section header to toc

(default)

--enable-toc-back-links Link from section header to toc

--user-style-sheet Specify a user style sheet, to load with

every page

--username HTTP Authentication username

--viewport-size <> Set viewport size if you have custom

scrollbars or css attribute overflow to

emulate window size

--window-status Wait until window.status is equal to this

string before rendering page

--zoom Use this zoom factor (default 1)

Headers And Footer Options:

--footer-center Centered footer text

--footer-font-name Set footer font name (default Arial)

--footer-font-size Set footer font size (default 12)

--footer-html Adds a html footer

--footer-left Left aligned footer text

--footer-line Display line above the footer

--no-footer-line Do not display line above the footer

(default)

--footer-right Right aligned footer text

--footer-spacing Spacing between footer and content in mm

(default 0)

--header-center Centered header text

--header-font-name Set header font name (default Arial)

--header-font-size Set header font size (default 12)

--header-html Adds a html header

--header-left Left aligned header text

--header-line Display line below the header

--no-header-line Do not display line below the header

(default)

--header-right Right aligned header text

--header-spacing Spacing between header and content in mm

(default 0)

--replace Replace [name] with value in header and

footer (repeatable)

TOC Options:

--disable-dotted-lines Do not use dotted lines in the toc

--toc-header-text The header text of the toc (default Table

of Contents)

--toc-level-indentation For each level of headings in the toc

indent by this length (default 1em)

--disable-toc-links Do not link from toc to sections

--toc-text-size-shrink For each level of headings in the toc the

font is scaled by this factor (default

0.8)

--xsl-style-sheet Use the supplied xsl style sheet for

printing the table of content

Page sizes:

The default page size of the rendered document is A4, but using this

--page-size optionthis can be changed to almost anything else, such as: A3,

Letter and Legal. For a full list of supported pages sizes please see

.

For a more fine grained control over the page size the --page-height and

--page-width options may be used

Reading arguments from stdin:

If you need to convert a lot of pages in a batch, and you feel that

wkhtmltopdf is a bit too slow to start up, then you should try

--read-args-from-stdin,

When --read-args-from-stdin each line of input sent to wkhtmltopdf on stdin

will act as a separate invocation of wkhtmltopdf, with the arguments specified

on the given line combined with the arguments given to wkhtmltopdf

For example one could do the following:

echo " qapplication.pdf" >> cmds

echo "cover google.com (software) qt.pdf" >> cmds

wkhtmltopdf --read-args-from-stdin --book < cmds

Specifying A Proxy:

By default proxy information will be read from the environment variables:

proxy, all_proxy and http_proxy, proxy options can also by specified with the

-p switch

:= "http://" | "socks5://"

:= (":" )? "@"

:= "None" | ? ? (":" )?

Here are some examples (In case you are unfamiliar with the BNF):

socks5://myproxyserver

None

Footers And Headers:

Headers and footers can be added to the document by the --header-* and

--footer* arguments respectfully. In header and footer text string supplied

to e.g. --header-left, the following variables will be substituted.

* [page] Replaced by the number of the pages currently being printed

* [frompage] Replaced by the number of the first page to be printed

* [topage] Replaced by the number of the last page to be printed

* [webpage] Replaced by the URL of the page being printed

* [section] Replaced by the name of the current section

* [subsection] Replaced by the name of the current subsection

* [date] Replaced by the current date in system local format

* [isodate] Replaced by the current date in ISO 8601 extended format

* [time] Replaced by the current time in system local format

* [title] Replaced by the title of the of the current page object

* [doctitle] Replaced by the title of the output document

* [sitepage] Replaced by the number of the page in the current site being converted

* [sitepages] Replaced by the number of pages in the current site being converted

As an example specifying --header-right "Page [page] of [toPage]", will result

in the text "Page x of y" where x is the number of the current page and y is

the number of the last page, to appear in the upper left corner in the

document.

Headers and footers can also be supplied with HTML documents. As an example

one could specify --header-html header.html, and use the following content in

header.html:

function subst() {

var vars = {};

var query_strings_from_url = document.location.search.substring(1).split('&');

for (var query_string in query_strings_from_url) {

if (query_strings_from_url.hasOwnProperty(query_string)) {

var temp_var = query_strings_from_url[query_string].split('=', 2);

vars[temp_var[0]] = decodeURI(temp_var[1]);

}

}

var css_selector_classes = ['page', 'frompage', 'topage', 'webpage', 'section', 'subsection', 'date', 'isodate', 'time', 'title', 'doctitle', 'sitepage', 'sitepages'];

for (var css_class in css_selector_classes) {

if (css_selector_classes.hasOwnProperty(css_class)) {

var element = document.getElementsByClassName(css_selector_classes[css_class]);

for (var j = 0; j < element.length; ++j) {

element[j].textContent = vars[css_selector_classes[css_class]];

}

}

}

}Page of

As can be seen from the example, the arguments are sent to the header/footer

html documents in get fashion.

Outlines:

Wkhtmltopdf with patched qt has support for PDF outlines also known as book

marks, this can be enabled by specifying the --outline switch. The outlines

are generated based on the tags, for a in-depth description of how this

is done see the Table Of Content section.

The outline tree can sometimes be very deep, if the tags where spread to

generous in the HTML document. The --outline-depth switch can be used to

bound this.

Table Of Content:

A table of content can be added to the document by adding a toc object to the

command line. For example:

wkhtmltopdf toc qstring.pdf

The table of content is generated based on the H tags in the input documents.

First a XML document is generated, then it is converted to HTML using XSLT.

The generated XML document can be viewed by dumping it to a file using the

--dump-outline switch. For example:

wkhtmltopdf --dump-outline toc.xml qstring.pdf

The XSLT document can be specified using the --xsl-style-sheet switch. For

example:

wkhtmltopdf toc --xsl-style-sheet my.xsl qstring.pdf

The --dump-default-toc-xsl switch can be used to dump the default XSLT style

sheet to stdout. This is a good start for writing your own style sheet

wkhtmltopdf --dump-default-toc-xsl

The XML document is in the namespace "" it has a

root node called "outline" which contains a number of "item" nodes. An item

can contain any number of item. These are the outline subsections to the

section the item represents. A item node has the following attributes:

* "title" the name of the section.

* "page" the page number the section occurs on.

* "link" a URL that links to the section.

* "backLink" the name of the anchor the section will link back to.

The remaining TOC options only affect the default style sheet so they will not

work when specifying a custom style sheet.

Contact:

If you experience bugs or want to request new features please visit

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值