html2text man page

html2text — manual page for html2text 2016.9.19


html2text [(filename|url) [encoding]]



show program's version number and exit

-h, --help

show this help message and exit


The default alt string for images with missing ones


pad the cells to equal column width in tables


wrap links during conversion


don't include any formatting for emphasis


use reference style links instead of inline links


don't include any formatting for links


protect links from line breaks surrounding them with angle brackets


don't include any formatting for images


Discard image data, only keep alt text


Write image tags with height and width attrs as raw html to retain dimensions

-g, --google-doc

convert an html-exported Google Document

-d, --dash-unordered-list

use a dash rather than a star for unordered list items

-e, --asterisk-emphasis

use an asterisk rather than an underscore for emphasized text

-b BODY_WIDTH, --body-width=BODY_WIDTH

number of characters per output line, 0 for no wrap

-i LIST_INDENT, --google-list-indent=LIST_INDENT

number of pixels Google indents nested lists

-s, --hide-strikethrough

hide strike-through text. only relevant when -g is specified as well


Escape all special characters.  Output is less readable, but avoids corner case formatting issues.


Format tables in HTML rather than Markdown syntax.


Ignore table-related tags (table, th, td, tr) while keeping rows.


Use a single line break after a block element rather than two line breaks. NOTE: Requires --body-width=0


Use unicode throughout document


Do not use automatic links wherever applicable


Do not skip internal links


Put links after each paragraph instead of document


Mark program code blocks with [code]...[/code]


What to do in case of decode errors.'ignore', 'strict' and 'replace' are acceptable values


February 2018 html2text 2016.9.19 User Commands