Strip all the remaining HTML tag when keepHtml option is disabled #31

tzi · 2018-01-29T21:03:59Z

From #29

SL-Gundam · 2018-01-30T02:49:37Z

Here are some examples of left over html after parsing emails with Markdownify. Some of them are a big mess html wise.
Would also really like it if something could be done about the large amount of empty lines in the end result like in Variant1

I tried cleaning them of any sensitive information. Let me know If i overlooked anything.

The files are paired
_HTML is the html before Markdownify
_Markdownify is what Markdownify made of it after processing using
$html2markdown = new Markdownify\ConverterExtra();
$html2markdown->setKeepHTML( FALSE );
$body = $html2markdown->parseString( $body );
Variant1_HTML.txt
Variant1_Markdownify.txt

Variant2_HTML.txt
Variant2_Markdownify.txt

Variant3_HTML.txt
Variant3_Markdownify.txt

Variant4_HTML.txt
Variant4_Markdownify.txt

Variant5_HTML.txt
Variant5_Markdownify.txt