Mailing List Archive

[Bug 8218] HTML URIs with linebreaks not parsed with Content-Transfer-Encoding: quoted-printable
https://bz.apache.org/SpamAssassin/show_bug.cgi?id=8218

Sidney Markowitz <sidney@sidney.com> changed:

What |Removed |Added
----------------------------------------------------------------------------
CC| |sidney@sidney.com

--- Comment #1 from Sidney Markowitz <sidney@sidney.com> ---
Created attachment 5942
--> https://bz.apache.org/SpamAssassin/attachment.cgi?id=5942&action=edit
Test email

Can you attach a full test email that demonstrates the problem? I made your
snippet into an email with quoted-printable and ran

./spamassassin -t -D uri,message < uriwraptest.eml

Did I miss something? I am testing using current trunk.

Here is the debug output showing the correct URL being parsed:

Mar 15 20:23:08.229 [99779] dbg: message: _decode_header date: Thu, 2 May 2002
00:02:49 +1200
Mar 15 20:23:08.229 [99779] dbg: message: _decode_header subject: foo
Mar 15 20:23:08.229 [99779] dbg: message: _decode_header to: <bar@example.org>
Mar 15 20:23:08.229 [99779] dbg: message: _decode_header from:
<baz@example.com>
Mar 15 20:23:08.229 [99779] dbg: message: _decode_header message-id:
<INTM-6516584-3669405-2002.08.01-16.21.51--f@example.com>
Mar 15 20:23:08.229 [99779] dbg: message: _decode_header mime-version: 1.0
Mar 15 20:23:08.229 [99779] dbg: message: _decode_header content-type:
text/html; charset=US-ASCII
Mar 15 20:23:08.229 [99779] dbg: message: _decode_header
content-transfer-encoding: quoted-printable
Mar 15 20:23:08.230 [99779] dbg: message: main message type: text/html
Mar 15 20:23:08.235 [99779] dbg: message: ---- MIME PARSER START ----
Mar 15 20:23:08.235 [99779] dbg: message: parsing normal part
Mar 15 20:23:08.235 [99779] dbg: message: storing a body to memory
Mar 15 20:23:08.235 [99779] dbg: message: ---- MIME PARSER END ----
Mar 15 20:23:08.235 [99779] dbg: message: decoding quoted-printable
Mar 15 20:23:08.235 [99779] dbg: message: contains only US-ASCII characters,
declared US-ASCII, not decoding
Mar 15 20:23:08.235 [99779] dbg: message: HTML::Parser utf8_mode off (default,
assumed Unicode characters)
Mar 15 20:23:08.236 [99779] dbg: message: spaces (octets) in HTML: 3 out of 21,
chars!?
Mar 15 20:23:08.242 [99779] dbg: uri: canonicalizing html uri:
http://hashbltest.surbl.org/example_uri
Mar 15 20:23:08.242 [99779] dbg: uri: cleaned uri:
http://hashbltest.surbl.org/example_uri
Mar 15 20:23:08.242 [99779] dbg: uri: added host: hashbltest.surbl.org domain:
surbl.org

--
You are receiving this mail because:
You are the assignee for the bug.