Mailing List Archive

[Bug 7656] UTF8 rules, normalize_charset etc overhaul
https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7656

--- Comment #12 from Henrik Krohns <apache@hege.li> ---
Bumping this bug. Comments? Monologs are getting a bit tiresome.. :-)

--
You are receiving this mail because:
You are the assignee for the bug.
[Bug 7656] UTF8 rules, normalize_charset etc overhaul [ In reply to ]
https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7656

Giovanni Bechis <giovanni@paclan.it> changed:

What |Removed |Added
----------------------------------------------------------------------------
CC| |giovanni@paclan.it

--- Comment #13 from Giovanni Bechis <giovanni@paclan.it> ---
I am +1 to enable "normalize_charset 1" on 4.0.0 by default,
I have it enabled for a long time in production without any issues.

--
You are receiving this mail because:
You are the assignee for the bug.
[Bug 7656] UTF8 rules, normalize_charset etc overhaul [ In reply to ]
https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7656

--- Comment #14 from Henrik Krohns <apache@hege.li> ---
Good to hear, I cast my official +1 for normalize_charset 1 too.

There doesn't seen to be any dependencies, Encode::Detect can still remain
optional and required HTML::Parser 3.46 is from 2005..

Will check if there's anything in tests that should be changed.

--
You are receiving this mail because:
You are the assignee for the bug.
[Bug 7656] UTF8 rules, normalize_charset etc overhaul [ In reply to ]
https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7656

Bill Cole <billcole@apache.org> changed:

What |Removed |Added
----------------------------------------------------------------------------
CC| |billcole@apache.org

--- Comment #15 from Bill Cole <billcole@apache.org> ---
(In reply to Henrik Krohns from comment #12)
> Bumping this bug. Comments? Monologs are getting a bit tiresome.. :-)

+1

The minor pain of revamping rules that match non-ASCII characters is
compensated by the fact that this is a *normalization* and so reduces the
frequency of edge cases that escape rules written (perhaps inadvertently) to
depend on a particular subset of possible encodings. My personal experience
running SA instances that see a lot of non-ASCII messages is that enabling
normalize_charset is a best practice, and the default is basically tech debt.

As for requiring discussion on-list, these comments are sent to the dev list.
I'm going to bump it there to get the attention of anyone filtering out
Bugzilla mail (!? if that's a thing...) and will also post on the Users list to
get a broader audience.

--
You are receiving this mail because:
You are the assignee for the bug.