Normalize HTML entities into unicode characters (sanitize)