#
bfcb754e |
| 14-Oct-2010 |
Gustavo André dos Santos Lopes |
- Fixed get_next_char(), used by htmlentities/htmlspecialchars, accepting certain ill-formed UTF-8 sequences.
|
#
99b613cb |
| 12-Oct-2010 |
Gustavo André dos Santos Lopes |
- Added a 3rd parameter to get_html_translation_table. It now takes a charset hint, like htmlentities et al. - Fixed bug #49407 (get_html_translation_table doesn't handle UTF-8). - Fixe
- Added a 3rd parameter to get_html_translation_table. It now takes a charset hint, like htmlentities et al. - Fixed bug #49407 (get_html_translation_table doesn't handle UTF-8). - Fixed bug #25927 (get_html_translation_table calls the ' ' instead of '). - Fixed tests for get_html_translation_table and unified the Windows and non-Windows versions of the tests.
show more ...
|
#
4de6c3a9 |
| 12-Oct-2010 |
Gustavo André dos Santos Lopes |
- Added a 3rd parameter to get_html_translation_table. It now takes a charset hint, like htmlentities et al. - Fixed bug #49407 (get_html_translation_table doesn't handle UTF-8). - Fixe
- Added a 3rd parameter to get_html_translation_table. It now takes a charset hint, like htmlentities et al. - Fixed bug #49407 (get_html_translation_table doesn't handle UTF-8). - Fixed bug #25927 (get_html_translation_table calls the ' ' instead of '). - Fixed tests for get_html_translation_table and unified the Windows and non-Windows versions of the tests.
show more ...
|
#
f4a896c2 |
| 11-Oct-2010 |
Gustavo André dos Santos Lopes |
- PHP uses a big endian representation when it converts the code unit sequences to integers so as to store the entity maps. Code in traverse_for_entities assumed little endian. Fixe
- PHP uses a big endian representation when it converts the code unit sequences to integers so as to store the entity maps. Code in traverse_for_entities assumed little endian. Fixed. (in practice, due to the absence of unicode and entity mappings for multi-byte encodings -- except UTF-8 --, this doesn't matter, so the relevant code was commented out for performance reasons).
show more ...
|
#
7aa43a8d |
| 10-Oct-2010 |
Gustavo André dos Santos Lopes |
- Revamp of the decoding portion of html.c. - Dramatic improvements on the performance of html_entity_decode and htmlspecialchars_decode, as the string is now traversed only once. Speedups
- Revamp of the decoding portion of html.c. - Dramatic improvements on the performance of html_entity_decode and htmlspecialchars_decode, as the string is now traversed only once. Speedups of 20 to 25 times with Windows release builds and a ~250 characters string (for 2nd and subsequent calls). - Consistent behavior on html_entity_decode. For instance, the entity in "&<" would be decoded, but not "&é". Not anymore. The code path for "basic" and non-basic entities is now mostly shared. - Code of html_entity_decode and htmlspecialchars_decode is now shared. - [DOC] More consistent behavior of htmlspecialchars_decode. Instead of translating only <, >, &, ", ' and ', now e.g. ", ', ', ', etc. are also decoded. - [DOC] Previous translation of unicode code points in numerical entities was seriously broken. When the code points for some character were not the same in unicode and the target encoding, the behavior could be an erroneous translation (e.g. 0x80-0xA0 in win-1252) or no translation at all. Added unicode translation tables for all single-byte encodings. Entities are not translated for multi-byte entities, except for ASCII characters whose code points are shared. We could add the huge translation tables (several thousand elements) for those encodings in the future. - Fixed numerical entities that after # had text accepted by strcol being accepted. - Much more commented and well-structured code... - Tests for get_html_translation_table()) are broken. I stared fixing the tests, but then I realized it was completely helpless because get_html_translation_table() is broken by not handling multi-byte characters correctly.
show more ...
|
#
4e2fd7ee |
| 08-Oct-2010 |
Gustavo André dos Santos Lopes |
- Fixed a typo in rev #304208 (24 instead of 34/'"'). - Improved the test bug53021.phpt to reflect other fixes in rev #304208. - Updated NEWS to reflect other fixes in rev #304208.
|
#
dd5d1b2b |
| 08-Oct-2010 |
Gustavo André dos Santos Lopes |
- Fixed a typo in rev #304208 (24 instead of 34/'"'). - Improved the test bug53021.phpt to reflect other fixes in rev #304208. - Updated NEWS to reflect other fixes in rev #304208.
|
#
485a0943 |
| 08-Oct-2010 |
Gustavo André dos Santos Lopes |
- Fixed bug #53021 (In html_entity_decode, failure to convert numeric entities with ENT_NOQUOTES and ISO-8859-1).
|
#
df428304 |
| 08-Oct-2010 |
Gustavo André dos Santos Lopes |
- Fixed bug #53021 (In html_entity_decode, failure to convert numeric entities with ENT_NOQUOTES and ISO-8859-1).
|
#
cb500110 |
| 23-Sep-2010 |
Kalle Sommer Nielsen |
Fixed compiler warnings in the standard library
|
Revision tags: oci8-1.4.3, php-5.2.14, php-5.3.3, php-5.3.3RC3, php-5.2.14RC3, php-5.3.3RC2, php-5.2.14RC2, php-5.3.3RC1, php-5.2.14RC1 |
|
#
906dd4ea |
| 23-Mar-2010 |
Rasmus Lerdorf |
Switch default_charset, if not specified, from ISO-8859-1 to UTF-8 I have been wanting to make this change for years, but there is a small chance of BC issues, so it shouldn't go into a minor
Switch default_charset, if not specified, from ISO-8859-1 to UTF-8 I have been wanting to make this change for years, but there is a small chance of BC issues, so it shouldn't go into a minor release.
show more ...
|
#
73ba4956 |
| 12-Mar-2010 |
Moriyoshi Koizumi |
- Forgot to commit this patch. Sorry.
|
Revision tags: php-5.3.2, php-5.2.13, php-5.3.2RC3, php-5.3.2RC2, php-5.2.13RC2, php-5.2.13RC1 |
|
#
9ba1e816 |
| 03-Jan-2010 |
Sebastian Bergmann |
sed -i "s#1997-2009#1997-2010#g" **/*.c **/*.h **/*.php
|
Revision tags: php-5.3.2RC1 |
|
#
7d9a7dba |
| 22-Dec-2009 |
Moriyoshi Koizumi |
- Fix bug #46478 (htmlentities() uses obsolete mapping table for character entity references)
|
Revision tags: php-5.2.12, php-5.2.12RC4 |
|
#
413196c5 |
| 07-Dec-2009 |
Moriyoshi Koizumi |
- Take account of surrogate pairs.
|
Revision tags: php-5.2.12RC3, php-5.2.12RC2, php-5.3.1, php-5.3.1RC4, php-5.2.12RC1, php-5.3.1RC3, php-5.3.1RC2 |
|
#
20737bac |
| 13-Oct-2009 |
Moriyoshi Koizumi |
- Bug #49785: take 5. What the hell happened to me...
|
#
884cf3f1 |
| 12-Oct-2009 |
Moriyoshi Koizumi |
- Bug #49785: take 4 - typo. this flaw is unharmful since the return value of get_next_char() is only used when UTF-8 is specified to the third argument.
|
#
1835a63d |
| 11-Oct-2009 |
Moriyoshi Koizumi |
- A couple more fix for my previous fix. (one of the fix by Arnaud Le Blanc. Thanks!)
|
#
9d198664 |
| 09-Oct-2009 |
Moriyoshi Koizumi |
- Fixed bug #49785 (insufficient input string validation of htmlspecialchars()).
|
Revision tags: oci8-1.4.0, php-5.2.11, php-5.2.11RC3, php-5.3.1RC1, php-5.2.11RC2, php-5.2.11RC1, php-5.3.0, php-5.3.0RC4, php-5.2.10, php-5.2.10RC2, php-5.3.0RC3, php-5.2.10RC1, php-5.3.0RC2, php-5.3.0RC1, RELEASE_1_3_5, php-5.2.9, php-5.2.9RC3, php-5.2.9RC2, php-5.2.9RC1, php-5.3.0beta1 |
|
#
08659c2d |
| 31-Dec-2008 |
Sebastian Bergmann |
MFH: Bump copyright year, 3 of 3.
|
Revision tags: NEWS, php-5.2.8, BEFORE_HEAD_NS_CHANGES_MERGE, php-5.3.0alpha3, php-5.3.0alpha2, php-5.2.7, php-5.2.7RC5 |
|
#
18794add |
| 26-Nov-2008 |
Arnaud Le Blanc |
MFH: Added ENT_IGNORE as a compatibility flag for htmlentities() and htmlspecialchars() to skip multibyte sequences intead of returning an empty string (as iconv's //IGNORE). These functions
MFH: Added ENT_IGNORE as a compatibility flag for htmlentities() and htmlspecialchars() to skip multibyte sequences intead of returning an empty string (as iconv's //IGNORE). These functions will still never return an invalid or incomplete multibyte sequence. Fixes #43896
show more ...
|
#
a05edaf2 |
| 26-Nov-2008 |
Arnaud Le Blanc |
MFB 5.2
|
Revision tags: php-5.2.7RC4, BEFORE_HEAD_NS_CHANGE, BEFORE_NS_RULES_CHANGE, php-5.2.7RC3, php-5.2.7RC2 |
|
#
d69dfa4b |
| 21-Oct-2008 |
Arnaud Le Blanc |
MFH: initialize optional vars
|
Revision tags: php-5.2.7RC1 |
|
#
06998948 |
| 18-Aug-2008 |
Moriyoshi Koizumi |
- MFH: beware of signedness
|
#
71e50de4 |
| 10-Aug-2008 |
Arnaud Le Blanc |
MFH: Fixed bug #45581 (htmlspecialchars() double encoding &#x hex items)
|