develooper Front page | perl.perl5.changes | Postings from January 2019

[perl.git] branch smoke-me/khw-turkish created.v5.29.7-45-gc7964c061e

From:
Karl Williamson
Date:
January 31, 2019 00:21
Subject:
[perl.git] branch smoke-me/khw-turkish created.v5.29.7-45-gc7964c061e
Message ID:
E1gp06j-0007Zj-MI@git.dc.perl.space
In perl.git, the branch smoke-me/khw-turkish has been created

<https://perl5.git.perl.org/perl.git/commitdiff/c7964c061e78c59f5f4fec9f57d173dda2943c26?hp=0000000000000000000000000000000000000000>

        at  c7964c061e78c59f5f4fec9f57d173dda2943c26 (commit)

- Log -----------------------------------------------------------------
commit c7964c061e78c59f5f4fec9f57d173dda2943c26
Author: Karl Williamson <khw@cpan.org>
Date:   Wed Jan 30 17:21:16 2019 -0700

    smoke

commit e5f51d143c5e1c175740f58246f46b1d87537b37
Author: Karl Williamson <khw@cpan.org>
Date:   Wed Jan 30 15:00:30 2019 -0700

    regcomp.c: Fix recent optimization of [...] bug
    
    This bug was introduced in b2296192536090829ba6d2cb367456f4e346dcc6
    n 5.29.7.  Using /il should not result in looking for a [:posix:] class
    that matches the code points given.

commit c1ec7af9297157670794d4415333938aa6859fb8
Author: Karl Williamson <khw@cpan.org>
Date:   Wed Jan 30 11:24:12 2019 -0700

    pp.c: Don't assume worst case memory needs
    
    Since 5.28, there has been a function that will calculate the expansion
    of a string when converted into UTF-8, using per-word operations.  This
    means it runs 8 times faster than doing this count previously would have
    taken.
    
    I've come to believe it is better to calculate how much memory we need
    than to overallocate based on worst-case scenarios.  This is because in
    very large strings, over allocating can lead to unnecessary inefficient
    processing.
    
    This commit changes several instances in pp.c where a string needs to be
    converted to UTF-8 to not assume the worst case, but instead calculate
    what's needed using the faster function.

commit 7f560c7a5c93dfa0f910fa6e9e7bbc30eba6cb1d
Author: Karl Williamson <khw@cpan.org>
Date:   Wed Jan 30 11:09:01 2019 -0700

    pp.c: Don't use function call for easy copy
    
    Like the previous commit, this code is adding the UTF-8 for a Greek
    character to a string.  It previously used Copy, but this character is
    representable as two bytes in both ASCII and EBCDIC UTF-8, the only
    character sets that Perl will ever supports, so we can use the
    specialized code that is used most everywhere else for two byte UTF-8
    characters, avoiding the function overhead, and having to treat this
    character as particularly special.

commit cae464ba1f9b89313f8f28aa4f0492af646b242b
Author: Karl Williamson <khw@cpan.org>
Date:   Wed Jan 30 10:52:41 2019 -0700

    pp.c: Don't use function call for easy copy
    
    This code is adding the UTF-8 for a Greek character to a string.  It
    previously used Copy, but this character is representable as two bytes
    in both ASCII and EBCDIC UTF-8, the only character sets that Perl will
    ever supports, so we can use the specialized code that is used most
    everywhere else for two byte UTF-8 characters, avoiding the function
    overhead, and having to treat this character as particularly special.

commit 95a61adcff6e0d31bf00149c02a796b62c3288f4
Author: Karl Williamson <khw@cpan.org>
Date:   Wed Jan 30 10:35:21 2019 -0700

    pp.c: pp_fc(): Simplify
    
    The function being called does everything that the code being eliminated
    here did.  We just pass the function the final destination instead of a
    temporary.

commit f5e87e8df25e7e29aedd9b1628d9ef07954c94d0
Author: Karl Williamson <khw@cpan.org>
Date:   Wed Jan 30 10:27:17 2019 -0700

    pp.c:  White-space, comments only

commit 37b53739d739f5609259bc949bc034d54e53986d
Author: Karl Williamson <khw@cpan.org>
Date:   Wed Jan 30 10:02:35 2019 -0700

    pp.c: Reorder && order in 'if'
    
    This makes the test most likely to fail be first, and adding an
    UNLIKELY() to it, thus saving a conditional in most instances.

commit 49cdf26e59b81773d81ca79006908728c03d2955
Author: Karl Williamson <khw@cpan.org>
Date:   Wed Jan 30 09:56:15 2019 -0700

    XXX move to /locale ws.  locale.c white space only

commit 01da0ec65d5b40c728db81ede3e7147b9d4c13f7
Author: Karl Williamson <khw@cpan.org>
Date:   Wed Jan 30 09:08:13 2019 -0700

    handy.h: Add comment

commit 2f18184efc45188d1aadb405b6eea5824b1035b5
Author: Karl Williamson <khw@cpan.org>
Date:   Tue Jan 29 22:25:03 2019 -0700

    t/re/fold_grind.pl: White-space only
    
    Just align some logical or clauses for readability.

commit 2f6a4bb7b66adbc6a9d9949367a7c7635c2cc585
Author: Karl Williamson <khw@cpan.org>
Date:   Tue Jan 29 22:02:59 2019 -0700

    pp.c: Use faster method to convert to UTF-8
    
    There is a special inline function that's used when converting a single
    byte to UTF-8, that is faster than the more general one used prior to
    this commit.

commit 02472c60ba605c46d031a1a16bcd5b66e922aa77
Author: Karl Williamson <khw@cpan.org>
Date:   Tue Jan 29 22:01:18 2019 -0700

    pp.c: Add missing assert
    
    The comments say there is an assert, but it wasn't there.

commit 149c0eb7421489764c82a208e27e816d3a8b0627
Author: Karl Williamson <khw@cpan.org>
Date:   Fri Jan 25 09:55:58 2019 -0700

    handy.h: White-space only
    
    Vertically align the ternary colon with the question mark above it.

commit a0a5ce2cd9198eaeb5e044e2016352e61ebab736
Author: Karl Williamson <khw@cpan.org>
Date:   Wed Jan 23 15:42:35 2019 -0700

    handy.h: Add void * casts to memEQ, memNE
    
    This change is to allow these macros to be called without having to do
    casting in the call.

commit 5a0e4ae3280cd1eb035406f683f46fb7855bc516
Author: Karl Williamson <khw@cpan.org>
Date:   Mon Jan 21 09:46:00 2019 -0700

    mktables: Make Turkic 'I' chars problematic
    
    In a Turkic locale, these are problematic because their mappings
    cross the 255/256 boundary.
    
    This change has the side effect of causing U+307 to be added to the
    problematic list, and it normally really isn't problematic, because in
    those locales where U+130 and U+131 are problematic, U+307 isn't used.
    But applications could switch in and out of Turkic locales, so it's best
    to leave it be considered problematic.  The consequences of making this
    mark problematic are simply slightly less optimized regex pattern code.

-----------------------------------------------------------------------

-- 
Perl5 Master Repository



nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About