Regular expressions are a powerful tool for using patterns to search and modify text, and are vital in many programs, programming languages, databases, and spreadsheets.Starting in 1999, UTS #18: Unicode Regular Expressions has supplied guidelines and conformance levels for supporting Unicode in regular expressions. The new version 21 broadens the scope of properties for regular expressions (regex) to allow for properties of strings (such as for emoji sequences). For example, the following matches all emoji flags except the French flag:
/[\p{RGI_Emoji_Flag_Sequence}--\q{🇫🇷}]/
Among the improvements are:- Provides a new Annex D: Resolving Character Classes with Strings for handling negations of sets of strings.
- Updates the full property list to include the latest UCD properties, plus Emoji properties and UTS #39 properties.
- Removes obsolete text passages, and makes editorial changes for clarity.
Over 140,000 characters are available for adoption to help the Unicode Consortium’s work on digitally disadvantaged languages
We are pleased to announce the following leadership additions at the Unicode Consortium. “Each of these individuals brings deep expertise in their field,” said Mark Davis, president of the Consortium. “They have already made significant improvements in their new roles.”
The Unicode 13.0 core specification is now available in paperback book form with a new, original cover design by Huijun Shan. This edition consists of a pair of modestly priced print-on-demand volumes containing the complete text of the core specification of Version 13.0 of the Unicode Standard.![[badge]](https://bestbuyvn.site/index.php/https://www.unicode.org/announcements/ynh-1fab4-potted-plant.png)
The Unicode Consortium has posted a new issue for public review and comment.