charindex: Bold matching words, prefer matches in order with fewer intervening words, do not permute entries that have too much punctuation (#1433) * A more compact charindex * meow * Save another 433 kiB * Document the build process for charindex.html * meow * This is getting out of hand! Now there are two of them! * Link back from β, document * spots * Link to About from the top left * async/await, and a new loop * bold matching words * prefer more matches in the tail * Wrong set * Rank by ordered matches * Drop the logs * Do not permute punctuated entries nor split a parenthetical * Poorly wrapped comments * meow * meow * Intervening words * A bug and another sorting level * tail before head
This is a collection of tools by and for Unicode Character Database (UCD) maintainers for the production and vetting of data files for the UCD and other Unicode specs such as UCA, emoji, idna, and security.
Do not use the Unicode data files in this repo for production. Do use the data files posted publicly on unicode.org
There is some documentation for these tools in this repo, in the docs folder.
Some of the documentation still refers to the previous Subversion repository. This GitHub repo reflects the svn repo up to r1566, plus a few snapshots up to r1830. (Don’t ask.)
This repository includes the source for the tooling at https://util.unicode.org - see /UnicodeJsps
For feedback on the Unicode Standard and bug reports against the Unicode Character Database, use the Unicode Contact Form: https://www.unicode.org/reporting.html
Do not use the GitHub Issues feature in this repo for those. The tools maintainers use GH issues for issues with the code in this repo.
Copyright © 2001-2025 Unicode, Inc. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the United States and other countries.
A CLA is required to contribute to this project - please refer to the CONTRIBUTING.md file (or start a Pull Request) for more information.
The contents of this repository are governed by the Unicode Terms of Use and are released under LICENSE.