Library inspired by https://github.com/mattn/go-runewidth and https://github.com/rivo/uniseg to handle the width and segmenting of Unicode runes and graphemes
Go to file
Vivianne 50f8bb891d
Add string->grapheme-list and char->grapheme
- string->grapheme-list gets only final graphemes without intermediate ones
- char->grapheme just works for single-chars that are valid graphemes.
2024-03-10 23:50:05 -04:00
build-aux Delete hall and its nonsense 2024-03-06 12:42:03 -05:00
doc Update doc stubs to remove hall and references to reflow 2024-03-06 12:43:47 -05:00
scripts Yet another reorganization, and solve Christine's 'rude problem' 2024-03-05 11:46:32 -05:00
tests Yet another reorganization, and solve Christine's 'rude problem' 2024-03-05 11:46:32 -05:00
uniseg Add string->grapheme-list and char->grapheme 2024-03-10 23:50:05 -04:00
.gitignore Add pre-inst-env 2024-03-06 12:58:37 -05:00
COPYING Initial commit 2024-02-29 13:03:58 -05:00
HACKING Update doc stubs to remove hall and references to reflow 2024-03-06 12:43:47 -05:00
Makefile.am Delete hall and its nonsense 2024-03-06 12:42:03 -05:00
README.org Renaming to uniseg 2024-03-03 13:13:22 -05:00
configure.ac Delete hall and its nonsense 2024-03-06 12:42:03 -05:00
guix.scm Update guix to get rid of hall cruft 2024-03-06 12:48:57 -05:00
pre-inst-env.in Add pre-inst-env 2024-03-06 12:58:37 -05:00
uniseg.scm Yet another reorganization, and solve Christine's 'rude problem' 2024-03-05 11:46:32 -05:00

README.org

README for Guile Uniseg

guile-uniseg is a simple Guile library that provides several new Guile character sets for categorizing East Asian characters and emoji. This can be useful when determining the width of characters.

This library is heavily inspired by the golang libraries uniseg and uniseg.

Scripts

There are two scripts which can be used to regenerate the character set files. These are optional to run, since the script's output is stored in the repository at uniseg/eastasian.scm and uniseg/emoji.scm.

However, should the source change as the unicode spec changes, these can be refreshed.