r12a

Apps and information related to language, writing systems, and Unicode. Made for myself in my spare time, but free for all to use if it helps.

2025-06-17

UNIKEMET DATA in the EGYPTIAN HIEROGLYPH PICKER

When you bring up detailed notes about a character it now shows Unikemet data.

The Unikemet data is also available in UniView.

Try it at r12a.github.io/pickers/egyp/

2025-06-16

EGYPTIAN HIEROGLYPH PICKER
r12a.github.io/pickers/egyp/?s

Rewritten to fix bugs, update, & add new features:

- new term list with 3033 entries (though most still need to have hieroglyphs, etc, added)
- MdC to WikiHiero converter displays results in a popup
- mousing over shape selectors shows category name
- all Unicode format controls can be found under MORE CONTROLS
- mousing over a character reveals its description as well as the code point, and its MdC notation

Extended-A is not yet supported.

2025-05-20

UNIVIEW NOW SUPPORTS UNICODE17 beta

r12a.github.io/uniview/

Orthography notes, plus term lists, & pickers, are also available for 3 of the new scripts:

- Beria Erfe r12a.github.io/scripts/berf/za
- Tolong Siki r12a.github.io/scripts/tols/kr
- Tai Yo r12a.github.io/scripts/tayo/ty

2025-05-20

Chakma orthography notes updated.
r12a.github.io/scripts/cakm/cc

There is still much more to understand about how the Chakma script works, but the notes have been updated to talk about simplified vs traditional orthographies, and some clarifications around onsets, finals, and consonant clusters.

There may also be a need for a new character.

Changes also incorporate general revisions that are slowly being rolled out across all pages.

2025-04-26

INDIC orthography notes sections on vowels, consonants, encoding choices, and shaping have been largely rewritten for Hindi, Malayalam, Gujarati, Kashmiri (deva), Tamil, Bengali, Odia, Punjabi, Telugu, and Sinhala.

r12a.github.io/scripts/index.h

Most term lists for these orthographies have also seen a significant increase in the number of terms and transcriptions.

2025-04-19

GREEK term list now has 19,074 entries (up from around 250). You may need to be a little patient for it to parse all the data when loading or searching.

r12a.github.io/scripts/grek/el

There were also a few small changes to the orthography notes.

r12a.github.io/scripts/grek/el

2025-04-17

@rajeesh Thanks! Fixed.

2025-04-16

MALAYALAM orthography notes have received significant updates through most of the character related sections.

The term list has also been increased to almost 9,000 items.

r12a.github.io/scripts/mlym/ml
r12a.github.io/scripts/mlym/ml

r12a boosted:
W3C Internationalization, i18nwebi18n@w3c.social
2025-04-16

WEB I18N GAP FIXED: Greek case conversion

WebKit now supports the following Greek specific behaviours for text-transform: uppercase:
- words that are all uppercase lose tonos
- for diphthongs with tonos on the first character the tonos is removed but a dialytika is added above the second character in the diphthong
- a disjunctive eta in a Greek sentence keeps its tonos diacritic

This behaviour is now interoperable across WebKit, Gecko, and Blink browsers.

More details: w3.org/TR/grek-gap/#issue15_tr

Greek text: ήσουν ή εγώ ή εσύ ΗΣΟΥΝ Ή ΕΓΩ Ή ΕΣΥ
2025-04-16

NEW PICKER FEATURE: In-page notes

Highlight some text, click on List Characters, then click on a character name in the popup. This will display any available notes about that character in the right half of the screen.

The notes can also be accessed via the Analyse Text control. When the data appears, click on the Notes to see the notes.

Try it: r12a.github.io/pickers/mlym/?s

A screenshot of a picker tool for the Malayalam script. The main text area contains a paragraph written in Malayalam, with a specific word "തന്റെ" highlighted. A small popup appears at the bottom right of the window, showing the individual characters that compose it, along with their Unicode values and transliterations. Above that is a panel containing information about one of the characters in the list.A screenshot of a picker tool for the Malayalam script. The main text area contains a paragraph written in Malayalam, with a specific word "തന്റെ" highlighted. A list of information about the individual characters that compose it appears below the text area. Clicking on the Notes button alongside one of those items opens a panel containing information about the character.
2025-04-16

NEW PICKER FEATURE: Copy list items to clipboard

When you select List Characters, each line in the list has 2 icons to the right. The first copies just the character; the second copies just the Unicode name.

A screenshot of a picker tool for the Malayalam script. The main text area contains a paragraph written in Malayalam, with a specific word "തന്റെ" highlighted. An arrow points to a detailed breakdown of this word on the right side of the image, showing the individual characters that compose it, along with their Unicode values and transliterations. Alongside each item are two icons that copy information to the clipboard.
2025-04-08

NEW PICKER FEATURE: New font size slider

An extra font size slider has been added just below the bottom right of the text area. The autofocus control that used to be here can now be found as an icon to the right.

Tip: In Blink/Gecko browsers this is just 1 tab key press away from the text area. Two tabs takes you to the control to input characters as escapes/code points.

View of a picker indicating the location of the new font-size control and the Autofocus control.
2025-04-04

WEB I18N GAP PROGRESS: Support for extra spacing between Chinese/Japanese and Western text

Safari 18.4 supports text-autospace:ideograph-alpha ideograph-numeric;

This is only a start, since other browsers don't support this CSS yet, and Safari still doesn't support the 'replace' value. However, this is the kind of property that you can apply to all text now, and as browsers improve it will make the text look better.

More details: w3.org/TR/clreq-gap/#issue401_

r12a boosted:
W3C Internationalization, i18nwebi18n@w3c.social
2025-04-02

WEB I18N GAP FIXED: “Writing-modes sideways values not implemented”

With the release of Safari 18.4 all major desktop browsers now interoperably support the sideways-lr and sideways-rl values of CSS writing-mode.

This means it's now possible to properly display horizontal scripts in vertical arrangements, such as table headings.

This is a significant step forward, and turns many cells green in the language matrix! See w3.org/TR/latn-gap/#issue11_wr

2025-03-31

BENGALI VOWEL LIGATURES

The Bengali orthography notes section r12a.github.io/scripts/beng/bn now shows which Noto/System fonts support ligation for ru, rū, etc. And which browsers support associated ZWJ and ZWNJ behaviours. (It's very complicated!)

2025-03-16

NEW PICKER FEATURE: Collapse spaces but keep lines

Pickers make it easy to rearrange/add/delete/change combining marks by clicking on an icon to the right of the text area which adds space(s) between each character. Clicking another icon removes the spaces once the edits are complete.

However, if the text area includes several lines of text, that icon removes the line breaks, too.

The new icon collapses spaces while retaining the line breaks.

2025-03-14

NEW PICKER FEATURE: Render HTML

Pickers will now do basic rendering of simple HTML snippets. (Especially useful for testing RTL behaviours.)

Hint: Set the direction to LTR override to manage bidi with HTML. Type the HTML. Select Render HTML from the pulldown menu.

Try it: r12a.github.io/pickers/arab-ar

2025-03-10

Thanks to Vinodh Rajan, the webfont used for these pages now uses a font that supports the Kashmiri vowel signs (and a lot more conjunct forms).

2025-03-05

The input field that allows you to add characters to the text area in a picker by typing code points or escapes has been moved just below the text area, for convenience.

Just type one or more code points or escapes and hit return to add the character(s) to the text area.

r12a.github.io/pickers/

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst