Based on the comment of this file, the ICU's Chinese word segmentation data is very out-dated (20+ years ago?).
Newer words from CC-CEDICT were purged because of license issue.
Based on the comment of this file, the ICU's Chinese word segmentation data is very out-dated (20+ years ago?).
Newer words from CC-CEDICT were purged because of license issue.
The Ideographic Research Group (IRG) is responsible for preparing and reviewing sets of CJK unified ideographs to be included in the Unicode Standard.
Current and future IRG source prefixes used to be listed in the main IRG homepage, but are now available in a separate dedicated page:
Another weekend, another couple of steps fixing the input method mess I (partly) started:
https://gitlab.freedesktop.org/wayland/wayland-protocols/-/merge_requests/408
Keyboard support and moving popups.
https://codeberg.org/dcz/stiwri/src/branch/popup
On the client support side, I'm kinda bummed that #iced looks like it's starved for maintainers. I don't want to switch contributions to something else because it's stalled.
Back to #Wayland and input methods.
What's your favorite input method?
Cause for me it was the one on old #Nokia phones. I could type with my hands in my pockets! Handy in winter.
https://codeberg.org/dcz/stiwri
Tested with #cosmic and #GTK .
This is just the beginning. Actually useful stuff is coming in the future. #Mobile keyboards, #Chinese input is what it's for:
https://gitlab.freedesktop.org/wayland/wayland-protocols/-/merge_requests/396
The Ideographic Research Group (IRG) is responsible for preparing and reviewing sets of CJK unified ideographs to be included in the Unicode Standard.
The IRG homepage is now including comprehensive lists of current and future IRG source prefixes...
ๅฎๅฏงํ์ธ์, ์ ๋ ์์ธ์ ์ด๊ณ ์๋ 30ไปฃ ๅพๅ ์คํ ์์ค ์ํํธ์จ์ด ์์ง๋์ด์ด๋ฉฐ, ่ช็ฑยท์คํ ์์ค ์ํํธ์จ์ด์ ่ฏๅๅฎๅฎ(fediverse)์ ็ฑ็ํ ๆฏๆ่ ์ ๋๋ค.
์ ๋ TypeScript็จ ActivityPub ์๋ฒ ํ๋ ์์ํฌ์ธ @fedify ํ๋ก์ ํธ์ ์ฑ๊ธ ์ ์ ็จ ActivityPub ๋ง์ดํฌ๋ก๋ธ๋ก๊ทธ์ธ @hollo ํ๋ก์ ํธ์ ActivityPub ๋ด ํ๋ ์์ํฌ์ธ @botkit ํ๋ก์ ํธ์ ่ฃฝไฝ่ ์ด๊ธฐ๋ ํฉ๋๋ค.
์ ๋ ๆฑ์์์ ่จ่ช(์ด๋ฅธ๋ฐ #CJK)์ ์ ๋์ฝ๋์๋ ้ๅฟ์ด ๋ง์ต๋๋ค. ่ฏๅๅฎๅฎ์์๋ ๅๆผขๆๆทท็จ้ซ๋ฅผ ์ฐ๊ณ ์์ด์! ์ ๊ฒ ้ๅ่ช๋ ่ฑ่ช, ๆฅๆฌ่ช๋ก ๋ง์ ๊ฑธ์ด์ฃผ์ธ์. (์๋๋ฉด, ๆผขๆ์ผ๋ก๋!)
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#ํ๊ตญ์ด), or #Japanese (#ๆฅๆฌ่ช), or even in Literary Chinese (#ๆ่จๆ, #ๆผขๆ)!
Die japanische #Schrift besteht aus #Kanji, #Hiragana, #Katakana und #Romulaner. Romulaner sind kleine lateinische #Buchstaben, die in #Japan unter die #CJK zeichen geschrieben werden, weil die japaner selbst nicht verstehen, was sie da schreiben.
ๆจก็ณ๏ผ้ป่ฟ๏ผๅฝขๆโๅขจ็นโ๏ผๅฝฑๅฝฑ็ปฐ็ปฐ๏ผไผผๆ่ฑ้ฆใ
่ฟๆฏไธๅฅไธๆฑๆธ
ๆฐใๆ ๅ
ณ้ไธฝ็ๅฎไฝๅญใ็ฌๅฝขๆๅ
ถๆธฉๅ๏ผๅญ้ขๆนๆญฃ๏ผ้ๅฟๆพไฝ๏ผ็ปๆๅ ๅคๅค่ๆพๆฐ้ขใ
่ฏๅพๅธฆๅ้ฃๆถ็กฎๅฎๅญๅจ่ฟ็ไธ็ง็พๅฅฝใ
#่ฑๅฎ #typedesign #kanji #chinesecharacters #ๅฎไฝ #ๆผขๅญ #hanzidesign #cjk #flower #hanzi #typeface
Adapted my first source for the #9front #troff instead of #OpenBSDโs Heirloom oneโฆ and now I can insert also .ps pictures, which oddly were not embedded under BSD. Next step could be to try and add a #CJK font and try to input #Japanese under 9front. Any hint? Is it possible to insert PDF marks under 9front, using Troff? Are they supported?
Video โLaTeX โ Multilingual Unicode stringsโ, explaining how to deal with them without explicit markup. It combines #Hindi, #Japanese and #Arabic. With TeXLive fonts and also Noto fonts.
#multilingual #texlatex #localization #lualatex #devanagari #cjk
https://www.youtube.com/watch?v=jWGmYZsNiYA
In the open-source application `Unicopedia Sinica`, both data files used for the `CJK Components` and the `CJK Related` utilities are now in a consistent JSON format with MIT license: `cjk-ids.json` and `cjk-related.json` respectively.
ๅฎๅฏงํ์ธ์, ์ ๋ ์์ธ์ ์ด๊ณ ์๋ 30ไปฃ ๅพๅ ์คํ ์์ค ์ํํธ์จ์ด ์์ง๋์ด์ด๋ฉฐ, ่ช็ฑยท์คํ ์์ค ์ํํธ์จ์ด์ ่ฏๅๅฎๅฎ(fediverse)์ ็ฑ็ํ ๆฏๆ่ ์ ๋๋ค.
์ ๋ TypeScript็จ ActivityPub ์๋ฒ ํ๋ ์์ํฌ์ธ @fedify ํ๋ก์ ํธ์ ์ฑ๊ธ ์ ์ ็จ ActivityPub ๋ง์ดํฌ๋ก๋ธ๋ก๊ทธ์ธ @hollo ํ๋ก์ ํธ์ ActivityPub ๋ด ํ๋ ์์ํฌ์ธ @botkit ํ๋ก์ ํธ์ ่ฃฝไฝ่ ์ด๊ธฐ๋ ํฉ๋๋ค.
์ ๋ ๆฑ์์์ ่จ่ช(์ด๋ฅธ๋ฐ #CJK)์ ์ ๋์ฝ๋์๋ ้ๅฟ์ด ๋ง์ต๋๋ค. ่ฏๅๅฎๅฎ์์๋ ๅๆผขๆๆทท็จ้ซ๋ฅผ ์ฐ๊ณ ์์ด์! ์ ๊ฒ ้ๅ่ช๋ ่ฑ่ช, ๆฅๆฌ่ช๋ก ๋ง์ ๊ฑธ์ด์ฃผ์ธ์. (์๋๋ฉด, ๆผขๆ์ผ๋ก๋!)
Hello, I'm an open source software engineer in my late 30s living in #Seoul, #Korea, and an avid advocate of #FLOSS and the #fediverse.
I'm the creator of @fedify, an #ActivityPub server framework in #TypeScript, @hollo, an ActivityPub-enabled microblogging software for single users, and @botkit, a simple ActivityPub bot framework.
I'm also very interested in East Asian languages (so-called #CJK) and #Unicode. Feel free to talk to me in #English, #Korean (#ํ๊ตญ์ด), or #Japanese (#ๆฅๆฌ่ช), or even in Literary Chinese (#ๆ่จๆ, #ๆผขๆ)!
@F2erron I previously used #emacs, but for some reason in my system a #multilingual document with several scripts (e.g., #Arabic, #Hebrew, #Devanagari, #CJK and #Tibetan) became deadly sluggish. As the maintainer of #babel, I found myself unable to work. I currently use #Notepad++, based on #Scintilla and programmable with #Lua, which I prefer to #Lisp. Add #TeXLive and #Git.
#texlatex
New utility in Unicopedia Sinica v11.0.0:
- CJK Strokes