#Unicode

Laurent Cheyluslcheylus@bsd.network
2025-12-19

Full Unicode Search at 50× ICU (C/C++ Library for Unicode) Speed using StringZilla with AVX‑512 (SIMD, SWAR, and CUDA-accelerated String Algorithms) - Detailed Article and Benchmarks by Ash Vardanian #Programming #Unicode ashvardanian.com/posts/search-

Hype for the Future 48G: Future Videos from novaTopFlex

In the near future, expect additional novaTopFlex content to utilize the Unicode character set and different code charts and code blocks. A specific Unicode character, code block, snippet, or other piece of the character puzzle must be mentioned, and while pronunciations may not be exact, approximate pronunciations are often to be idealized in the up and coming video streams. For the first video by the novaTop Identity, the content should include, but not be limited to, the Basic Latin and […]

novatopflex.wordpress.com/2025

2025-12-18

Tiny note-to-self blog:

How I made #unicode work between my #slackware laptop and my #openbsd server.

Also featuring guest appearances by #xterm and #emacs.

kindness.city/blog/2025-12-18-

2025-12-17

Why is there no sad/empathetic/sympathetic smile emoji!?!? It's one of the ones I need the most, and after 10 years of Unicode emoji, with tons of additions trough the years, noone has yet added this most useful emoji!? Why!?

#sympathy #unicode #emoji

N-gated Hacker Newsngate
2025-12-16

🎉 30 minutes of your life you'll never get back! 🚀 Why settle for regular searches when you can dive into a 6,244-word on AVX-512? 🤦‍♂️ , you're so , but hey, at least you're thorough! 😏
ashvardanian.com/posts/search-

2025-12-16

Unicode, emoji, and inclusion. In the emoji keyboard on smartphones you can find almost everything: wheelchair, robotic arms, white canes, guide dogs, several medical equipments.
So, I wouldn't ask for a condom in there, I'd recognize it to be explicit, questionable, misunderstandable for too many users around the world.
But, LGBT have their own flag, disability has most symbols associated to it, but the HIV red ribbon is absent and it has one meaning: INVISIBILITY.
I wonder how to go and talk to Unicode consortium. How could a red ribbon harm someone? With USA administration making every unrepresented group invisible, we should raise our hands from our places.
Emoji is nothing. But even official documents' font could be "nothing" but let's remember that for good and for bad, big battles start from small things.
#emoji #hiv #poz #RedRibbon #stigma #unicode

GripNewsGripNews
2025-12-15

🌗 Unscii:為像素圖形設計的點陣式 Unicode 字體
➤ 重現經典,擁抱 Unicode:Unscii 字體如何 bridge 復古圖形與現代運算
viznut.fi/unscii/
Unscii 是一套基於經典系統字體的點陣式 Unicode 字體,旨在完美支援字符單元藝術,同時適用於終端機與程式開發。它提供 8x8 和 8x16 像素的字元尺寸,並有多種變體,包括支援更多 Unicode 字元的「full」版本。Unscii 2.0 的發布,主要歸因於 Unicode 13.0 新增的「舊式運算」圖形字元,作者為這些字元提供了正確的 Unicode 對應,並修正了部分字元的錯誤及提升清晰度。
+ 這套字體太棒了!讓我想起以前玩老遊戲的感覺,而且居然支援 Unicode,真沒想到。
+ 我一直在找適合終端機使用的字體,Unscii 的 8x16 版本看起來非常適合我的程式碼。

apfeltalk :verified:apfeltalk@creators.social
2025-12-14

Neue Emojis für iPhone und Android: Das kommt 2026 auf Eure Geräte
Mit iOS 26 hat Apple im September keine neuen Emojis ausgeliefert. Anfang 2026 sollen nun aber acht neue Symbole auf iPhones und Android-Geräten erscheinen.

Acht neue Emojis mit Unicode 17.0 bestätigt
Das Unicode-Konsortium hat im Septembe
apfeltalk.de/magazin/news/neue
#News #Tellerrand #Android #Emojis #IOS26 #iPhone #Unicode #Unicode170

Michel Marianimikaeru
2025-12-13

RE: mastodon.social/@mikaeru/11558

Generally, new CJK Ideographs proposed by members of the IRG (Ideographic Research Group) go through several rounds of exchanges/discussions until they get approved or possibly postponed or rejected.

For instance, here is the page dedicated to UK-20538 ⿰㐅也 (with images as "pieces of evidence"), which eventually made its way to Unicode 17.0, encoded as U+323BF 𲎿 :

🔗 hc.jsecs.org/irg/ws2021/app/?i

DoctorG ♀️🏳️‍🌈DoctorG_1@vivaldi.net
2025-12-13

Filtering Unicode "Emoji" and other symbols is a hell with PCRE.

#Perl #PCRE #Unicode #Emojis

GripNewsGripNews
2025-12-13

🌕 GNU Unifont 字元集:全面的 Unicode 涵蓋範圍與開源授權
➤ 探索 GNU Unifont 的最新釋出,瞭解其字元涵蓋範圍、授權條款及技術細節
unifoundry.com/unifont/index.h
GNU Unifont 是一個開源字體專案,旨在提供 Unicode 標準中的所有可列印字元。此頁面介紹了最新版本的 GNU Unifont,它涵蓋了 Unicode 基本多語言平面 (BMP) 中的所有字元,並擴展支援了補充多語言平面 (SMP) 和 ConScript Unicode Registry (CSUR)。該字體採用 GNU GPLv2+(附帶字體嵌入例外)和 SIL OFL 1.1 雙重授權,允許商業用途,但要求衍生字體同樣開源。文章詳細說明瞭字體的授權條款、不同格式的下載選項,以及其在不同作業系統上的安裝與使用建議,同時也指出了字體在處理複雜腳本時的限制,並鼓勵貢獻新的字元圖形。
+ 這字體太有用了!涵蓋範圍廣泛,而且授權也很友善,可以直接用在商業專案上,真是福音。
+ 感

Aaron “インフルエンサーA型” Madlon-Kayamake
2025-12-13

I used the new Unicode script matchers in Orgro (orgro.org) to improve text reflow for Japanese and Chinese text.

Previously all text would reflow like the Latin text above—with a space where line breaks were. Now I remove the space when appropriate based on the script of the abutting non-whitespace characters.

Aaron “インフルエンサーA型” Madlon-Kayamake
2025-12-13

A good character to test with is 𠮟 (U+20B9F)

- It's outside the BMP

- It's similar enough to a commonly used BMP character (叱 U+53F1) that people will accidentally use it

- It's entirely reasonable to expect that your software should handle it correctly

If your software handles 叱 but not 𠮟, it is broken!

Aaron “インフルエンサーA型” Madlon-Kayamake
2025-12-13

I see various and libraries offering functions for detecting kanji characters, but they almost always do this in a limited way that misses a huge number of characters, i.e. nothing beyond the BMP, or even missing ranges in the BMP.

The only way to do this right is to

1. Work with codepoints, not UTF-16 code units

2. Look at the Unicode script property, which should be `Han` for kanji/hanzi

N-gated Hacker Newsngate
2025-12-12

Oh joy, yet another font for nerds to drool over. 😴 is here with galore for everyone who spends their weekends arguing about planes—because that's what all the cool kids are doing, right? 😂
unifoundry.com/unifont/index.h

2025-12-11

I pretty much completely stopped using #emoticons like :) when #emojis became available.

I think because I find emojis to be a much more elegant and complete solution to the same problem. Elegant in that emojis have a dedicated, unambiguous #encoding; complete in that there are loads and loads of emojis to choose from.

#emoji #Unicode #emoticon

2025-12-10

Le ➎ est sorti l'année dernière et est sûrement la version la plus aboutie de la série, celui qui s'est le moins bien vendu aussi mais c'est classique dans la sortie des séries. Encore une fois ils sont tous indépendants et peuvent se lire dans le désordre.

Ici on découvre les "nouveaux" hiéroglyphes égyptiens qui viennent de sortir, on parle d'Afrique du Nord, de langue des signes ou encore de chèques de banque.

La petite news c'est que numéro ➏ est en cours d'écriture !

#ad #unicode #design

couverture noir et café du numéro 5 avec ses stickersnouveaux hiéroglyphe égyptiensCarte des communautés Amazighs d'Afrique du nord
2025-12-10

L'encodage des systèmes d'écritures est un enjeu politique et culturel d'émancipation et j'essaye dans chaque numéro de mettre en lumière un système d'écriture minoritaire ou en danger, comme dans ce numéro ➍ l'histoire de la Mongolie et ses systèmes. On parle aussi drapeaux, de nouilles chinoises et de cartes de jeux.

#ad #design #unicode #cadeau2025

couverture du numéro 5 verte et bleu avec ses stickersdétail intérieur des cartes de tarots partiellement encodéesdétail d'un sinogramme culinaire chinois

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst