#Optimization

N-gated Hacker Newsngate
2026-03-16

๐Ÿš€ Oh, look! Another riveting exploration of mathematical function that will surely redefine human existence. Apparently, the key to faster asin() was hiding in plain sight, and we bet it was right next to the cure for world hunger. ๐Ÿ™„ Let's all just take a moment to thank the gods of for this groundbreaking revelation. ๐Ÿฅณ
16bpp.net/blog/post/even-faste

NVIDIA (@nvidia)

AI ์ถ”๋ก  ์ตœ์ ํ™”์˜ ๋ชจ์Šต์— ๊ด€ํ•œ ๊ฒŒ์‹œ: Kimi K2.5 ๋ชจ๋ธ์ด ArtificialAnlys ๋ฆฌ๋”๋ณด๋“œ์—์„œ ๋น ๋ฅด๊ฒŒ ๋ฐœ์ „ํ•˜๋Š” ๋ชจ์Šต์„ ๋ณด์—ฌ์ฃผ๋ฉฐ, ์ถ”๋ก  ์—”๋“œํฌ์ธํŠธ ์ œ๊ณต์ž๋“ค์ด NVIDIA Blackwell ์•„ํ‚คํ…์ฒ˜์—์„œ ๋งž์ถค ์ตœ์ ํ™”์™€ NVFP4 ๋“ฑ์„ ํ™œ์šฉํ•ด ์„ฑ๋Šฅ ํ•œ๊ณ„๋ฅผ ๋ฐ€์–ด๋ถ™์ด๊ณ  ์žˆ๋‹ค๋Š” ๋‚ด์šฉ์ž…๋‹ˆ๋‹ค.

x.com/nvidia/status/2033281263

#inference #optimization #kimi #nvidia #blackwell

:rss: Qiita - ไบบๆฐ—ใฎ่จ˜ไบ‹qiita@rss-mstdn.studiofreesia.com
2026-03-16
Hacker Newsh4ckernews
2026-03-15
arya dradjicabal4e@tech.lgbt
2026-03-14

I'm a little surprised that Rust/LLVM doesn't optimize away certain atomic operations. See play.rust-lang.org/?version=st (compile to assembly in release mode); an unused atomic load (with Relaxed or Acquire ordering) won't be elided, and an atomic swap with unused loaded value won't be downgraded to a store. I'm fairly confident that the atomic loads can be elided, but I'm willing to believe that the downgrading the RMW swap operation might affect e.g. release-acquire sequences. Perhaps these atomic operations are so rare (and usually, hopefully, done properly) that optimizing them is not worthwhile?

#programming #rust #llvm #optimization

Jens Oliver Meiert ๐Ÿ‡บ๐Ÿ‡ณ ๐Ÿ‡ต๐Ÿ‡ธj9t@mas.to
2026-03-12

Website Optimization Measures, Part XXXVI:

In this action-packed episode, improvements around GitHub Actions, article headings, CDNs, spellchecking, stale branches, acronym handling, Bing authentication, Eleventy performance, and site searches.

meiert.com/blog/optimization-m

#webdev #optimization

2026-03-11

๐Ÿ“ฐ LLVMใซๅฏพใ™ใ‚‹32ใƒ“ใƒƒใƒˆๅฎšๆ•ฐ้™ค็ฎ—ใฎๆ”นๅ–„ (๐Ÿ‘ 33)

๐Ÿ‡ฌ๐Ÿ‡ง PR merged to LLVM improving 32-bit unsigned constant division optimization using 33-bit magic constants on 64-bit targets.
๐Ÿ‡ฐ๐Ÿ‡ท 64๋น„ํŠธ ํƒ€๊ฒŸ์—์„œ 33๋น„ํŠธ ๋งค์ง ์ƒ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•œ 32๋น„ํŠธ ๋ถ€ํ˜ธ ์—†๋Š” ์ƒ์ˆ˜ ๋‚˜๋ˆ—์…ˆ ์ตœ์ ํ™” ๊ฐœ์„  PR์ด LLVM์— ๋ณ‘ํ•ฉ๋จ.

๐Ÿ”— zenn.dev/herumi/articles/const

#LLVM #Optimization #Compiler #Zenn

2026-03-11

๐Ÿ“ฐ LLVMใซๅฏพใ™ใ‚‹32ใƒ“ใƒƒใƒˆๅฎšๆ•ฐ้™ค็ฎ—ใฎๆ”นๅ–„ (๐Ÿ‘ 28)

๐Ÿ‡ฌ๐Ÿ‡ง LLVM optimization PR merged: improves 32-bit constant division using 33-bit magic constants on 64-bit targets, benefiting C/C++/Rust/Swift compilers.
๐Ÿ‡ฐ๐Ÿ‡ท LLVM ์ตœ์ ํ™” ๊ฐœ์„ : 64๋น„ํŠธ ํƒ€๊ฒŸ์—์„œ 33๋น„ํŠธ ๋งค์ง ์ƒ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•œ 32๋น„ํŠธ ์ •์ˆ˜ ๋‚˜๋ˆ—์…ˆ ์ตœ์ ํ™”. C/C++/Rust/Swift ๋“ฑ์— ์ ์šฉ.

๐Ÿ”— zenn.dev/herumi/articles/const

#LLVM #Compiler #Optimization #Zenn

2026-03-11

Test limits to find the sweet spot under damage: adaptation explodes there. From tanning to training to workโ€”push hard, recover fully, repeat. hackernoon.com/the-sweet-spot- #optimization

2026-03-11

๐Ÿ“ฐ LLVMใซๅฏพใ™ใ‚‹32ใƒ“ใƒƒใƒˆๅฎšๆ•ฐ้™ค็ฎ—ใฎๆ”นๅ–„ (๐Ÿ‘ 24)

๐Ÿ‡ฌ๐Ÿ‡ง LLVM optimization improvement for 32-bit constant division merged to main - benefits C/C++/Rust/Swift compilers using 33-bit magic constants
๐Ÿ‡ฐ๐Ÿ‡ท 32๋น„ํŠธ ์ƒ์ˆ˜ ๋‚˜๋ˆ—์…ˆ์— ๋Œ€ํ•œ LLVM ์ตœ์ ํ™” ๊ฐœ์„ ์ด ๋ฉ”์ธ์— ๋ณ‘ํ•ฉ - 33๋น„ํŠธ ๋งค์ง ์ƒ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•ด C/C++/Rust/Swift ์ปดํŒŒ์ผ๋Ÿฌ์— ์ด์ 

๐Ÿ”— zenn.dev/herumi/articles/const

#LLVM #Compiler #Optimization #Zenn

Dan McAteer (@daniel_mac8)

ํ˜„์žฌ๋Š” AI ๋ชจ๋ธ ์ตœ์ ํ™”๊ฐ€ ์ค‘์‹ฌ์ด์ง€๋งŒ ๊ณง ๊ณผํ•™ ์—ฐ๊ตฌ, ์‹ ์•ฝ ๊ฐœ๋ฐœ, ํ•˜๋“œ์›จ์–ด ๋“ฑ ๊ฐ๊ด€์  ๋ชฉํ‘œ๋ฅผ ์ •๋Ÿ‰ํ™”ํ•˜๊ณ  ์ตœ์ ํ™”ํ•  ์ˆ˜ ์žˆ๋Š” ๋ชจ๋“  ๋ถ„์•ผ๋กœ ํ™•์žฅ๋  ๊ฒƒ์ด๋ฉฐ, ์žฅ๊ธฐ์ ์œผ๋กœ๋Š” ์ธ๊ฐ„์˜ 'ํ–‰๋ณต'๊นŒ์ง€๋„ ์ตœ์ ํ™” ๋Œ€์ƒ์ด ๋  ์ˆ˜ ์žˆ๋‹ค๋Š” ์ „๋ง์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.

x.com/daniel_mac8/status/20311

#aioptimization #drugdiscovery #scientificresearch #hardware #optimization

2026-03-10

๐Ÿšจ Riset terbaru baru saja dirilis!

"Unlock Peak Performance: How to Speed Up Your PC with Microsoft PC Manager"

๐Ÿ”— Akses repositori/dokumentasi: dev.to/living_palace_033483e78

N-gated Hacker Newsngate
2026-03-10

๐Ÿš€ Wow, finally realized can do more than just store data! ๐ŸŽ‰ After a deep dive into buzzword soup ๐Ÿฒ, they claim to have "optimized" Top Kโ€”meaning they changed a few lines of code and hoped for the best. ๐Ÿ™ƒ Who knew needed a "Missing Manual" to find its way around a database? ๐Ÿ˜‚
paradedb.com/blog/optimizing-t

Dennis Alexis Valin Dittrichdavdittrich@fediscience.org
2026-03-09

Robust estimation demands highly efficient computation, especially in streaming anomaly detection where latency budgets are tight.

While Rousseeuw & Croux's robust estimators ($Q_n$ and $S_n$), and Rousseeuw & Verboven's M-estimators of location and scale for very small samples, provide exceptional reliability, computing them requires intensive math.

robscale 0.1.5 is now on CRAN. It delivers a native C++17/Rcpp implementation designed for absolute speed. The package utilizes SIMD-vectorized $\tanh$ evaluation, Newton-Raphson iteration, and optimal sorting networks for cache-aware median selection.

The result? A 1.6x up to ~28x performance leap over pure-R implementations. The mathematical results remain identical; only the computational underpinnings change.

๐Ÿ“ฆ CRAN: cran.r-project.org/package=rob
๐Ÿ’ป Code: github.com/davdittrich/robscale

#RStats #RobustStatistics #DataScience #Optimization

A two-panel benchmark charting performance multipliers of the optimized C++ robscale package against legacy pure-R implementations across sample sizes from $n = 3$ up to $10^7$, with the vertical axis starting honestly at 0x. The left panel reveals a massive speedup for M-estimators (robLoc, robScale, adm  vs. revss), pushing up to ~28x for robScale. The right panel tracks scale estimators ($Q_n$, $S_n$ vs. robustbase), showing the speedup curve upward from 1.6x, approaching 10x at large sample sizes. Shaded ribbons show 95% bootstrap confidence intervals, visually confirming dramatic computational efficiency.

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst