#MLsec

2026-03-13

Just for the record, this is not really #MLsec...this is using #ML for security ops. Which means...whatever. yawnzies.

codewall.ai/blog/how-we-hacked

2026-03-12

What is "beigification" in AI, and is it good or bad?

#AI #ML #MLsec

berryvilleiml.com/2026/03/12/o

2026-03-12

NEW BIML Bibliography entry

arxiv.org/abs/2310.08754

Tokenizer Choice For LLM Training: Negligible or Crucial?

Mehdi Ali, et al

Often ignored, this kind of work is at the foundation of ML. Using languages to experiment. Straightforward but not profound work.

#MLsec #Representation #Tokenization

berryvilleiml.com/references/

BIML logo
2026-03-12

This work is apparently being commercialized. That is a tell regarding the nascent state of #MLsec.

2026-03-12

NEW BIML Bibliography entry

arxiv.org/abs/2601.09923

CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents

Hanna Foerster, et al (Shumailov and Papernot)

This paper tries so hard to be good but shows what happens when security engineering plays second fiddle to agentic AI (through Computer Use Agents using a GUI). Results are thin and repeated. Pretends that your PC is somehow “isolated.”

#MLsec #Agents #Engineering

berryvilleiml.com/references/

BIML logo
2026-03-12

You owe your soul to the company store. Company scrip is back, but not in the coal mines ...in the AI software mines.

#ML #AI #MLsec

businessinsider.com/ai-compute

2026-03-11

NEW BIML Bibliography entry

arxiv.org/abs/2503.03150

Position: Model Collapse Does Not Mean What You Think

Rylan Schaeffer, Joshua Kazdan, Alvan Caleb Arulandu, Sanmi Koyejo

We think recursive pollution is a better term than model collapse. Weak terminology leads to misunderstanding of impact. See figure 4. This is a very good paper.

#TOPPAPER #MLsec #RecursivePollution #DataPoisoning

berryvilleiml.com/references/

BIML cow
2026-03-11

NEW BIML Bibliography entry

arxiv.org/abs/2404.05090

How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse

Mohamed El Amine Seddik, et al

This treatment fails because the models being studied are TOY models too simple to be interesting.

#MLsec #RecursivePollution #SyntheticData

berryvilleiml.com/references/

BIML logo
2026-03-11

NEW BIML Bibliography entry

arxiv.org/abs/2502.18865

A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops

Shi Fu, Yingjie Wang, Yuzhu Chen, Xinmei Tian, Dacheng Tao

Published at ICLR 2025. A bit overfocused on the real vs synthetic data problem, this paper covers the depletion of real data available for training ML. STLs are getting very close indeed to recursive pollution, so the math here is relevant.

#MLsec #RecursivePollution

berryvilleiml.com/references/

BIML voe
2026-03-11

NEW BIML Bibliography entry

arxiv.org/abs/2410.04840

Strong Model Collapse

Elvis Dohmatob, Yunzhen Feng, Arjun Subramonian, Julia Kempe
(NYU and META)

Recursive pollution leads to model collapse. This view of strong model collapse describes what happens in the case of recursive data poison.
#TOPPAPER #MLsec #Data #RecursivePollution

berryvilleiml.com/references/

BIML logo
2026-03-11

NEW BIML Bibliography entry

arxiv.org/abs/2509.16499

A Closer Look at Model Collapse: From a Generalization-to-Memorization Perspective

Lianghe Shi, et al

A very nice set of references to work in model collapse. Collapsed model == lookup table (that is, no generalization). Discussion of recursive pollution as causing variance shrinkage or distribution shift.

#TOPPAPER #MLsec #Data #RecursivePollution

berryvilleiml.com/references/

BIML cow
2026-03-10

Openclaw in China "raises a lobster"

Once again, the weakest link in security is the people #MLsec #ML #AI

wsj.com/tech/ai/chinas-opencla

2026-03-09
2026-03-09

It is both rewarding and daunting to be mentioned in this work along with Ken Thompson and Ross Anderson. Lots of ideas expressed in this essay are right on the money.

Have a read. Pass it on.

#MLsec #ML #swsec #appsec #security #infosec

medium.com/@maconstantino/trus

2026-03-06

Maybe the answer is "building security in" instead of "penetrate and patch," huh @gadi ?

#swsec #MLsec #appsec

wsj.com/tech/ai/send-us-more-a

2026-03-04

#ML and #AI deeply impacting time to exploit. The zero day clock shows this.

This is #security impacted by ML...not #MLsec

Guess we should have learned those lessons from #swsec 25 years ago

zerodayclock.com/

2026-03-04

But that's not all. We also ran into Carl Hurd from Starseer at [un]prompted. Starseer's work on getting inside the network is the future of #MLsec engineering.

Katie McMahon and Harold Figueroa in the house!

2026-03-04

BIML is in San Francisco at [un]prompted, where we've met two of our heroes in #MLsec...Carlini and Shumailov. Two authors of the very best work in our emerging field.

See berryvilleiml.com/bibliography/

Katie McMahon and Harold Figueroa in the house!

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst