#memorization

Aubreader MastoAubreader@mas.to
2026-01-12

AI's Memorization Crisis - The Atlantic

theatlantic.com/technology/202

> Large language models don’t “learn”—they copy. And that could change everything for the tech industry.

#AI #LLM #memorization

Don Curren 🇨🇦🇺🇦dbcurren.bsky.social@bsky.brid.gy
2026-01-12

Aharon Azulay (@AharonAzulay)

작성자는 관찰 결과가 일치한다고 말하며, 이러한 시스템들이 잘 알려지지 않은 arXiv 논문의 수치적 세부사항까지 기억할 수 있음을 지적하고 있습니다. 연구·검증 관점에서 모델의 기억(기록) 능력과 데이터 출처 관련 행동을 시사하는 댓글입니다.

x.com/AharonAzulay/status/2009

#research #memorization #arxiv #llm

2025-12-30

A quotation from Bill Watterson

CALVIN: As you can see, I have memorized this utterly useless piece of information long enough to pass a test question. I now intend to forget it forever. You’ve taught me nothing except how to cynically manipulate the system. Congratulations.

Bill Watterson (b. 1958) American cartoonist
Calvin and Hobbes (1994-01-27)

More about this quote: wist.info/watterson-bill/81087…

#quote #quotes #quotation #qotd #billwatterson #calvinandhobbes #cramming #cynicism #education #learning #lesson #memorization #rotememorization #school #teaching #test

Calvin writing to his teacher on his test paper.
N-gated Hacker Newsngate
2025-12-24

Ah, the age-old quest for finding the perfect hack! 🤔 Well, here comes the article, promising to revolutionize your brain with a sprinkle of and . ✨ Just remember, if this method was truly foolproof, would be running Apple by now, not blogging about it. 😂
gwern.net/spaced-repetition

2025-12-04

As always: #OpenData persistently available at:
Du, K. (2025). Reconstructing Shuffled Text (Derived Text Formats) [Data set]. Zenodo. doi.org/10.5281/zenodo.17198425
#CLS #CCLS25 #DTF #LiteraryComputing #LLM #Memorization

Arie van Deursen 🇪🇺🇳🇱avandeursen@mastodon.acm.org
2025-11-11

In our own work, we researched memorization in language models for code and ways to let them regurgitate training data:

> From the training data that was identified to be potentially extractable we were able to extract 47% from a CodeGen-Mono-16B code completion model.

> We also observe that models memorise more, as their parameter count grows, and that their pre-training data are also vulnerable to attack

dl.acm.org/doi/abs/10.1145/359

#memorization #atemlos

Arie van Deursen 🇪🇺🇳🇱avandeursen@mastodon.acm.org
2025-11-11

Urteil GEMA gegen Open AI:

> Sowohl durch die Memorisierung in den Sprachmodellen als auch durch die Wiedergabe der Liedtexte in den Outputs des Chatbot lägen Eingriffe in die urheberrechtlichen Verwertungsrechte vor

justiz.bayern.de/gerichte-und-

#atemlos #openai #copyright #memorization #gema #chatgpt

Hacker Newsh4ckernews
2025-11-07
N-gated Hacker Newsngate
2025-06-13

The New York Times thinks a turtle poem will "win your heart" 🐢💔—because nothing screams "captivating" like slow-moving reptiles and deep dives into poetic gravity. 🎼✨ Meanwhile, they offer a to help memorize it, as if anyone is clamoring to recite turtle verses at parties. 🎉📜
nytimes.com/interactive/2025/0

Erik JonkerErikJonker
2025-06-07

Interesting, "GPT-style models have a fixed memorization capacity of approximately 3.6 bits per parameter."
venturebeat.com/ai/how-much-in

2025-06-06

How much information do LLMs really memorize? Now we know, thanks to Meta, Google, Nvidia and Cornell https://venturebeat.com/ai/how-much-information-do-llms-really-memorize-now-we-know-thanks-to-meta-google-nvidia-and-cornell/ #AI #memorization #copyright

Text Shot: Jack Morris, the lead author, explained via the social network X that “training on more data will force models to memorize less per-sample.”

These findings may help ease concerns around large models memorizing copyrighted or sensitive content.

If memorization is limited and diluted across many examples, the likelihood of reproducing any one specific training example decreases. In essence, more training data leads to safer generalization behavior, not increased risk.
2025-06-06

How much information do LLMs really memorize? Now we know, thanks to Meta, Google, Nvidia and Cornell venturebeat.com/ai/how-much-in #AI #memorization #copyright

Text Shot: Jack Morris, the lead author, explained via the social network X that “training on more data will force models to memorize less per-sample.”

These findings may help ease concerns around large models memorizing copyrighted or sensitive content.

If memorization is limited and diluted across many examples, the likelihood of reproducing any one specific training example decreases. In essence, more training data leads to safer generalization behavior, not increased risk.
WIST Quotations Has Moved!wist@my-place.social
2025-04-16

A quotation from Montaigne

I gladly return to the subject of the ineptitude of our education. Its goal has been to make us not good or wise, but learned; it has attained this goal. It has not taught us to follow and embrace virtue and wisdom, but has imprinted in us their derivation and etymology. We know how to decline virtue, if we cannot love it. If we do not know what wisdom is by practice and experience, we know it by jargon and by rote.
 
[Je retombe volontiers sur ce discours de l’ineptie de nostre institution : Elle a eu pour sa fin, de nous faire, non bons & sages, mais sçavans : elle y est arrivée. Elle ne nous a pas appris de suyvre & embrasser la vertu & la prudence : mais elle nous en a imprimé la derivation & l’etymologie. Nous sçavons decliner vertu, si nous ne sçavons l’aymer. Si nous ne sçavons que c’est que prudence par effect, & par experience, nous le sçavons par jargon & par cœur.]

Michel de Montaigne (1533-1592) French essayist
Essay (1578), “Of Presumption [De la Presomption], Essays, Book 2, ch. 17 (2.17) (1595) [tr. Frame (1943)]

Sourcing, notes, alternate translations: wist.info/montaigne-michel-de/…

#quote #quotes #quotation #qotd #montaigne #education #learning #meaning #memorization #morality #rote #school #understanding #virtue #wisdom

WIST Quotations Has Moved!wist@my-place.social
2025-03-12

A quotation from Montaigne

We readily inquire, “Does he know Greek or Latin?” “Can he write poetry and prose?” But what matters most is what we put last: “Has he become better and wiser?” We ought to find out not who understands most but who understands best. We work merely to fill the memory, leaving the understanding and the sense of right and wrong empty.
 
[Nous enquerons volontiers, Sçait-il du Grec ou du Latin ? escrit-il en vers ou en prose ? mais, s’il est devenu meilleur ou plus advisé, c’estoit le principal, & c’est ce qui demeure derriere. Il falloit s’enquerir qui est mieux sçavant, non qui est plus sçavant. Nous ne travaillons qu’à remplir la memoire, & laissons l’entendement & la conscience vuide.]

Michel de Montaigne (1533-1592) French essayist
Essay (1572-1578), “Of Pedantry [Du pedantisme]), Essays, Book 1, ch. 24 (1.24) (1595) [tr. Screech (1987), ch. 25]

Sourcing, notes, alternate translations: wist.info/montaigne-michel-de/…

#quote #quotes #quotation #Montaigne #comprehension #education #evaluation #improvement #learning #memorization #rubric #school #student #teaching #understanding #wisdom

WIST Quotations Has Moved!wist@my-place.social
2025-03-11

A quotation from William Feather

An education isn’t how much you have committed to memory, or even how much you know. It’s being able to differentiate between what you do know and what you don’t. It’s knowing where to go to find out what you need to know, and it’s knowing how to use the information once you get it.

William Feather (1889-1981) American publisher, author
(Attributed)

Sourcing, notes: wist.info/feather-william/1479…

#quote #quotes #quotation #application #competence #education #ignorance #knowledge #memorization #research

Andrew ShieldsAndrewShields@mas.to
2025-03-07

Counting to high numbers and reciting poems to suppress evil thoughts in Charles Dickens’s “Hard Times” (1854). #111Words #CharlesDickens #HardTimes #Poetry #Counting #Recitation #Memorization andrewjshields.blogspot.com/20

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst