#phacking

Felix Schönbrodtnicebread@scicomm.xyz
2025-05-15

#phacking #LLM performance:

"The researchers found that big players like Meta, Google, OpenAI, and Amazon are given special privileges to privately test multiple versions of their models and only publish the best results. This hidden practice allows them to inflate their rankings by cherry picking data, making their models appear stronger than they actually are."

arxiv.org/abs/2504.20879?utm_s

via Sabine Hossenfelder's newsletter

Dr Mircea Zloteanu 🌼🐝mzloteanu
2025-04-25

#330 Encourage Playing with Data and Discourage Questionable Reporting Practices

Thoughts: What are and aren't "Questionable Research Practices"? Where is the "grey area"? Interesting opinion piece.

link.springer.com/article/10.1

2025-02-02

> 2011: Joseph Simmons, Leif Nelson, and Uri Simonsohn publish a paper, “False-positive psychology,” in Psychological Science introducing the useful term “researcher degrees of freedom.” Later they come up with the term p-hacking, and Eric Loken and I speak of the garden of forking paths to describe the processes by which researcher degrees of freedom are employed to attain statistical significance.
statmodeling.stat.columbia.edu
#PHacking #DegreesOfFreedom
@bsmall2@mstdn.jp

2025-01-16

🧵
> ... unethical behaviour during the report of results is.. P hacking... frequent in research.. [of a] clinical nature... two main reasons.. First, scientists are often evaluated by the number and quality of publications, and sometimes this pressure to get sig­nificant results makes some scientists cherry-pick their results. Second (and more frequent), some inexperienced analysts are unaware of the importance of #MultipleTesting and think this is
OK. But it is not! #PHacking
@bsmall2@fedibird.com

Dr Mircea Zloteanu 🌼🐝mzloteanu
2024-08-28

#168 Large P-Values Cannot be Explained by Power Analysis

Thoughts: "Researchers cannot “aim” for p = .05, not even with a careful, perfectly accurate, power analysis."

quentinandre.net/post/large-pv

Dr Mircea Zloteanu 🌼🐝mzloteanu
2024-07-12

#135 {p-checker} The one-for-all p-value analyzer

Thoughts: Why not try your hand at some tools for detecting publication bias (mileage may vary). Useful teaching demo tools.

shinyapps.org/apps/p-checker/

2024-04-26

Fun class class survey of other undergrads, current N=55. I'm doing #irresponsible #DataAnalysis bc not actual #research.

Still #WTF?

ghost_recv_log = How many times have you been ghosted? (log-transformed)

mosi = Misperception of others' sexual interest

bjw = Belief in a just world

swls = Satisfaction with life

csei = College self-efficacy

High self-esteem assholery? IDK.

OH WAIT. Gender!

Shit. We didn't ask.

#NotResearch #Datafishing #phacking but it's #OK I'm a #professional #oops

Lea 💚🧵🛠️📊🥧🇺🇦🍉leamusi@mendeddrum.org
2024-04-24

@ALLEA No time to read the full framework, but is there any mention of enforcing best scientific practices and ensuring scientific integrity?
#scientists #replicability #scientificmisconduct #reproducibilityofresearch #reproducibilitea #publishorperish #phacking

Bjørn Sætreviksatrevik@fediscience.org
2024-01-17

Table 1 of the "False-positive psychology" paper (Simmons, Nelson & Simonsohn 2011, papers.ssrn.com/sol3/papers.cf) estimate the false-positive rate of some questionable #ResearchPractices, both alone and in combination (see attached figure).

I remember reading somewhere that the authors later stated that the numbers were inaccurate and should have been somewhat higher. Does anyone have a reference for this? #FalsePositivePsychology #QRP #phacking #HARKing #ReproducibiliTea

A visual representation of Table 1 from the "False-positive psychology" paper (Simmons, Nelson & Simonsohn 2011). 

Researcher degrees of freedom	p < .05
Situation A: two dependent variables (r = .50) 	10 %
Situation B: option to add 10 more observations per cell	8 %
Situation C: controlling for gender or interaction of gender	13 %
Situation D: dropping (or not) one of three conditions	13 %
	
Combine Situations A and B	14 %
Combine Situations A, B, and C	31 %
Combine Situations A, B, C, and D	61 %
Karthik Srinivasanskarthik@neuromatch.social
2023-10-11

A nice hatchet job from the New Yorker!

newyorker.com/magazine/2023/10

The story of morally and intellectually bankrupt superstar researchers who elevated what is/was/has always been a somewhat semi-decent, hardly scientific enterprise called behavioral economics (within social psychology) into a juggernaut of "just-so stories" that swayed nations, industries and academias to form absolutely horrendous worldviews and make decisions likewise.

The field: The poor need to be nudged (read, extra inconvenience) to make rational decisions with marginally any value to them. Meanwhile, dear corporations, here are some tax write-offs and subsidies; you can also engage in open thievery. That's our way to nudge you to do the right thing, i.e., fudging with the accounts. This is how we social engineer (All this, based on absolutely moronic and morally dubious "studies". I am reminded of the evo-psych psychopaths).

As if economics as it stands isn't dismal and unscientific enough, whether you legitimize with a couple Nobels or not, this is like "polishing a turd".

Of course not much is likely to change in academia across fields as well. It will continue to seek and hire more such people who will bring the money, fame, and storytelling at the expense of hard work and reality. Knowledge building and intellectual work... what's that?

#Dishonesty #Academia #BehavioralEconomics #Management #BusinessSchools #Economics #Nudge #pHacking #SocialPsychology

Research Network Digi-Oek.chDigiOekCH@social.tchncs.de
2023-06-24

[en] Cheating in Science: Harvard "Honesty Scholar" May Have Been Caught in Dishonesty

"... dishonesty can lead to creativity" - an interesting and somewhat amusing read.

The New York Times: "Questions about a widely cited paper are the latest to be raised about methods used in #behavioral research."

datacolada.org/111

#ResearchHighlights #honesty #dishonesty #phacking #harking #dredging #gino #harvard #fraud #cheating #academic #datacolada

2023-05-11

“It’s a bit like seeing a rabbit shape in the clouds and then testing whether all clouds look like rabbits… using the same cloud. I hope you appreciate that you’re going to need some new clouds to test your theory.

Any datapoint you use to inspire a theory or question can’t be used to test that same theory.”

#phacking #statistics #rstats

towardsdatascience.com/the-mos

@sociology @communicationscholars @rstats

Anita Graser 🇪🇺🇺🇦🇬🇪underdarkGIS@fosstodon.org
2023-03-28

"Do not chase better metrics. Chase better models."

Excellent lighting talk on how spatial sampling affects the quality of data driven models by Luísa Lucchese

youtu.be/q9dy2dQuTak

Chasing better #metrics is like #phacking

#GeoAI #gischat #datascience #machinelearning

2023-03-10

If you’re interested in #statistics, #Metascience, #openScience, #pHacking, etc., this is quite a good read.

joebakcoleman.com/blog/2023/pc

2023-03-09

In 2012, Noam Chomsky mentioned a Lisp-language guy, #PatWinston. Winston said that, in the AI^1 field, people were directed away from "original questions." That sent me to #Chomsky's #PowersAndProspects(1996). _Prospects_ reminded me of Alex Carey's _Taking the Risk Out of Democracy_ and I now see #Mayo's #HawthorneStudies as 1927-1932 #PHacking. #GeorgeEltonMayo! in textbooks.
^1 AI, short for SALAMI: Systematic Approaches to Learning Algorithms and Machine Inferences #AISalami

2023-02-22

> Psychologists who specialize in exercise music have quantified.. [that] listening to songs with high tempos motivates us to run faster, and the swifter we move, the quicker we prefer our music. Likewise, when drivers hear loud, fast music, they unconsciously step a bit harder on the gas pedal. Walking at our own pace creates an unadulterated feedback loop between the rhythm of our bodies and our mental state that we cannot experience as easily.. during any other.. locomotion.
#PHacking?

2023-02-21

> OUTRIGHT FAKERY IS CLEARLY more common in.. sciences than we’d like to believe. But it may not be the biggest threat to their credibility... #MichaelKinsley once said of wrongdoing in Washington, so too in the lab: “The scandal is what’s legal.” The kind of manipulation that went into the “When I’m Sixty-Four” paper, for instance, is “nearly universally common,” #Simonsohn says. It is called “p-hacking,” or, more colorfully, “torturing the data until it confesses.”
#PHacking #OnBullshit?

2023-01-07

No more #PHacking or vague #HealthClaims for products to obscure the fact that they lack solid evidence from #RCTs for such claims. So says the FTC. We say that's definitely a good call.
conscienhealth.org/2023/01/ftc

2022-12-16

just a little engineering mishap I guess, low probability to happen, but HUGE effect size 🤣🤣🤣
#Science #pHacking

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst