The Mermaid's Tale: Non-replicability in science: your antelope for the day

Friday, May 18, 2012

Non-replicability in science: your antelope for the day

A piece in the May 17 Nature supports one of Ken's favorite observations, something he says while wearing his Anthropologist's hat -- "Journal articles are just an academic's antelope for the day." We're still just hunter/gatherers -- our published papers are, more often than not, nothing more than the way we feed ourselves. Our basket of berries -- eaten today, droppings tomorrow.

Blackbuck male, females; Photo from Wikimedia, Mr Raja Purohi

Ed Yong, in "Replication studies: Bad copy," reports that most published studies can't be replicated. This is something we often talk about with respect to genetic studies, and there are many reasons for this that are specific to genetic data, but apparently it's even more rampant in psychology, for reasons also specific to the field.

And there is the notorious problem that 'negative' results are not published very often. They're not glamorous and won't get you tenure--even if some of the most important findings in science are 'negative' if they steer work towards valid rather than dreamt-of theory or hypothesis. Clinical trials are a major example, but less noticed are ephemeral natural selection stories about evolution.

A paper published last year claiming support for extrasensory perception, or psi, for example, produced a major kerfuffle (we blogged about it at the time). The aftermath has been no less interesting, and informative about the world of publishing, as researchers who tried to replicate the findings but failed also failed to find publishers for their results. This lead to a lot of discussion about the implications of negative results not being published, a discussion that has flared up frequently in academia, as well it should, although we're no closer to resolving it than ever.

There are some experiments that everyone knows don't replicate, but this knowledge doesn't get into the literature,” says [Eric-Jan] Wagenmakers [mathematical psychologist at the University of Amsterdam]. The publication barrier can be chilling, he adds. “I've seen students spending their entire PhD period trying to replicate a phenomenon, failing, and quitting academia because they had nothing to show for their time.”

But we'll leave that issue for another time.

The question of why studies so often aren't replicable is a different, if related one. And one that The Reproducibility Project, a large scale collection of scientists from around the world, is addressing head on, as they attempt to replicate every study published in three major psychology journals in 2008, as described last month in the Chronicle of Higher Education.

For decades, literally, there has been talk about whether what makes it into the pages of psychology journals—or the journals of other disciplines, for that matter—is actually, you know, true. Researchers anxious for novel, significant, career-making findings have an incentive to publish their successes while neglecting to mention their failures. It’s what the psychologist Robert Rosenthal named “the file drawer effect.” So if an experiment is run ten times but pans out only once you trumpet the exception rather than the rule. Or perhaps a researcher is unconsciously biasing a study somehow. Or maybe he or she is flat-out faking results, which is not unheard of.

According to Yong, the culture in psychology is such that experimental designs that "practically guarantee positive results" are perfectly acceptable. This is one of the downsides of peer review -- when all your peers are doing it, good scientific practice or not, you can get away with it, too.

And once positive results are published, few researchers replicate the experiment exactly, instead carrying out 'conceptual replications' that test similar hypotheses using different methods. This practice, say critics, builds a house of cards on potentially shaky foundations.

So, if a study isn't replicated exactly (or however exactly it can be), it's possibly because the methods were not described in enough detail for the study to be replicated. Or, and this is a problem certainly not confined to psychology, the effect was small and significant by chance, as epidemiologist John Ionnides suggested in a paper published in 2005 that garnered a lot of attention for saying most Big-Splash studies are false. He explained this in statistical terms, having to do with bias in significance levels of studies of new hypotheses and similar issues.

As the Chronicle story says about non-replicability:

The researchers point out, fairly, that it’s not just social psychology that has to deal with this issue. Recently, a scientist named C. Glenn Begley attempted to replicate 53 cancer studies he deemed landmark publications. He could only replicate six. Six! Last December I interviewed Christopher Chabris about his paper titled “Most Reported Genetic Associations with General Intelligence Are Probably False Positives.” Most!

So, psychology is under attack. We blogged not long ago about an op/ed piece in the New York Times by two social scientists calling for an end to the insistence that the social sciences follow any scientific method. Enough with the physics envy, they said, we don't do physics. Thinking deeply is the answer. But, would giving these guys free rein to completely make stuff up really be the solution? Well, it might just be, if their peers agree. But, let's not just pick on psychology. The problem is rampant throughout the sciences.

Meanwhile, the motto seems to be: Haste makes....nutrition for scientists!

8 comments:

James GoetzMay 18, 2012 at 2:49 PM
Great post Anne.

Lately, I am refocusing on the definition of the scientific method. For example, if we interpret unrepeatable observations including unrepeatable experiments, then at best we are doing scientific speculation, history, or philosophy. We of course can make scientific hypotheses and theories about past. In these cases, the scientists are not replicating the past, but making inference from, for example, comparative genetics, comparative anatomy, geology, and the redshift in the galaxies. These theories of the past have their limits while there is no need to doubt, for example, the common genetic ancestry of all animals. However, if interpretations of unrepeatable medical or psychological experiments are called a scientific hypothesis instead of mere speculation, then the researcher has crossed the line into pseudoscience.

I will also clarify that I fully appreciate the need for speculation and philosophy, but the problem is when the speculations and philosophy are improperly labeled a scientific hypothesis.

Per "boring" negative results, perhaps open access journals is a convenient place to publish them.

I hope to have more time to criticize such problems in science, and you guys are great source of information for this : -)
ReplyDelete
Replies
James GoetzMay 18, 2012 at 5:56 PM
I wholeheartedly agree that "non-replicable doesn't necessarily mean wrong" because historical method of the non-replicable realm can be powerful and accurate. But regardless of the amount of empirical observation used with historical method, all historical theories and hypotheses are not scientific theories or hypotheses. Perhaps your example of different alleles causing the same disease was discovered by historical method with a lot of empirical observation.
ReplyDelete
Replies