på¤ãããã³ã°ã«ã¤ãã¦ã®è«æãèªãã
PLOS Biology: The Extent and Consequences of P-Hacking in Scienceãèªãã ã
ä¸ã®ä¸ã«ã¯på¤ãå°ãã(ã¤ã¾ãçµ±è¨çã«ææ)ãªãã¼ã¿ãå°ã°ããå¾åããããããããã¨çºè¡¨ãããçµæã¯ææãªãã®ã°ããã ããæªããã°è©ç§°ãããããªããééã£ãçµæãéãã¦ãã¾ãã¨ã¡ã¿è§£æãã¦ããã¤ã¢ã¹ãæ®ããããã§p-hackã®å¯è½æ§ãæ¤å®ããæ¹æ³ãæ±ã£ãè«æã
ã®ã¯ããªãã ãã©ãç¥èããªãããããæ¬å½ã«ããã§ãããã®ãã¨ããçåãæ®ã£ãã以䏿¦è¦ã
1. p-hackingã¨ã¯?
ç ç©¶è ããããã¡ãªãã¤ã¢ã¹ã¨ãã¦selection biasãinflation biasããããselection biasã¯ææã§ãªãå®é¨çµæãä¸ã«åºãªããã¨ãinflation biasã¯ããããp-hackingã§ãã广éããå°ããã®ã«ãµã³ãã«ãµã¤ãºã大ããããããé½åã®è¯ãæ¤å®ææ³ã§çºè¡¨ãã¦ãã¾ã£ãããããã¨ã
2. èãæ¹
på¤ãããã³ã°ãè¦æããªãã¨ã¡ã¿è§£æããã¤ã¢ã¹ãåããã®ã§ãã®æ¹æ³ã¨ãã¦på¤ã®åå¸(p-curve)ã使ãããã¨ããã®ããã®è«æã®ææ¡ã以ä¸ã§ã¯på¤ã<0.05ã§ææã¨ããå ´åã«ã¤ãã¦èããã

广éã0ãªå®é¨ããã®på¤ã¯ä¸æ§åå¸ã«ãªããããã«selection biasããããã¨p>0.05ã§ã°ã©ããä¸é£ç¶ã«ä½ä¸ããã

广éã0ã§ãªãå®é¨ããã¯å°ããpå¤ãåºã確çãé«ãã®ã§p-curveã¯å³ä¸ããã«ãªããselection biasããããã¨p>0.05ã§ã°ã©ããä¸é£ç¶ã«ä½ä¸ããã

på¤ãããã³ã°ã§ã¯p>0.05ãªã®ãç¡çãã0.05ãä¸åããã使¥ãããã®ã§ã0.05ã®å·¦å´ã§ä¸èªç¶ãªå³ä¸ãããç¾ããã

p-hackingãããã¨ãããã0.5ãä¸åããªãpå¤ãç¡çããå°ããããã®ã§ãã0.5-2α<p<0.5-αãã«å ¥ãpå¤ãã0.5-α<p<0.5ãã«å ¥ãpå¤ãããå°ãªããªããããã§å¸°ç¡ä»®èª¬ããpå¤ãããããã®ãã³ã«å ¥ã確çã¯0.5ãã¨ãã¦ããã¼ã¿ããåã£ãçµæãåºã確çãé¾å¤ããå°ãããã°ã帰ç¡ä»®èª¬ãæ£å´ããã(çå´2é æ¤å®)
ä¾ãã°26åã®på¤ããã£ã¦ã10ãã0.5-α<p<0.5ãã16ãã0.5-2α<p<0.5-αãã«å ¥ã£ãæã
> sum(dbinom(x = 0:10,size = 26,prob = 0.5))
0.1634698
> binom.test(c(10,16),alternative = "less")
Exact binomial test
data: c(10, 16)
number of successes = 10, number of trials = 26,
p-value = 0.1635
alternative hypothesis: true probability of success is less than 0.5
95 percent confidence interval:
0.0000000 0.5643373
sample estimates:
probability of success
0.3846154
ææ³
ä¸ã®æ¹æ³ã使ã£ã¦æªããçµæãæ¶ããã¨ã§ãæ£ãããçµæãå¾ãããã¨ã®ãã¨ã ã£ãããã®ææ³ã§èªåãéåæãæãã¦ããã®ã¯ä»¥ä¸ã®ãããªã±ã¼ã¹ã

0.05ãã大ããã¨ããã«ã¯ãã¨ãã¨på¤ããã¾ããªãå ´åã§p-hackãè¦ããããããªã¨ã(䏿£ããæå³ãªãã...)ãã¤ã¾ãpå¤ã®å¹³åã0.05ããå°ããæã«ã0.05å¨è¾ºã®ãã¼ã¿ãç¡è¦ããã¨ææãªçµæã°ããæãåºããè§£æã«ãªããªãã®ãã¨ãããã¨ãæ°ã«ãªã£ã¦ãããã¡ã¿è§£æã®ä»æ¹ã¨ã广éã®åå¼·ãããã°çè§£ã§ããã®ããª...?