Book Image

Clojure for Data Science

By : Henry Garner
Book Image

Clojure for Data Science

By: Henry Garner

Overview of this book

Table of Contents (18 chapters)
Clojure for Data Science
Credits
About the Author
Acknowledgments
About the Reviewer
www.PacktPub.com
Preface
Index

Effect size


In this chapter, we focused on statistical significance—the methods employed by statisticians to ensure a difference is discovered, which cannot be easily explained as chance variation. We must always remember that finding a significant effect isn't the same as finding a large effect. With very large samples, even a tiny difference in sample means will count as significant. To get a better sense of whether our discovery is both significant and important, we should state the effect size as well.

Cohen's d

Cohen's d is an adjustment that can be applied to see whether the difference we have observed is not just statistically significant, but actually large. Like the Bonferroni correction, the adjustment is a straightforward one:

Here, Sab is the pooled standard deviation (not the pooled standard error) of the samples. It can be calculated in a way similar to the pooled standard error:

(defn pooled-standard-deviation [a b]
  (i/sqrt (+ (i/sq (standard-deviation a))
             (i/sq...