R: Is a bootstrap of mean differences statistically significant

Last year, I wrote a blog post about bootstrapping (two-sample) the mean in R. Recently, someone contacted me with the question of how to get the corresponding percentile for a specific value of those bootstrapped differences. It’s something you need to answer the question if the difference is statistically significant from 0. Let’s do it in this blog post.

In the previous blog post, I stored my 2500 bootstrapped differences in a vector. Let’s call it mean_diffs. If we want to get a percentile for a specific value, we need to produce a cumulative distribution first. We can do this using the ecdf function, which produces a cumulative distribution function from the values we provide it.

We provide ecdf() with the vector and we plot the resulting function.

cdf <- ecdf(mean_diffs)
plot(cdf)

The object cdf is a function. We can provide it with a value and a corresponding percentile will be returned. For example, if we’d like to check if our mean differences are statistically significant from zero, you can provide the function a zero.

cdf(0)

In our example, the corresponding percentile is 0. However, if the difference between both samples is less obvious, it won’t be zero.

Finally, not only can you plot the function or have it return a value, you can also ask for a summary.

summary(cdf)

Like this:

In this blog post you learned how to determine if the bootstrapped difference is statistically significant from zero.

By the way, if you’re having trouble understanding some of the code and concepts, I can highly recommend “An Introduction to Statistical Learning: with Applications in R”, which is the must-have data science bible. If you simply need an introduction into R, and less into the Data Science part, I can absolutely recommend this book by Richard Cotton. Hope it helps!

Say thanks, ask questions or give feedback

Technologies get updated, syntax changes and honestly… I make mistakes too. If something is incorrect, incomplete or doesn’t work, let me know in the comments below and help thousands of visitors.

2 thoughts on “R: Is a bootstrap of mean differences statistically significant”

20bet September 14, 2023 at 4:10 pm

Your article gave me a lot of inspiration, I hope you can explain your point of view in more detail, because I have some doubts, thank you.

Pendaftaran Binance February 23, 2024 at 1:33 pm

Your article helped me a lot, is there any more related content? Thanks! https://www.binance.com/id/register?ref=DB40ITMB

R: Is a bootstrap of mean differences statistically significant

Say thanks, ask questions or give feedback

2 thoughts on “R: Is a bootstrap of mean differences statistically significant”

Leave a Reply Cancel reply

Related Posts

Starting a remote Selenium server in R

How to set the package directory in R

Counting, adding or subtracting business days in R