Data completeness via a server-side Mixpanel implementation

The future is server-side. The rise of ad blockers and other privacy extensions hasn’t made product analytics implementation easier. On the other hand, good ol’ Google Analytics has gained quite a lot of competition in recent years from vendors like Heap and Mixpanel. A friend of mine asked my opinion… 

Solve “invalid table reference” when streaming into BigQuery

  • by
  • 2 min read

Recently I decided to start streaming clickstream data (from this blog, indeed) into BigQuery using the insertAll method. It is a relatively cheap solution that requires almost no technical configuration. Until I bumped into an error. Let’s tackle it in this blog post. The error I encountered is: “google.api_core.exceptions.BadRequest: 400… 

Pandas: Solve ‘You are trying to merge on object and float64 columns’

  • by
  • 2 min read

Pandas can be somewhat puzzling, sometimes. It has a ton load of functionalities, but that can make the syntax and methods obscure. Simply judging from the method name, the ‘join’ and ‘merge’ method could be the same thing. However, they aren’t. Here’s an error that I used to run into… 

Solving “Permission ‘cloudfunctions.functions.get’ denied on resource”

  • by
  • 2 min read

Your first steps in setting up Cloud Build for deploying Google Cloud Functions but running into yet another error? You’re not alone. With this blog post, I hope to put you on your way again. Have you run into the following error? “Permission ‘cloudfunctions.functions.get’ denied on resource […]” Here’s what’s… 

R: Is a bootstrap of mean differences statistically significant

  • by
  • 2 min read

Last year, I wrote a blog post about bootstrapping (two-sample) the mean in R. Recently, someone contacted me with the question of how to get the corresponding percentile for a specific value of those bootstrapped differences. It’s something you need to answer the question if the difference is statistically significant…