Using ‘PARTITION BY’ in BigQuery to calculate the share per group

  • by
  • 3 min read

In this blog post we’ll calculate the share of a row per group. There are multiple ways to do this: with aggregate functions and with analytic functions. We’ll go over both and demonstrate how analytic functions are far superior in terms of readability. First, let’s create a table to demonstrate… 

A primer on iOS 14.5’s impact on Facebook audience building (and what to do about it)

  • by
  • 6 min read

I’ve read a lot of bullshit regarding the impact of Apple’s notorious iOS14.5 update. It’s a fairly technical & complex matter, yet most articles on the subject are clickbait or just content for the sake of being content, often written by a junior content marketer. Consequently, I couldn’t resist finding… 

How to deploy a scraping script and Selenium in Google Cloud Run

In this blog post, you’ll learn how you can run scraping jobs using Selenium from Google Cloud Run, a fully managed solution that eliminates the burden of configuring and managing a Kubernetes cluster. It simplifies your production pipeline and fully automates your scraping process. Take notice, I’m not an expert… 

Set environment variables in a Python venv (Windows)

  • by
  • 1 min read

This is a brief overview of the possibilities for setting environment variables in a virtual environment (venv). Set environment variables in Python code Environment variables can be set inside your Python script. All child processes will inherit the environment variables and their values. Set environment variables in activate.bat or activate.ps1… 

Solve Docker error: “unable to delete (must be forced) – image is referenced in multiple repositories”

  • by
  • 2 min read

I’ve had to brush up my Docker skills again due to some side projects I’m working on. So I ran into some basic issues I wanted to elaborate on. In this blog post: “image is referenced in multiple repositories.” I had to remove a Docker image from my machine using…