Skip to content

Creating a GCP Service Account and linking it to your system

When you haven’t gone through the Google Cloud Platform authentication process in your development environment, you might run into the following error message. In this blog post, I explain how you can solve it. google.auth.exceptions.DefaultCredentialsError: Could not automatically determine credentials. Please set GOOGLE_APPLICATION_CREDENTIALS or explicitly create credentials and re-run the… 

Error in .jcall(“RJavaTools”…) when importing large xlsx files in R

Recently I had to read in a folder full of large Excel (.xlsx) files. I did as I usually did and use the xlsx library. However, reading in the largest files produced an error: “Error in .jcall(“RJavaTools”, “Ljava/lang/Object;”, “invokeMethod”, cl, : java.lang.OutOfMemoryError: Java heap space” I don’t know what cause… 

Date conversion from Excel to R

  • by
  • 1 min read

It happened too many times before I write this blog post. Oftentimes, when I read Excel (xls or xlsx) files into R, I encounter this strange phenomenon where dates are converted to a 5 number integer. Here is how to fix it. For example, 02/01/2017 (dd/MM/yyyy) would be converted to… 

Over racistische machines

  • by
  • 7 min read

In het boek Outnumbered gaat auteur David Sumpter op zoek naar de beperkingen van de algoritme-hype. In het hoofdstuk Impossibly unbiased beschrijft hij hoe algoritmes fouten kunnen maken. Hij kwam terecht bij justitie, in de VS. Zo publiceerde ProPublica in 2016 een artikel dat een gevoelige snaar raakte bij data… 

Optimizing the number of clusters using Tibshirani’s gap statistic

  • by
  • 5 min read

When you are clustering, what you are actually trying to do is to find groups of objects so that they are similar to one another, and different from the object of other groups. In other words, you want to minimize the intra-cluster distance and maximize the inter-cluster distance. Clustering algorithms… 

An almost complete overview of Apple WebKit’s Intelligent Tracking Prevention

  • by
  • 10 min read

Historically, browsers have had a great deal of control over the online experience of end users. Since their genesis, several different browsers have competed for dominant market share. With the arrival of Intelligent Tracking Prevention (ITP), it seems that big tech is now using browser standards to target each other’s… 

“Hit Refresh”: Or how one man’s personal life helped Microsoft rediscover its soul

  • by
  • 4 min read

Another management book? Not just a management book, but the story of Satya Nadella. Who? The son of a marxist economist and a drama professor but mostly known as Microsoft’s less famous, less rich, but current CEO. A brief review. Nadella’s book ‘Hit Refresh’ is a triptych. The first part…