Home » how to » Page 2

how to

Solve “Message: ‘chromedriver’ executable needs to be in PATH”

  • by
  • 2 min read

All of us data hoarders get to a point where we need to circumvent in-place rules that prevent us from scraping the web. Enter: Selenium. The world’s favorite tool for automating tasks in a browser. Selenium uses ChromeDriver, initially created for automated testing, but also a splendid scraping tool. In… 

Dataform: Create if incremental table doesn’t exist (JS)

  • by
  • 2 min read

One of Google’s latest acquisitions, ELT data orchestration tool Dataform, supports JavaScript. This is a great way to chain Dataform’s actions. In this blog post we explore how to automatically create a destination table in your data warehouse, before an incremental definition is run. If you are interested in a… 

BigQuery: JSON_EXTRACT vs. JSON_QUERY

  • by
  • 2 min read

In this blog post, I discuss the difference between several BigQuery functions that appear to do the same: JSON_EXTRACT and JSON_QUERY — and their value counterparts JSON_EXTRACT_SCALAR and JSON_VALUE. MPP Data Warehouses are increasingly becoming data lakes. In this light, extracting strings/values from semistructured data (stored as a JSON STRING)… 

Solve “PermissionError: [Errno 1] Operation not permitted” when deploying to DigitalOcean

I’m a data scientist. Deploying is not part of my core skills, but something I need to be able to do when working on hobby projects. I recently embraced DigitalOcean to deploy my Django (Python) Project and want to elaborate on a problem I ran into. The error that appeared… 

How to install cURL in Windows

  • by
  • 3 min read

Lots of product documentation refers to and generate curl commands because it is a convenient way to test API endpoints. For data analysts/scientists with a not-so-tech-savvy background, it can be hard to grasp what cURL is, how to install it, and how to use it. In this blog post: how…