Skip to content
Home ยป data science

data science

How to do a SUMIF in PySpark

  • by
  • 2 min read

One of the most frequent used Excel functions is probably SUMIF and its SUMIFS variant. In this article, you’ll learn how to do exactly the same in PySpark. What is the sumif function? In Excel, the SUMIF function is an aggregation function for summing values from a column, but only… 

World Cup 2018: The hopelessness of being the runner-up in the group phase

  • by
  • 4 min read

Yesterday, Iceland gained a point against Argentina by putting up an icy wall in front of their goal. Nothing adds more spice to a football game than an underdog holding out against a football superpower. Sometimes details are responsible for second tier teams to win their group phase and for…