Skip to content
Home » pandas » Page 4

pandas

Solve DtypeWarning: Columns have mixed types. Specify dtype option on import or set low_memory=False in Pandas

The title of this blog post says it all: let’s solve a warning. It’s one that occurs when you’re using Pandas’ read_csv. Let’s start by just doing that. Let’s say you run into the following error: Solve DtypeWarning: Columns (X,X) have mixed types. Specify dtype option on import or set… 

Pandas: Solve “You are trying to merge on datetime64[ns] and Object columnS”

Without a doubt, Pandas is one of the most popular Python packages within Data Science. However, coming from an R background, discovering the difference between the join and merge method was mind-boggling. Especially if it generates confusing errors that urge you to use pd.concat, which you shouldn’t. This is the… 

Pandas’ pivot_table vs. pivot

  • by
  • 4 min read

When you’re an R poweruser, pivoting tables in pandas feels unnecessarily complex. Why are there two pivot functions? Why does it return an index when you wanted a column? Why does it generate multi index columns? Those are the questions I tackle in this blog post. 💥 This blog post… 

Working with NaN’s (nulls/NA’s) in pandas: per column, per row and per group

  • by
  • 2 min read

Getting a firm understanding of NaNs in your dataset ensures you don’t draw wrong conclusions from an incomplete dataset. In this blog post I show how you can list the amount of NaNs per column, per row, and per group. First, let’s create some dummy data, and add some NaNs.… 

Replacing multiple values in a pandas DataFrame column

Without going into detail, here’s something I truly hate in R: replacing multiple values. In Python’s pandas, it’s really easy. In this blog post I try several methods: list comprehension, apply(), replace() and map(). First, let’s create some dummy data. First, let’s try with list comprehension. The get() function tries… 

Pandas: Solve ‘You are trying to merge on object and int64 columns’

Pandas is the go-to package for anything data science in Python. However, if you’re used to R and the convenience of dplyr or data.table, pandas can be confusing, now and then. For example, the following error is a real newb issue. ValueError: You are trying to merge on object and…