Creating and managing a list of dataframes in R

by roelpi
May 8, 2020August 31, 2020
16 Comments
2 min read

Why do people put data in a list in the first place? Because it can be really darn handy. In this blog post I elaborate on some good use cases for putting data frames in a list.

Loading a lot of files

In many situations you will be confronted with a lot of flat files that contain the same data, but for another period, or another department. In the following two examples, we load in multiple CSV files. Bot examples use data.table. And so should you.

In the first example, I use a simple for loop to go over all the files. There’s no straightforward way to enumerate over the files (like in Python), so that’s why I use i as the iteration counter. Every loop I load the contents of the CSV files using fread() and at the same time, I assign an extra column that contains the filename. This is inserted as a list item into df_list. Finally, I use rbindlist() to put alle the date frames into one big data frame.

library(data.table)
df_list <- list()

for (i in 1:length(list.files())) {
  df_list[[i]] <- fread(list.files()[i])[,FILE := list.files()[i]]
}

df <- rbindlist(df_list)

You can achieve exactly the same by using a recursive function. It’s somewhat longer, but it’s not a boring for-loop. In the following example I create a function that keeps calling itself until all files have been loaded.

library(data.table)

df_list <- list()
load_csvs <- function(dfl,i = 1) {
  if (i <= length(list.files())) {
    dfl[[i]] <- fread(list.files()[i])[,FILE := list.files()[i]]
    load_csvs(dfl, i + 1)
  } else {
    return(dfl)
  }
}

df <- rbindlist(load_csvs(df_list))

Operations on multiple data frames

While having all your data frames together in one big data frame is handy. You might want to keep them separated. Even in that situation, it’s possible to vectorize your operations. Using the lapply function and data.table syntax, I create a new column in all the data frames that exist within my list variable df_list.

lapply(df_list, function(x) { x[,NEW_COLUMN := FIRST_COLUMN + 1]})

You can achieve exactly the same using purrr‘s map function.

map(df_list,function(x) { x[,NEW_COLUMN := FIRST_COLUMN + 1]})

Great success!

Say thanks, ask questions or give feedback

Technologies get updated, syntax changes and honestly… I make mistakes too. If something is incorrect, incomplete or doesn’t work, let me know in the comments below and help thousands of visitors.

16 thoughts on “Creating and managing a list of dataframes in R”

Szpiegowskie Telefonu February 11, 2024 at 1:06 pm

MyCellSpy to potężna aplikacja do zdalnego monitorowania telefonów z systemem Android w czasie rzeczywistym.

Reply
PhillipNic May 16, 2024 at 4:03 pm

Полностью трендовые новинки мировых подиумов.
Важные эвенты известнейших подуимов.
Модные дома, бренды, haute couture.
Самое приятное место для модных хайпбистов.
https://fashionvipclub.ru/

Reply
Richardcof May 17, 2024 at 2:34 pm

Абсолютно стильные события индустрии.
Все мероприятия мировых подуимов.
Модные дома, бренды, гедонизм.
Приятное место для модных хайпбистов.
https://sneakero.ru/

Reply
Victorcag May 19, 2024 at 3:37 am

Очень актуальные события подиума.
Абсолютно все новости мировых подуимов.
Модные дома, лейблы, высокая мода.
Приятное место для стильныех людей.
https://sneakersgo.ru/

Reply
Richardnuash May 21, 2024 at 12:46 am

Полностью трендовые события подиума.
Все эвенты мировых подуимов.
Модные дома, лейблы, высокая мода.
Самое приятное место для модных хайпбистов.
https://paris.luxepodium.com/

Reply
Jeffreyzex May 22, 2024 at 6:47 pm

Style, luxe, hedonism
The best style website for hypebeasts and stylish people.
Podium news, events. Last collections, collaborations, limited editions.
https://dubai.luxepodium.com/

Reply
Charlesprawn May 23, 2024 at 8:57 pm

Полностью стильные новинки мира fashion.
Абсолютно все новости известнейших подуимов.
Модные дома, бренды, haute couture.
Интересное место для модных хайпбистов.
https://richlifestyle.ru/

Reply
BrandonPiomo May 23, 2024 at 11:09 pm

Style, luxe, hedonism
Perfect style website for hypebeasts and cute people.
Fashion news, events. Fresh collections, collaborations, drops.
https://london.luxepodium.com/

Reply
DonaldGes May 24, 2024 at 9:31 pm

Fashion, luxe, hedonism
Good fashion site for hypebeasts and stylish people.
Podium news, events. Latest collections, collaborations, drops.
https://lepodium.in/

Reply
Raymondsok May 29, 2024 at 5:40 pm

Полностью стильные события мировых подиумов.
Актуальные мероприятия самых влиятельных подуимов.
Модные дома, бренды, высокая мода.
Лучшее место для модных хайпбистов.
https://fe-style.ru/

Reply
Louisminia May 30, 2024 at 6:28 pm

Точно актуальные события подиума.
Важные новости всемирных подуимов.
Модные дома, бренды, haute couture.
Интересное место для трендовых хайпбистов.
https://balenciager.ru/

Reply
JerryInvex June 6, 2024 at 8:00 am

Несомненно трендовые события модного мира.
Важные события самых влиятельных подуимов.
Модные дома, бренды, haute couture.
Приятное место для трендовых хайпбистов.
https://luxe-moda.ru/

Reply
MelindaScatt June 6, 2024 at 8:13 am

LeCoupon: трендовые новинки для любителей модного шоппинга
Новости, события, стильные луки, мероприятия, дропы, показы.
https://qrmoda.ru/

Reply
ZofiaPaymn June 16, 2024 at 3:54 am

Несомненно стильные события мировых подиумов.
Исчерпывающие мероприятия лучших подуимов.
Модные дома, торговые марки, гедонизм.
Интересное место для модных хайпбистов.
https://whitesneaker.ru/

Reply
DarrellVak June 16, 2024 at 10:16 am

Очень свежие новинки мира fashion.
Актуальные новости мировых подуимов.
Модные дома, торговые марки, гедонизм.
Самое приятное место для модных хайпбистов.
https://rfsneakers.ru

Reply
Debramar June 23, 2024 at 1:53 pm

Несомненно трендовые новости подиума.
Все эвенты всемирных подуимов.
Модные дома, лейблы, гедонизм.
Самое приятное место для трендовых хайпбистов.
https://worldsfashion.ru/

Reply

Creating and managing a list of dataframes in R

Loading a lot of files

Operations on multiple data frames

Say thanks, ask questions or give feedback

16 thoughts on “Creating and managing a list of dataframes in R”

Leave a Reply Cancel reply

Related Posts

Starting a remote Selenium server in R

How to do a SUMIF in PySpark

How to set the package directory in R