You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When combining .csv files of uniform column number and data type, the order of the files seems to matter when one of the files has only a single row if pooling is on.
Whereas reversing the order of the files i.e. Multiple Row Data First works in all cases
julia> DataFrame(CSV.File(["filepath/multiple_row_data.csv","filepath/single_row_data.csv"]))
4×2 DataFrame
Row │ Name Age
│ String7 Int64
─────┼────────────────
1 │ Bob 25
2 │ Charlie 28
3 │ David 22
4 │ Alice 30
julia> CSV.read(["filepath/multiple_row_data.csv","filepath/single_row_data.csv"],DataFrame)
4×2 DataFrame
Row │ Name Age
│ String7 Int64
─────┼────────────────
1 │ Bob 25
2 │ Charlie 28
3 │ David 22
4 │ Alice 30
Hi!
When combining .csv files of uniform column number and data type, the order of the files seems to matter when one of the files has only a single row if pooling is on.
for example
If I try to combine the CSV files into a single DataFrame with the Single Row Data first I receive the following errors.
Similarly
and
both return the following error
Whereas reversing the order of the files i.e. Multiple Row Data First works in all cases
multiple_row_data.csv
single_row_data.csv
as pointed out by @nilshg here this comes from pooled arrays and turning pooling off by setting
pool = false
fixes the problem.The text was updated successfully, but these errors were encountered: