We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
On the dataset defined here, a group operation is extremely slow on JuliaDB compared to DataFrames.
The following benchmark was done on dataset of size N=1e8 from the link above.
Grouping by one column and calculating sum along another
On JuliaDB
@btime groupby(sum, df, :id1, select=:v1); 6.908 s (1710 allocations: 1.68 GiB)
On DataFrames
@btime combine(groupby(df, :id1), :v1=>sum) 743.827 ms (222 allocations: 762.96 MiB)
This was on Julia 1.4, DataFrames v0.21.0, and JuliaDB v0.13.0
The text was updated successfully, but these errors were encountered:
No branches or pull requests
On the dataset defined here, a group operation is extremely slow on JuliaDB compared to DataFrames.
The following benchmark was done on dataset of size N=1e8 from the link above.
Grouping by one column and calculating sum along another
On JuliaDB
On DataFrames
This was on Julia 1.4, DataFrames v0.21.0, and JuliaDB v0.13.0
The text was updated successfully, but these errors were encountered: