You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, after a discussion with @piever , he suggested I open an issue with StructArrays, but I am also posting it here to help facilitate communication between the projects. A recent update to CategoricalArrays is suspected to have created an extreme slowdown of the IndexedTables/JuliaDB table() function. The full details of that conversation can be found on this open issue posted on JuliaDB.
The short version is (with a MWE):
using JuliaDB, BenchmarkTools, CategoricalArrays
functiontabletest()
names =fill.(["red","blue","green"], 150000) |> Base.Iterators.flatten |> collect
loci ="loc".*string.(collect(1:1500))
loci =fill.(loci, 300) |> Base.Iterators.flatten |> collect
genotypes =fill((1,2), 450000)
returntable((names = names, loci = loci, genotypes = genotypes), pkey =:names)
endfunctiontabletest_cat()
names =fill.(["red","blue","green"], 150000) |> Base.Iterators.flatten |> collect
loci ="loc".*string.(collect(1:1500))
loci =fill.(loci, 300) |> Base.Iterators.flatten |> collect
genotypes =fill((1,2), 450000)
returntable(
(names =categorical(names, compress =true),
loci =categorical(loci, compress =true),
genotypes = genotypes),
pkey =:names
)
end
julia> @benchmark tabletest()
BenchmarkTools.Trial:
memory estimate: 79.81 MiB
allocs estimate: 900109
--------------
minimum time: 246.040 ms (7.45% GC)
median time: 270.271 ms (12.39% GC)
mean time: 280.085 ms (15.80% GC)
maximum time: 436.054 ms (44.68% GC)
--------------
samples: 18
evals/sample: 1
Howerver, doing @benchmark tabletest_cat() has been running for several hours and hadn't finished by the time I killed the job, which is probably not a good sign.
The text was updated successfully, but these errors were encountered:
Hello, after a discussion with @piever , he suggested I open an issue with StructArrays, but I am also posting it here to help facilitate communication between the projects. A recent update to CategoricalArrays is suspected to have created an extreme slowdown of the IndexedTables/JuliaDB
table()
function. The full details of that conversation can be found on this open issue posted on JuliaDB.The short version is (with a MWE):
Howerver, doing
@benchmark tabletest_cat()
has been running for several hours and hadn't finished by the time I killed the job, which is probably not a good sign.The text was updated successfully, but these errors were encountered: