-
Notifications
You must be signed in to change notification settings - Fork 664
Open
Labels
P1Important tasks that we should complete soonImportant tasks that we should complete soonbug 🦗Something isn't workingSomething isn't workinghybrid-execution
Description
The PandasQueryCompiler
doesn't implement reset_index()
for all scenarios. Per the docs, it will default to pandas "when names
or allow_duplicates
is non-default." This is important because switching backends may require the index to be reset to preserve metadata. Defaulting to pandas would cause performance degradation every time a switch involving the Ray backend occurs.
Example:
import modin.pandas as pd
Backend.put("Ray")
index = pd.Index(["A", "B", "C"], name="index")
df = pd.DataFrame([[1, 2], [3, 4], [5, 6]], index=index)
df.reset_index(names="index") # Since names != None, this will raise a warning
Note:
Logs UserWarning: reset_index is not currently supported by PandasOnRay, defaulting to pandas implementation.
Metadata
Metadata
Assignees
Labels
P1Important tasks that we should complete soonImportant tasks that we should complete soonbug 🦗Something isn't workingSomething isn't workinghybrid-execution