Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Group by - Mode of categorical variable is sometimes completely off #6948

Closed
wvdvegte opened this issue Dec 5, 2024 · 2 comments · Fixed by #6958
Closed

Group by - Mode of categorical variable is sometimes completely off #6948

wvdvegte opened this issue Dec 5, 2024 · 2 comments · Fixed by #6958
Assignees
Labels
bug A bug confirmed by the core team

Comments

@wvdvegte
Copy link

wvdvegte commented Dec 5, 2024

What's wrong?
When grouping by a categorical variable, selecting mode of another categorical variable sometimes produces an incorrect value

How can we reproduce the problem?
In the attached worfkflow, the variable "group" is grouped by cluster (apologies for the confusing variable name). In the result, for instance, the grouping of C2 shows in the concatenation that the DOS is the most occurring group, but the mode column shows HCD, which isn't even present in C2.
Group by - mode bug.zip

What's your environment?

  • Operating system: Mac OS Sequoia
  • Orange version: 3.38.0
  • How you installed Orange: upgraded from DMG
@wvdvegte wvdvegte added the bug report Bug is reported by user, not yet confirmed by the core team label Dec 5, 2024
@janezd janezd self-assigned this Dec 6, 2024
@janezd janezd added bug A bug confirmed by the core team and removed bug report Bug is reported by user, not yet confirmed by the core team labels Dec 6, 2024
@janezd
Copy link
Contributor

janezd commented Dec 6, 2024

This could be caused by #6906.

@janezd janezd assigned VesnaT and unassigned janezd Dec 6, 2024
@janezd
Copy link
Contributor

janezd commented Dec 13, 2024

Note: The bug occurs in released Orange, which packages pandas 1.5.3. It seems that it cannot be reproduced with newer pandas, e.g. 2.2.3.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug A bug confirmed by the core team
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants