Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sorting on Chocolatey Community Repository should not rely on high download counts alone #287

Open
2 tasks done
davidem00 opened this issue Nov 17, 2023 · 1 comment
Open
2 tasks done
Labels
0 - _Triaging New tickets that need to be hashed out a bit more before they hit the backlog. CommunityRepository Priority_LOW Represents tickets that are of lower priority and can be taken care of whenever.

Comments

@davidem00
Copy link

davidem00 commented Nov 17, 2023

Checklist

  • I have verified this is the correct repository for opening this issue.
  • I have verified no other issues exist related to my request.

Is Your Feature Request Related To A Problem? Please describe.

(In similar spirit to #286) The top Popular items in so many search results of the Community Repository, especially the broad searches, have an overrepresentation of packages that are not worth installing directly, but are shown as Popular because of a download count that is overweighted because they are dependencies of other packages, as opposed to those package installations that are directly invoked by users.

The top page of the default "All Results" search is telling. Doesn't exactly look like what one imagines a "Store" to look like, in terms of what is of interest, worth perusing. (Beside items that should not be installed at all, per #286, like Flash, ActiveX, Win8 KB patches, etc) the Most Popular results have an overrepresentation of "support library" dependencies, not things that by and large are not really worth installing directly themselves: chocolatey extensions, various KB packs, vcrt, adobe air, dotnetfx rt, etc.

Describe The Solution. Why is it needed?

The Popularity metric should be reworked to subtract out the count of downloads initiated because it is a dependency of an actual end-user package (whose counts are recorded, themselves).

Additional Context

raw Download count of a package lineage would still be available, but Popularity should evolve to take on a distinct meaning, a more "meaningful" one.

Otherwise, the Community Repo browser is much less useful as a "store" worth exploring, as a place to discover new/alternative software.

Related Issues

#286

┆Issue is synchronized with this Gitlab issue by Unito

@pauby pauby added CommunityRepository 0 - _Triaging New tickets that need to be hashed out a bit more before they hit the backlog. Priority_LOW Represents tickets that are of lower priority and can be taken care of whenever. labels Nov 17, 2023
@pauby pauby added 0 - Waiting on User Waiting on a response from either a commenter or ticket creator. and removed 0 - Waiting on User Waiting on a response from either a commenter or ticket creator. labels Dec 23, 2023
@pauby pauby changed the title downweight package dependencies in "Popularity" sorting of Community Repository Sorting on Chocolatey Community Repository should not rely on high download counts alone Dec 23, 2023
@sync-by-unito sync-by-unito bot removed the 0 - _Triaging New tickets that need to be hashed out a bit more before they hit the backlog. label Jan 18, 2024
@sync-by-unito sync-by-unito bot added the 0 - _Triaging New tickets that need to be hashed out a bit more before they hit the backlog. label Feb 28, 2024
@pauby
Copy link
Member

pauby commented Aug 9, 2024

Just to add some additional information on this.

The Popularity metric should be reworked to subtract out the count of downloads initiated because it is a dependency of an actual end-user package (whose counts are recorded, themselves).

(emphasis mine).

This isn't something that is possible or something that we will be adding functionality to support. When a package is installed, it is recorded as being downloaded. A package can be a standalone package and a dependency of others. It relies on context and the end-users use case for it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
0 - _Triaging New tickets that need to be hashed out a bit more before they hit the backlog. CommunityRepository Priority_LOW Represents tickets that are of lower priority and can be taken care of whenever.
Projects
None yet
Development

No branches or pull requests

2 participants