Structure Refactor and Duplicate Trait Support #75

theelderbeever · 2022-10-07T15:21:01Z

This PR has two main goals...

Refactor the library structure with lessons learned to help with future extensibility and clarity
Refactor the underlying data structures used to allow for duplicate attribute names in metadata

Additionally, a breaking change that moves the "entrypoint" of the library from a scorer interface to a collection has been made.

impreso · 2022-10-07T18:41:47Z

.pre-commit-config.yaml

-    hooks:
-      - id: flake8
-        additional_dependencies: [flake8-bugbear, pep8-naming]
+  # - repo: https://gitlab.com/pycqa/flake8


you need to fix it in github actions as well , otherwise the build fails.

impreso · 2022-10-07T18:59:08Z

open_rarity/metrics/ic.py

+) -> list[AttributeStatistic]:
+    return [
+        {
+            **attr,


let's not use ** syntax , i think it's hard to read

impreso · 2022-10-07T18:59:27Z

open_rarity/metrics/ic.py

+from open_rarity.models.collections import AttributeCounted, AttributeStatistic
+
+
+def information_content(


documentation

impreso · 2022-10-07T19:00:26Z

open_rarity/models/collections/_utils.py

+    """
+    return list(
+        chain(
+            *[


let's not use * , and ** i always find them unintuitive in the code.

impreso · 2022-10-07T19:01:09Z

open_rarity/models/collections/_utils.py

+        _description_
+    """
+    d = defaultdict(int)
+    for key, count in groupapply(tokens, extract_token_name_key, "count").items():


Not in favor of additional dependencies - can we have a list of dependencies and discuss what's needed and what's not?

impreso · 2022-10-07T19:01:42Z

open_rarity/metrics/utc.py

+from open_rarity.models.tokens import TokenAttributeStatistic
+
+
+def unique_trait_count(


Naming utc clashes with utc timestamp - better naming will help us perhaps?

impreso · 2022-10-07T19:07:10Z

open_rarity/scorers/scorer.py


    def __init__(self) -> None:
        # OpenRarity uses InformationContent as the scoring algorithm of choice.
-        self.handler = InformationContentScoringHandler()
+        self.handler = IC()


I think we should be more explicit here - i like more InformationContent

theelderbeever added 3 commits October 6, 2022 09:19

WIP: restructure

5298b60

WIP: Working attribute statistics

e17c070

WIP update typing and add metrics submodule.

0d93677

impreso reviewed Oct 7, 2022

View reviewed changes

open_rarity/metrics/ic.py

) -> list[AttributeStatistic]:

return [

{

**attr,

Copy link

Contributor

impreso Oct 7, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's not use ** syntax , i think it's hard to read

impreso reviewed Oct 7, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Structure Refactor and Duplicate Trait Support #75

Structure Refactor and Duplicate Trait Support #75

theelderbeever commented Oct 7, 2022

impreso Oct 7, 2022

impreso Oct 7, 2022

impreso Oct 7, 2022

impreso Oct 7, 2022

impreso Oct 7, 2022

impreso Oct 7, 2022

impreso Oct 7, 2022

		from open_rarity.models.collections import AttributeCounted, AttributeStatistic


		def information_content(

		from open_rarity.models.tokens import TokenAttributeStatistic


		def unique_trait_count(

Structure Refactor and Duplicate Trait Support #75

Are you sure you want to change the base?

Structure Refactor and Duplicate Trait Support #75

Conversation

theelderbeever commented Oct 7, 2022

impreso Oct 7, 2022

Choose a reason for hiding this comment

impreso Oct 7, 2022

Choose a reason for hiding this comment

impreso Oct 7, 2022

Choose a reason for hiding this comment

impreso Oct 7, 2022

Choose a reason for hiding this comment

impreso Oct 7, 2022

Choose a reason for hiding this comment

impreso Oct 7, 2022

Choose a reason for hiding this comment

impreso Oct 7, 2022

Choose a reason for hiding this comment