forta-network · RCantu92 · Jun 24, 2024 · Jun 24, 2024 · Jun 24, 2024 · Jun 24, 2024
diff --git a/scam-detector-py/README.md b/scam-detector-py/README.md
@@ -51,8 +51,6 @@ The Scam Detector is opinionated, and consumes evidence of and issues judgment a
    - **Scammer Deployed Contracts**: When an EOA is labeled a scammer, the Scam Detector queries for all contracts created by the EOA at the time the EOA was labeled. This includes direct contract creations (e.g. scammer deploys a token) as well as indirect contract creations (e.g. scammer adds liquidity to a pool that may result in the pool creation). To capture future contract creations, the Scam Detector also monitors for new contract creations by known scammers and emits a new ‘scammer’ label.
 
    - **Scammer Association**: When an EOA is labeled a scammer, label propagation will label associated scammers (e.g. addresses that are a destination for stolen assets) as scammers. [A graph-based approach](https://forta.org/blog/discovering-scammer-networks-with-machine-learning/) is utilized. The threat category of these labels is ‘scammer-association’.
-
-   - **Similar to Scammer**: For all contracts labeled scam by the Scam Detector, a contract similarity bot will identify and emit labels about contracts that resemble known scammer contacts. The threat category of these labels is ‘similar-contract’.
 
 **False Positive Mitigation**
 
@@ -161,11 +159,6 @@ The complete list of scammer label threat-categories, and conditions under which
     <td>Passthrough Label</td>
     <td>At times, the specific threat category can not be identified, but there is confidence in the address being associated with a scam. In those cases, the threat category is set to unknown.</td>
   </tr>
-  <tr>
-    <td>similar-contract</td>
-    <td>Propagation Label</td>
-    <td>Emitted to identify a newly deployed contract that is similar to a known scammer contract</td>
-  </tr>
   <tr>
     <td>scammer-deployed-contract</td>
     <td>Propagation Label</td>
@@ -241,11 +234,11 @@ For reference, each field is described below:
   </tr>
   <tr>
     <td>base_bot_alert_ids</td>
-    <td>When the label is emitted via passthrough, ML, similar contract, or an association alert, this field will contain the alert ids of the base bot alerts utilized to derive the label.</td>
+    <td>When the label is emitted via passthrough, ML, or an association alert, this field will contain the alert ids of the base bot alerts utilized to derive the label.</td>
   </tr>
   <tr>
     <td>base_bot_alert_hashes</td>
-    <td>When the label is emitted via passthrough, ML, similar contract, or an association alert, this field will contain the alert hashes of the base bot alerts utilized to derive the label.</td>
+    <td>When the label is emitted via passthrough, ML, or an association alert, this field will contain the alert hashes of the base bot alerts utilized to derive the label.</td>
   </tr>
   <tr>
     <td>deployer_info</td>
@@ -327,7 +320,7 @@ Address poisoners are the initiator of the address poisoning activity. A simple
 
 Native ice phishing are straight transfers of native assets to the scammer. Matching the to address of the transaction against Forta threat intelligence yield transactions for this type of scam.
 
-### soft-rug-pull, hard-rug-pull, rake-token, impersonating-token, similar-contract, scammer-deployed-contract
+### soft-rug-pull, hard-rug-pull, rake-token, impersonating-token, scammer-deployed-contract
 
 These threat categories all point to contracts that a user should not be interacting with. A check of the to address or the transaction trace data against Forta threat intelligence yields transactions where this may be the case. 
 

diff --git a/scam-detector-py/base_bots_additions.md b/scam-detector-py/base_bots_additions.md
@@ -19,13 +19,22 @@ Once added, the new bot/alert id should be handled by the Scam Detector. It is h
 
 ## Contract Similarity Bot
 
-Currently, we only have one contract similarity bot configured. A new similarity bot could be configured by addition to the `CONTRACT_SIMILARITY_BOTS` with the `CONTRACT_SIMILARITY_BOT_THRESHOLDS` to be utilized. A contract similarity bot would need to emit the following fields in the metadata:
-    - new_scammer_contract_address
-    - new_scammer_eoa
-    - scammer_contract_address
-    - scammer_eoa
-    - similarity_hash
-    - similarity_score
+Currently, we do not have a contract similarity bot configured. A new similarity bot could be configured by the addition of:
+- `CONTRACT_SIMILARITY_BOTS`
+- `CONTRACT_SIMILARITY_BOT_THRESHOLDS` to be utilized
+- new entries to `BASE_BOTS` and `CONFIDENCE_MAPPINGS`
+- `ALERTED_ENTITIES_SIMILAR_CONTRACT_KEY` and `ALERTED_ENTITIES_SIMILAR_CONTRACT_QUEUE_SIZE` for persistence
+
+in `constants.py`. A contract similarity bot would need to emit the following fields in the metadata (and the necessary logic can be added to `findings.py`):
+
+- new_scammer_contract_address
+- new_scammer_eoa
+- scammer_contract_address
+- scammer_eoa
+- similarity_hash
+- similarity_score
+
+The Scam Detector's logic itself would then need to be updated for the newly added items listed above. A previous implementation of such logic can be found at this commit, [420ce3cced8ee7acb7e1ddb23ccf4e27019da8dc](https://github.com/forta-network/starter-kits/tree/420ce3cced8ee7acb7e1ddb23ccf4e27019da8dc/scam-detector-py), to more easily reintroduce.
 
 In addition, the deployer of the new contract needs to be extracted. This should be configured in the `basebot_parsing_config.csv`.
 

diff --git a/scam-detector-py/package-lock.json b/scam-detector-py/package-lock.json