Skip to content

Commit

Permalink
chore: update hostgraph configuration for cc-main-2024-jun-jul-aug
Browse files Browse the repository at this point in the history
  • Loading branch information
thunderpoot committed Aug 18, 2024
1 parent 90b8f08 commit 8a1a3e5
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions src/script/hostgraph/hostgraph_config.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@
### saved as tuples <from_host, to_host>

# crawls to be processed
CRAWLS=("CC-MAIN-2024-22" "CC-MAIN-2024-26" "CC-MAIN-2024-30")
CRAWLS=("CC-MAIN-2024-26" "CC-MAIN-2024-30" "CC-MAIN-2024-33")

INPUT_BASE_URL="s3://commoncrawl/"

Expand Down Expand Up @@ -42,7 +42,7 @@ S3A_OUTPUT_PREFIX=s3a://commoncrawl-webgraph
################################################################################
# construct a merged graph of multiple monthly crawls

MERGE_NAME=cc-main-2024-may-jun-jul
MERGE_NAME=cc-main-2024-jun-jul-aug

# Naming convention should be the three months' crawls that are
# used to generate this graph release. In the event of multiple months
Expand Down

0 comments on commit 8a1a3e5

Please sign in to comment.