Skip to content

Commit cb514ec

Browse files
committed
rename: dataforge -> swe-forge
- Rename package, binary, repository URL in Cargo.toml - Rename all 194 source references (dataforge -> swe_forge) - Rename all docs/deploy/config references (dataforge -> swe-forge) - Update CI artifact names and binary paths - Rename GitHub repo via gh CLI (CortexLM/dataforge -> CortexLM/swe-forge) - 0 clippy warnings, 1251 tests passing
1 parent f33cc6a commit cb514ec

File tree

101 files changed

+379
-379
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

101 files changed

+379
-379
lines changed

.github/workflows/ci.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -85,8 +85,8 @@ jobs:
8585
- name: Upload binary
8686
uses: actions/upload-artifact@v4
8787
with:
88-
name: dataforge-linux-x86_64
89-
path: target/release/dataforge
88+
name: swe-forge-linux-x86_64
89+
path: target/release/swe-forge
9090
retention-days: 7
9191

9292
# ──────────────────────────────────────────────

.github/workflows/nightly.yml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -95,13 +95,13 @@ jobs:
9595
RUSTFLAGS: "-C target-cpu=native -C link-arg=-fuse-ld=mold"
9696

9797
- name: Binary size
98-
run: ls -lh target/release/dataforge
98+
run: ls -lh target/release/swe-forge
9999

100100
- name: Upload release binary
101101
uses: actions/upload-artifact@v4
102102
with:
103-
name: dataforge-nightly-linux-x86_64
104-
path: target/release/dataforge
103+
name: swe-forge-nightly-linux-x86_64
104+
path: target/release/swe-forge
105105
retention-days: 14
106106

107107
# ──────────────────────────────────────────────

.github/workflows/release.yml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -47,11 +47,11 @@ jobs:
4747
- name: Package
4848
run: |
4949
mkdir -p dist
50-
cp target/release/dataforge dist/
51-
cd dist && tar czf ../dataforge-${{ github.ref_name }}-linux-x86_64.tar.gz dataforge
50+
cp target/release/swe-forge dist/
51+
cd dist && tar czf ../swe-forge-${{ github.ref_name }}-linux-x86_64.tar.gz swe-forge
5252
5353
- name: Create GitHub Release
5454
uses: softprops/action-gh-release@v2
5555
with:
56-
files: dataforge-${{ github.ref_name }}-linux-x86_64.tar.gz
56+
files: swe-forge-${{ github.ref_name }}-linux-x86_64.tar.gz
5757
generate_release_notes: true

Cargo.lock

Lines changed: 44 additions & 44 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

Cargo.toml

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,15 @@
11
[package]
2-
name = "dataforge"
2+
name = "swe-forge"
33
version = "0.1.0"
44
edition = "2021"
5-
description = "Data pipeline orchestration framework for LLM evaluation with multi-agent validation"
5+
description = "SWE-bench dataset forge: mine, generate and evaluate real software engineering tasks"
66
license = "MIT"
7-
repository = "https://github.com/dataforge/dataforge"
7+
repository = "https://github.com/CortexLM/swe-forge"
88
readme = "README.md"
9-
keywords = ["dataforge", "pipeline", "llm", "evaluation", "agents"]
9+
keywords = ["swe-bench", "pipeline", "llm", "evaluation", "agents"]
1010

1111
[[bin]]
12-
name = "dataforge"
12+
name = "swe-forge"
1313
path = "src/main.rs"
1414

1515
[dependencies]

README.md

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# dataforge
1+
# swe-forge
22

33
[![Rust](https://img.shields.io/badge/rust-1.70%2B-orange.svg)](https://www.rust-lang.org/)
44
[![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](LICENSE)
@@ -9,7 +9,7 @@ Built on top of [SweInfinite](https://github.com/unconst/SweInfinite) by [@uncon
99

1010
## What it does
1111

12-
dataforge connects to [GH Archive](https://www.gharchive.org/) to discover recently merged pull requests, enriches them via the GitHub API, classifies their difficulty using an LLM, generates test specifications via an agentic loop, and exports SWE-bench-compatible task instances. It also includes a full evaluation harness to run external coding agents on generated tasks and verify their solutions.
12+
swe-forge connects to [GH Archive](https://www.gharchive.org/) to discover recently merged pull requests, enriches them via the GitHub API, classifies their difficulty using an LLM, generates test specifications via an agentic loop, and exports SWE-bench-compatible task instances. It also includes a full evaluation harness to run external coding agents on generated tasks and verify their solutions.
1313

1414
## Key features
1515

@@ -77,8 +77,8 @@ graph TB
7777
### Build
7878

7979
```bash
80-
git clone https://github.com/CortexLM/dataforge.git
81-
cd dataforge
80+
git clone https://github.com/CortexLM/swe-forge.git
81+
cd swe-forge
8282
cargo build --release
8383
```
8484

@@ -313,7 +313,7 @@ Pre-classification uses only the PR title and body (~100 tokens, ~0.5s). Full cl
313313
### `swe mine`
314314

315315
```
316-
dataforge swe mine [OPTIONS]
316+
swe-forge swe mine [OPTIONS]
317317
318318
Options:
319319
-o, --output <DIR> Output directory [default: ./swe-datasets]
@@ -330,7 +330,7 @@ Options:
330330
### `swe harness`
331331

332332
```
333-
dataforge swe harness [OPTIONS] --agent-dir <AGENT_DIR>
333+
swe-forge swe harness [OPTIONS] --agent-dir <AGENT_DIR>
334334
335335
Options:
336336
-i, --input <INPUT> Directory containing SWE workspaces [default: ./generated-swe]
@@ -347,7 +347,7 @@ Options:
347347
### `swe validate`
348348

349349
```
350-
dataforge swe validate [OPTIONS]
350+
swe-forge swe validate [OPTIONS]
351351
352352
Options:
353353
-i, --input <DIR> Input directory with SWE workspaces [default: ./generated-swe]
@@ -358,7 +358,7 @@ Options:
358358
### `swe export`
359359

360360
```
361-
dataforge swe export [OPTIONS]
361+
swe-forge swe export [OPTIONS]
362362
363363
Options:
364364
-i, --input <DIR> Input directory [default: ./generated-swe]
@@ -427,7 +427,7 @@ RUST_LOG=debug cargo run -- swe mine --max-tasks 1 --once # Debug run
427427

428428
## Credits
429429

430-
Built on top of [SweInfinite](https://github.com/unconst/SweInfinite) by [@unconst](https://github.com/unconst). The original architecture for mining GitHub PRs and generating SWE-bench-style datasets was designed by the SweInfinite team. dataforge extends it with:
430+
Built on top of [SweInfinite](https://github.com/unconst/SweInfinite) by [@unconst](https://github.com/unconst). The original architecture for mining GitHub PRs and generating SWE-bench-style datasets was designed by the SweInfinite team. swe-forge extends it with:
431431

432432
- Difficulty-based pre-classification and filtering
433433
- Agentic test generation (Codex-style multi-turn loop with shell access)

deploy/docker-compose.prod.yml

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,15 @@
11
version: '3.8'
22

33
services:
4-
dataforge:
5-
image: dataforge:latest
4+
swe-forge:
5+
image: swe-forge:latest
66
build:
77
context: ..
88
dockerfile: deploy/Dockerfile
99
environment:
1010
- LITELLM_API_BASE=https://openrouter.ai/api/v1
1111
- LITELLM_API_KEY=${OPENROUTER_API_KEY}
12-
- DATABASE_URL=postgres://synth:${POSTGRES_PASSWORD}@postgres:5432/dataforge
12+
- DATABASE_URL=postgres://synth:${POSTGRES_PASSWORD}@postgres:5432/swe-forge
1313
- REDIS_URL=redis://redis:6379
1414
depends_on:
1515
- postgres
@@ -26,12 +26,12 @@ services:
2626
restart: unless-stopped
2727

2828
worker:
29-
image: dataforge:latest
30-
command: ["dataforge", "worker", "--queue", "tasks"]
29+
image: swe-forge:latest
30+
command: ["swe-forge", "worker", "--queue", "tasks"]
3131
environment:
3232
- LITELLM_API_BASE=https://openrouter.ai/api/v1
3333
- LITELLM_API_KEY=${OPENROUTER_API_KEY}
34-
- DATABASE_URL=postgres://synth:${POSTGRES_PASSWORD}@postgres:5432/dataforge
34+
- DATABASE_URL=postgres://synth:${POSTGRES_PASSWORD}@postgres:5432/swe-forge
3535
- REDIS_URL=redis://redis:6379
3636
depends_on:
3737
- postgres
@@ -49,13 +49,13 @@ services:
4949
postgres:
5050
image: postgres:15-alpine
5151
environment:
52-
- POSTGRES_USER=dataforge
52+
- POSTGRES_USER=swe-forge
5353
- POSTGRES_PASSWORD=${POSTGRES_PASSWORD}
54-
- POSTGRES_DB=dataforge
54+
- POSTGRES_DB=swe-forge
5555
volumes:
5656
- postgres_data:/var/lib/postgresql/data
5757
healthcheck:
58-
test: ["CMD-SHELL", "pg_isready -U dataforge"]
58+
test: ["CMD-SHELL", "pg_isready -U swe-forge"]
5959
interval: 10s
6060
timeout: 5s
6161
retries: 5

deploy/kubernetes/configmap.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,10 @@
11
apiVersion: v1
22
kind: ConfigMap
33
metadata:
4-
name: dataforge-config
5-
namespace: dataforge
4+
name: swe-forge-config
5+
namespace: swe-forge
66
labels:
7-
app.kubernetes.io/name: dataforge
7+
app.kubernetes.io/name: swe-forge
88
app.kubernetes.io/component: config
99
data:
1010
LITELLM_API_BASE: "https://openrouter.ai/api/v1"

deploy/kubernetes/deployment.yaml

Lines changed: 14 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -1,35 +1,35 @@
11
apiVersion: apps/v1
22
kind: Deployment
33
metadata:
4-
name: dataforge
5-
namespace: dataforge
4+
name: swe-forge
5+
namespace: swe-forge
66
labels:
7-
app.kubernetes.io/name: dataforge
7+
app.kubernetes.io/name: swe-forge
88
app.kubernetes.io/component: api
99
spec:
1010
replicas: 1
1111
selector:
1212
matchLabels:
13-
app: dataforge
13+
app: swe-forge
1414
template:
1515
metadata:
1616
labels:
17-
app: dataforge
18-
app.kubernetes.io/name: dataforge
17+
app: swe-forge
18+
app.kubernetes.io/name: swe-forge
1919
app.kubernetes.io/component: api
2020
spec:
2121
containers:
22-
- name: dataforge
23-
image: dataforge:latest
22+
- name: swe-forge
23+
image: swe-forge:latest
2424
imagePullPolicy: IfNotPresent
2525
ports:
2626
- containerPort: 8080
2727
name: http
2828
envFrom:
2929
- configMapRef:
30-
name: dataforge-config
30+
name: swe-forge-config
3131
- secretRef:
32-
name: dataforge-secrets
32+
name: swe-forge-secrets
3333
resources:
3434
requests:
3535
cpu: "500m"
@@ -64,15 +64,15 @@ spec:
6464
path: /var/run/docker.sock
6565
- name: data
6666
persistentVolumeClaim:
67-
claimName: dataforge-data
67+
claimName: swe-forge-data
6868
---
6969
apiVersion: v1
7070
kind: PersistentVolumeClaim
7171
metadata:
72-
name: dataforge-data
73-
namespace: dataforge
72+
name: swe-forge-data
73+
namespace: swe-forge
7474
labels:
75-
app.kubernetes.io/name: dataforge
75+
app.kubernetes.io/name: swe-forge
7676
app.kubernetes.io/component: storage
7777
spec:
7878
accessModes:

deploy/kubernetes/namespace.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
apiVersion: v1
22
kind: Namespace
33
metadata:
4-
name: dataforge
4+
name: swe-forge
55
labels:
6-
app.kubernetes.io/name: dataforge
6+
app.kubernetes.io/name: swe-forge
77
app.kubernetes.io/component: namespace

0 commit comments

Comments
 (0)