Skip to content

Actions: huggingface/datatrove

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,278 workflow runs
1,278 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add RayPipelineExecutor
Test & Check Code Quality #393: Pull request #331 opened by nelson-liu
January 27, 2025 00:21 Action required nelson-liu:ray_executor
January 27, 2025 00:21 Action required
Allow custom parquet schema
Test & Check Code Quality #392: Pull request #330 synchronize by BramVanroy
January 26, 2025 15:43 Action required BramVanroy:main
January 26, 2025 15:43 Action required
Allow custom parquet schema
Test & Check Code Quality #391: Pull request #330 opened by BramVanroy
January 26, 2025 13:51 2m 35s BramVanroy:main
January 26, 2025 13:51 2m 35s
fixes stopwors implementation
Test & Check Code Quality #390: Pull request #329 opened by guipenedo
January 26, 2025 12:10 2m 58s stopwords_set
January 26, 2025 12:10 2m 58s
Add customization for fetching SLURM job id (#320)
Secret Leaks #208: Commit 0c3df50 pushed by guipenedo
January 24, 2025 13:06 21s main
January 24, 2025 13:06 21s
Add customization for fetching SLURM job id (#320)
Test & Check Code Quality #389: Commit 0c3df50 pushed by guipenedo
January 24, 2025 13:06 2m 50s main
January 24, 2025 13:06 2m 50s
Fix issues with URL Deduplication when using the Index (#327)
Test & Check Code Quality #388: Commit c0f3c38 pushed by guipenedo
January 24, 2025 13:05 2m 42s main
January 24, 2025 13:05 2m 42s
Fix issues with URL Deduplication when using the Index (#327)
Secret Leaks #207: Commit c0f3c38 pushed by guipenedo
January 24, 2025 13:05 19s main
January 24, 2025 13:05 19s
Add customization for fetching SLURM job id
Test & Check Code Quality #387: Pull request #320 synchronize by BramVanroy
January 24, 2025 09:25 2m 46s BramVanroy:main
January 24, 2025 09:25 2m 46s
Fix issues with URL Deduplication when using the Index
Test & Check Code Quality #386: Pull request #327 synchronize by muzzynine
January 23, 2025 15:01 2m 37s muzzynine:fix_url_dedup
January 23, 2025 15:01 2m 37s
Fix issues with URL Deduplication when using the Index
Test & Check Code Quality #385: Pull request #327 synchronize by muzzynine
January 22, 2025 12:26 3m 17s muzzynine:fix_url_dedup
January 22, 2025 12:26 3m 17s
Update README.md (#323)
Test & Check Code Quality #384: Commit 8063aed pushed by guipenedo
January 22, 2025 11:15 2m 49s main
January 22, 2025 11:15 2m 49s
Update README.md (#323)
Secret Leaks #206: Commit 8063aed pushed by guipenedo
January 22, 2025 11:15 18s main
January 22, 2025 11:15 18s
Fix issues with URL Deduplication when using the Index
Test & Check Code Quality #383: Pull request #327 opened by muzzynine
January 22, 2025 09:17 Action required muzzynine:fix_url_dedup
January 22, 2025 09:17 Action required
fixes stopwors implementation...
Secret Leaks #205: Commit f8e78f5 pushed by guipenedo
January 20, 2025 15:49 18s stopwords_set
January 20, 2025 15:49 18s
Add customization for fetching SLURM job id
Test & Check Code Quality #381: Pull request #320 synchronize by BramVanroy
January 10, 2025 15:38 3m 11s BramVanroy:main
January 10, 2025 15:38 3m 11s
fix(utils): Enhance the dependencies check to include pip distributio…
Secret Leaks #204: Commit 2260603 pushed by guipenedo
January 9, 2025 18:31 17s main
January 9, 2025 18:31 17s
fix(utils): Enhance the dependencies check to include pip distributio…
Test & Check Code Quality #379: Commit 2260603 pushed by guipenedo
January 9, 2025 18:31 22s main
January 9, 2025 18:31 22s
fix(utils): Enhance the dependencies check to include pip distribution
Test & Check Code Quality #378: Pull request #317 synchronize by guipenedo
January 9, 2025 18:24 19s aiqwe:main
January 9, 2025 18:24 19s
Add glob pattern for hash index (#313)
Secret Leaks #203: Commit cd61018 pushed by guipenedo
January 9, 2025 12:47 22s main
January 9, 2025 12:47 22s
Add glob pattern for hash index (#313)
Test & Check Code Quality #377: Commit cd61018 pushed by guipenedo
January 9, 2025 12:47 2m 32s main
January 9, 2025 12:47 2m 32s
style fix
Secret Leaks #202: Commit b9b24cf pushed by guipenedo
January 9, 2025 12:47 22s decont-glob
January 9, 2025 12:47 22s
nit
Secret Leaks #201: Commit 3168cf5 pushed by guipenedo
January 9, 2025 12:39 17s main
January 9, 2025 12:39 17s
nit
Test & Check Code Quality #375: Commit 3168cf5 pushed by guipenedo
January 9, 2025 12:39 3m 18s main
January 9, 2025 12:39 3m 18s
clean up PipelineStepWithTokenizer
Secret Leaks #200: Commit 66221c8 pushed by guipenedo
January 9, 2025 12:38 18s main
January 9, 2025 12:38 18s