Skip to content

Issues: modelscope/data-juicer

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Can the cleaning statistics be viewed after creating the config file and performing the cleaning? question Further information is requested
#499 opened Nov 27, 2024 by Tendo33
3 tasks done
Guidance on Monitoring Task Execution with Ray Executor in Data Juicer dj:dist issues/PRs about distributed data processing question Further information is requested
#496 opened Nov 24, 2024 by Fatima-0SA
3 tasks done
AttributeError: 'FusedFilter' object has no attribute '_name' bug Something isn't working dj:op issues/PRs about some specific OPs
#495 opened Nov 24, 2024 by xunmenglt
Merge local and API LLM calling enhancement New feature or request
#490 opened Nov 15, 2024 by BeachWang
2 tasks done
sharegpt format support dj:multimodal issues/PRs about multimodal data processing question Further information is requested
#488 opened Nov 14, 2024 by IvanDeng0
3 tasks done
Checkpointer support for Ray-Mode enhancement New feature or request
#487 opened Nov 12, 2024 by yxdyc
2 tasks done
Distributed processing
编译安装时报错 question Further information is requested
#486 opened Nov 12, 2024 by charonkk
3 tasks done
Anyone tried DJ on multimodal datasets of more than 20M samples? question Further information is requested
#482 opened Nov 11, 2024 by serser
3 tasks done
windows系统支持 question Further information is requested
#477 opened Nov 6, 2024 by zytcharming
3 tasks done
Update of Jupyter Notebooks bug Something isn't working documentation Improvements or additions to documentation
#476 opened Nov 6, 2024 by HYLcool
[Bug]: perplexity_filter 算子内存OOM bug Something isn't working
#474 opened Nov 5, 2024 by weiaicunzai
3 tasks done
How to calculate the image_text_similarity scores for both Chinese and English? dj:multimodal issues/PRs about multimodal data processing dj:op issues/PRs about some specific OPs question Further information is requested
#473 opened Nov 5, 2024 by weiaicunzai
LLM造数据时需要try_num参数 enhancement New feature or request
#470 opened Nov 4, 2024 by BeachWang
2 tasks done
如何获取tool_quality_classifier模块中[chinese,code,gtp3]这3个模型的权重? question Further information is requested
#467 opened Oct 30, 2024 by yaun248
3 tasks done
How to use 'chinese_convert_mapper' ? question Further information is requested
#458 opened Oct 22, 2024 by abchbx
3 tasks done
How to use ‘hf_model’ question Further information is requested
#457 opened Oct 22, 2024 by abchbx
3 tasks done
[Bug]: librosa use lazy_loader which depend on python version bug Something isn't working
#453 opened Oct 17, 2024 by BeachWang
3 tasks done
[Feat]: Unified LLM Calling Management enhancement New feature or request
#451 opened Oct 16, 2024 by drcege
2 tasks done
[Feat]: Automatic Version Matching During Installation enhancement New feature or request
#450 opened Oct 16, 2024 by drcege
2 tasks done
[Bug]: test_adapter 兼容性 bug Something isn't working
#441 opened Sep 29, 2024 by FailedNamed
3 tasks done
[Bug]: KeyError: 'resource' bug Something isn't working
#440 opened Sep 29, 2024 by luckystar1992
3 tasks done
ProTip! no:milestone will show everything without a milestone.