-
Notifications
You must be signed in to change notification settings - Fork 25.4k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
don't zero out the attention_mask when using sliding window with flash attention
#31670
opened Jun 27, 2024 by
winglian
Loading…
5 tasks
Add ignore_errors=True to trainer.py rmtree in _inner_training_loop
#31668
opened Jun 27, 2024 by
njbrake
Loading…
3 tasks
feat(ci): set
fetch-depth: 0
in trufflehog checkout step
#31663
opened Jun 27, 2024 by
McPatate
Loading…
Add warning message for beta and gamma parameters
#31654
opened Jun 27, 2024 by
OmarManzoor
Loading…
3 tasks done
Enhancing SFT Training Efficiency Using Packing and FlashAttention2 with Position IDs
#31629
opened Jun 26, 2024 by
RhuiDih
Loading…
5 tasks
Allow infer_framework_load_model to use the originally specified config.
#31580
opened Jun 24, 2024 by
inf3rnus
Loading…
[WIP] Add implementation of
_extract_fbank_features_batch
Audio
#31579
opened Jun 24, 2024 by
ravenouse
Loading…
1 of 5 tasks
add warning when using gradient_checkpointing with FSDP full shard
#31578
opened Jun 24, 2024 by
yundai424
Loading…
2 of 5 tasks
transformers.fx.symbolic_trace supports inputs_embeds
#31574
opened Jun 24, 2024 by
fxmarty
Loading…
Add language to word timestamps for Whisper
Audio
#31572
opened Jun 24, 2024 by
robinderat
Loading…
2 of 5 tasks
Add torch_empty_cache_steps to TrainingArguments
#31546
opened Jun 22, 2024 by
aliencaocao
Loading…
2 of 5 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.