-
Notifications
You must be signed in to change notification settings - Fork 25.2k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
SwinLayer / DonutSwinLayer / ClapAudioLayer attention mask creation always happens on CPU
#31294
opened Jun 6, 2024 by
gorodnitskiy
2 of 4 tasks
merge_and_unload
for a quantized model ruins its quality
Quantization
#31293
opened Jun 6, 2024 by
Aktsvigun
2 of 4 tasks
Having a function to verify if checkpoint is valid
Feature request
Request for a new feature
#31283
opened Jun 6, 2024 by
Bfault
Constraints in constrained beam search can be satisfied by the inputs.
Generation
#31281
opened Jun 6, 2024 by
zawedcvg
2 of 4 tasks
Stuck on Initializing Transformers Model with FSDP (Fully Sharded Data Parallel) using meta device
#31278
opened Jun 6, 2024 by
jiangjiadi
2 of 4 tasks
While using the integration of bitsandbytes, Error shows: name 'torch' is not defined
#31273
opened Jun 6, 2024 by
46319943
2 of 4 tasks
'FastSpeech2ConformerConfig' object has no attribute 'model_config'
Audio
#31270
opened Jun 6, 2024 by
spencerchubb
1 of 4 tasks
bf16 is more unstable than fp16, when looking at the difference of generation logprobs and forward logprobs
#31267
opened Jun 5, 2024 by
vwxyzjn
2 of 4 tasks
Flaky test - tests/models/mobilenet_v1/test_modeling_mobilenet_v1.py::MobileNetV1ModelTest::test_batching_equivalence
#31257
opened Jun 5, 2024 by
amyeroberts
4 tasks
Adaptive Decoding Support
Feature request
Request for a new feature
Generation
#31250
opened Jun 5, 2024 by
zwhong714
Intel/dpt-swinv2-tiny-256: TypeError: unsupported operand type(s) for //: 'NoneType' and 'NoneType'
Vision
#31249
opened Jun 5, 2024 by
yurithefury
2 of 4 tasks
Add support for non-CUDA architectures at the same time Bitsandbytes is doing it
Feature request
Request for a new feature
#31248
opened Jun 4, 2024 by
sealad886
We Need Compile Support For Mamba!
Compilation
Issues related to torchdynamo and torchinductor
Feature request
Request for a new feature
#31246
opened Jun 4, 2024 by
zhenglongjiepheonix
🐛
attn_implementation="sdpa"
slower than BetterTransformer.transform
?
#31245
opened Jun 4, 2024 by
vibhas-singh
2 of 4 tasks
PEFT + ZeRO Phase 2 + Transformers doesn't output pytorch_model.bin
#31234
opened Jun 4, 2024 by
cdoern
2 of 4 tasks
FlashAttention2 issue with Mistral/Mixtral related to max length and RotaryEmbedding
#31228
opened Jun 4, 2024 by
psinger
Batch size schedulers
Feature request
Request for a new feature
trainer
#31222
opened Jun 4, 2024 by
BramVanroy
Loading XGLM with Tensorflow and apply resize_token_embeddings() raises an error.
TensorFlow
Anything TensorFlow
#31219
opened Jun 4, 2024 by
CHLEE-Leo
PreTrainedModel.from_pretrained(path, from_flax=True) fails for sharded Flax checkpoints
Flax
#31210
opened Jun 3, 2024 by
KathyHaem
1 of 4 tasks
Speed up image processors - cast to array before BatchFeature
Feature request
Request for a new feature
Good Second Issue
Issues that are more difficult to do than "Good First" issues - give it a try if you want!
#31205
opened Jun 3, 2024 by
amyeroberts
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-05-06.