-
Notifications
You must be signed in to change notification settings - Fork 64
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Updated reduce sum calculation to use einsum for gpt_oss
#754
opened Jan 23, 2026 by
asmigosw
Loading…
['QEff.finetuning'] Changing some params from training config to model config
#747
opened Jan 21, 2026 by
tchawada
Loading…
HOTFIX : Added support for repeat kv heads aligned Bias scaling for AWQ and FP8 models. (#735)
#744
opened Jan 20, 2026 by
quic-dhirajku
Loading…
[Qeff.finetuning] Adding Full document for hf_based finetuning stack
#732
opened Jan 16, 2026 by
tchawada
Loading…
[QEff. Finetuning] Adding finetune_experiemental.py and related files
#731
opened Jan 16, 2026 by
quic-swatia
Loading…
Adding support of QEFFAutoModelForSequenceClassification
#729
opened Jan 16, 2026 by
quic-amitraj
Loading…
Adding the support of dense models distilled from moe models with the same architecture
#728
opened Jan 16, 2026 by
vjanfaza
Loading…
Added changes to load and export Llama model in bfloat16/float16 precision
#707
opened Jan 7, 2026 by
quic-dhirajku
Loading…
Subfunction fix: changed invalid_index to INT32MAX Always
#700
opened Jan 5, 2026 by
abhishek-singh591
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.