quic / efficient-transformers Public

Notifications You must be signed in to change notification settings
Fork 64
Star 85

Code
Issues 2
Pull requests 50
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: quic/efficient-transformers

Labels 26 Milestones 0

New pull request New

50 Open 692 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Adding support for multi_vision Specialization in qwen2_5_vl

#755 opened Jan 23, 2026 by mohiso22 • Draft

Updated reduce sum calculation to use einsum for gpt_oss

#754 opened Jan 23, 2026 by asmigosw

Loading…

Cherry pick 733 to release v1.21.0

#753 opened Jan 23, 2026 by vbaddi

Loading…

Mainline version update

#752 opened Jan 22, 2026 by quic-rishinr

Loading…

CI test optimization

#751 opened Jan 22, 2026 by quic-rishinr

Loading…

["QEff.finetuning"] Inference script for HF_trainer

#749 opened Jan 21, 2026 by tchawada

Loading…

['QEff.finetuning'] Changing some params from training config to model config

#747 opened Jan 21, 2026 by tchawada

Loading…

Adding blocked kv and skip softmax for gpt oss

#745 opened Jan 20, 2026 by kdulla • Draft

HOTFIX : Added support for repeat kv heads aligned Bias scaling for AWQ and FP8 models. (#735)

#744 opened Jan 20, 2026 by quic-dhirajku

Loading…

Support to skip export, compile if qpc exist

#743 opened Jan 20, 2026 by tv-karthikeya • Draft

Fixing SW issue in Gemma3

#740 opened Jan 19, 2026 by qcdipankar

Loading…

Documents update for 1.21.0

#739 opened Jan 19, 2026 by quic-amitraj • Draft

Enabling Fsdp-zero DP

#737 opened Jan 19, 2026 by quic-akuruvil

Loading…

Wan support to skip compilation

#734 opened Jan 18, 2026 by tv-karthikeya

Loading…

[Qeff.finetuning] Adding Full document for hf_based finetuning stack

#732 opened Jan 16, 2026 by tchawada

Loading…

[QEff. Finetuning] Adding finetune_experiemental.py and related files

#731 opened Jan 16, 2026 by quic-swatia

Loading…

Adding support of QEFFAutoModelForSequenceClassification

#729 opened Jan 16, 2026 by quic-amitraj

Loading…

Adding the support of dense models distilled from moe models with the same architecture

#728 opened Jan 16, 2026 by vjanfaza

Loading…

gemma sliding window cache fixed

#719 opened Jan 12, 2026 by ochougul • Draft

Added changes to load and export Llama model in bfloat16/float16 precision

#707 opened Jan 7, 2026 by quic-dhirajku

Loading…

Flux rotary embedding changes

#705 opened Jan 6, 2026 by quic-amitraj • Draft

Loading HF models partially to save testing compute

#704 opened Jan 6, 2026 by quic-swatia

Loading…

Updated compile from qaic-exec to qaic-compile

#703 opened Jan 6, 2026 by asmigosw

Loading…

Wan I2V Support

#701 opened Jan 5, 2026 by tv-karthikeya • Draft

Subfunction fix: changed invalid_index to INT32MAX Always

#700 opened Jan 5, 2026 by abhishek-singh591

Loading…

Previous 1 2 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!