Adding the support of dense models distilled from moe models with the same architecture #728

vjanfaza · 2026-01-16T01:38:02Z

In this PR, we are adding the support of meta-llama/Llama-Guard-4-12B which is a dense model distilled form llama4 scout moe model. The changes in pytorch_transforms.py file can be applied to any dense model distilled from a moe model with supported architecture in QEfficient.

Signed-off-by: Vahid Janfaza <[email protected]>

…ma4-guard

vjanfaza added 2 commits January 15, 2026 17:20

Support models that are dense distilled from moe

4cea884

Signed-off-by: Vahid Janfaza <[email protected]>

Merge remote-tracking branch 'origin/main-llama4-guard' into main-lla…

65c6a36

…ma4-guard

vjanfaza requested review from ochougul, quic-amitraj, quic-hemagnih and quic-rishinr as code owners January 16, 2026 01:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding the support of dense models distilled from moe models with the same architecture #728

Adding the support of dense models distilled from moe models with the same architecture #728

vjanfaza commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Adding the support of dense models distilled from moe models with the same architecture #728

Are you sure you want to change the base?

Adding the support of dense models distilled from moe models with the same architecture #728

Conversation

vjanfaza commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant