quic / efficient-transformers Public

Notifications You must be signed in to change notification settings
Fork 63
Star 85

Code
Issues 1
Pull requests 37
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: quic/efficient-transformers

Labels 26 Milestones 0

New pull request New

37 Open 622 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

test: Verify ONNX subfunction usage through model inspection instead of hash comparison

#670 opened Dec 16, 2025 by vbaddi

Loading…

Adding WAN Lightning support 1.21.0 ready for review

#669 opened Dec 16, 2025 by tv-karthikeya

Loading…

Onboarding Qwen Image

#664 opened Dec 11, 2025 by qcdipankar • Draft

Add automatic CCL list generation for prefill and decode when user does not provide lists

#663 opened Dec 10, 2025 by vjanfaza

Loading…

HOTFIX: Testing the Finetune base CI failure by installing pytorch2.9…

#661 opened Dec 10, 2025 by quic-dhirajku

Loading…

[QEff.Finetuning] Added support for SFTTrainer class along with tests

#660 opened Dec 9, 2025 by quic-dhirajku

Loading…

[toml]: urllib package removed

#659 opened Dec 9, 2025 by abukhoy

Loading…

[QEff. Finetune]: Adding base class and HF class

#658 opened Dec 9, 2025 by quic-swatia

Loading…

Added default NPI file 1.21.0

#657 opened Dec 9, 2025 by quic-akuruvil

Loading…

Added support of subfunction for VLMs

#653 opened Dec 5, 2025 by abhishek-singh591 • Draft

TF upgrade to v5.0.0rc0

#651 opened Dec 3, 2025 by quic-hemagnih • Draft

Adding dependencies for Wan2.2-Lightning

#648 opened Dec 2, 2025 by tv-karthikeya • Draft

CB support for mllama

#643 opened Nov 26, 2025 by asmigosw • Draft

[QEff Finetune]: Made a separate list of dependencies for Finetuning.

#636 opened Nov 25, 2025 by quic-meetkuma • Draft

Dynamo-Enabled ONNX Export onnx.dynamo

#632 opened Nov 24, 2025 by smedhe • Draft

Created ReplicateKVHeadTransform to integrate KV-heads replication module within Qefficient library.

#625 opened Nov 19, 2025 by quic-dhirajku

Loading…

Add Support for Guided Decoding to On Device Sampling

#624 opened Nov 19, 2025 by quic-sanising

Loading…

[Proxy]: Adding support for exporting proxy Model

#620 opened Nov 17, 2025 by abukhoy • Draft

Add Glm4MoeForCausalLM Support

#619 opened Nov 14, 2025 by quic-shagun

Loading…

Remove transformers dependencies from cache_utils and restructure cache classes

#616 opened Nov 13, 2025 by quic-mamta • Draft

[Exp]: Python 3.12 upgrade

#607 opened Nov 5, 2025 by abukhoy

Loading…

Extend on-device sampling support for dual QPC VLMs enhancement

New feature or request

#597 opened Oct 24, 2025 by quic-xiyushi

Loading…

[Test]: Models test configs in a single Config file

#596 opened Oct 22, 2025 by abukhoy

Loading…

Modified qwen_2.5 modelling file to allow replicate_kv_script to work for custom num_kv_heads.

#595 opened Oct 18, 2025 by quic-dhirajku

Loading…

[WIP]: Add early support for KV replication in VLMs

#594 opened Oct 18, 2025 by vbaddi

Loading…

Previous 1 2 Next

Previous Next

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!