-
Notifications
You must be signed in to change notification settings - Fork 54
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Updated KV_pytorch and ORT inference for VLMs to incorporate image_idx
#557
opened Sep 10, 2025 by
quic-dhirajku
Loading…
Logger Module For Efficient Transformers
1.21.0
wip
Work in progress
#555
opened Sep 10, 2025 by
quic-hemagnih
•
Draft
Extend On-Device Sampling Support to more Causal Language Models
#553
opened Sep 4, 2025 by
quic-sanising
•
Draft
TF ver 4.55.0, pytorch 2.7.1, hf hub 0.34.0 and diffusers 0.31.0
#551
opened Sep 3, 2025 by
quic-hemagnih
•
Draft
Optimized ONNX Transform via Class Merging and Thread Pooling
#546
opened Aug 23, 2025 by
abhishek-singh591
Loading…
[QEff]: Add OpenAI Oss Models (gpt_oss)
enhancement
New feature or request
#534
opened Aug 6, 2025 by
vbaddi
Loading…
Update PyTorch to 2.7.1+cpu, Torchvision to 0.22.1+cpu, and Python Requirement to >=3.9
#524
opened Jul 28, 2025 by
abukhoy
Loading…
2 tasks done
Add Support for Frequency Penalties in On Device Sampling
#523
opened Jul 24, 2025 by
quic-sanising
•
Draft
[Olmo2]: Add Support for Olmo2 CausalLM Model in QEff
1.21.0
enhancement
New feature or request
#509
opened Jul 9, 2025 by
vbaddi
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-09-08.