Feature request : Support for PHI3 mini #210

raymond-infinitecode · 2024-07-14T07:50:06Z

Prerequisites

Before submitting your issue, please ensure the following:

I am running the latest version of PowerInfer. Development is rapid, and as of now, there are no tagged versions.
I have carefully read and followed the instructions in the README.md.
I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).

Feature Description

PHI3 mini is currently the most powerful SLM yet, but can we relu it to make it fast so a single Xeon server can serve hundreds of concurrent users with relu implementation ?

Motivation

Please provide a detailed written description of reasons why this feature is necessary and how it is useful to PowerInfer users.

Possible Implementation

Convert the Phi3 model to relu model

raymond-infinitecode added the enhancement New feature or request label Jul 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request : Support for PHI3 mini #210

Feature request : Support for PHI3 mini #210

raymond-infinitecode commented Jul 14, 2024

Feature request : Support for PHI3 mini #210

Feature request : Support for PHI3 mini #210

Comments

raymond-infinitecode commented Jul 14, 2024

Prerequisites

Feature Description

Motivation

Possible Implementation