Skip to content

Conversation

mcremon-meta
Copy link
Contributor

Summary:
As titled. This allows removing the outer loop unrolling in cases where the input to linear is a vector.
Shaves ~10k cyles from WW stage 1 model.

Differential Revision: D66208417

Copy link

pytorch-bot bot commented Nov 20, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/6976

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 75f1476 with merge base cd306d3 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 20, 2024
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66208417

@mcremon-meta
Copy link
Contributor Author

@pytorchbot label "topic: not user facing"

facebook-github-bot pushed a commit that referenced this pull request Nov 25, 2024
…add quantized fully connected per_tensor (#6976)

Summary:

As titled. This allows removing the outer loop unrolling in cases where the input to linear is a vector.
Shaves 8.5k cyles from WW stage 1 model.

Differential Revision: D66208417
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66208417

mcremon-meta added a commit that referenced this pull request Nov 25, 2024
…add quantized fully connected per_tensor (#6976)

Summary:

As titled. This allows removing the outer loop unrolling in cases where the input to linear is a vector.
Shaves 8.5k cyles from WW stage 1 model.

Differential Revision: D66208417
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66208417

mcremon-meta added a commit that referenced this pull request Nov 27, 2024
…add quantized fully connected per_tensor (#6976)

Summary:

As titled. This allows removing the outer loop unrolling in cases where the input to linear is a vector.
Shaves 8.5k cyles from WW stage 1 model.

Differential Revision: D66208417
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66208417

mcremon-meta added a commit that referenced this pull request Dec 1, 2024
…add quantized fully connected per_tensor (#6976)

Summary:

As titled. This allows removing the outer loop unrolling in cases where the input to linear is a vector.
Shaves 8.5k cyles from WW stage 1 model.

Differential Revision: D66208417
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66208417

facebook-github-bot pushed a commit that referenced this pull request Dec 3, 2024
…add quantized fully connected per_tensor (#6976)

Summary:

As titled. This allows removing the outer loop unrolling in cases where the input to linear is a vector.
Shaves 8.5k cyles from WW stage 1 model.

Differential Revision: D66208417
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66208417

@zonglinpeng
Copy link
Contributor

unrelated lint error, please rebase and try

mcremon-meta added a commit that referenced this pull request Dec 4, 2024
…add quantized fully connected per_tensor (#6976)

Summary:

As titled. This allows removing the outer loop unrolling in cases where the input to linear is a vector.
Shaves 8.5k cyles from WW stage 1 model.

Reviewed By: zonglinpeng

Differential Revision: D66208417
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66208417

facebook-github-bot pushed a commit that referenced this pull request Dec 4, 2024
…add quantized fully connected per_tensor (#6976)

Summary:

As titled. This allows removing the outer loop unrolling in cases where the input to linear is a vector.
Shaves 8.5k cyles from WW stage 1 model.

Reviewed By: zonglinpeng

Differential Revision: D66208417
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66208417

mcremon-meta added a commit that referenced this pull request Dec 4, 2024
…add quantized fully connected per_tensor (#6976)

Summary:

As titled. This allows removing the outer loop unrolling in cases where the input to linear is a vector.
Shaves 8.5k cyles from WW stage 1 model.

Reviewed By: zonglinpeng

Differential Revision: D66208417
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66208417

…add quantized fully connected per_tensor (#6976)

Summary:

As titled. This allows removing the outer loop unrolling in cases where the input to linear is a vector.
Shaves 8.5k cyles from WW stage 1 model.

Reviewed By: zonglinpeng

Differential Revision: D66208417
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66208417

@facebook-github-bot facebook-github-bot merged commit 27ed99f into main Dec 5, 2024
41 of 43 checks passed
@facebook-github-bot facebook-github-bot deleted the export-D66208417 branch December 5, 2024 22:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported topic: not user facing

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants