Commit 9573b03
metal: optimise
* optimise GGML_OP_SUM
* add non-contiguous tests by permuting the input
* change tests to require full contiguity of OP_SUM
* cuda : add check GGML_OP_SUM
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>GGML_OP_SUM (ggml-org#16559)1 parent 85ecad6 commit 9573b03
File tree
5 files changed
+71
-11
lines changed- ggml/src
- ggml-cuda
- ggml-metal
- tests
5 files changed
+71
-11
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3625 | 3625 | | |
3626 | 3626 | | |
3627 | 3627 | | |
3628 | | - | |
3629 | 3628 | | |
3630 | 3629 | | |
| 3630 | + | |
| 3631 | + | |
3631 | 3632 | | |
3632 | 3633 | | |
3633 | 3634 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
662 | 662 | | |
663 | 663 | | |
664 | 664 | | |
| 665 | + | |
665 | 666 | | |
666 | 667 | | |
667 | 668 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
866 | 866 | | |
867 | 867 | | |
868 | 868 | | |
| 869 | + | |
| 870 | + | |
| 871 | + | |
| 872 | + | |
| 873 | + | |
| 874 | + | |
| 875 | + | |
| 876 | + | |
| 877 | + | |
| 878 | + | |
| 879 | + | |
869 | 880 | | |
870 | 881 | | |
871 | 882 | | |
872 | 883 | | |
873 | 884 | | |
874 | | - | |
| 885 | + | |
| 886 | + | |
| 887 | + | |
875 | 888 | | |
876 | 889 | | |
877 | 890 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1727 | 1727 | | |
1728 | 1728 | | |
1729 | 1729 | | |
1730 | | - | |
| 1730 | + | |
| 1731 | + | |
| 1732 | + | |
| 1733 | + | |
| 1734 | + | |
| 1735 | + | |
1731 | 1736 | | |
1732 | | - | |
| 1737 | + | |
1733 | 1738 | | |
1734 | 1739 | | |
1735 | 1740 | | |
1736 | | - | |
1737 | | - | |
1738 | | - | |
| 1741 | + | |
| 1742 | + | |
| 1743 | + | |
| 1744 | + | |
| 1745 | + | |
| 1746 | + | |
1739 | 1747 | | |
1740 | 1748 | | |
1741 | | - | |
| 1749 | + | |
| 1750 | + | |
| 1751 | + | |
| 1752 | + | |
| 1753 | + | |
| 1754 | + | |
| 1755 | + | |
| 1756 | + | |
| 1757 | + | |
| 1758 | + | |
| 1759 | + | |
| 1760 | + | |
| 1761 | + | |
| 1762 | + | |
| 1763 | + | |
| 1764 | + | |
| 1765 | + | |
| 1766 | + | |
| 1767 | + | |
| 1768 | + | |
| 1769 | + | |
| 1770 | + | |
| 1771 | + | |
1742 | 1772 | | |
1743 | 1773 | | |
1744 | 1774 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4588 | 4588 | | |
4589 | 4589 | | |
4590 | 4590 | | |
| 4591 | + | |
| 4592 | + | |
4591 | 4593 | | |
4592 | 4594 | | |
4593 | | - | |
| 4595 | + | |
| 4596 | + | |
| 4597 | + | |
4594 | 4598 | | |
4595 | 4599 | | |
4596 | 4600 | | |
4597 | | - | |
4598 | | - | |
| 4601 | + | |
| 4602 | + | |
| 4603 | + | |
| 4604 | + | |
4599 | 4605 | | |
4600 | 4606 | | |
4601 | 4607 | | |
4602 | 4608 | | |
4603 | 4609 | | |
4604 | 4610 | | |
| 4611 | + | |
| 4612 | + | |
| 4613 | + | |
| 4614 | + | |
| 4615 | + | |
4605 | 4616 | | |
4606 | 4617 | | |
4607 | 4618 | | |
| |||
6724 | 6735 | | |
6725 | 6736 | | |
6726 | 6737 | | |
| 6738 | + | |
| 6739 | + | |
| 6740 | + | |
6727 | 6741 | | |
6728 | 6742 | | |
6729 | 6743 | | |
| |||
6734 | 6748 | | |
6735 | 6749 | | |
6736 | 6750 | | |
| 6751 | + | |
6737 | 6752 | | |
6738 | 6753 | | |
6739 | 6754 | | |
| |||
0 commit comments