DirectMLを使用する #81

Oyaki122 · 2022-02-22T09:59:11Z

内容

DirectMLを用いた推論を可能ができるようにし、Windows-cpu版dllをdirectml版に変更します。

DirectMLはDirectX12を用いて推論を行うことができ、これによってAMD製GPUやIntel内蔵グラフィックスでの推論が可能になります。

DirectML版のcore.dllでは DirectML.dll を同じディレクトリに配置し、initializeのuse_gpuをtrueにすることでDirectMLを用いて推論を行います

DirectMLに対応したonnxruntimeはそれ以外のディレクトリ構造と異なり、nugetパッケージ用のものになっているため、cmakeやconfigure.pyの実行時にオプションをつける必要があります

--追記--
動作検証を行う際はREADMEの「コアライブラリのビルド」に従って自前でビルドして検証していただくか、configure.py実行時にオプションをつけていただくことで可能です
configure.pyを用いる場合、

python configure.py --use_directml --voicevox_download_link https://github.com/Oyaki122/voicevox_core/releases/download/dml-test20220223/core.zip

の様にし、またexample/pythonを実行する際は

python run.py --text=<テキスト> --speaker_id=<話者ID> --use_gpu --root_dir_path="../../model"

としてください。
今回のプルリクエストは開発版のcoreであるため、Releaseにある実際のモデルを使用することはできません。 modelフォルダにあるモックのみ可能です

その他

Radeonでの推論が検証できていないため、お持ちの方がいらっしゃいましたら検証よろしくおねがいします

Oyaki122 · 2022-02-22T12:47:53Z

比較

日本国憲法50字を用い、exampleのforwarder.forward()を5回実行するのにかかった平均時間を検証する。単位は秒

PC1

Corei7 8700
内蔵グラフィックス(Intel HD Graphics 630)
Geforce GTX1060 6GB

	平均	標準偏差
DirectML(内蔵)	33.93	0.58
DirectML(グラボ)	2.124	0.437
cpu	10.86	1.07
cuda	1.710	0.006

PC2

Corei5 7200U
内蔵グラフィックス(Intel HD Graphics 620)

	平均	標準偏差
DirectML	46.32	0.44
cpu	38.41	1.07

この様に、グラフィックボードを搭載したPCではcudaの1.2倍程度の時間で合成できる。
一方内蔵グラフィックスではcpuよりも遅くなる事がわかった

Hiroshiba

おおお、すごい！！
結構範囲の広い変更なので見るのに時間がかかっちゃうかもですが、これくらいの量なら１つのプルリクエストで完結したほうが楽かなと感じました。

かなり素人っぽい質問ですが１つだけコメントしてみました。

@Yosshi999 さん、またレビューお願いしてもよろしいでしょうか 👀

DetermineTargetArchitecture.cmake

Patchethium · 2022-02-23T12:43:05Z

横から失礼します。

日本国憲法50字を用い、exampleのforwarder.forward()を5回実行するのにかかった平均時間を検証する。単位は秒

Radeonでの推論が検証できていないため、お持ちの方がいらっしゃいましたら検証よろしくおねがいします

I have a Radeon GPU and ran a similar^[1] test on it, here's the result:

Environment:
cpu: Ryzen 3600
gpu: Radeon RX 570
(cpu_num_threads=8)

	avg	var
cpu	4.314	0.0005
gpu (DirectML)	1.089	0.009

The acceleration is pretty decent, good job!

It's also worth noting that I experienced some noticeable lag on the whole computer while running the test. A resource regulator like cpu_num_threads may be needed in order to avoid that.

[1] Google won't tell me what 日本国憲法50字 is so I went here and simply took the first paragraph, this could make some difference.

o108minmin · 2022-02-23T13:13:19Z

突然すいません。
別の方も検証されていますがRadeon RX 6700 XTで --use_directmlを使い example/python の動作を確認しました(特にエラーなく合成できるところまで)

以下実施ログ

https://gist.github.com/o108minmin/10216558ce22669e33097020d2b24438

[edit: 実施手順が間違っていたようなので、 @Oyaki122 さんに教えていただきました。無事ダミー音声も生成できました。ありがとうございます！ ]

Yosshi999 · 2022-02-26T16:24:21Z

このPRに含める必要は無いですが、configure.pyの実行オプションが変わった時にダウンロードしてくるonnxruntimeが変わるので、既に展開済みのものがあったらダウンロードのスキップをするという処理は変更したほうがよさそうですね

Yosshi999

LICENSEだけちょっと追加しました

DetermineTargetArchitecture.cmake

Co-authored-by: Yosshi999 <Yosshi999@users.noreply.github.com>

Yosshi999

LGTM!

Hiroshiba · 2022-02-26T18:50:13Z

こちらの機能は次の次の大きめのアップデート（0.12）の目玉機能とさせてください･･･！
検証の時間を長めにとったほうが良さそうなのと、次のアップデートは今月中に行いたいというのがコンフリクトしているためです。
プレビュー版を作って１～２週間ほどプレリリースし、いろんな方に試してもらうフェーズを用意しましょう･･･！

Oyaki122 · 2022-02-26T19:02:43Z

@Hiroshiba
承知しました
引き続きよろしくおねがいします

Oyaki122 · 2022-02-26T19:03:26Z

WindowsでCPU版のVOICEVOXは必要なくなるかもしれませんが、coreには需要があるのではないかと思ったのでcpu版coreを復活させました

Hiroshiba

LGTM！！

製品版をなるべく早くビルドしてみたいと思います。
（ちょっとしばらく忙しいのですが、忘れてそうだったらリプライ頂けると嬉しいです🙇）

こちらのissueで試していければなと思います。

DirectML版の動作チェック #90

Oyaki122 added 30 commits February 10, 2022 05:16

動かないけどとりあえずコミット

9059af2

Yosshiさんのモデルで動作した

1ca9a60

コメント追加・dml,cuda分岐の作成

d3a839c

ちょっと変更

21f2499

突然initで失敗するようになったが、dllをすべて読み込む(PR#49をなくす)と動いた

7a9cb7c

DirectML.dllのみ明示的に読み込むようにした

d38c3a6

検証用のゴミを削除

5758ef8

workflowを変更

88d2078

dml_urlを追加

35bf378

エラー修正

3c4a3bb

cacheのpathを追加

ea4b9f6

スペル修正

59833b2

cmakeがonnxruntimeを見つけられないのでデバッグ

d15151d

fix

3ce40a0

fix

2d8dc90

directmlはビルド時にいらないようなので削除

8e86006

Merge branch 'main' into dml

d990b31

dll周り以外動くようにした

61c8f22

ctypesで動いた

41b3289

lib作成呼び出しの削除他

9962df7

ちょこちょこ変更

ba29158

Merge branch 'windows_config' into dml

460acf9

pip install できるようにした

ed65259

Merge branch 'windows_config' into dml

b60ac56

dmlをcmakeのオプション扱いに、__init__.pyをdml専用に

2cf1404

configure_directmlを作成

c7b43c9

setup.pyからcythonを完全排除

6bf5fc4

readmeを大幅改変

fec5551

リドミ微編、buildからinstall requirement.txtを削除

7b04b37

Merge branch 'windows_config' into dml

a265269

readmeを改良

ef65d7b

Oyaki122 added 5 commits February 22, 2022 21:57

dml版onnxruntimeの存在判定対象を変更

0720254

検証用のゴミを削除

f7cbdec

fix

fad5be8

fix

15e3dd4

fix

f994d93

Oyaki122 marked this pull request as ready for review February 22, 2022 14:07

Oyaki122 marked this pull request as draft February 22, 2022 14:07

dml-onnxruntime判定を変更

f226073

Oyaki122 marked this pull request as ready for review February 22, 2022 14:14

Hiroshiba reviewed Feb 22, 2022

View reviewed changes

DetermineTargetArchitecture.cmake Show resolved Hide resolved

core_cpu->core_gpu

ec9deb5

Yosshi999 requested changes Feb 26, 2022

View reviewed changes

DetermineTargetArchitecture.cmake Show resolved Hide resolved

Update DetermineTargetArchitecture.cmake

4e2aeea

Co-authored-by: Yosshi999 <Yosshi999@users.noreply.github.com>

Yosshi999 approved these changes Feb 26, 2022

View reviewed changes

onnxruntimeがあったときに、上書きするか残すか聞くようにする

cd08c59

Oyaki122 requested a review from Hiroshiba February 26, 2022 17:58

cpu版もビルドするようにする

cd52468

Hiroshiba mentioned this pull request Mar 10, 2022

DirectML版の動作チェック #90

Closed

5 tasks

Hiroshiba approved these changes Mar 10, 2022

View reviewed changes

Hiroshiba merged commit 8c607b4 into VOICEVOX:main Mar 10, 2022

Oyaki122 deleted the dml branch March 10, 2022 16:02

y-chan mentioned this pull request Mar 12, 2022

DirectML対応 VOICEVOX/voicevox_engine#363

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DirectMLを使用する #81

DirectMLを使用する #81

Oyaki122 commented Feb 22, 2022 •

edited

Loading

Oyaki122 commented Feb 22, 2022 •

edited

Loading

Hiroshiba left a comment

Patchethium commented Feb 23, 2022

o108minmin commented Feb 23, 2022 •

edited

Loading

Yosshi999 commented Feb 26, 2022

Yosshi999 left a comment

Yosshi999 left a comment

Hiroshiba commented Feb 26, 2022

Oyaki122 commented Feb 26, 2022

Oyaki122 commented Feb 26, 2022

Hiroshiba left a comment

DirectMLを使用する #81

DirectMLを使用する #81

Conversation

Oyaki122 commented Feb 22, 2022 • edited Loading

内容

関連 Issue

その他

Oyaki122 commented Feb 22, 2022 • edited Loading

比較

PC1

PC2

Hiroshiba left a comment

Choose a reason for hiding this comment

Patchethium commented Feb 23, 2022

o108minmin commented Feb 23, 2022 • edited Loading

Yosshi999 commented Feb 26, 2022

Yosshi999 left a comment

Choose a reason for hiding this comment

Yosshi999 left a comment

Choose a reason for hiding this comment

Hiroshiba commented Feb 26, 2022

Oyaki122 commented Feb 26, 2022

Oyaki122 commented Feb 26, 2022

Hiroshiba left a comment

Choose a reason for hiding this comment

Oyaki122 commented Feb 22, 2022 •

edited

Loading

Oyaki122 commented Feb 22, 2022 •

edited

Loading

o108minmin commented Feb 23, 2022 •

edited

Loading