Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

producer-consumer への変更での並列実行の高速化 #204

Merged

Conversation

r-terada
Copy link
Collaborator

issue: #190

parallel での実行を mp.pool:map から producer-consumer パターンに変更することで高速化しました.
macos, 4 cores Intel Core i5, 8並列で約20万行のファイルを解析すると 26min. -> 20min. とおよそ 20% ほどの高速化ができました.

use producer-consumer pattern with mp.Process and mp.Queue.
@r-terada r-terada changed the title Feature/optimize parallel process and queue producer-consumer への変更での並列実行の高速化 Oct 29, 2021
r-terada and others added 23 commits November 7, 2021 15:34
…ocess_and_queue_matsuda

debug multi processing and refactor analyzer.py
it depends on sudachi-style normalization
TODO: FIX test case
…f_spacy_v3.2

 use morph features of spacy v3.2 (revised)
@r-terada r-terada marked this pull request as ready for review November 20, 2021 11:48
@hiroshi-matsuda-rit
Copy link
Contributor

All test cases succeeded.

@hiroshi-matsuda-rit hiroshi-matsuda-rit merged commit 75bbe38 into develop Nov 20, 2021
@hiroshi-matsuda-rit hiroshi-matsuda-rit deleted the feature/optimize_parallel_process_and_queue branch November 20, 2021 14:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants