You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Kedro is trying to register same input dataset twice when used in multiple nodes and using ThreadRunner.
Context
Im trying to use dataset factories with ThreadRunner to run independent nodes in parallel and reduce execution time. If using SecuentialRunner, pipeline works, when using ThreadRunner, DatasetAlreadyExistsError thrown.
## Your Environment
* Kedro version used (`pip show kedro` or `kedro -V`): kedro -V
kedro, version 0.19.6
* Python version used (`python -V`): python -V
Python 3.10.12
* Operating system and version:
MacOS Darwin Kernel Version 23.5.0
The text was updated successfully, but these errors were encountered:
Description
Kedro is trying to register same input dataset twice when used in multiple nodes and using ThreadRunner.
Context
Im trying to use dataset factories with ThreadRunner to run independent nodes in parallel and reduce execution time. If using SecuentialRunner, pipeline works, when using ThreadRunner, DatasetAlreadyExistsError thrown.
Steps to Reproduce
Minimal code to reproduce:
catalog.yml
pipeline.py
Run the project:
kedro run -r ThreadRunner
Expected Result
Read same input on different nodes
Actual Result
Pipeline fails. Error log:
The text was updated successfully, but these errors were encountered: