[SDK] Setup Logging Verbose Level in TrainingClient #1946

andreyvelich · 2023-11-16T14:23:32Z

To followup on this thread, I created a new issue: #1944 (comment).

I agree, that we should have appropriate logger for the SDK (e.g. logging.getLogger(__name__)) to not use root logger, but I still think that we should use logging to log some data for the user.

The problem with print is that user can't identify from which library the output was generated, but with logging we can configure it.

What do you think about this:

We can introduce a new parameter to TrainingClient() called verbose, and we can configure this parameter for various levels of logging: https://docs.python.org/3/library/logging.html#levels. We can start with (INFO, WARNING, and ERROR).
Depends on this parameter we can configure our logger in the TrainingClient constructor. For example, for verbose=20

self.logger = logging.getLogger(__name__)
if verbose == 20:
  self.logger.setLevel(logging.INFO)

And then we are going to use self.logger.info() or self.logger.warning() or self.logger.error() when it is required to print some data.
If user doesn't want to see any logs, they can always override it as follows:

logger = logging.getLogger("kubeflow.training.api.training_client")
logger.setLevel(logging.NOTSET)

Or provide verbose=0 in the TrainingClient parameter:

client = TrainingClient(verbose=0)

WDYT @droctothorpe @johnugeorge @kuizhiqing ?

The text was updated successfully, but these errors were encountered:

droctothorpe · 2023-11-16T15:16:34Z

Take this with a grain of salt, but my understanding is that in Python the expectation is that this kind of log level configuration is typically handled out of band. Take a look at httpx's logging documentation, for example:

https://github.com/encode/httpx/blob/master/docs/logging.md#logging

They log a TON of data with info. Library consumers can adjust the level at will, so they don't need to provide a wrapper or alternative interface for that functionality.

andreyvelich · 2023-11-24T17:52:31Z

Sorry for the late reply @droctothorpe.
Do you see any user limitations if we define the specific logger for the SDK (e.g. logging.getLogger(__name__)) and other libraries are going to re-use our SDK package ?

I just want to avoid complexity for users who just use our SDK for the first time and can't see information that we want to log for them (e.g. Experiment has been created) until they modify the log level manually.

johnugeorge · 2023-11-24T18:12:24Z

@andreyvelich Is it really needed? Typically in SDK, logging is not generally required unless explictly enabled

andreyvelich · 2023-11-24T18:46:20Z

@andreyvelich Is it really needed? Typically in SDK, logging is not generally required unless explictly enabled

In that case, should we convert all of our logs to DEBUG messages ?
E.g. when we print logs for Job pods: https://github.com/kubeflow/training-operator/blob/master/sdk/python/kubeflow/training/api/training_client.py#L856.

Kubernetes Python client logs some messages in debug mode: https://github.com/kubernetes-client/python/blob/master/kubernetes/client/rest.py#L235

andreyvelich · 2023-12-04T18:29:07Z

@droctothorpe @johnugeorge Any comments on the above suggestion ?

johnugeorge · 2023-12-04T19:02:15Z

So, when follow is true, how is it planned to print logs ? Do you mean that get_job_logs works only in DEBUG mode ?

andreyvelich · 2023-12-04T21:49:17Z

So, when follow is true, how is it planned to print logs ? Do you mean that get_job_logs works only in DEBUG mode ?

Yes, that's right. So for users to see this they need to run the following:

logger = logging.getLogger("kubeflow.training.api.training_client")
logger.setLevel(logging.DEBUG)

Or, we can configure the default logger to be in DEBUG mode:

logger = logging.getLogger(__name__)
logger.setLevel(logging.DEBUG)

johnugeorge · 2023-12-06T09:04:06Z

Isn't it a bad UX? The user needs to get the result whenever get_job_logs is called. But if we set DEBUG by default, it will create a flood of log messages. I agree that the current solution is not ideal but we need to find out a better solution.

andreyvelich · 2023-12-06T15:09:51Z

In that case, we can set the default logger to INFO and use logger.info() here to print pod logs..
Other prints we will keep as logger.debug() when it is not necessary to log any data.
What do you think @johnugeorge ?

andreyvelich added the area/sdk label Nov 16, 2023

andreyvelich mentioned this issue Jan 3, 2024

[SDK] Add information about TrainingClient logging #1973

Merged

google-oss-prow bot closed this as completed in #1973 Jan 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SDK] Setup Logging Verbose Level in TrainingClient #1946

[SDK] Setup Logging Verbose Level in TrainingClient #1946

andreyvelich commented Nov 16, 2023

droctothorpe commented Nov 16, 2023

andreyvelich commented Nov 24, 2023

johnugeorge commented Nov 24, 2023

andreyvelich commented Nov 24, 2023

andreyvelich commented Dec 4, 2023

johnugeorge commented Dec 4, 2023

andreyvelich commented Dec 4, 2023

johnugeorge commented Dec 6, 2023

andreyvelich commented Dec 6, 2023

[SDK] Setup Logging Verbose Level in TrainingClient #1946

[SDK] Setup Logging Verbose Level in TrainingClient #1946

Comments

andreyvelich commented Nov 16, 2023

droctothorpe commented Nov 16, 2023

andreyvelich commented Nov 24, 2023

johnugeorge commented Nov 24, 2023

andreyvelich commented Nov 24, 2023

andreyvelich commented Dec 4, 2023

johnugeorge commented Dec 4, 2023

andreyvelich commented Dec 4, 2023

johnugeorge commented Dec 6, 2023

andreyvelich commented Dec 6, 2023