Avoid modifying log level globally #1944

droctothorpe · 2023-11-08T22:06:13Z

What this PR does / why we need it:
This PR eliminates the code that was setting the log level to INFO globally, as documented in this issue.

The get_job_logs implementation needs to be addressed before merging, but I figured it would be easier to talk about with a proper PR @johnugeorge.

Do you want it to not print to stdout by default and just return a data structure that looks like this?

{
    pod1: logs,
    pod2: logs,
    pod3: logs
}

Which issue(s) this PR fixes
Fixes # #1942

coveralls · 2023-11-09T03:23:00Z

Pull Request Test Coverage Report for Build 6804550532

0 of 0 changed or added relevant lines in 0 files are covered.
4 unchanged lines in 1 file lost coverage.
Overall coverage decreased (-0.03%) to 42.859%

Files with Coverage Reduction	New Missed Lines	%
pkg/controller.v1/mpi/mpijob_controller.go	4	81.05%

Totals
Change from base Build 6723816999:	-0.03%
Covered Lines:	3751
Relevant Lines:	8752

💛 - Coveralls

johnugeorge · 2023-11-09T03:23:07Z

Sounds good to me.

droctothorpe · 2023-11-09T15:18:37Z

@johnugeorge I updated get_job_logs to return Dict[str, str].

There's one corner case, which is when follow is set to True. It seems like it's meant to function similar to kubectl logs <pod> -f, where emitting to stdout is the whole point, so I applied a print statement to that particular fork in the logic exclusively.

coveralls · 2023-11-09T20:02:11Z

Pull Request Test Coverage Report for Build 6851207814

0 of 0 changed or added relevant lines in 0 files are covered.
2 unchanged lines in 1 file lost coverage.
Overall coverage decreased (-0.01%) to 42.882%

Files with Coverage Reduction	New Missed Lines	%
pkg/controller.v1/mpi/mpijob_controller.go	2	81.24%

Totals
Change from base Build 6723816999:	-0.01%
Covered Lines:	3753
Relevant Lines:	8752

💛 - Coveralls

johnugeorge · 2023-11-10T18:32:41Z

@droctothorpe Tests failed

kuizhiqing

LGTM

BTW, for the format part, we can do something like this

logger = logging.getLogger(__name__)
# logger.setLevel(logging.INFO)
formatter = logging.Formatter(
    fmt='%(name)s %(levelname)s %(asctime)s %(message)s')
ch = logging.StreamHandler()
ch.setFormatter(formatter)
logger.addHandler(ch)

droctothorpe · 2023-11-13T14:19:27Z

LGTM

BTW, for the format part, we can do something like this

logger = logging.getLogger(__name__)
# logger.setLevel(logging.INFO)
formatter = logging.Formatter(
    fmt='%(name)s %(levelname)s %(asctime)s %(message)s')
ch = logging.StreamHandler()
ch.setFormatter(formatter)
logger.addHandler(ch)

@kuizhiqing that's neat! Previously, the format was actually stripping all of the metadata:

logging.basicConfig(format="%(message)s")

I didn't implement the original code, but I suspect it was implemented to make logging function more like print (i.e. no structured log metadata). This PR eliminates that pattern, so modifying the logging format should no longer be necessary, but it's good to know that this is an option.

droctothorpe · 2023-11-13T14:20:16Z

/retest

droctothorpe · 2023-11-13T16:21:30Z

@johnugeorge I don't have retest privileges apparently 🙃 .

johnugeorge · 2023-11-13T19:06:46Z

Thanks @kuizhiqing for review.
Thanks @droctothorpe for adding the fix.

/lgtm
/approve

google-oss-prow · 2023-11-13T19:07:02Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: droctothorpe, johnugeorge, kuizhiqing

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~sdk/python/OWNERS~~ [johnugeorge]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

andreyvelich · 2023-11-13T19:20:39Z

sdk/python/kubeflow/training/api/training_client.py

@@ -849,7 +853,7 @@ def get_job_logs(
                            if logline is None:
                                finished[index] = True
                                break
-                            logging.info("[Pod %s]: %s", pods[index], logline)
+                            print(f"[Pod {pods[index]}]: {logline}")


Thank you for updating the logging.
@droctothorpe Do we want to use print here instead of logging ?
Maybe we should keep our logging in SDK consistent ?
For example, KFP uses logging.info also: https://github.com/kubeflow/pipelines/blob/master/sdk/python/kfp/client/client.py#L470
And they also setup logging here: https://github.com/kubeflow/pipelines/blob/a9279843946183429f6572516acee6523de36e53/sdk/python/kfp/cli/__main__.py#L23

Thanks, @andreyvelich! I'm still wrapping my head around some of these nuances so take these answers with a grain of salt (Python logging is surprisingly complex).

Do we want to use print here instead of logging ?

print is more performant and recommended by the Python docs:

Source: https://docs.python.org/3/howto/logging.html#when-to-use-logging

And they also setup logging here:
https://github.com/kubeflow/pipelines/blob/a9279843946183429f6572516acee6523de36e53/sdk/python/kfp/cli/__main__.py#L23

Note that they only do this in the CLI. That's because the CLI is a user-facing API so to speak, i.e. it's not meant to be consumed by other libraries that have their own opinions on how logging should be configured. From the Python docs:

It is strongly advised that you do not add any handlers other than NullHandler to your library’s loggers. This is because the configuration of handlers is the prerogative of the application developer who uses your library. The application developer knows their target audience and what handlers are most appropriate for their application: if you add handlers ‘under the hood’, you might well interfere with their ability to carry out unit tests and deliver logs which suit their requirements.

Source: https://docs.python.org/3/howto/logging.html#configuring-logging-for-a-library

I think, that makes sense, thanks for sharing these links!
I agree, that we should have appropriate logger for the SDK (e.g. logging.getLogger(__name__)) to not use root logger, but I still think that we should use logging to log some data for the user.

The problem with print is that user can't identify from which library the output was generated, but with logging we can configure it.

What do you think about this:

We can introduce a new parameter to TrainingClient() called verbose, and we can configure this parameter for various levels of logging: https://docs.python.org/3/library/logging.html#levels. We can start with (INFO, WARNING, and ERROR).

Depends on this parameter we can configure our logger in the constructor. For example, for verbose=10

self.logger = logging.getLogger(__name__) if verbose == 20: self.logger.setLevel(logging.INFO)

And then we are going to use self.logger.info() or self.logger.warning() or self.logger.error() when it is required to print some data.

If user doesn't want to see any logs, they can always override it as follows:

logger = logging.getLogger("kubeflow.training.api.training_client") logger.setLevel(logging.NOTSET)

Or provide verbose=0 as TrainingClient() parameter:

client = TrainingClient(verbose=0)

WDYT @droctothorpe @johnugeorge @kuizhiqing ?

Avoid modifying log level globally

9fe1450

google-oss-prow bot added the size/S label Nov 8, 2023

google-oss-prow bot requested review from jinchihe and kuizhiqing November 8, 2023 22:06

Address get_job_logs

1a46c85

kuizhiqing approved these changes Nov 13, 2023

View reviewed changes

Fix integration tests

afd218e

google-oss-prow bot assigned johnugeorge Nov 13, 2023

google-oss-prow bot added the lgtm label Nov 13, 2023

google-oss-prow bot added the approved label Nov 13, 2023

google-oss-prow bot merged commit 230bfb4 into kubeflow:master Nov 13, 2023
32 checks passed

andreyvelich reviewed Nov 13, 2023

View reviewed changes

droctothorpe deleted the log-level branch November 13, 2023 20:15

andreyvelich mentioned this pull request Nov 16, 2023

[SDK] Setup Logging Verbose Level in TrainingClient #1946

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid modifying log level globally #1944

Avoid modifying log level globally #1944

droctothorpe commented Nov 8, 2023

coveralls commented Nov 9, 2023

johnugeorge commented Nov 9, 2023

droctothorpe commented Nov 9, 2023 •

edited

Loading

coveralls commented Nov 9, 2023 •

edited

Loading

johnugeorge commented Nov 10, 2023

kuizhiqing left a comment

droctothorpe commented Nov 13, 2023 •

edited

Loading

droctothorpe commented Nov 13, 2023

droctothorpe commented Nov 13, 2023 •

edited

Loading

johnugeorge commented Nov 13, 2023

google-oss-prow bot commented Nov 13, 2023

andreyvelich Nov 13, 2023

droctothorpe Nov 13, 2023

andreyvelich Nov 13, 2023 •

edited

Loading

Avoid modifying log level globally #1944

Avoid modifying log level globally #1944

Conversation

droctothorpe commented Nov 8, 2023

coveralls commented Nov 9, 2023

Pull Request Test Coverage Report for Build 6804550532

💛 - Coveralls

johnugeorge commented Nov 9, 2023

droctothorpe commented Nov 9, 2023 • edited Loading

coveralls commented Nov 9, 2023 • edited Loading

Pull Request Test Coverage Report for Build 6851207814

💛 - Coveralls

johnugeorge commented Nov 10, 2023

kuizhiqing left a comment

Choose a reason for hiding this comment

droctothorpe commented Nov 13, 2023 • edited Loading

droctothorpe commented Nov 13, 2023

droctothorpe commented Nov 13, 2023 • edited Loading

johnugeorge commented Nov 13, 2023

google-oss-prow bot commented Nov 13, 2023

andreyvelich Nov 13, 2023

Choose a reason for hiding this comment

droctothorpe Nov 13, 2023

Choose a reason for hiding this comment

andreyvelich Nov 13, 2023 • edited Loading

Choose a reason for hiding this comment

droctothorpe commented Nov 9, 2023 •

edited

Loading

coveralls commented Nov 9, 2023 •

edited

Loading

droctothorpe commented Nov 13, 2023 •

edited

Loading

droctothorpe commented Nov 13, 2023 •

edited

Loading

andreyvelich Nov 13, 2023 •

edited

Loading