Default logging is noisy and not too useful #323

aigarius · 2016-08-03T10:46:01Z

Here is an example of how the default logging configuration looks in action:

sh.command: INFO: <Command '/usr/bin/git add tes...(6 more)' call_args {'in': None, 'no_out...(509 more)>: starting process

You will notice that both the executed command and arguments are cut off and unusable. At the same time the message is logged at INFO level.

First the application or library should not make assumptions about what is a good logging line size - people log to files, to logging servers and even to log aggregators that process megabytes of logs per second effortlessly. There should not be any ellipsing of the log data.

IMHO there is too much data for the INFO level. There is no need for the "<Command " or the call_args there. The INFO level message (either should not be there at all or) should just say "sh.command: INFO: '/usr/bin/git add testfile' started, pid 1232" and then have a DEBUG level message directly after that with full call_args data dumped into the line.

And if there is a start message, there should also be a stop message at the same log level. That is why you need the pid to link together via logs multiple instances of the same command starting and stopping from multiple threads.

The text was updated successfully, but these errors were encountered:

jf--- · 2016-09-09T18:26:47Z

True, its not particularly useful...
Also, I havent seen a flag to turn off logging?

aigarius · 2016-09-09T21:23:30Z

You can control logging via default Python logging module settings. Like this:
logging.getLogger("sh.command").setLevel(logging.ERROR)

no truncating output end process logging message pids in info logs

amoffat · 2016-10-06T07:27:48Z

I cleaned up the logging a bit more. Now the logging output is not truncated, and there is a ending logging message for processes that complete. Messages also include the pid, but exclude the call args.

I am leaving the default logging level to be INFO however, and the format of the info messages more or less the same. You may not see a use for it, @aigarius, but when child loggers are used and debugging logging is turned on, the <Command> and <Process> delimiters make the logs easier to read. Example:

INFO:sh.command:<Command '/bin/ls'>: starting process
DEBUG:sh.command.process:<Command '/bin/ls'>.<Process 14278 ['/bin/ls']>: started process
DEBUG:sh.command.process.streamwriter:<Command '/bin/ls'>.<Process 14278 ['/bin/ls']>.stdin: parsed stdin as a queue
DEBUG:sh.command.process:<Command '/bin/ls'>.<Process 14278 ['/bin/ls']>: <sh.StreamWriter object at 0x7fba62d6f490> ready for more input
INFO:sh.command:<Command '/bin/ls', pid 14278>: process started
DEBUG:sh.command.process:<Command '/bin/ls'>.<Process 14278 ['/bin/ls']>: acquiring wait lock to wait for completion
DEBUG:sh.command.process:<Command '/bin/ls'>.<Process 14278 ['/bin/ls']>: got wait lock
DEBUG:sh.command.process:<Command '/bin/ls'>.<Process 14278 ['/bin/ls']>: exit code not set, waiting on pid
DEBUG:sh.command.process:<Command '/bin/ls'>.<Process 14278 ['/bin/ls']>: <sh.StreamReader object at 0x7fba62d6f650> ready to be read from
DEBUG:sh.command.process.streamreader:<Command '/bin/ls'>.<Process 14278 ['/bin/ls']>.stdout: got chunk size 720: 'AUTHORS.md\t      MANIFEST\t\t   '
DEBUG:sh.stream_bufferer:acquiring buffering lock to process chunk (buffering: 1)
DEBUG:sh.stream_bufferer:got buffering lock to process chunk (buffering: 1)
DEBUG:sh.stream_bufferer:released buffering lock for processing chunk (buffering: 1)
...

These changes are on the release-1.2 branch and will ship when that ships

aigarius · 2016-10-06T07:36:29Z

That is ok, but looking at the example you provided it feels like having a DEBUG level message with all the args and parameters (like cwd) would be very useful in this context. Unless I misunderstand and "/bin/ls" here is actually the full command line with all arguments. In which case the DEBUG level messages could get a bit noise in the Process header level, but that is fine for DEBUG.

Edit: Having a separate DEBUG level message in sh.command with a list of args could still useful, so that it can be enabled while sh.command.process DEBUG messages can be silenced.

amoffat · 2016-10-06T07:42:33Z

Fair points...I honestly only ever use DEBUG when I am debugging the internals of sh, and I have not yet needed more info. I think the changes you suggested at first are good enough for now, and when we need more, we can make more changes 👍

aigarius · 2016-10-06T07:52:14Z

Cheers mate, thanks for the work! 👍

* added `_out` and `_out_bufsize` validator [#346](amoffat/sh#346) * bugfix for internal stdout thread running when it shouldn't [#346](amoffat/sh#346) * regression bugfix on timeout [#344](amoffat/sh#344) * regression bugfix on `_ok_code=None` * further improvements on cpu usage * regression in cpu usage [#339](amoffat/sh#339) * fd leak regression and fix for flawed fd leak detection test [#337](amoffat/sh#337) * support for `io.StringIO` in python2 * added support for using raw file descriptors for `_in`, `_out`, and `_err` * removed `.close()`ing `_out` handler if FIFO detected * composed commands no longer propagate `_bg` * better support for using `sys.stdin` and `sys.stdout` for `_in` and `_out` * bugfix where `which()` would not stop searching at the first valid executable found in PATH * added `_long_prefix` for programs whose long arguments start with something other than `--` [#278](amoffat/sh#278) * added `_log_msg` for advanced configuration of log message [#311](amoffat/sh#311) * added `sh.contrib.sudo` * added `_arg_preprocess` for advanced command wrapping * alter callable `_in` arguments to signify completion with falsy chunk * bugfix where pipes passed into `_out` or `_err` were not flushed on process end [#252](amoffat/sh#252) * deprecated `with sh.args(**kwargs)` in favor of `sh2 = sh(**kwargs)` * made `sh.pushd` thread safe * added `.kill_group()` and `.signal_group()` methods for better process control [#237](amoffat/sh#237) * added `new_session` special keyword argument for controlling spawned process session [#266](amoffat/sh#266) * bugfix better handling for EINTR on system calls [#292](amoffat/sh#292) * bugfix where with-contexts were not threadsafe [#247](amoffat/sh#195) * `_uid` new special keyword param for specifying the user id of the process [#133](amoffat/sh#133) * bugfix where exceptions were swallowed by processes that weren't waited on [#309](amoffat/sh#309) * bugfix where processes that dupd their stdout/stderr to a long running child process would cause sh to hang [#310](amoffat/sh#310) * improved logging output [#323](amoffat/sh#323) * bugfix for python3+ where binary data was passed into a process's stdin [#325](amoffat/sh#325) * Introduced execution contexts which allow baking of common special keyword arguments into all commands [#269](amoffat/sh#269) * `Command` and `which` now can take an optional `paths` parameter which specifies the search paths [#226](amoffat/sh#226) * `_preexec_fn` option for executing a function after the child process forks but before it execs [#260](amoffat/sh#260) * `_fg` reintroduced, with limited functionality. hurrah! [#92](amoffat/sh#92) * bugfix where a command would block if passed a fd for stdin that wasn't yet ready to read [#253](amoffat/sh#253) * `_long_sep` can now take `None` which splits the long form arguments into individual arguments [#258](amoffat/sh#258) * making `_piped` perform "direct" piping by default (linking fds together). this fixes memory problems [#270](amoffat/sh#270) * bugfix where calling `next()` on an iterable process that has raised `StopIteration`, hangs [#273](amoffat/sh#273) * `sh.cd` called with no arguments no changes into the user's home directory, like native `cd` [#275](amoffat/sh#275) * `sh.glob` removed entirely. the rationale is correctness over hand-holding. [#279](amoffat/sh#279) * added `_truncate_exc`, defaulting to `True`, which tells our exceptions to truncate output. * bugfix for exceptions whose messages contained unicode * `_done` callback no longer assumes you want your command put in the background. * `_done` callback is now called asynchronously in a separate thread. * `_done` callback is called regardless of exception, which is necessary in order to release held resources, for example a process pool

samjewell · 2021-07-16T14:31:20Z

Now the logging output is not truncated

@amoffat Is this still true? I see truncate_cap = 750 in the code still.

I share @aigarius opinion that output to STDOUT and STDERR should not be truncated at all, even to as many as 750 chars. It means that you can't use sh out of the box ("batteries included") and see what errors are actually occurring in the shell commands that you're calling, because they fall outside the 750 char limit, and you're forced to go back and add exception handling around the call to the sh command.

Would you consider removing that 750 char limit?

ecederstrand · 2021-07-21T08:45:23Z

There are pros and cons to logging the full stdout/stderr output.

I see your point about getting the full context without having to go back and wrap your code in a try/except and explicitly capture the full output.

On the other hand, I think it's a good principle that logging statements shouldn't be able to single-handedly break your code. That's why for example logging.error('%s %s', 'foo') prints a stack trace but does not raise an exception. If you don't put any limit on the logging message, you could potentially emit a multi-TB log message and risk:
* Exhausting disk space, effectively killing your server
* Running out of memory, also effectively killing your server
* Breaking downstream log processors
And all of the above would happen without any obvious indication about the reason, because you're still not done logging the full message.

If you're fine with those risks, then sh still allows a simple way of disabling the limit:

import sh
sh.ErrorReturnCode.truncate_cap = 10**1000

# Your code using 'sh' here

amoffat pushed a commit that referenced this issue Oct 6, 2016

better logging, closes #323

0ee6ceb

no truncating output end process logging message pids in info logs

amoffat added the 1.2 label Oct 6, 2016

amoffat closed this as completed Oct 6, 2016

amoffat mentioned this issue Oct 6, 2016

Cleanup logging #255

Closed

amoffat modified the milestone: 1.2 Oct 25, 2016

amoffat removed the 1.2 label Oct 25, 2016

This was referenced Dec 16, 2016

Update sh to 1.12.8 Dallinger/psiTurk#49

Closed

Update sh to 1.12.8 jayfk/cookiecutter-saas#135

Closed

This was referenced Jan 4, 2017

Update sh to 1.12.9 Dallinger/psiTurk#54

Open

Update sh to 1.12.9 jayfk/cookiecutter-saas#154

Closed

pyup-bot mentioned this issue Feb 13, 2017

Update sh to 1.12.9 dimagi/commcare-hq#14923

Closed

pyup-bot mentioned this issue Mar 2, 2017

Update sh to 1.12.10 jayfk/cookiecutter-saas#187

Closed

pyup-bot mentioned this issue Mar 14, 2017

Update sh to 1.12.11 jayfk/cookiecutter-saas#198

Closed

This was referenced Mar 30, 2017

Update sh to 1.12.12 jayfk/cookiecutter-saas#204

Closed

Update sh to 1.12.13 jayfk/cookiecutter-saas#205

Closed

friendly-test-bot mentioned this issue Apr 5, 2017

Update sh to 1.12.7 jayfk/cookiecutter-saas#215

Closed

pyup-bot mentioned this issue Jun 7, 2017

Update sh to 1.12.14 jayfk/cookiecutter-saas#257

Open

pyup-bot mentioned this issue Jun 28, 2017

Update sh to 1.12.14 abkfenris/gage-beaglebone#38

Open

pyup-bot mentioned this issue Jul 16, 2017

Initial Update LuisAlejandro/candyshop#8

Closed

pyup-bot mentioned this issue Jul 26, 2017

Initial Update NdagiStanley/room-allocation#6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Default logging is noisy and not too useful #323

Default logging is noisy and not too useful #323

aigarius commented Aug 3, 2016

jf--- commented Sep 9, 2016

aigarius commented Sep 9, 2016

amoffat commented Oct 6, 2016 •

edited

Loading

aigarius commented Oct 6, 2016 •

edited

Loading

amoffat commented Oct 6, 2016

aigarius commented Oct 6, 2016

samjewell commented Jul 16, 2021 •

edited

Loading

ecederstrand commented Jul 21, 2021

Default logging is noisy and not too useful #323

Default logging is noisy and not too useful #323

Comments

aigarius commented Aug 3, 2016

jf--- commented Sep 9, 2016

aigarius commented Sep 9, 2016

amoffat commented Oct 6, 2016 • edited Loading

aigarius commented Oct 6, 2016 • edited Loading

amoffat commented Oct 6, 2016

aigarius commented Oct 6, 2016

samjewell commented Jul 16, 2021 • edited Loading

ecederstrand commented Jul 21, 2021

amoffat commented Oct 6, 2016 •

edited

Loading

aigarius commented Oct 6, 2016 •

edited

Loading

samjewell commented Jul 16, 2021 •

edited

Loading