client_idle_timeout does not work #2166

datasage · 2018-08-12T23:42:27Z

What happened:
I upgraded my cluster to use 2.7.3 (from 2.4.7 using the expected upgrade path) so that I could enable idle timeout. In my testing, no matter what setting i set for client_idle_timeout, It would always disable it.

I am currently using the community version of teleport. Configuration is currently set up as one instance serving as auth and proxy server. Storage is set up to use file directory.

From what I could tell, it seems that teleport auth service does not use config values after initially initialized. I dug through the cache files and cluster_configuration always had the client_idle_timeout set to 0, regardless of what the config has.

What you expected to happen:
Client should disconnect when idle timeout period is reached.

How to reproduce it (as minimally and precisely as possible):

I started at version 2.4.7 and upgraded following the upgrade procedure to 2.7.3.
I set the client timeout to 1m.
I connected to the cluster and waited 1m
Client did not disconnect.

Environment:

Teleport version (use teleport version): 2.7.3
Tsh version (use tsh version): 2.7.3
OS (e.g. from /etc/os-release): Amazon Linux AMI 2018.03

Relevant Debug Logs If Applicable
I ran with debug and never saw the debug output from the client idle checks. It would appear that the config is getting read as 0.

The text was updated successfully, but these errors were encountered:

klizhentas · 2018-08-12T23:44:10Z

if you change the settings in teleport.yaml config file, you have to restart or reload the server in order for the changes to take effect.

datasage · 2018-08-12T23:45:54Z

@klizhentas I have done that plenty of times. I've also disabled it, and run it manually with --debug on to see if I could get more information.

klizhentas · 2018-08-12T23:47:47Z

ok, we will take a look. thanks for your bug report

klizhentas · 2018-08-14T00:13:56Z

@datasage I've looked into it and could not reproduce. Can you paste your configuration here (removing all specifics of course) and the places where you are looking at?

datasage · 2018-08-14T02:13:08Z

Starting with the config:

teleport:
    nodename: bastion.mydomain.com
    data_dir: /var/lib/teleport
    advertise_ip: 10.0.0.1
    connection_limits:
        max_connections: 1000
        max_users: 250
    log:
        output: stderr
        severity: ERROR
    storage:
        region: us-east-1
        audit_sessions_uri: s3://my-session-bucket/
    ciphers:
      - aes128-ctr
      - aes192-ctr
      - aes256-ctr
      - aes128-gcm@openssh.com
    kex_algos:
      - curve25519-sha256@libssh.org
      - ecdh-sha2-nistp256
      - ecdh-sha2-nistp384
      - ecdh-sha2-nistp521
    mac_algos:
      - hmac-sha2-256-etm@openssh.com
      - hmac-sha2-256

auth_service:
    enabled: yes
    client_idle_timeout: 15m
    disconnect_expired_cert: yes
    authentication:
        type: local
        second_factor: otp
    listen_addr: 0.0.0.0:3025
    tokens:
        - "node:xxxxxxxxxxxxxxxxxxxx"
    
    cluster_name: "production"

ssh_service:
    enabled: yes
    listen_addr: 0.0.0.0:3022
    labels:
        role: my-role
        type: my-type
    commands:
    - name: awsid
      command: [curl, "http://169.254.169.254/latest/meta-data/instance-id"]
      period: 1h0m0s
    - name: version
      command: [/usr/local/bin/teleport, "version"]
      period: 1h0m0s

proxy_service:
    enabled: yes
    listen_addr: 0.0.0.0:3023
    tunnel_listen_addr: 0.0.0.0:3024
    web_listen_addr: 0.0.0.0:3080
    https_key_file: /etc/teleport/teleport.key
    https_cert_file: /etc/teleport/teleport.crt

I updated the session storage setting to s3 and that worked right away. Are auth service settings initialized once and then stored in the cluster state? The code I looked at seem to indicate that.

Cluster config from cache file (cache/auth/cluster_configuration):

{"kind":"cluster_config","version":"v3","metadata":{"name":"cluster-config"},"spec":{"session_recording":"node","cluster_id":"cluster-uuid","proxy_checks_host_keys":"yes","audit":{"region":"us-east-1","audit_sessions_uri":"s3://my-session-bucket/"},"client_idle_timeout":"0s","disconnect_expired_cert":false}}

datasage · 2018-08-14T03:12:10Z

I set up the cluster originally with 2.3.x so it uses the boltdb backend by default. I was able to find a way to read that db. This does show the correct values.

{"kind":"cluster_config","version":"v3","metadata":{"name":"cluster-config"},"spec":{"session_recording":"node","cluster_id":"cluster-uuid","proxy_checks_host_keys":"yes","audit":{"region":"us-east-1","audit_sessions_uri":"s3://my-session-bucket/"},"client_idle_timeout":"15m0s","disconnect_expired_cert":true}}

I am not sure why cache file is showing a different value or which value is used by the system to make a determination for terminating the idle session. I've never seen any if the idle session entries in the debug log so I would assume the value being used on a given connection is 0.

klizhentas · 2018-08-14T05:55:23Z

cache/node or cache/proxy will be using this setting, not cache/auth, can you take a look there as well? Also you may want to try to set it to 30seconds and quickly see if it works.

datasage · 2018-08-14T15:13:30Z

The state is the same for both node and proxy. I have tried a low timeout, 60 seconds in my case, and it did not disconnect the client.

I recently changed the s3 storage location and that updated in the cache, but the idle timeout and client expiration settings did not.

This commit fixes #2166

This PR fixes #2166, adds suite tests.

This commit fixes #2166

elg0ch0 · 2018-10-22T15:40:02Z

Hi @datasage, is your issue solved?

I'm having issues with client_idle_timeout too but the behavior is a little bit different (I'm using TSH), since it's an issue with idle_timeout might them be related?:

If client_idle_timeout < 5m -> timeout works as expected
if client_idle_timeout > 5m -> the shell becomes unresponsive after 5m idle and by the time when I type anything it gets disconnected a few seconds later.

Regards,

datasage · 2018-10-22T17:55:13Z

My issues have been solved, but i primarily use the Web UI.

This sounds like an idle connection issue to me. A router, or firewall is dropping idle connections after 5 minutes.

elg0ch0 · 2018-10-22T18:08:42Z

I just found that it might be related with tsh version, I tried v2.5.6 and it worked properly but using tsh v3.0.1 didn't.
I'll use it as a workaround (Teleport server running v3.0.1 and tsh v2.5.6)

Thank you anyway!

datasage changed the title ~~client_idle_timeout does not seem to work.~~ client_idle_timeout does not work Aug 12, 2018

klizhentas added a commit that referenced this issue Aug 14, 2018

Fix non-renewing cache.

c55d0bc

This commit fixes #2166

klizhentas mentioned this issue Aug 14, 2018

Fix non-renewing cache. #2169

Merged

klizhentas added a commit that referenced this issue Aug 14, 2018

Fix non-expiring cache for cluster config

49b9ce4

This PR fixes #2166, adds suite tests.

klizhentas added a commit that referenced this issue Aug 14, 2018

Fix non-renewing cache.

1066a16

This commit fixes #2166

klizhentas closed this as completed in 310918f Aug 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

client_idle_timeout does not work #2166

client_idle_timeout does not work #2166

datasage commented Aug 12, 2018

klizhentas commented Aug 12, 2018 •

edited

Loading

datasage commented Aug 12, 2018

klizhentas commented Aug 12, 2018

klizhentas commented Aug 14, 2018

datasage commented Aug 14, 2018 •

edited

Loading

datasage commented Aug 14, 2018

klizhentas commented Aug 14, 2018

datasage commented Aug 14, 2018

elg0ch0 commented Oct 22, 2018

datasage commented Oct 22, 2018

elg0ch0 commented Oct 22, 2018

client_idle_timeout does not work #2166

client_idle_timeout does not work #2166

Comments

datasage commented Aug 12, 2018

klizhentas commented Aug 12, 2018 • edited Loading

datasage commented Aug 12, 2018

klizhentas commented Aug 12, 2018

klizhentas commented Aug 14, 2018

datasage commented Aug 14, 2018 • edited Loading

datasage commented Aug 14, 2018

klizhentas commented Aug 14, 2018

datasage commented Aug 14, 2018

elg0ch0 commented Oct 22, 2018

datasage commented Oct 22, 2018

elg0ch0 commented Oct 22, 2018

klizhentas commented Aug 12, 2018 •

edited

Loading

datasage commented Aug 14, 2018 •

edited

Loading