[marathon] Marathon plugin slows down agent when marathon has many apps running #1861

sverch · 2015-08-25T22:28:04Z

We are monitoring a marathon framework using datadog which has over 150 apps, and the marathon check seems to be slowing down the entire datadog process.

After investigating what the plugin actually does, the problem seems to be this loop: https://github.com/DataDog/dd-agent/blob/5.4.4/checks.d/marathon.py#L46. It appears that the agent is sequentially hitting the API 150 times, which is enough to stop the agent from reporting metrics long enough to trigger some of our other alerts.

yannmh · 2015-09-01T15:46:01Z

Hi @Zarkantho,

Sorry to hear you have trouble with our Marathon integration. Could you reach out to support AT datadoghq.com with those details and a 'flare' archive please ?

That's a valuable feedback to help us understanding your needs and improving the check. Thank you.

sverch · 2015-09-01T19:51:25Z

Hi @yannmh,

Sorry about this, but we actually already replaced the marathon check with a custom check to work around this problem, which means the logs would probably not be relevant at this point.

All we did was comment out this line: https://github.com/DataDog/dd-agent/blob/5.4.4/checks.d/marathon.py#L53, since we don't really care about the "versions" of each application in marathon. This made the problem go away.

Sorry again that we can't use the standard tool here, but I hope this description was at least somewhat helpful for diagnosing the problem.

yannmh · 2015-09-02T14:00:13Z

Happy to know you find a solution to this issue.

Still, let's keep the ticket opened so we can assess for the 5.6.0 agent release based on your feedback. Thanks !

This metric is not really useful but it’s really costly to collect. Let’s remove it. Fix #1861

yannmh added this to the 5.6.0 milestone Sep 2, 2015

remh modified the milestones: 5.7.0, 5.6.0 Nov 3, 2015

olivielpeau modified the milestones: 5.8.0, 5.7.0 Dec 23, 2015

remh pushed a commit that referenced this issue Apr 25, 2016

[marathon] Remove versions metric

8a202e4

This metric is not really useful but it’s really costly to collect. Let’s remove it. Fix #1861

remh mentioned this issue Apr 25, 2016

[marathon] Remove versions metric #2443

Merged

olivielpeau closed this as completed in #2443 Apr 29, 2016

olivielpeau pushed a commit that referenced this issue Apr 29, 2016

[marathon] Remove versions metric (#2443)

272d91d

This metric is not really useful but it’s really costly to collect. Let’s remove it. Fix #1861

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[marathon] Marathon plugin slows down agent when marathon has many apps running #1861

[marathon] Marathon plugin slows down agent when marathon has many apps running #1861

sverch commented Aug 25, 2015

yannmh commented Sep 1, 2015

sverch commented Sep 1, 2015

yannmh commented Sep 2, 2015

[marathon] Marathon plugin slows down agent when marathon has many apps running #1861

[marathon] Marathon plugin slows down agent when marathon has many apps running #1861

Comments

sverch commented Aug 25, 2015

yannmh commented Sep 1, 2015

sverch commented Sep 1, 2015

yannmh commented Sep 2, 2015