-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-3562]Periodic cleanup event logs #2391
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Can one of the admins verify this patch? |
|
Can you explain this patch? What problem does it solve and why? There is no JIRA here either. |
|
@srowen If we run spark application frequently, it will write many spark event log into spark.eventLog.dir. After a long time later, there will be many spark event log that we do not concern in the spark.eventLog.dir. So we need delete some logs that exceeding the time limit. |
|
Hey @viper-kun can you file a JIRA and put it in the title of this PR? See how other Spark PRs are formatted. |
|
ok to test |
|
QA tests have started for PR 2391 at commit
|
|
QA tests have finished for PR 2391 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pretty sure you don't want to clean logs on initialization. Otherwise your history server will always display nothing!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes,you are right. it is no need to clean it on initialization. And i can not change it, because of unknown repository. And i will submit other commit.
|
@viper-kun I'm not sure if it's the HistoryServer's responsibility to delete the application logs. The HistoryServer is intended to be a serving daemon; it should not modify application output, since it did not create any output in the first place. If I'm not wrong, Hadoop's JobHistoryServer has the same semantics (i.e. it doesn't randomly delete your logs). Perhaps @srowen can confirm this. |
|
@andrewor14 i have checked Hadoop's JobHistoryServer. it is JobHistoryServer's responsibility to delete the application logs. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: cleans, logs
|
Looks ok to me, mostly a few nits. Could you also update |
|
@vanzin , @andrewor14 .Thanks for your opinions. Because the source branch had been deleted by me, i can change it in this commit. i submit another commit[#2471] and change the source according your opinions. |
|
@viper-kun Since there is now a new PR, would you mind closing this one? It's confusing to have two PRs that do the same thing. |
No description provided.