-
Notifications
You must be signed in to change notification settings - Fork 40.6k
fluentd doesn't receive the signal TERM #68064
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
/assing @coffeepac |
/retest |
/sign instrumentation |
/remove-sig cluster-lifecycle |
/approve |
@gianrubio please add a release note, this is something people have asked about in the past. |
done @coffeepac |
/test pull-kubernetes-e2e-gke |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: coffeepac, gianrubio The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@gianrubio we missed the 1.12 code freeze by a day, my bad it was today. I had a busy weekend. code freeze runs until Sept 18th so this will get merged then. |
/test pull-kubernetes-integration |
/retest Review the full test history for this PR. Silence the bot with an |
2 similar comments
/retest Review the full test history for this PR. Silence the bot with an |
/retest Review the full test history for this PR. Silence the bot with an |
/retest |
/retest Review the full test history for this PR. Silence the bot with an |
3 similar comments
/retest Review the full test history for this PR. Silence the bot with an |
/retest Review the full test history for this PR. Silence the bot with an |
/retest Review the full test history for this PR. Silence the bot with an |
@coffeepac any idea how to fix this flaky test? |
/retest |
@gianrubio looks like it passed. unfortunately, fetja-bot is gonna be running the tests every 72 hours until this PR gets merged. |
/retest Review the full test history for this PR. Silence the bot with an |
3 similar comments
/retest Review the full test history for this PR. Silence the bot with an |
/retest Review the full test history for this PR. Silence the bot with an |
/retest Review the full test history for this PR. Silence the bot with an |
Sometimes when my fluentd daemonset restarts, it sends all the logs again, even logs already sent.
My first assumption was the pos file were misconfigured but after a long time working on this I found the issue. When fluentd is stopped by kubernetes it does not receive the TERM signal, "broking" the position file. The cause of this issue is the CMD spec without "[" uses a shell form and doesn't use exec to start a process.
To reproduce the error you could delete the fluentd pod and you will never see a message saying fluentd has been killed
(ex: ...fluentd main process get SIGTERM).
After I applied the patch, fluentd started receiving the SIGTERM
Fluentd logs after the patch
Release note: