Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add event exporter deployment to the fluentd-gcp addon #46700

Conversation

crassirostris
Copy link

@crassirostris crassirostris commented May 31, 2017

Introduce event exporter deployment to the fluentd-gcp addon so that by default if logging to Stackdriver is enabled, events will be available there also.

In this release, event exporter is a non-critical pod in BestEffort QoS class to avoid preempting actual workload in tightly loaded clusters. It will become critical in one of the future releases.

Stackdriver cluster logging now deploys a new component to export Kubernetes events.

@crassirostris crassirostris added the sig/instrumentation Categorizes an issue or PR as relevant to SIG Instrumentation. label May 31, 2017
@crassirostris crassirostris added this to the v1.7 milestone May 31, 2017
@crassirostris crassirostris requested a review from piosz May 31, 2017 10:24
@k8s-ci-robot k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label May 31, 2017
@k8s-github-robot k8s-github-robot added approved Indicates a PR has been approved by an approver from all required OWNERS files. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. release-note Denotes a PR that will be considered when it comes time to generate release notes. labels May 31, 2017
command:
- '/event-exporter'
resources:
limits:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Don't we want to autoscale this?

cc @gmarek

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't know - you tell me how many events per second it can handle with this settings.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, we do, I was just about to add it

I was checking the throughput to decide whether cpu has to be scaled. Manual test with 100 events/sec showed that cpu is not a bottleneck, but memory is

@gmarek Is it safe to assume that 50MB + 1MB/node covers the size of the events db in typical cluster?

Copy link
Contributor

@gmarek gmarek May 31, 2017

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You mean db as in etcd? For 500 Nodes it's <200MB, so this is very safe estimate. You probably can do .5MB/node. But that's etcd - I'm not sure how this related to your needs though.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@gmarek Great, thanks. I've observed that the db size when event exporter crashed was ~ the same size as the limit

@piosz
Copy link
Member

piosz commented May 31, 2017

@roberthbailey could you please approve or delegate?

@crassirostris
Copy link
Author

@k8s-bot pull-kubernetes-unit test this
@k8s-bot pull-kubernetes-e2e-gce-etcd3 test this

@crassirostris crassirostris force-pushed the add-event-exporter-deployment branch 2 times, most recently from 8c68fd9 to f95024a Compare May 31, 2017 14:50
@k8s-github-robot k8s-github-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels May 31, 2017
@crassirostris
Copy link
Author

@piosz PTAL at the nanny config

@crassirostris crassirostris force-pushed the add-event-exporter-deployment branch from f95024a to 07902ea Compare May 31, 2017 17:02
@crassirostris
Copy link
Author

Test failures are flakes due to missing quota in the test project: #46713

@krzyzacy
Copy link
Member

@k8s-bot pull-kubernetes-kubemark-e2e-gce test this

@crassirostris
Copy link
Author

@k8s-bot pull-kubernetes-e2e-gce-etcd3 test this
@k8s-bot pull-kubernetes-e2e-kops-aws test this

kubernetes.io/cluster-service: "true"
addonmanager.kubernetes.io/mode: Reconcile
---
apiVersion: rbac.authorization.k8s.io/v1alpha1
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please use rbac.authorization.k8s.io/v1beta1 instead.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for noticing! Done

@crassirostris crassirostris force-pushed the add-event-exporter-deployment branch from 07902ea to 238db08 Compare May 31, 2017 20:28
@roberthbailey roberthbailey requested a review from j3ffml May 31, 2017 21:22
@roberthbailey
Copy link
Contributor

/approve

fieldRef:
fieldPath: metadata.namespace
command:
- /pod_nanny
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The number of pod_nannies is getting larger and larger.
Can we file an issue to make it possible to use a single pod nanny for many different pods?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done: #46763

@crassirostris crassirostris force-pushed the add-event-exporter-deployment branch 2 times, most recently from b82a482 to d88ea16 Compare June 1, 2017 20:10
@k8s-github-robot k8s-github-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Jun 1, 2017
@fejta
Copy link
Contributor

fejta commented Jun 1, 2017

@k8s-bot pull-kubernetes-federation-e2e-gce test this
ref kubernetes/test-infra#2931

@crassirostris
Copy link
Author

@k8s-bot pull-kubernetes-e2e-kops-aws test this

@crassirostris
Copy link
Author

@piosz I removed resources to make them BestEffort

@crassirostris crassirostris force-pushed the add-event-exporter-deployment branch from d88ea16 to 88d9e55 Compare June 2, 2017 13:41
@crassirostris
Copy link
Author

@k8s-bot pull-kubernetes-e2e-kops-aws test this
@k8s-bot pull-kubernetes-verify test this

@crassirostris crassirostris force-pushed the add-event-exporter-deployment branch from 88d9e55 to 527206c Compare June 2, 2017 15:00
@justinsb
Copy link
Member

justinsb commented Jun 2, 2017

@k8s-bot pull-kubernetes-e2e-kops-aws test this

@piosz
Copy link
Member

piosz commented Jun 2, 2017

/lgtm
/approve

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 2, 2017
@justinsb
Copy link
Member

justinsb commented Jun 2, 2017

@k8s-bot pull-kubernetes-e2e-kops-aws test this

@wojtek-t
Copy link
Member

wojtek-t commented Jun 4, 2017

@k8s-bot pull-kubernetes-verify test this

@crassirostris
Copy link
Author

@k8s-bot pull-kubernetes-unit test this

@crassirostris
Copy link
Author

@k8s-bot pull-kubernetes-federation-e2e-gce test this

@k8s-github-robot k8s-github-robot removed the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 5, 2017
@crassirostris
Copy link
Author

/approve no-issue

2 similar comments
@crassirostris
Copy link
Author

/approve no-issue

@piosz
Copy link
Member

piosz commented Jun 5, 2017

/approve no-issue

@k8s-github-robot
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: crassirostris, piosz, roberthbailey

Associated issue requirement bypassed by: piosz

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these OWNERS Files:

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

@k8s-github-robot k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 5, 2017
@k8s-ci-robot
Copy link
Contributor

k8s-ci-robot commented Jun 6, 2017

@crassirostris: The following test failed, say /retest to rerun them all:

Test name Commit Details Rerun command
pull-kubernetes-federation-e2e-gce 527206c link @k8s-bot pull-kubernetes-federation-e2e-gce test this

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@crassirostris
Copy link
Author

@k8s-bot pull-kubernetes-e2e-gce-etcd3 test this
@k8s-bot pull-kubernetes-federation-e2e-gce test this

@k8s-github-robot
Copy link

Automatic merge from submit-queue

@k8s-github-robot k8s-github-robot merged commit 8df56da into kubernetes:master Jun 6, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. release-note Denotes a PR that will be considered when it comes time to generate release notes. sig/instrumentation Categorizes an issue or PR as relevant to SIG Instrumentation. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.