Need tests of KUBE-MARK-DROP

We have apparently been accidentally deleting the

    -t nat -A KUBE-MARK-DROP -j MARK --set-xmark 0xXXXX

rule for a few weeks (#85527), and no one noticed.

I suspect this is because `KUBE-MARK-DROP` is really only needed if the host accepts all incoming packets by default, and if you have any sort of plausible firewall, then `KUBE-MARK-DROP` is redundant, and so therefore the e2e tests that might otherwise catch `KUBE-MARK-DROP` failures don't actually catch them.

The iptables proxier uses `KUBE-MARK-DROP` in two cases, on cloud platforms where we create iptables rules for LoadBalancer IPs (eg, GCE but not AWS), when a service has a load balancer IP, and it has endpoints, and a packet arrives on the node whose destination is the load-balancer IP:
  1. If the service specifies `spec.loadBalancerSourceRanges`, and the packet's source IP is not in the source ranges, then [we call `KUBE-MARK-DROP` on the packet](https://github.com/kubernetes/kubernetes/blob/2b9aeabf/pkg/proxy/iptables/proxier.go#L1126) to drop it later.
      - This is theoretically tested by [`"It should only allow access from service loadbalancer source ranges"`](https://github.com/kubernetes/kubernetes/blob/2b9aeabf/test/e2e/network/service.go#L1533). However, if the `KUBE-MARK-DROP` rule becomes a no-op, then the pod-to-LoadBalancer-IP connection will fall through the firewall chain, never hit the XLB chain, and eventually just get masqueraded and delivered to the LoadBalancer IP like it would for any other cluster-external IP. Since the cloud loadbalancer is *also* programmed with the source ranges, and the source range in this test is a single pod IP, the load balancer will then reject the packet (since it has the node's IP as its source at this point).
      - I think we can fix this test to fail in the absence of the drop rule by adding the node's IP to the source range. Then the expected-to-fail connection would (erroneously) not get dropped by the node, get passed to the cloud load balancer, which would accept it, and then get passed back to the service, causing the test case to fail.
  2. If the service has `ServiceExternalTrafficPolicyTypeLocal` and no local endpoints then [we call `KUBE-MARK-DROP` on the packet](https://github.com/kubernetes/kubernetes/blob/2b9aeabf/pkg/proxy/iptables/proxier.go#L1365) to drop it later.
      - This does not get tested by [`"It should only target nodes with endpoints"`](https://github.com/kubernetes/kubernetes/blob/2b9aeabf/test/e2e/network/service.go#L2190), because if the load balancers are working correctly then they won't send any traffic to the nodes that are creating the drop rules anyway.
      - It also does not get tested by [`"It should work from pods"`](https://github.com/kubernetes/kubernetes/blob/2b9aeabf/test/e2e/network/service.go#L2261) because a pod-to-LoadBalancer-IP connection [will be rewritten to be pod-to-ClusterIP](https://github.com/kubernetes/kubernetes/blob/2b9aeabf/pkg/proxy/iptables/proxier.go#L1339) before the only-local check and bypass the drop rule.
      - I think it should be possible to test this by trying to connect to an only-local LoadBalancer service from a `hostNetwork` pod on a node that has no endpoints for the service. The drop rule ought to cause that connection to fail, but if the drop rule was missing then it would make a connection directly to the LoadBalancer and then succeed.

The ipvs proxier *refers* to the `KUBE-MARK-DROP` chain, but I think it doesn't actually *use* it...

/sig network
/priority important-soon

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need tests of KUBE-MARK-DROP #85572

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development