Handle same hostname for multiple pods in a headless service #123

guillaumerose · 2017-07-07T12:51:26Z

This code improves the handling of headless services pointing at pods sharing
the same hostname.

For each hostname, A records should point at the list of pods IPs.
Currently, only the last pod gets its A record.
For the service, a A record should be created for each pod.

Fixes #116

k8s-ci-robot · 2017-07-07T12:51:28Z

Thanks for your pull request. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please follow instructions at https://github.com/kubernetes/kubernetes/wiki/CLA-FAQ to sign the CLA.

It may take a couple minutes for the CLA signature to be fully registered; after that, please reply here with a new comment and we'll verify. Thanks.

If you've already signed a CLA, it's possible we don't have your GitHub username or you're using a different email address. Check your existing CLA data and verify that your email is set on your git commits.
If you signed the CLA as a corporation, please sign in with your organization's credentials at https://identity.linuxfoundation.org/projects/cncf to be authorized.
If you have done the above and are still having issues with the CLA being reported as unsigned, please email the CNCF helpdesk: [email protected]

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

thockin · 2017-07-07T16:13:18Z

I know the spec says it should return multiple A records in this case, but I just want to think it through. Is this the correct behavior? It's less surprising than dropping all-but-one, though I could argue that using the same hostname twice is a bug on the user's end.

So is the spec wrong or is the code wrong? If we implement this - what else might break? Are there apps that actually expect to use DNS as "phantom elimination"? I feel the current behavior is under-specified to rely on that, but the implementation is reasonably deterministic.

@bowei @johnbelamaric @smarterclayton @erictune @kubernetes/sig-apps-misc

erictune · 2017-07-07T16:50:55Z

@kow3ns

erictune · 2017-07-07T16:55:13Z

Using the same hostname is not a bug.

Spreading may be soft rather than hard.

Someone may want to bring up a size-3 etcd in their 2-node cluster as part of a test. They don't need HA, they just want to reuse the config. That should work as similarly as possible to the "normal" case where the pods are properly spread.

thockin · 2017-07-07T17:29:06Z

@erictune I am not clear on your reply. Let me provide an example and you tell me which behavior is considered correct.

T0: User somehow creates a set of pods that have hostnames "app-1", "app-2", "app-3". DNS lookups of "service" return 3 distinct A records. DNS lookups of "app-[123].service" each return one distinct A record.

T1: Some management event happens and, through cut-paste or similar, the app-3 pod is replaced with a new pod that has hostname "app-2". DNS lookups of "service" return 3 distinct A records. Question: should a DNS lookup of "app-2" return a single A record (one of the 2 pods, chosen somehow) or should it return 2 A records?

Today it returns 1 record.

thockin · 2017-07-07T17:31:14Z

Another note: if we decide the spec is wrong, and that we should only serve one A record, we have a bug. When you hit this overlap, the IP of the last pod created "wins". If that pod goes away, we do not update the PTR to the other pod with the same hostname.

erictune · 2017-07-07T17:44:17Z

Please disregard previous comment. I was not aware of how the hostname override behavior affected DNS. I don't know of any use cases that break either way. Ken may know some.

…

On Fri, Jul 7, 2017 at 10:31 AM, Tim Hockin ***@***.***> wrote: Another note: if we decide the spec is wrong, and that we should only serve one A record, we have a bug. When you hit this overlap, the IP of the last pod created "wins". If that pod goes away, we do not update the PTR to the other pod with the same hostname. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#123 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AHuudieh9DlWhKR8hXezcR2XsdssdxC0ks5sLmtjgaJpZM4OQ5rn> .

dgageot · 2017-07-10T11:17:36Z

Thanks @thockin for the feedback.

When we prepared this PR, we tried real hard to not break existing things and we considered this whole "phantom elimination" couldn't be considered a feature given what's in the spec. But I understand that nonetheless, it might be a breaking change.

What do you think we can do to mitigate that risk?

johnbelamaric · 2017-07-10T14:23:36Z

@chrisohaver I believe in CoreDNS we include all IPs. This is consistent with ordinary DNS behavior - it is legal in DNS to have multiple A records with the same name and different IPs.

guillaumerose · 2017-08-03T13:20:29Z

We fixed the CLA check. Any chance to get this merged ? Should we do something ?
Thanks !

dgageot · 2017-08-03T14:18:58Z

ping @thockin

guillaumerose · 2017-09-22T07:35:57Z

Any update on this ?

fejta-bot · 2018-01-06T06:27:38Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

Signed-off-by: Guillaume Rose <[email protected]>

fejta-bot · 2018-03-15T10:31:43Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten
/remove-lifecycle stale

k8s-ci-robot added the cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. label Jul 7, 2017

guillaumerose force-pushed the same-hostname branch from 362758d to 1e2f2dd Compare August 3, 2017 12:55

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. and removed cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. labels Aug 3, 2017

guillaumerose force-pushed the same-hostname branch from 1e2f2dd to 72ccfe5 Compare September 20, 2017 15:44

k8s-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Sep 20, 2017

wu8685 mentioned this pull request Nov 20, 2017

enable skyDNS round robin A/AAAA records #167

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 6, 2018

Handle same hostname for multiple pods in a headless service

e69a534

Signed-off-by: Guillaume Rose <[email protected]>

guillaumerose force-pushed the same-hostname branch from 62ef7c1 to e69a534 Compare February 13, 2018 09:41

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Mar 15, 2018

guillaumerose closed this Mar 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle same hostname for multiple pods in a headless service #123

Handle same hostname for multiple pods in a headless service #123

guillaumerose commented Jul 7, 2017

k8s-ci-robot commented Jul 7, 2017

thockin commented Jul 7, 2017

erictune commented Jul 7, 2017

erictune commented Jul 7, 2017

thockin commented Jul 7, 2017

thockin commented Jul 7, 2017

erictune commented Jul 7, 2017 via email

dgageot commented Jul 10, 2017

johnbelamaric commented Jul 10, 2017

guillaumerose commented Aug 3, 2017

dgageot commented Aug 3, 2017

guillaumerose commented Sep 22, 2017

fejta-bot commented Jan 6, 2018

fejta-bot commented Mar 15, 2018

Handle same hostname for multiple pods in a headless service #123

Handle same hostname for multiple pods in a headless service #123

Conversation

guillaumerose commented Jul 7, 2017

k8s-ci-robot commented Jul 7, 2017

thockin commented Jul 7, 2017

erictune commented Jul 7, 2017

erictune commented Jul 7, 2017

thockin commented Jul 7, 2017

thockin commented Jul 7, 2017

erictune commented Jul 7, 2017 via email

dgageot commented Jul 10, 2017

johnbelamaric commented Jul 10, 2017

guillaumerose commented Aug 3, 2017

dgageot commented Aug 3, 2017

guillaumerose commented Sep 22, 2017

fejta-bot commented Jan 6, 2018

fejta-bot commented Mar 15, 2018