Add unit tests for github auth provider #149

vit-zikmund · 2024-03-04T17:26:53Z

This PR add 100% (well 99% due to typing clutter) coverage unit tests for the github auth provider.

It fixes a handful of mostly cosmetic things in the original code, that popped up during the work.

One considerably more significant change is the introduction of the authorization "proxy" cache, which ensures that the core giftless code will always get a valid is_authorized() response for the user calling giftless, even if the usual auth proxy is configured too low to withstand some harsh hammering conditions.

Closes: #148

Setting any threshold for the lowest value for the auth cache size might still not guarantee (in some rough timing conditions), the entry won't disappear before it's read for the first time by the core logic. The change adds a short "hot/proxy" cache for new entries, that are not yet ever read by the core logic via is_authorized(). When that's done, only then the entry is moved to the regular TTL/LRU cache.

Also provide getter for tests

vit-zikmund · 2024-03-04T17:43:52Z

Hey, @athornton long time no see :) Sorry to keep you busy, but since you proved so much helpful on my previous endeavors... I hope it's pretty much on a sliver plate. All tests clean and tidy. The unit tests aren't hardcore exhaustive, but provide full line&branch coverage.

athornton · 2024-03-05T00:06:10Z

giftless/auth/github.py

 from operator import attrgetter, itemgetter
-from threading import Condition, Lock, RLock
-from typing import Any, cast, overload
+from threading import Lock, RLock, _RLock


Ew, but if you need it you need it. Honestly it looks like a bug in threading that something public exposes _RLock.

Indeed 🤮😞 It's mostly a shutup-treat for mypy which for some reason rejects type(RLock()) as a type. This is the only workaround I found for a dynamic low-level type coming from a factory function RLock(). I'll try to dig a bit more, but my hopes are low.

Yissss 🐍 I fixed this with a little protocol instead.

Cool! Now I've actually seen Protocol in the wild, and it makes sense.

athornton · 2024-03-05T00:13:30Z

giftless/auth/github.py

@@ -280,17 +278,28 @@ def __eq__(self, other: object) -> bool:
    def __hash__(self) -> int:
        return hash((self.login, self.id))

-    def permissions(self, org: str, repo: str) -> set[Permission] | None:
+    def permissions(


Not entirely sure I understand the purpose of "authoritative".

This is related to caching and thread safety, in the boundaries of the current design.

The "problem" is in the way how this whole authentication is performed in BaseView._is_authorized, which first creates the authenticator, __call__s it to get the Identity and only then it verifies the right permissions with Identity.is_authorized.

These two consecutive steps have no locks, so they can get mixed up with other threads simultaneously doing the same thing. Because I'm authorizing the user via GitHub API calls, my goal is to prevent those from repeating not only by caching that for some TTL, but also by blocking concurrent identical auth requests from different threads, so only a single thread gets to do the API calls and other just sit blocking and get only the result.

Because I wanted to do all the API calls within the GithubAuthenticator.__call__ method not to drag the requests Session object around (which is otherwise helpful in reusing a single TLS connection the to API), I'm storing only the resulting permissions on the GithubIdentity object, which can only simply expire.

in a different design with a shared requests.Session, the GithubIdentity would know itself how to get the user's permissions (and refresh its permission cache), but the Session should have a limited lifetime and as such, I'd have to manage it externally, which seemed even more complicated.

With all this, and the fact all the caches not only have configurable TTL, but also a rather low maximum size, I was obsessed about a scenario where (likely) an attacker would DoS the permission cache by doing concurrent calls for the same user, but different org/repo. In that scenario, the cached entry that got stored in the GithubAuthenticator.__call__ gets evicted sooner than it's being finally accessed via Identity.is_authorized.

Therefore I couldn't eventually resist to introduce a tiny in-between cache to remember the permissions for a user obtained in __call__ till it's being successfully read in is_authorized 😮‍💨

This permissions method is called in two contexts:

in the __call__ to peek if a certain user's permissions are cached

from is_authorized, which isn't used internally, but mean the obtained permissions have been read by the core logic for good and that I'm free to finally forget it.

The authoritative flag is just for that latter case, telling the logic the permissions have been properly read and now can sit and expire in the regular cache.

I admit this design is covering a really niche scenario, but once I realized it's there, my "OCD" couldn't let it stay. If you think there's a chance to considerably simplify the design, I'm all ears, but since this works and it's already done... 😇

That sounds good to me. My approach would just (if I'd noticed it at all, which I probably wouldn't because I wouldn't have tested high concurrency) have been to throw a mutex around the critical section and accept the performance hit. This seems much more amenable to many callers at once.

athornton

This looks great. I wish we didn't have to import a private symbol from threading, but that feels like a problem in threading, not with your code.

vit-zikmund · 2024-03-08T13:31:30Z

@athornton, is it merge-ready now? ;)

athornton · 2024-03-11T16:30:54Z

Hey @vit-zikmund post-merge I'm now getting a bunch of failures in the github auth tests.

They're of the form:

    def __post_init__(self, request: flask.Request) -> None:
        org_repo_getter = itemgetter("organization", "repo")
>       self.org, self.repo = org_repo_getter(request.view_args or {})
E       KeyError: 'organization'

I'm starting in on this but I suspect you'll be able to fix it trivially.

vit-zikmund · 2024-03-11T16:45:27Z

Oh yeah, lemme fix that real quick!

vit-zikmund · 2024-03-11T17:03:55Z

fixed in #155

vit-zikmund added 6 commits February 29, 2024 17:45

chore(github): add a couple decorator tests

03f9c78

chore(github): simplify caching decorators finish decorator tests

cbde637

chore(github): move config validation to schema

5b875f0

Also provide getter for tests

chore(github): add auth provider tests

368eef8

chore(github): fix incorrect comment

125de7c

athornton reviewed Mar 5, 2024

View reviewed changes

athornton approved these changes Mar 5, 2024

View reviewed changes

vit-zikmund added 2 commits March 5, 2024 12:53

chore(github): improve lock typing

f6a7cbc

chore(github): use Flask means to extract org & repo

d622b91

athornton merged commit a9a799d into datopian:main Mar 11, 2024
7 checks passed

vit-zikmund deleted the github-test branch March 11, 2024 16:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add unit tests for github auth provider #149

Add unit tests for github auth provider #149

vit-zikmund commented Mar 4, 2024 •

edited

Loading

vit-zikmund commented Mar 4, 2024

athornton Mar 5, 2024 •

edited

Loading

vit-zikmund Mar 5, 2024

vit-zikmund Mar 5, 2024

athornton Mar 5, 2024

athornton Mar 5, 2024

vit-zikmund Mar 5, 2024

athornton Mar 5, 2024

athornton left a comment

vit-zikmund commented Mar 8, 2024

athornton commented Mar 11, 2024

vit-zikmund commented Mar 11, 2024

vit-zikmund commented Mar 11, 2024

Add unit tests for github auth provider #149

Add unit tests for github auth provider #149

Conversation

vit-zikmund commented Mar 4, 2024 • edited Loading

vit-zikmund commented Mar 4, 2024

athornton Mar 5, 2024 • edited Loading

Choose a reason for hiding this comment

vit-zikmund Mar 5, 2024

Choose a reason for hiding this comment

vit-zikmund Mar 5, 2024

Choose a reason for hiding this comment

athornton Mar 5, 2024

Choose a reason for hiding this comment

athornton Mar 5, 2024

Choose a reason for hiding this comment

vit-zikmund Mar 5, 2024

Choose a reason for hiding this comment

athornton Mar 5, 2024

Choose a reason for hiding this comment

athornton left a comment

Choose a reason for hiding this comment

vit-zikmund commented Mar 8, 2024

athornton commented Mar 11, 2024

vit-zikmund commented Mar 11, 2024

vit-zikmund commented Mar 11, 2024

vit-zikmund commented Mar 4, 2024 •

edited

Loading

athornton Mar 5, 2024 •

edited

Loading