Integrating native-proxy #501

jwindgassen · 2024-09-10T20:17:25Z

Hey everyone.

I was recently made aware, of the desire to be able to create standalone proxies, similar to how it is done in jhsingle-native-proxy (see #1).
This would be immensely advantageous for us, so I started porting the code here recently.

There is still a lot to do and I needed to remove/comment out a few of the original features, but it is already fundamentally working as is. I am opening this PR to let you know of this and get an opinion on a few bits here and there. I will continue to improve on it in the next weeks.

In the meantime, any comments and ideas are welcome :)

ryanlovett · 2024-09-10T20:51:09Z

Hi @jwindgassen , thanks for this contribution!

Is the idea to replace jhsingle-native-proxy (so that code can be shared), or will that project continue on? If the latter then could there still be duplication of code? (will this need to periodically get resynced with jhsingle-native-proxy?) Or should shared code be factored out into something separate for each project to use?

If there is documentation on using this, could that be integrated as well?

jwindgassen · 2024-09-10T21:19:10Z

That is a good question. I do not know about the current state of the original project. The newest commit was already made a few months ago, but I do not think it is abandoned. I will contact the original developer soon and ask him about the current status and his opinion on this.

But the proxies used by it are almost identical to the proxies here, and (I think) they originally were a copy of an old version of the ones in this package.
IMHO, the best would either be completely merging it into here, or making JSP a dependency of jhsingle-native-proxy and importing the Proxies there. But that is, of course, not my decision to make but one of the original developer.
Furthermore, jhsingle-native-proxy does not support Unix Sockets, which we would love to use.
But since both projects have a lot of similarity, keeping them completely separate from one another is a lot of redundancy.

The Documentation for it is currently in the ReadMe, but could be added to the docs here, given that we decide to add this feature.

aktech · 2024-09-18T09:04:44Z

This sounds like a great idea. I would suggest to create some kind of checklist of the features that are ported from jhsingle-native-proxy and what's not during the course of this pull request (or a series of pull requests).

We're using jhsingle-native-proxy heavily in jhub-apps would love to move to just using jupyter-server-proxy for everything we can, and having a feature parity checklist will help us understand, what we'll be having/missing with this port.

jwindgassen · 2024-09-18T12:48:40Z

@ryanlovett I talked to the original developer. He welcomes the merging of his feature into Jupyter Server Proxy. There are currently no further plans for jhsingle-native-proxy, but it will persist (for now), as there are a few custom additions, which I currently do not plan to integrate here.

@aktech Sure. Here is a list with the features I have currently ported or plan to do so:

Launching standalone proxies from CLI
Automatically enable JupyterHub Authentication when we are spawned by one
Send Activity Notifications to JupyterHub
Ensure usability with Unix-Sockets
Ensure authenticated Access via JupyterHub
Allow customization of environment and mappath
Progressive Proxies (Responsive Proxy for other Stream Types #502)
Docs & Tests (maybe even with JupyterHub?)
(If desired) Re-add pulling of Git Repository and venv/conda activation?

That's at least everything I can think of now. If you have other ideas or require something more, let me know and I will add it to the list :)

ryanlovett · 2024-09-18T17:26:47Z

@jwindgassen Thanks very much for asking! It sounds like this has the potential to consolidate development in the future.

jwindgassen · 2024-10-18T14:07:17Z

@ryanlovett @aktech I have been working on the feature over the last few weeks, and I am now happy to announce, that the code is working 🎉
I tested it with my own TLJH instance, and it also works with our production JupyterHub Setup. I also verified that authentication with JupyterHub works.
There are a few small things I will probably improve in the future, but I am quite happy with the state of the code as it is now.

I'm welcoming anyone to test the feature on their JupyterHub instance for testing. Please let me know about any problems or errors you encounter when doing so 🙂

How to use

For testing, I like to use voila. The command to execute might look like this:
jupyter standaloneproxy --debug -- voila --no-browser --port={port} /path/to/notebook.ipynb"

What else do you need from my side, besides the code, to be satisfied with this PR? I am currently writing a page for the docs, which I will commit soon.
I was also planning to write a few tests for the feature, but I will have to look into that later.

ryanlovett · 2024-10-23T05:21:28Z

@jwindgassen Thanks for the update! I'll try to test this week locally.

How might a hub administrator typically configure use of this feature? For example would they set every user to launch voila via standalone proxy from c.Spawner.cmd instead of jupyterhub-singleuser?

Is the intent of standalone to essentially re-use the hub's auth, spawner, user storage, etc. but limit what apps users can invoke because it specifies just the one? (since jupyter server + jupyter-server-proxy enables users to launch an arbitrary number of proxied apps)

jwindgassen · 2024-10-24T14:52:55Z

In essence, yes.
With the standalone proxy feature, admins/developers of the JupyterHub can give the users of the Hub access to other applications instead of just Notebook/JupyterLab.
While we can achieve the same with the current jupyter-server-proxy via the button in the Launcher, this is often an indirection when you only want to access e.g. RStudio.
By using the SuperviseAndProxyHandler directly, without attaching it to a jupyter-server, we can (re)use the Authentication, User Model, Proxy, etc. JupyterHub provides. Secondly, it is also probably easier to integrate new web apps into an already existing cloud environment this way, than to manually setup everything yourself.

But since you need to overwrite c.Spawner.cmd or modify the behavior of the Spawner in a subclass, new Apps can only be added by the administrators. This gives users access to different apps on the JupyterHub landing page, similar to Open OnDemand. The Spawner then needs to switch between the launch commands accordingly.
In our case, we are planning to provide RStudio, XpraHTML (Remote Desktop), MATLAB, NEST-Desktop, and more to come in the future, using this feature.

ryanlovett · 2024-10-24T16:52:46Z

Fascinating, thanks!

More of an aside, but how are you customizing the spawner to launch the different applications?

Edit: oh, is it jhub-apps as mentioned earlier?

jwindgassen · 2024-10-24T18:46:06Z

No. We have created our own custom Spawner. But is similar to this. We have overwritten the start method, which will submit a new Job to our cluster. And inside the start script, we start jupyterhub singleuser at the end.

But now you mention it, jhub-apps might synergyze quite well with the standalone feature.

manics

The command line arguments are quite complicated, e.g. having to parse maps.

The alternative is more work, but if we were to refactor ServerProcess to be a Traitlets Configurable

jupyter-server-proxy/jupyter_server_proxy/config.py

Lines 31 to 49 in 41e588a

    
           ServerProcess = namedtuple( 
        
               "ServerProcess", 
        
               [ 
        
                   "name", 
        
                   "command", 
        
                   "environment", 
        
                   "timeout", 
        
                   "absolute_url", 
        
                   "port", 
        
                   "unix_socket", 
        
                   "mappath", 
        
                   "launcher_entry", 
        
                   "new_browser_tab", 
        
                   "request_headers_override", 
        
                   "rewrite_response", 
        
                   "update_last_activity", 
        
                   "raw_socket_proxy", 
        
               ], 
        
           )

I think we could take advantage of Traitlets ability to automatically create all CLI args, and we'd have the benefit of being able to configure jupyter-standaloneproxy using an arbitrarily complex file.

What do you think? I'm happy to investigate converting ServerProxy to Traitlets.

jupyter_server_proxy/standalone/__init__.py

jwindgassen · 2024-10-24T20:35:26Z

Thanks for the suggestion, I really like that idea. This should also solve the issue with keeping CLI Arguments up with any new options added to the proxies. I will look into it :)

manics · 2024-10-24T22:09:59Z

I've made a start in #507

jwindgassen · 2024-10-28T21:01:40Z

@manics

So I used your branch to create the CLI via traitlets. I'm no expert with traitlets, but I managed to get it working: jwindgassen@2d9eb5b

I had to play around with aliases and the extra_args a bit, but I wanted to keep the CLI reasonably unchanged to before.
Getting the command, which was previously a positional argument, was a bit tricky, since I couldn't find a proper way to add a positional argument to the Parser traitlets create. It was possible by overwriting _create_loader(), but not really pretty. Using extra_args instead works, but we now lack a proper explanation of command when using --help.

If you have a better idea or otherwise comments on my changes over there, let me know! :)

manics · 2024-11-06T23:29:05Z

Sorry for the delay. Since this is a new addition to Jupyter server proxy I think we should prioritise long term maintenance over just replicating the previous CLI- if there's a better way to do things we can use it as an opportunity to refactor.

We also don't need to do everything in one go, for example it's fine to initially focus on creating a functional standalone proxy along that only supports standard traitlets configuration, and add additional flags in a follow-up PR.

jwindgassen · 2024-11-11T10:43:37Z

I'm fine with how the CLI looks right now, so I'm happy to switch to this once your PR has been accepted.

I am also almost done with Tests and Docs, they should be ready by the end of the week.

aktech · 2024-11-12T14:51:33Z

Hey @jwindgassen Thanks a lot for working on this and for the ping. I'll play with it this week to provide some feedback.

jwindgassen · 2024-11-15T14:02:53Z

Ok, I have now also added Docs and Tests for the new feature.

Writing proper tests for Login and Activity is a bit more difficult, since I would need to spawn a JupyterHub instance to gain full access to its API. For now, I limited it to only testing our code, since the classes I import from JupyterHub are tested over there.

I also added a section to the docs, mostly targeted to developers, where I explain the different sections of the code and what features I needed to implement to make this work smoothly.

If you think we need more tests for specific cases or want something to be explained in the docs in more detail, let me know.

jhgoebbert · 2024-11-30T11:52:49Z

@jwindgassen So from your perspective this is ready to be merged. Great and Thank you!

( As soon as #507 and #508 is merged this native-proxy can be updated afterwards. But for now this PR here is implemented to be independent of them. True? )

jwindgassen · 2024-11-30T14:57:08Z

Yes. This is currently independent of #507 and functions without it. But in the future, once that has been merged, I would update the standalone feature to use traitlets.Configurable. It would be a lot tidier and future-proof using that approach.

@aktech Have you been able to get it running and did it work in your setup? I would highly appreciate any feedback or comments on this 🙂

@ryanlovett How do we continue for this PR? Is there anyone specific responsible for reviewing it? Is there still more you need here? Sorry for being a bit impatient, but we would like to get this feature running soon on our JupyterHub instance.

ryanlovett

I've left a few minor comments and suggested changes. What do you think?

docs/source/standalone.md

jupyter_server_proxy/standalone/__init__.py

jupyter_server_proxy/standalone/activity.py

ryanlovett · 2024-12-01T05:08:49Z

In terms of next steps, I'm fine with merging if @manics thinks its okay to go ahead and then address the related PRs later.

aktech

@jwindgassen Thanks a lot for working on this. This is really very useful. I tried replacing jhsingle-native-proxy with this branch and I was able to get it working partially (minus conda env and repo) for panel and voila apps.

To be able to completely use it, we would need ability to specify conda env and pull from repo, but I think this PR is a great start in that direction. These features can be contributed in later PRs and this PR looks complete enough to get merged IMO.

I tested for panel and voila. For panel I ran this:

jupyter standaloneproxy --debug --skip-authentication -- python -m bokeh_root_cmd.main ~/path/to/jhub-apps/jhub_apps/examples/panel_basic.py --port={port} --debug --allow-websocket-origin=127.0.0.1:8888 --server=panel

jupyter_server_proxy/standalone/__init__.py

manics · 2024-12-04T15:00:50Z

@ryanlovett If you're generally happy with this would you mind merging #507 first, then we can rebase this PR, and it'll make this PR smaller, and it'll be a lot clearer what we're adding.

@aktech I don't think we should completely reproduce jhsingle-native-proxy since we have to maintain this long term- cloning git repos and setting up conda envs doesn't feel like it's in scope. However I'd hope the move to traitlets makes it much easier to extend this in a separate app, or perhaps it's as easy as wrapping it in another script?

jwindgassen · 2024-12-04T15:08:24Z

@aktech Very nice to hear that it worked for you out of the box. I needed to find and fix some bugs when I installed it on our system to make it working, so I am relieved it worked without much effort for you now. You currently have the --skip-authentication flag enabled, does it work without that?

Regarding the conda/env activation and git puller, I would suggest seeing how desired this feature becomes in the future. I think it's probably quite niche, but maybe I am wrong and many people would like to use it. But for now I would consider it out of scope for this PR.

jwindgassen · 2024-12-04T20:16:01Z

@ryanlovett @manics Now that #507 is merged, should I rebase and tidy up the commits here? Or should I merge main into here and then append the required changes to the CLI?

manics · 2024-12-04T21:39:41Z

Up to you! I usually rebase my PRs.

…ation-path`

…tion and check_origin

…act address from `JUPYTERHUB_SERVICE_URL`

The `oauth_callback` requests were handled by the ProxyHandler, effectively causing the request to ping-pong between JupyterHub Login and `/oauth_callback`

…ocess

jwindgassen · 2024-12-07T22:35:40Z

Ok. So I rebased on top of the new #507 Merge.

I had to refactor the ServerProcess Configurable a bit more to reuse it for the standalone. However, when done like this, it should reasonably well future-proof for any new attributes and functionality that might be added.

There are still 2 minor changes I am thinking about implementing:

Instead of using traitlets.config.Application, we could use jupyter_core.application.JupyterApp. This adds the --config argument, meaning existing JupyterHub setups could reuse their config.py file. A config.py might also be a bit nicer when setting up the standalone proxy for complicated setups
As I mentioned at the end of Make config.ServerProcess into a Configurable #507, I think we should be able to make ServerProcess and LauncherEntry a HasTraits instead of a Configurable.

Any comments, on the changes or these ideas, are very welcome :) @manics @ryanlovett

jhgoebbert · 2025-01-08T14:44:10Z

This looks great to me. Can it be merged?

jwindgassen mentioned this pull request Sep 16, 2024

Reunion with jupyter-server-proxy ideonate/jhsingle-native-proxy#28

Closed

manics reviewed Oct 24, 2024

View reviewed changes

jupyter_server_proxy/standalone/__init__.py Outdated Show resolved Hide resolved

jupyter_server_proxy/standalone/__init__.py Outdated Show resolved Hide resolved

manics mentioned this pull request Oct 24, 2024

Make config.ServerProcess into a Configurable #507

Merged

jwindgassen force-pushed the native-proxy branch from 1f8855b to e4741d0 Compare November 15, 2024 13:46

jwindgassen mentioned this pull request Nov 25, 2024

Enable progressive proxy via flag #512

Draft

ryanlovett requested changes Dec 1, 2024

View reviewed changes

aktech suggested changes Dec 4, 2024

View reviewed changes

jupyter_server_proxy/standalone/__init__.py Outdated Show resolved Hide resolved

jwindgassen added 17 commits December 6, 2024 13:44

Merge jhsingle-native-proxy.

d0be294

Rename to standalone

26059b0

Remove forward-user-info, query-user-info and remains of `present…

fae833b

…ation-path`

Create StandaloneHubProxyHandler and timeout Argument. Fix authentica…

36c8e17

…tion and check_origin

Send Activity Notifications to JupyterHub

087c11d

Fix Authentication with JupyterHub

0d4e701

Remove jupyter-server Authentication

fa8621f

Set env & mappath via CLI, add Args for Unix Sockets

5cae144

Merge StandaloneProxyHandler into StandaloneHubProxyHandler, extr…

6b9336d

…act address from `JUPYTERHUB_SERVICE_URL`

Fix SSL Configuration, Add SlashHandler to Application

51ee31c

Add Slash to Prefix

f547e5f

Fix error generation

2a83343

Fixed ordering in handlers

325217b

The `oauth_callback` requests were handled by the ProxyHandler, effectively causing the request to ping-pong between JupyterHub Login and `/oauth_callback`

Defer xsrf checking to proxied app

7f85c9f

Add Documentation for standalone

87efc56

Add Tests for the StandaloneProxy

7b33d45

Fix Typos and minor cleanups

941356f

jwindgassen force-pushed the native-proxy branch from d7681f1 to 941356f Compare December 6, 2024 12:47

jwindgassen added 3 commits December 6, 2024 15:59

Switch from argparse to traitlets.Application and use config.ServerPr…

0228e86

…ocess

Refactor and reuse Proxy generation in standalone

7ed8974

Update standalone tests

1ff6051

jwindgassen requested review from ryanlovett and manics January 6, 2025 08:57

Allow configuration via traitlets

e07a615

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrating native-proxy #501

Integrating native-proxy #501

jwindgassen commented Sep 10, 2024

ryanlovett commented Sep 10, 2024

jwindgassen commented Sep 10, 2024

aktech commented Sep 18, 2024

jwindgassen commented Sep 18, 2024 •

edited

Loading

ryanlovett commented Sep 18, 2024

jwindgassen commented Oct 18, 2024

ryanlovett commented Oct 23, 2024

jwindgassen commented Oct 24, 2024 •

edited

Loading

ryanlovett commented Oct 24, 2024 •

edited

Loading

jwindgassen commented Oct 24, 2024

manics left a comment

jwindgassen commented Oct 24, 2024

manics commented Oct 24, 2024

jwindgassen commented Oct 28, 2024

manics commented Nov 6, 2024

jwindgassen commented Nov 11, 2024

aktech commented Nov 12, 2024

jwindgassen commented Nov 15, 2024

jhgoebbert commented Nov 30, 2024

jwindgassen commented Nov 30, 2024

ryanlovett left a comment

ryanlovett commented Dec 1, 2024

aktech left a comment

manics commented Dec 4, 2024

jwindgassen commented Dec 4, 2024 •

edited

Loading

jwindgassen commented Dec 4, 2024

manics commented Dec 4, 2024

jwindgassen commented Dec 7, 2024

jhgoebbert commented Jan 8, 2025

	ServerProcess = namedtuple(
	"ServerProcess",
	[
	"name",
	"command",
	"environment",
	"timeout",
	"absolute_url",
	"port",
	"unix_socket",
	"mappath",
	"launcher_entry",
	"new_browser_tab",
	"request_headers_override",
	"rewrite_response",
	"update_last_activity",
	"raw_socket_proxy",
	],
	)

Integrating native-proxy #501

Are you sure you want to change the base?

Integrating native-proxy #501

Conversation

jwindgassen commented Sep 10, 2024

ryanlovett commented Sep 10, 2024

jwindgassen commented Sep 10, 2024

aktech commented Sep 18, 2024

jwindgassen commented Sep 18, 2024 • edited Loading

ryanlovett commented Sep 18, 2024

jwindgassen commented Oct 18, 2024

How to use

ryanlovett commented Oct 23, 2024

jwindgassen commented Oct 24, 2024 • edited Loading

ryanlovett commented Oct 24, 2024 • edited Loading

jwindgassen commented Oct 24, 2024

manics left a comment

Choose a reason for hiding this comment

jwindgassen commented Oct 24, 2024

manics commented Oct 24, 2024

jwindgassen commented Oct 28, 2024

manics commented Nov 6, 2024

jwindgassen commented Nov 11, 2024

aktech commented Nov 12, 2024

jwindgassen commented Nov 15, 2024

jhgoebbert commented Nov 30, 2024

jwindgassen commented Nov 30, 2024

ryanlovett left a comment

Choose a reason for hiding this comment

ryanlovett commented Dec 1, 2024

aktech left a comment

Choose a reason for hiding this comment

manics commented Dec 4, 2024

jwindgassen commented Dec 4, 2024 • edited Loading

jwindgassen commented Dec 4, 2024

manics commented Dec 4, 2024

jwindgassen commented Dec 7, 2024

jhgoebbert commented Jan 8, 2025

jwindgassen commented Sep 18, 2024 •

edited

Loading

jwindgassen commented Oct 24, 2024 •

edited

Loading

ryanlovett commented Oct 24, 2024 •

edited

Loading

jwindgassen commented Dec 4, 2024 •

edited

Loading