-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mount remote filesystem with sshfs #255
base: master
Are you sure you want to change the base?
Conversation
clean debug code
e583142
to
0617b50
Compare
I think this generally LGTM. As I understand it, it will listen for all 4 possible events at all times. The online events will only do anything if we set a flag in the database. Then it will process any non-cluster variables with the data mounted through sshfs. Once the data is migrated to Maxwell (and possibly corrected), the same variables will be reevaluated with data in the usual location, plus any |
Yep, you got it right. I know there's some duplication, we could e.g. ignore offline processing (at least for raw data) if online is set. But I think for the use case we have this should not cause issues. There are also other ideas in mind for long term, for example Philipp suggested that file based migration might be there soon so that could be an alternative to remove this special case. If we can also special case the calibration pipeline to run as soon as the data is available on maxwell. But that's an other conversation |
I also prevented now the GUI to start the backend if it's running on the online cluster. |
Other than that, LGTM |
run as slurm job, proxy through 10G node
Are we still doing this? I thought ITDM wasn't a fan? |
This not an option for the long run, but it is fine as temporary solution until IT sets up what we need online. |
I'll test this a bit more, and will merge end of this week or early next week, unless there are objections. |
main change since the last review: it's adapted to work running on the solaris cluster, tunneling data mounts through the display nodes |
…solute and break with using the mount point
Attempt to solve executing the context file with online data without having 2 separate databases.
The current solution is very simple: it mounts the remote directory with sshfs and the execution runs on maxwell.
I think ideally, when we have a centralized backend, we may want to have a permanent NFS mount but that's for a longer term solution.
There's a meta variable to set in order to run the context on online data.
I've loosely tested it and I couldn't see a noticeable slowdown due to the remote data in the kind of data we work with (at HED).
Some limitations: