More complete HDFS / WASB style path structure #153

isaacabraham · 2015-11-21T09:17:02Z

There's a need to have the ability to specify the account as part of the path e.g.

"customerAccount@container/folder/folder/file.txt"
|> CloudFlow.ofFileByLine

etc. etc.

I've raised this as its own issue as it's probably an enabler for a number of scenarios.

eiriktsarpalis · 2015-11-21T13:50:44Z

MBrace.Core and by extension MBrace.Flow do not on themselves perform any type of parsing on the paths. This job is delegated to the ICloudFileStore abstraction that the current runtime happens to be using.

So I think this really is an MBrace.Azure issue: we should consider whether the concrete implementation of ICloudFileStore, BlobStore should support multiple storage accounts and recognise WASB-style paths.

If we decide to go for this approach, there are a few ramifications that might be worth considering:

How will the cluster be handling key management? By design, the current implementation will never encapsulate connection strings in serialized storage objects; rather it is expected that connection strings are specified at the configuration level of each node. This happens in order to avoid inadvertent leaks of connection strings to exported serializations of object graphs, which is very easy to occur. Should the user decide to introduce a new connection string from the client side, how will that key be distributed across the cluster without worrying that leaks might happen?
Issues of cluster identity: at the moment every MBrace cluster is uniquely identified by the pair of storage and service bus accounts that it uses. How could we design frictionless introduction of secondary keys without potentially blurring this identity? And how can we be sure that those secondary keys are recoverable in cases where all worker instances have died?

eiriktsarpalis · 2015-11-21T13:56:37Z

There are quite a few ways we could address these concerns: One would be maintain an "accounts" table in the master storage account which would contain all secondary connection strings. I do feel though that this may violate security expectations users may have.

eiriktsarpalis · 2015-11-21T14:04:02Z

Another would be to use the service bus to broadcast additional auth data to workers.

dsyme · 2017-06-08T16:36:02Z

See mbraceproject/MBrace.Azure#161 which I think covers this enough for these purposes (an MBrace.Core PR may follow out of that)

isaacabraham mentioned this issue Nov 21, 2015

Attach multiple FileStores to a cluster. #154

Open

eiriktsarpalis added the MBrace.Azure label Nov 21, 2015

dsyme closed this as completed Jun 8, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More complete HDFS / WASB style path structure #153

More complete HDFS / WASB style path structure #153

isaacabraham commented Nov 21, 2015

eiriktsarpalis commented Nov 21, 2015

eiriktsarpalis commented Nov 21, 2015

eiriktsarpalis commented Nov 21, 2015

dsyme commented Jun 8, 2017

More complete HDFS / WASB style path structure #153

More complete HDFS / WASB style path structure #153

Comments

isaacabraham commented Nov 21, 2015

eiriktsarpalis commented Nov 21, 2015

eiriktsarpalis commented Nov 21, 2015

eiriktsarpalis commented Nov 21, 2015

dsyme commented Jun 8, 2017