You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Best approach to support reading data from http via a pathlib-like class, i.e. httppathlib?
In the pangeo / xarray community we do a lot of reading of remote scientific data (particularly netCDF and Zarr). We generally want to treat 3 cases the same way: local filesystems, cloud storage, and http urls. The latter is important partly because a lot of archival scientific data is still only available from servers via http (e.g. via openDAP urls), and we often want to pull it out and deposit it onto cloud storage (e.g. using pangeo-forge).
CloudPathlib looks really nice, especially the strict typing and clear interface. (I'm in awe of the AnyPath virtual superclass trick too - and with #347 would be even cooler!) The Path abstraction also just seems more like the minimally-useful one, rather than trying to emulate a whole filesystem.
Rather than trying to support every filesystem under the sun as fsspec does, I'm wondering if we could just use pathlib, cloudpathlib, and some new httppathlib?
Best approach to support reading data from http via a
pathlib-like class, i.e.httppathlib?In the pangeo / xarray community we do a lot of reading of remote scientific data (particularly netCDF and Zarr). We generally want to treat 3 cases the same way: local filesystems, cloud storage, and http urls. The latter is important partly because a lot of archival scientific data is still only available from servers via http (e.g. via openDAP urls), and we often want to pull it out and deposit it onto cloud storage (e.g. using pangeo-forge).
We currently use
fsspecto abstract over these different filesystems, but despite much engagement upstream we have unfortunately experienced chronic reliability issues stemming from ill-defined interfaces.CloudPathlib looks really nice, especially the strict typing and clear interface. (I'm in awe of the
AnyPathvirtual superclass trick too - and with #347 would be even cooler!) ThePathabstraction also just seems more like the minimally-useful one, rather than trying to emulate a whole filesystem.Rather than trying to support every filesystem under the sun as
fsspecdoes, I'm wondering if we could just usepathlib,cloudpathlib, and some newhttppathlib?Do you have any thoughts on:
httppathlibto conform to thepathlibinterface?cloudpathlibor in a separate repository?