Basic Authentication#

Many OPeNDAP servers require authentication. PyDAP uses Python’s requests library to fetch data across the web, and it also handle two general forms of authentication:

  1. username/password.

  2. tokens

Requirement#

  • A .netrc file on your local system, to hold your authentication credentials(on Window machines, it is called _netrc).

Note

Below we will demonstrate how to create a .netrc file in your local system.

We strongly recommend having a static, local ~/.netrc file that holds all authentication credentials, since requests can handle authentication from a .netrc file for you, without you the user having to do any extra step. This is, pydap can “discover” your OPeNDAP authentication (username/password) credentials efficiently, if these are properly defined on a .netrc file (_netrc on Windows machines).

While not all OPeNDAP servers are set up to authenticate via token, the vast majority of NASA’s Hyrax servers are, and the token is created via your Earth Data Login (EDL) account. Betewen the two approaches above, token authentication is generally recommended thoughout the documentation as it can avoid many (but not all) redirects.

Note

For a broader introduction to authentication, check the OPeNDAP documentation on Authentication for DAP Clients, and Client Authentication With EDLTokens.

Let’s access some remote data!#

from pydap.client import open_url
from pydap.net import create_session
dataset_url = "https://opendap.earthdata.nasa.gov/providers/POCLOUD/collections/ECCO%20Ocean%20Temperature%20and%20Salinity%20-%20Monthly%20Mean%200.5%20Degree%20(Version%204%20Release%204)/granules/OCEAN_TEMPERATURE_SALINITY_mon_mean_2017-12_ECCO_V4r4_latlon_0p50deg"

username / password#

Nothing is needed, other than initiate a requests.Session object. pydap.net has a function that can initiate such a session along with some additional parameters. This is:

my_session = create_session() # default requests.session() object. 
print(my_session)
<requests.sessions.Session object at 0x1040ac950>

or if you want to cache the session to avoid repeated downloads:

my_session = create_session(use_cache=True) # default requests_cache.session() object. 
print(my_session)
<CachedSession(cache=<SQLiteCache(name=http_cache)>, settings=CacheSettings(expire_after=86400))>

Note

both requests and requests_cache can recover and handle authentication via .netrc

Now, we open the remote dataset#

Now, all that is needed is:

ds = open_url(dataset_url, session=my_session, protocol="dap4")

The (default) requests session will recover the .netrc credentials, and no additional information is needed.

Tokens#

This is another approach that is supported by various institutions, and will be used throughout the documentation. To enable token authentication, you must first make sure you have a valid (unexpired) token, and if not create a new one. For example, see this EDL resource that is specific to NASA.

To improve the user experience, pydap.net.create_session can take extra session arguments, such as the token to:

  • Create a requests.session with token information on its headers.

  • Disable automatic discovery of the .netrc file.

session_extra = {"token": "YourToken"}

# initialize a requests.session object with the token headers. All handled by pydap.
my_session = create_session(session_kwargs=session_extra)
ds = open_url(dataset_url, session=my_session, protocol='dap4')

Note

At this point only the metadata has been downloaded to generate the pydap dataset, and references to each remove variable have been added