Data Repository

iDASH Data Repositories

Notice: At this time, the funding of the iDASH program is over. Unfortunately, we have to sunset the data within iDASH. All data within any of the communities in iDASH will no longer be accessible after 07/30/2017. If you have any questions, please don't hesitate to contact us at . Data owners who need help finding a new repository for their data or who are interested in a fee-based hosting service can also contact us at

 

An important part of the information infrastructure provided by iDASH is to provide a single, comprehensive set of facilities to explore, navigate, analyze, and combine different forms of information provided by different data sources, within the bounds of privacy restrictions. iDASH is designed to be scalable and extensible so that developers can integrate the heterogeneous data from the national biomedical, clinical, and informatics communities. Developed as an open, community-serving, crowd-sourcing resource, the iDASH team is collaborating with biomedical, behavioral, and quantitative researchers to establish the nation’s most robust data repository for high-quality collections of data. This rich repository of medical data includes images and text accompanied by meta-data.

iDASH Repository Tutorials
 

How do I navigate through the iDASH Repository website?

View video demonstrations of the iDASH Repository here.

 

How do I access files in the repository?

Click here to download the tutorial.
View tutorial videos here.

 

What must I do before contributing NON-PHI Data?

Click here to download the tutorial.

 

What must I do before contributing PHI Data?

Click here to download the tutorial.

 

 

Data Repository FAQ

What is the non-PHI/PII data repository?

This repository is a customized instance of MIDAS, an open-source platform from Kitware. This iDASH data repository is intended to store biomedical data that do not contain personal health information (PHI) or personal identifiable information (PII). Data sets that contain PHI or PII and limited data sets are placed in the iDASH PHI data repository, a separate instance of MIDAS, which is not visible to the public.

 

How can I use the iDASH repository?

Data contributors

You can use the iDASH repository to organize your data and specify different access capabilities based on if your data contain PHI/PII or if you require others to sign a community agreement prior to accessing your data.

Data users

Without a MIDAS account, you can download any data contained in a public folder. Private folders require a MIDAS account and approval from the community owner for access.

 

What is MIDAS?

MIDAS is the open-source platform from Kitware upon which the iDASH repository is based. Customizations have been made to make this instance of MIDAS that are specific for iDASH purposes.

 

How do I upload data into the iDASH repository?

To upload data into the iDASH repository, you first need to obtain a MIDAS account. To obtain a MIDAS account, follow the appropiate DUA procedures or contact idash@ucsd.edu to help you get started. 

If you already have an account, click here for tutorial videos.

 

How do I get a MIDAS account?

To obtain a MIDAS account, follow the appropiate DUA procedures or contact idash@ucsd.edu to help you get started.

 

How do I request data that are in a private folder?

Log into the iDASH non-PHI/PII data repository and send a request to the community owner. If the owner is not specified, send an e-mail to idash@ucsd.edu.

 

How are data stored in MIDAS?

When data are uploaded into MIDAS they are placed in a hash-addressed directory structure, called an asset store, for fast retrieval based on the MD5 checksum of the binary file. This eliminates redundancy by only storing equivalent content in the asset store once while also adding a level of indirection between file names (stored in the database) and file content (stored in the asset store).

 

How are data organized in MIDAS?

Data in iDASH are stored in a MIDAS repository instance. Data in MIDAS are organized around users and communities that are groups of users with a common interest in, for example, a particular data set. Registered users can upload, edit, download, and share data and can create communities. Both users and communities have public and private folders, where data can be shared without restriction (public) or with a user-specified subset of registered users (private). MIDAS also allows anonymous (unregistered) users to view and download any publically shared data without any credentials. All uploaded data can have licenses associated with them. MIDAS supports multiple license options and the selection of the appropriate license is left to the user uploading the data into the repository.

An administrator manages communities—by default the user who created the community. The community administrator can specify community agreements that state the terms of use for participating in the community. The community administrator manages the membership of the community by inviting registered MIDAS users to join the community and by removing users from the community.

 

How are data uploaded into MIDAS?

There are multiple options for uploading data into the MIDAS repository that rely on the MIDAS web API. Since the MIDAS repository uses HTTPS, all data upload and download connections are encrypted by default with industry standard, high-grade encryption (Camellia-256 bit cipher).

 

How can I use the iDASH repository directly with my application?

The MIDAS repository has web-based APIs that can be invoked directly from any application to retrieve data, perform complex processing, and then push the results back into the repository. These transfers are performed via HTTPS to ensure in transit data security. It is also possible to perform light processing, such as data previews or specific summaries by developing a MIDAS plugin.
For more details, see the MIDAS documentation here.

 

Who do I contact if I still have questions?

Please e-mail questions to idash@ucsd.edu.