Physical Access to Data
The fees for physical receipt of Research Identifiable Files (RIF) are posted in the document “CMS Fee List for Physical Research Data Request”, linked at the bottom of this section.
The fees for physical receipt of data are determined by:
- Files Requested
- Number of People Included
- Frequency (annual or quarterly)
- Whether a Finder File is needed
- Preliminary and Updated Files
Files Requested
Each data file has a fee. The CMS fee list includes a list of files available and the associated fee.
Number of People Included
The CMS fee list includes four pricing tiers based on the number of people in a study (also referred to as cohort). Requesters may already know the number of people in the cohort because they have collected this information. Or, requesters may have criteria that they want used to select people in their cohort.
To estimate the number of people included for a data request, visit the CCW cohort creator and cost estimate application tool that uses menu-driven steps to determine the estimated number of people. The tool can also provide an estimated cost based on the cohort size.
Frequency
Most data files are priced per year. The Medicare fee-for-service (FFS) claims and enrollment data are also available on a quarterly basis. The quarterly data extract schedule and explanation of the fees are found in the article, "RIF Medicare Quarterly Data."
Finder File
A finder file is a file that identifies all of the people the requester wants to include. Requesters can submit their own finder file by submitting personal identifiers or they can request a finder file be created for them.
If requesters submit their own finder file then a finder file charge will not apply. For more information about the type of finder files CCW can receive, please see the “Finder File Encryption Policy” on the CCW website.
Requesters can also request to have a finder file created from the Medicare or Medicaid data. For example, a researcher could request to have all diabetics in a particular state pulled for a particular year. The cost of the finder file creation will depend on whether the criteria for the search is a simple or complex algorithm. A simple algorithm is defined as a search that only requires a single pass through the data. A complex algorithm is defined as a search that requires a multi-step approach, such as multiple passes through the data. Examples are found in the "Appendix - Simple vs Complex finder files" document found under “Resources” at the bottom of this section.
Preliminary and Updated Files
Some CMS files are available in a preliminary status prior to the fully mature file availability. If preliminary files are purchased at the provided fees, the fees for the fully mature files of the same year of data can be ordered at a 50% rate.
Certain CMS files may be available in multiple releases for a given year. If the initial files are updated because of significant improvements to the data quality or volume, an updated release of that service year(s) will become available. If you purchase one version of the file at full price, any subsequent future releases of the same year of data can be ordered at a 50% rate. There will be no fees for file updates in the VRDC.
If a formal cost estimate is needed for a grant proposal or planning purposes, complete a specifications worksheet and email it to resdac@umn.edu.
Resources:
VRDC (Researcher)
The Virtual Research Data Center (VRDC) is a virtual research environment that provides timelier access to Medicare and Medicaid program data. The fees for accessing Research Identifiable File (RIF) data via the VRDC are posted in the document “CMS Fee List for CCW VRDC Cloud Environment”, linked at the bottom of this section.
The fees associated with accessing data via the VRDC are based on a combination of:
- Seat Access
- Project Fee
- Space/Usage Cost
Seat Access
Researchers who access data in the secure VRDC environment will be charged a standard access fee per user or “seat.” This fee covers CMS onboarding, seat license, and administrative costs. The seat access fee is charged on an annual basis; each seat must be renewed every year in order for the user or “seat holder” to continue working on the study.
Project Fee
The VRDC project fee is an annual fee that covers space allocation, Databricks credits, output review and the cost of extracting data needed. There is no charge to add additional years of data for an existing cohort. However, any changes in the cohort that result in re-extracting data will incur a fee. Existing VRDC seat holders may add projects to their user workspace for an additional project fee.
Space/Usage Cost
An annual space allocation of 2 TB per DUA is included in the project fee. However, researchers may need to pay for additional space in the VRDC depending on the size of their data request. Space is needed for raw data, analytic files, and output. Additional space can be purchased in 1 TB blocks. The cost for continued additional space will be charged during the seat renewal period, if applicable. Databricks usage is measured in ’credits’. An annual usage allocation of 2,000 credits per DUA is included in the project fee under the Full VRDC option. If using an enhanced Databricks cluster, it is recommended that researchers consider purchasing additional Databricks credits. Additional Databricks credits can be purchased at the time of the DUA's project renewal and one additional time within the project year.
Obtaining a formal cost estimate is the best way to be sure of all associated VRDC fees, including whether or not extra space is required.
Resources:
VRDC (Innovator)
Innovators must use the VRDC which is a virtual research environment that provides timelier access to Medicare and Medicaid program data. The fees for accessing Research Identifiable File (RIF) data via the VRDC through the Innovator Research Program are posted in the document “CMS Fee List for CCW VRDC Cloud Environment”, linked at the bottom of this section.
The fees associated with accessing data via the VRDC are based on a combination of:
- Seat Access
- Project Fee
- Space/Usage Cost
Seat Access
Researchers who access data in the secure VRDC environment will be charged a standard access fee per user or “seat.” This fee covers CMS onboarding, seat license, training, and administrative costs. The seat access fee is charged on an annual basis; each seat must be renewed every year in order for the user or “seat holder” to continue working on the study.
Project Fee
The VRDC project fee is an annual fee that covers space allocation, Databricks credits, output review and the cost of extracting data needed. There is no charge to add additional years of data for an existing cohort. However, any changes in the cohort that result in re-extracting data will incur a fee. Existing VRDC seat holders may add projects to their user workspace for an additional project fee.
Space/Usage Cost
An annual space allocation of 5 TB per DUA is included in the project fee. However, researchers may need to pay for additional space in the VRDC depending on the size of their data request. Space is needed for raw data, analytic files, and output. Additional space can be purchased in 1 TB blocks. The cost for continued additional space will be charged during the seat renewal period, if applicable. Databricks usage is measured in ’credits’. An annual usage allocation of 4,000 credits per DUA is included in the project fee under the Full VRDC option. If using an enhanced Databricks cluster, it is recommended that researchers consider purchasing additional Databricks credits. Additional Databricks credits can be purchased at the time of the DUA's project renewal and one additional time within the project year.
Obtaining a formal cost estimate is the best way to be sure of all associated VRDC fees, including whether or not extra space is required.
Resources: