Pipelines to process CMS data: Medicaid and Medicare
See Medicaid master document.
Top level directories are:
- doc
- src
Doc directory contains documentation.
Src directory contains software source code. See details in Software Sources section.
The directories under sources are:
- cwl
- python
CWL folder contains reusable workflows, packaged as tools that can and should be used by all NSAPH pipelines.
Each processing step of CMS data is packaged as a standalone tool that can be run individually. Each tool is individually documented. The tools are combined into a workflow represented by medicaid.cwl file.
This package contains modules to generate YAML schema for CMS data from FTS files provided with CMS medicaid and medicare export (raw data).
Module fts2yaml is a generic parser for FTS format for both Medicaid and Medicare.
File transfer summary (FTS) document contains information about the data extract. These are plain text files containing information such as the number of columns in the data extract, number of rows and the size of the data file. The FTS document provides the starting positions, the length and the generic format of each of the column (such as character, numeric or date)
Module create_schema_config generates YAML schema for CMS medicaid data by parsing FTS files.
nsaph.tools
This package contains code that was written to try to extract corrupted medicare data for 2015. Ultimately, this attempt was unsuccessful.