Disability Analysis File Restricted Access File

Please see the DAF PUF page for details on the public use version of this file.

Disclaimer: The Disability Analysis File is comprised of SSA administrative records. Access to this data is strictly controlled and restricted to SSA employees, contractors, and other agencies and organizations with formal agreements with us. There is currently no routine process in place to give DAF access to other researchers, though we are currently examining whether such access may be possible in the future. Please send inquiries regarding DAF data to ORDES.DAF@ssa.gov.

DAF Overview
Descriptive Statistics from the Disability Analysis File
DAF Files
DAF Data Marts
DAF Extracts
DAF Changes
DAF Main Documentation and File Locations
DAF Code Library
DAF Research Solutions
Tips for How to Efficiently Use the DAF Data Documentation

New to the DAF? Check out our new Quick Start Guide to get started!

DAF Overview

The Disability Analysis File (DAF) is an analytical file consisting of agency administrative data in an easy-to-use format. We create a new version of the file and documentation each year. The file contains historical, longitudinal, and one-time data on all child and adult beneficiaries with disabilities who were below retirement age and who participated in the Supplemental Security Income (SSI) or Social Security Disability Insurance (SSDI) programs at any time between 1996 and the year of the file. Each DAF is an updated version of all prior DAF files, so users should use the most recent file available. All files are stored on the agency mainframe in SAS format.

There are several data marts and extracts available as well. These are generally small enough for use on servers or PC computers.

We build the DAF by extracting and combining data from various administrative data sources including:

  • Supplemental Security Income Record,
  • Master Beneficiary Record,
  • Disability Determination Service Processing File (also known as the 831/832),
  • Master Files of Social Security Number (SSN) Holders and SSN Applications (NUMIDENT) file,
  • Completed Determination Record – Continuing Disability Determinations (also known as the Disability Control File (DCF)),
  • Earnings Recording and Self-Employment Income System (also known as the Master Earnings File (MEF)), and
  • Various payment data files maintained by the agency.

Under the DAF, we combine data from these various administrative sources into a single record per beneficiary.

The DAF includes three main types of variables: one-time variables, "n" variables, and monthly historical variables. One-time variables include data such as SSN, date of birth, or Date of Initial Entitlement (DOEI). These variables reflect the latest information shown in the SSA administrative file used. Many of these variables, such as DOEI, will show dates going back several decades. Since the DAF include all beneficiaries who received benefits in any one month since 1996, many beneficiaries included in the DAF started benefits well before 1996.

Like one-time variables, "n" variables reflect the latest information shown in the SSA administrative file used and can show dates going back several decades, but unlike one-time variables, "n" variables can show multiple occurrences. Assuming Var is the root variable name, Var1 will be the first occurrence, Var2 the second, and so on. Most "n" variables will have a value variable (e.g. status for occurrence n) combined with a corresponding date variable for that occurrence. So, for example, RFCn (reason code for continuance/cessation) can have up to 16 occurrences, one for each continuing disability review, and RFC1 will align with DD1, RFC2 with DD2, etc., with each DDn showing the Medical Continuing Disability Review Date associated with the corresponding "n" decision code.

Monthly historical variables include such items as State of residence, impairment codes, and benefit payment status and amounts and show month-to-month variation for such variables, each ending in yymm to indicate the year and month for the value. Many monthly historical variables in the DAF have data spanning from January 1994 to December of the year of the file. Some variables have shorter time ranges, but none is earlier than January 1994.

You can find the DAF data documentation, descriptions and locations of the data files, and information on data marts and extracts for the most recent version of the file below. If you have used older versions of the DAF documentation, you should still use the most recent version of the documentation provided here. This version provides the most up-to-date information on the DAF and identifies any variables that have changed since we constructed earlier versions of the data. If you have questions about the DAF or need access to older versions of the data or documentation, please contact ^ORDES DAF.

Descriptive Statistics from the Disability Analysis File

The following report provides key descriptive statistics from the 2015 DAF. It examines the work activity, employment expectations and characteristics, employment services, and factors affecting employment of working-age adult Social Security Disability Insurance (DI) beneficiaries and Supplemental Security Income (SSI) recipients. It focuses on longitudinal analyses with timeframes spanning periods before and after disability award.

DI & SSI Program Participants: Characteristics & Employment, 2015

March 2020

DAF Files
The following files comprise the DAF20. We describe the files in more detail in the documentation below.

  • DAF Demographic File. This file contains a snapshot of what each beneficiary's administrative record looks like as of December 2020. It includes demographics such as date of birth and gender; current status, which could be active, suspended, or terminated; as well as summary information such as when the last period of eligibility began.
  • Annual Files for 1994-2020. These files contain monthly benefit and work data from January 1994 through December 2020.
  • DAF Ticket to Work Beneficiary Participation Files. These files contain monthly data on TTW eligibility and participation.
  • DAF Ticket to Work Payment Files. These files contain information on payments to Employment Networks for Ticket participants under the Milestone-Outcome or Outcome-Only payment systems and payments to State Vocational Rehabilitation Agencies under the traditional cost-reimbursement system.
  • DAF Linkable Master Earnings File (MEF). This file contains wage and self-employment earnings data from the Internal Revenue Service. These data are not available to contractors or grantees and so are stored separately from the other DAF files.
    The appendix of Volume 2 in the documentation below contains more information about the MEF linkable file.
  • DAF-RSA 911 Linkable File. This is a mini-DAF for Rehabilitation Services Administration (RSA) participants containing information for records from the DAF that match RSA records. The department of Education implemented significant changes to the RSA-911 file in July of 2017. Because of the change, DAF-RSA 911 Linkable File includes two files: one for those in RSA records through June 2017, and a second file for new records from July 2017 - December 2020. The appendix of Volume 2 in the documentation below contains more information about the RSA linkable file. Use of this data requires authorization from RSA through a simple project approval process. Contact ^ORDES DAF for information on requesting approval to use these data.
  • Local Economic Data from the LAUS and SAIPE. Beginning in the DAF12, SAS formats containing data from the Local Area Unemployment Statistics (LAUS) and Small Area Income and Poverty Estimates (SAIPE) are available for linking to the DAF. These formats contain county-level annual unemployment rates from the LAUS, and county-level monthly poverty rates and median income from the SAIPE. Because data from the LAUS and SAIPE are geography-specific and not person-level, we have stored this information as SAS formats rather than individual-level variables. You can find more information on these economic data and how to access the SAS formats in Volume 2 of the DAF documentation.

DAF Data Marts
The following data marts are available for the DAF20:

  • DAF20 10% Data Mart is a 10 percent random sample of the DAF, including all of the information from the core components described above. You can use the 10% data mart to test programs before running on the complete DAF or for analyses where smaller sample sizes are acceptable. A detailed description of the 10% Data Mart is in Volume 2.
  • The Awardee Data Mart (ADM) supports cohort analyses for the SSDI and SSI disability programs. It contains beneficiaries who received their first SSI or SSDI payment as an adult between 1996 and the end of the last year covered by the current DAF and includes all DAF demographic file variables for those beneficiaries as well as payment- and eligibility-based award variables constructed for the purpose. Researchers interested in assessing trends in beneficiary cohorts or in following outcomes of beneficiaries from first benefit month onward may be interested in this file. A detailed description of the ADM is in Volume 2.

DAF Extracts
The following extracts are available for the DAF20:

  • The National Beneficiary Survey (NBS) Extract is a mini version of the full DAF (Demographic, Annual, Ticket to Work Participation, Ticket to Work Payment, and RSA linkable files) that includes only respondents to one of SSA's six NBS surveys (NBS04, NBS05, NBS06, NBS10, NBS15, NBS17, and NBS19). See Volume 2 for more information about this extract.
  • The Ticket to Work Participant Extract is a mini version of the full DAF (Demographic, Annual, Ticket to Work Participation, Ticket to Work Payment, and RSA linkable files) that includes only participants in SSA's Ticket to Work program in 2006 or later. See Volume 2 for more information about this extract.

DAF Changes
We made a number of improvements to the DAF data and documentation in the DAF20. We have outlined some of the major changes below.

  • We have added child SSI recipients to the Awardee Data Mart. In prior versions of the DAF-ADM, only SSI recipients who had received their first SSI payment as an adult were included. For SSI recipients who have received both child and adult SSI payments, the new ADM now includes information related to the first childhood payment in addition to the first payment as an adult. Many users may be interested in the award date for child SSI recipients, and incorporating information on the first SSI payment as a child allows separate identification of child SSI award dates as distinct from SSI or SSDI benefits as an adult. Volume 2, Section V.B. contains more information on the revised ADM inclusion criteria. We have also added variables to the ADM that are specific to child awardees and we have revised the names of some variables to make it clear that they are based on the first adult SSI award.
  • We have added a race variable, RACE, to the DAF DMG file based on information stored in the NUMIDENT. This measure was previously included on the DAF but had been dropped starting in DAF15 due to limitations from changes in how SSA collected this information over time. Users who are interested in race and ethnicity information should consult Section IX in Volume 3 “Race and Ethnicity Categories in the DAF.”
  • We have incorporated a new policy directive (PD-19-03) from the Department of Education for the RSA-911 data that went into effect on July 1, 2020 and affects the contents of the file from 2020 onward. More information about the RSA-911 file is available in Volumes 2 and 12, with specifics about the new policy directive available in Appendix E of Volume 12.
  • We renamed some variables to improve the user experience and minimize the likelihood of error resulting from potentially misleading names. The list of renamed variables is available in Table I.1 of Volume 4.
  • We have enhanced the documentation related to the standalone CDR file, both in Section VIII of Volume 2 as well as in the variable detail pages in Volume 5. In particular, we offer more information about how to understand the sequential nature of variables drawn from SSA’s Waterfall File.
  • Consistent with a change that SSA made in its published statistics to capture beneficiaries with Attention Deficit Hyperactivity Disorder (ADHD), we have updated our classification of primary impairment groups. This is reflected in Volume 3 where we discuss a potential classification scheme that might be useful for researchers. We have also updated the code library to provides users with code to implement that classification scheme in addition to code for the validation statistics we present in Volume 6.
  • We have added data for respondents to Round 7 of the National Beneficiary Survey (NBS) to the DAF-NBS extract file and have included new variables to identify these participants; more information about these variables is available in Volume 4, with detail pages for each variable in Volume 5.
  • We have added flags in the demonstrations and surveys extract to identify participants in the Ohio Direct Referral demonstration project and Retaining Employment and Talent After Injury/Illness Network (RETAIN) projects. Section VI.C in Volume 2 contains more information about these and other flags in this extract file.

DAF Main Documentation and File Locations

The following files comprise the documentation for the main DAF20 files:

See the tips below for how to efficiently use the DAF data documentation.

File Locations for DAF20 Files, Data Marts, and Extracts. The dataset names (DSNs) of all DAF20 components and linkable files are provided in a file of filenames, a mainframe text file at the location below, or as a downloadable text file of filenames.

OPDR.TG.PRD.ETTW.DAF20.FILES.CURRENT

Some files (noted in the file of filenames) have copies stored on the agency mainframe. The copy name is identical to the original name except that the node DAF20P is DAF20C. Examples are included in the file of filenames.

DAF Code Library
To make the DAF more efficient and easier to use, we have developed SAS code for common analytical tasks run on DAF files. Researchers can use and modify this code as needed. The DAF Users' Code Library currently includes code to complete the following tasks:

  • Determine whether a beneficiary was eligible for SSDI or SSI within a user-specified time period;
  • Categorize impairment codes into the groupings used in our published statistics;
  • Determine whether we have suspended or terminated a beneficiary due to work within a user-specified time period; and
  • Reorder variables suffixed 1-n into a chronological order.

We expect the DAF Users' Code Library to grow over time, so please check back periodically.

DAF Research Solutions

These fact sheets illustrate how the DAF has been used to support research and answer questions about our disability beneficiary population.

DAF Research Solutions 1

This fact sheet describes how the DAF was useful in an analysis by Ben-Shalom and Stapleton (2012), who sought to better understand the long-term program participation and employment patterns of adult SSI recipients following benefit award.

Tips for How to Efficiently Use the DAF Data Documentation

Most users will only use volumes 1-5 of the DAF documentation.

  • Volume 1 and Volume 2 are primarily geared to new users and provide overview material on what the DAF is (Volume 1) and how to use it (Volume 2).
  • Volume 3 provides tips and tricks for using the DAF and also provides detailed information on the many constructed variables that simplify complex program information, such as the monthly composite suspense/termination variables (STW) and the benefits foregone for work variables (BFW).
  • Volume 4 and Volume 5 are reference volumes users will consult on choosing and using the variables in the DAF.

Volumes 6-12 deal primarily with the construction details of the DAF that will be of little use to most users.

In order to...
Refer to...
Get started with a research task

Volume 2, Working with the DAF20, for information about selecting beneficiaries using finder files versus selection criteria

Identify what's changed in the DAF Volume 1, "Getting Started with the DAF20" and Volume 4, "Lists of DAF20 Variables" for the list of new, revised and deleted variables in the current DAF.
View lists of DAF variables Volume 4,"Lists of DAF20 Variables"
Understand individual variable definitions,

specifications, and value ranges

Volume 5, "DAF20 Variable Detail Pages"
Understand the structure of the DAF data files at a high level

Volume 1, "Getting Started with the DAF20"

Identify variables for a specific research task Volume 4 , "Lists of DAF20 Variables," for a list of variables

contained within each DAF file and by analytic category

Understand the beneficiaries for which the DAF does and does not contain data Volume 1 , "Getting Started with the DAF20"
Identify our administrative data sources for the DAF Volume 9 , "DAF20 Source File Descriptions"
Generate ideas for using the DAF more efficiently Volume 1 , "Getting Started with the DAF20" and Volume 2, "Working with the DAF20"
Find suggested ways to identify common research concepts

in the DAF, such as calculating age of retirement, or disability title

Volume 3, "Tips for Conducting Analysis with the DAF20"
Understand what variables have changed in the most recent DAF Volume 4 , "Lists of DAF20 Variables"
Read about how information in the DAF is validated against other sources Volume 6 , "Validating the DAF20 Against Other Sources"