home sort-asc sort-desc sort square-o search plus minus caret-up quotes-left quotes-right angle-left angle-right
skip to main content

English Language Arts/Literacy and Mathematics Smarter Balanced Summative Assessments

Research Files for Smarter Balanced Assessments

CAASPP Research Files

These research files contain results from the administrations of the California Assessment of Student Performance and Progress (CAASPP) Smarter Balanced Assessments and is the same information presented in the Detailed Test Results section of this site. These files are provided to allow for more complex analyses and customized reporting of the data.

In order to protect student confidentiality, no scores are reported (or included in the research files) for any group with fewer than 11 students.

The research files are available in two formats: fixed width and caret delimited. All research files contain the data for entities comprising the level of that file.

  • The "statewide" files include the data for the state, and all counties, districts, and schools. Files can also be downloaded for any single county or district.
  • The "county" files include the data for the selected county and all districts and schools associated with that county.
  • The "district" files include the data for the selected district and schools associated with that district. "School only" files are not available.

Please note independent charter schools (direct funded) are treated as a district. Test results for these schools are included in the numbers for the state, county, and school.

Note: The research files are available in fixed width and comma delimited formats for years prior to 2019–20.

Downloading CAASPP Research Files

Use of these research files requires some expertise in the handling of data and advanced data management skills. Many of the district and county research files are very large (up to 130MB) and may be too large for spreadsheet applications. Database applications such as MS Access, SAS, or SPSS are required to manage these files. The file size is indicated in parentheses and is shown only when the size is greater than one megabyte. Because of the size of the California Statewide Research File for all student groups, both the fixed-width and caret-delimited versions will now have three sets of files: one set for the combined file with both ELA and mathematics, one set for ELA only, and one for mathematics only. When loading the Microsoft Access database, the combined file cannot be used due to its size. Instead, load the ELA file into the Microsoft Access database and load mathematics separately into a second Microsoft Access database.

Instructions for importing caret-delimited research files (PDF)


Report Options

Statewide Files (include data for the state, and all counties, districts, and schools)


Countywide/Districtwide Files

Select a county or a county and district to search and download research files.

Note: Countywide and Districtwide fixed-width research files are only available beginning with the 2023–24 administration.



Entity Files

The following entity files list the County, District, and School entity names and codes for all entities as the existed in the administration year selected. This file must be merged with the research file to join these entity names with the appropriate score data.


Access Database (.mdb) File

A database “shell” is another alternative provided at this site. Once downloaded to the target computer, this application provides a powerful school, district, county, county-district-school (CDS) code, and ZIP code search capability as well as a formatted report containing all the data for the selected entity. This MS Access 2007 shell contains all entity data and is designed to import any of the selected state, county, or district caret/comma delimited files. In order to use the shell, MS Access 2007 must already be installed on your computer.


Research File Formats, Layouts, and Lookup Tables

2022–23 Research File web page – research file layout and lookups
2022–23 Test ID / Test Name table (CSV) – test IDs and test names
2022–23 Student Group ID / Student Group Name table (CSV) – student group IDs and names

Getting Accurate Results from the Research Files

Achieving accurate results when working with these research files requires an understanding of the structure and content of the two primary tables: the entities table and the test data table. The research files have many rows for each entity. There are records for each combination of grades, tests, and student groups. This means that there are hundreds to thousands of records for each entity, with an average of approximately 900 records. In order to correctly work with the data, you must use constraints to limit the data you are reporting. These constraints are discussed below.

Entities Table

This table is composed of the state, all counties, districts, and schools in California. Because there are both school-level and district summary records as well as county and state summary records, it is critical that in any analysis, a "Type ID" record type be selected. This will help avoid the double or triple-counting that will occur when a school count is also counted in the associated district record.

Test Data Table

This table is composed of the school, district, county, and state aggregate CAASPP counts and scores. To accurately analyze and report from these research files, the appropriate constraints must be applied to the following elements:

  • CDS code – The research files contain summary district and county records. A district summary record will have a "school" code of "0000000". When working with the file, be sure to include the county, district, and school codes. Failure to include all three data codes will result in double-counting in any summary calculations.
  • Test type – Identifying the desired test (ELA or mathematics) will help to provide clear query results.
  • Student Group ID – Each student will be included in both the “All Students” student group aggregation and each of the appropriate student group aggregations. Consequently, an individual student group must be selected to avoid duplicate counts.
  • Test ID – In general, each student will take a number of tests. A specific test should be selected to avoid confusion.

Providing accurate and meaningful reports from the research files generally requires the "linking" of the Entities and Test Data tables. Additional efforts might include linking to the "lookup" tables. Working with these tables requires an understanding of "relational" data tables and their manipulation.

California Department of Education 1430 N Street Sacramento, CA 95814