
Better Data Is Needed to Tackle Health Equity
The US federal government is finally updating its standards for reporting data on race and ethnicity – and it’s an urgently needed chance to enable a national overview of crucial data on health inequities
Over the course of the COVID-19 pandemic, policy makers and health care stakeholders have been hamstrung in their pursuit of health equity by data gaps and inconsistencies. Measuring inequities in the uptake of the COVID-19 vaccine, the best defense against severe disease, was hindered by critical limitations, including the lack of standardized procedures and category definitions for race and ethnicity data as well as inadequate capacity to share data across systems. Regional data turned out to be disparate and inconsistent. Federal data is consistent but is coded using antiquated standards that prevent granular analysis and leave a significant proportion of records with a missing race and ethnicity component. To make fair and equitable policy decisions, federal and regional data needs to be comparable without the danger of data artifacts leading to misdirected resources.
As we consider the problem, we know that people of color have borne greater 
In the spring of 2021, we set out to create a daily updated, nationwide map of county-level vaccination coverage in the United States disaggregated by race and ethnicity by scraping publicly available data from state dashboards. It quickly became clear that differences in reporting styles and gaps in data coverage across states would prevent a nationally representative view into county-level disparities. Even among the states where county-level data is available on public-facing dashboards, large gaps, limitations, and inconsistencies inhibit the ability to compare among race and ethnicity both within and across states. Here we describe retrospective findings as of March 2022 from a small subset of 13 states with robust, consistent reporting standards, share the barriers we encountered, and propose recommendations for revamping standardization and data-sharing practices nationally. Comprehensive, granular data on race and ethnicity is fundamental for efforts to advance health equity, not only related to COVID-19 but in public health and health care more broadly.
County-level racial vaccination inequity shows substantial variability within and across states
National and state vaccination aggregates can mask local disparities with unique underlying causes. County-level estimates disaggregated by race and ethnicity, the most granular geographic unit reported at a national scale, reveal large differences in coverage. As of March 29, 2022, we can robustly report county-level first-dose vaccination data among the White and Black populations for 827 counties across 13 states, accounting for 40% of the population over the age of 5 years, 31% of the White population over the age of 5 years, and 40% of the Black population over the age of 5 years in the United States. We omit other racial and ethnic groups for comparability reasons further explored below. The fact that this omission is necessary, however, is precisely the problem that needs to be addressed.
In April 2021, vaccination coverage was 13 percentage points higher among the White population compared with the Black population. Across our dataset as of March 2022, we saw a decline in the vaccination gap, whereby the Black population trailed by 9 percentage points, consistent with 
Data gaps, limitations, and inconsistencies inhibit comparisons across and within states
As we began this data journey, we discovered county-level data by race and ethnicity for people who received at least 1 dose of the vaccine was publicly available on 22 individual state dashboards, but only 16 states showed the fully vaccinated (those that have completed their primary vaccine series [ie, 2 doses of either Pfizer or Moderna or 1 dose of the Johnson & Johnson]). Between states, we encountered 3 main issues in state-reported vaccination data: their racial/ethnic classifications, their frequency of reporting, and their separation in reporting of data administered through federal programs such as the Indian Health Services and the Long-Term Care Partnership Program. We defined 2 categories of states based on how they report race and ethnicity. There were 5 states in our sample (AL, DE, GA, OH, WI) that reported race and ethnicity separately, using a 2-question format (eg, “Black” and “Hispanic or Not Hispanic”), while the remaining 8 states (CA, MA, MI, OR, SC, TX, VA, WA) record them as mutually exclusive, using a 1-question format (eg, “Black, non-Hispanic” or “Hispanic”, but not both) (Exhibit 2). The total population within a state used to calculate vaccination rates differs depending upon the method used for reporting. This, in turn, prevents a proper comparison of vaccination rates between states using different approaches since the underlying total population of, for example, “White” (“Hispanic” and “non-Hispanic” under the 2-question format) is different from solely “White, non-Hispanic” (under the 1-question format).
The federal government has 
The federal government should revise and enforce data standardization protocols both on collection and reporting
Many of the inconsistencies in county-level vaccination stemmed from a dynamically changing pandemic environment where responders needed to adapt rapidly. However, data reporting could have been more consistent through better federal guidance early on. Most stakeholders agree that the Federal OMB standard is outdated, leading to regional decision-makers being 
In the context of public health emergencies like COVID-19, state and local health departments need access to timely data to monitor disparities and adjust response strategies. Immunization Information Systems (IISs), currently controlled individually by all 64 jurisdictions, including US states, are intended to be a centralized data repository of immunization records for each jurisdiction and the source for sending records to the CDC’s COVID-19 Data Clearinghouse. Individual IISs have varying capacities to automate processes and handle large quantities of data. Data quality can also 
In the longer term, an overhaul of current data standards and sharing processes is needed to ensure consistent health equity analyses can be conducted in the broader healthcare ecosystem. First, the federal government should review and update the OMB 1997 Statistical Directive on collecting and presenting federal data on race and ethnicity to more accurately reflect the demographics of the US population and provide flexibility to state and local governments to capture information representing their communities. Disparities often exist within granular racial and ethnic groups like Middle Eastern or North African, and local jurisdictions should be encouraged to track these data. There should be standard language that details why racial data is gathered in a given clinical setting, so that people can choose whether to provide or withhold their racial data. This informed consent will create implicit limits for what the data can be used for and why. These need to be codified and rigorously adhered to, with regular oversight.
Secondly, the 
Lastly, public health and health care systems should assess the feasibility of incorporating the 
Bad data blocks credible progress towards health equity
Health data—indeed, public data of any sort—need to be both locally relevant and nationally comparable. Decision-makers and practitioners within each US county should be armed with health equity data that accurately represents their local situation. Likewise, they should also be able to freely collaborate and share experiences, which will only be enabled by the adoption of national standards on race and ethnicity data and reporting that are applied at the most granular level. We observe that improving trends in vaccination equity at the national level belie significant, and persistent, state- and county-level inequities. These results underscore the need for responses tailored to the local context as researchers investigate barriers faced by specific groups. These calls to action will only be more pressing as we transition to 
For good reason, minorities in the United States are less likely to provide data of any kind for public health reporting. When they take the risk of trusting public health officials with details that might make them vulnerable, it is critical that they and their communities are rewarded with the best possible data analysis, resulting in the most equitable and reasonable policies possible. The only way to restore trust in public health institutions is for these institutions to consistently act in a trustworthy manner. Getting the data gathering and analysis right is certainly not the only thing that public health agencies need to do to restore trust. But it is an important issue that must be “gotten right” before other health equity issues can be discussed clearly.
Newsletter
Stay ahead of policy, cost, and value—subscribe to AJMC for expert insights at the intersection of clinical care and health economics.















































