Data / Critique

The Dataset: Distributional Financial Accounts (DFA) [1] – Share of Aggregate Household Wealth by Education

Information Included in the Dataset

Starting with the general organization of our dataset, the data is grouped by rows of years and quarters. For example, the actual data stretches from Quarter 3 of 1989 to Quarter 3 of 2025. The columns with financial data all correspond to a specific year and quarter.

Moving on to the specific financial data present in each row, the columns organize the data to most notably include:

Net worth
Assets
Loans
Financial Assets
Non-financial assets
Real estate
Home mortgages
Deposits
Corporate equities and mutual fund shares
Household count

Related fields to these financial categories are included in the dataset columns as well; the critique details the most notable ones.

Each of these fields, other than household count, is expressed as a percentage of a category, being:

College
SomeCollege
HS
NoHS

in education levels.

Example

Information, Events, or Phenomena the Dataset Illuminates

A collage-style image featuring a large yellow lightbulb set on a background with white binary code. In the foreground are two smaller, orange-tinted historical photos of people working with early computer technology and experimenting with chemical elements. — Jones, Lucy. Illustration. *Vox*, 13 Dec. 2025

Our dataset illustrates how different financial outcomes in the United States are distributed across different levels of educational attainment from 1990 to 2024. By examining variables such as total assets, real estate, and other forms of equity, the data can reveal patterns of economic inequality in the United States and how education levels play a pivotal role in achieving financial security.

The dataset can also illuminate levels of social mobility and how those with a higher level of education have more access to wealth-building opportunities and overall more economic stability compared to those with a lower level of education.

In addition, the dataset can demonstrate long-term trends, demonstrating how an increase of education attainment in the U.S. from 1990 to 2024 is also correlated with an increase in financial success across the population.

What the Dataset Cannot Reveal

The most significant limitation of our dataset is that it oversimplifies education into a binary metric, completely overlooking the differences in quality. For example, DFA cannot reveal the disparity between a local public University and an Ivy League school.

Furthermore, the data treats wealth as a passive accumulation, completely neglecting the physical and mental effort, grueling work that one had to endure on their way to success. The dataset is also not able to distinguish whether someone’s wealth came from their own job or that it was handed to them by their parents.

How the Data was Generated

The DFA data were generated and collected by the Federal Reserve, the Financial Accounts of the U.S., and the Survey of Consumer Finances (SCF). The Financial Accounts provide quarterly totals for household assets and liabilities across the entire United States economy. The SCF is a detailed household survey conducted every three years that shows how wealth is distributed by characteristics like education, income, and age.

A conceptual image featuring a dark silhouette of the United States map overlaid on a background of binary code (zeros and ones). The map appears as a void within the digital data stream. — Kothe, Ben. Illustration. *The Atlantic*, 11 Dec. 2025

To create the DFA, the Fed takes into account differences between the two datasets, then uses SCF data to estimate wealth shares for different groups. The estimated shares are then forecasted to create seasonal estimates. The shares are then applied to the aggregate totals from the Financial Accounts, producing consistent quarterly distributional wealth data, which is then applied to the measurement of different social structures, like education.

What Information is Left Out & Dataset’s Ontology

The dataset presents all financial variables as percentage shares rather than absolute dollar amounts, which obscures the actual magnitude of wealth held by each education group. The data also lacks demographic breakdowns beyond education level, as variables such as age, income, race, gender, and geographic region are absent. This prevents researchers from investigating how education intersects with other socioeconomic factors in shaping wealth distribution.

A vintage-style illustration of a large hand carefully placing a single purple block containing binary numbers into a gap within a wall made of scrolling rows of purple digital binary numbers. — “The Mistake That Cost Me Years of Work.” Getty Images. *Inc.*, 28 May. 2020

Additionally, the four education categories, from high school to college, do not distinguish between bachelor’s and advanced degrees, nor the fields of study, limiting analysis of how different educational paths affect wealth outcomes. Also, there could be institutional factors, such as financial literacy, access to banking services, or employee benefits, that are not represented, which are part of their either active or passive income.

The spreadsheet’s ontology is constructed around the spreadsheet columns we directly see. Overall, financial measures are highlighted, but other factors, such as stable housing and access to public services, are left out. Additionally, the spreadsheet’s ontology fails to detail the uncertainty of the data presented on it.

The sources underlying the DFA dataset are the Z.1 Financial Accounts and the SCF, both produced by the Federal Reserve. The Z.1 accounts gather data from various administrative and institutional sources, including regulatory filings and reports, to measure aggregate levels and flows of financial assets and liabilities across sectors of the U.S. economy. These accounts are updated each quarter, providing comprehensive quarterly measures for the household sector, but do not include demographic detail.

In contrast, the SCF is a triennial household-level survey that directly collects information on assets, debts, income, and demographic characteristics such as education, offering detailed distributional insight but at a lower frequency. The survey is collected by NORC at the University of Chicago.

The DFA dataset was constructed and funded by the Federal Reserve Board, which is the central banking system of the United States. Because the Federal Reserve is a public institution, the dataset is publicly funded rather than sponsored by private companies.

The DFA project builds on existing Federal Reserve data products, such as the financial accounts and the Survey of Consumer Finances, which are also produced and funded by the Federal Reserve. The research behind the DFA is documented in Federal Reserve publications such as FEDS Notes and working papers. Overall, the dataset reflects a government-funded effort to improve knowledge and understanding of wealth inequality in the United States.

dfa-education-shares-detail Download

Information Included in the Dataset

Example

Information, Events, or Phenomena the Dataset Illuminates

What the Dataset Cannot Reveal

How the Data was Generated

What Information is Left Out & Dataset’s Ontology

What are the Original Sources?+

What Organization Funded the Creation of the Dataset?+

Explore the Education Shares Dataset?+