Social Sector Data Sources
The Rustandy Center for Social Sector Innovation provides researchers with access to data and database administrative support. We also can help Chicago Booth researchers secure new datasets.
Below is a list of nonprofit sector data sources accessible for academic use. Due to license agreements, some data is restricted to the Booth and University of Chicago communities.
Bloomberg’s Environmental, Social & Governance dataset offers ESG metrics and ESG disclosure scores for more than 11,500 companies in over 80 countries. The dataset includes as-reported data and derived rations as well as sector- and country-specific data points. Historical data is available from 2006.
Booth faculty, MBA and PhD students, and alumni may access the data via Bloomberg stations in Harper Center and Gleacher Center computing labs. For questions about accessing the data stations, email Booth’s Information Technology staff.
In addition, the Rustandy Center can support faculty and PhD research by executing customized data pulls. For questions about the data and about customization, contact the Rustandy Center's Salma Nassar.
COVID-19 Cases by Zip Code
The COVID-19 Cases by Zip Code dataset lists the number of COVID-19 cases at the zip code–level. The zip codes where data were made publicly available via government-sponsored sources are included.
COVID-19 Stay At Home Orders
The COVID-19 Stay At Home Orders dataset contains a summary of shelter-in-place, stay-at-home, and other orders at the state, city, and county levels. Additionally, it contains information on school closures, public-health declarations, and other COVID-19-related actions at all levels.
COVID-19 Business Regulations
The COVID-19 Business Regulations dataset contains a summary of business regulations related to COVID-19 at the state and city levels. Data fields captured focus on the rollout of restrictions stipulating which types of businesses are allowed to operate.
Foundation Directory Online by Candid is an online database that profiles grant makers and their grant recipients. Profiles on grant makers include searchable 990s or 990-PFs, their total giving and assets, funding interests, officers and trustees, and the number and size of grants awarded.
The FDO also offers profiles on grant recipients, including their sector, geographic and population focus areas, and the number of grants and grant dollars received. Grant recipient profiles may also include 990 forms. Historical data is available dating back to 2003, and the database is updated on an ongoing basis.
For access information, contact the Rustandy Center's Salma Nassar.
The Entrepreneurship Database program at Emory University works in partnership with the Aspen Network of Development Entrepreneurs as part of the recently formed Global Accelerator Learning Initiative. Its aim is to collaborate with a growing number of accelerator programs around the world to collect and collate comparable longitudinal data that describes early-stage entrepreneurs and their ventures.
GuideStar by Candid is a large source of information on nonprofit organizations. GuideStar gathers and disseminates information about IRS-registered nonprofit organizations, and provides information about each nonprofit’s mission, finances, programs, transparency, governance, and more. GuideStar provides complimentary access to a mix of digitized and nondigitized data. Booth offers access to the following proprietary, digitized datasets:
- Tax Form 990s from 2003 through 2019
- Tax Form 990EZs from 2003 through 2017
- Tax Form 990PFs from 2014 through 2017
This data is available for academic research only by Chicago Booth and University of Chicago faculty, PhD students, and staff. For more information on the database, contact the Rustandy Center's Salma Nassar.
There is also data available from the Internal Revenue Service's Form 990 filed by a sample of 394 501(c)(3) organizations from 2004 to 2010 (unbalanced panel). The nonprofits included were sampled from the set of nonprofits rated by Charity Navigator. These were sampled to obtain variation within the sample with respect to the first rating publication date for each nonprofit. Access is available for Booth faculty and students. Request access by emailing the Fama-Miller Center.
Homebase provides a large, national employment dataset sourced from its free employee-scheduling and time-tracking tool used by 100,000+ local businesses. Participating businesses are primarily owned and operator-managed within the restaurant, food & beverage, retail, and services industries.
In 2020, the Rustandy Center at Chicago Booth, the Harvard Business School Impact Collaboratory, and the Wharton Social Impact Initiative of the University of Pennsylvania’s Wharton School launched the Impact Finance Research Consortium (IFRC), a joint effort to collect comprehensive data on impact investing funds. The IFRC’s key initiative is to build a database on the financial performance, due diligence practices, investor relations, legal governance, strategy, and management of impact investing funds across the world. The resulting Impact Finance Database (IFD) will catalyze groundbreaking research on this young but rapidly expanding field.
National Center for Charitable Statistics is a clearinghouse for data on the nonprofit sector in the United States. Working closely with the IRS, other government agencies, nonprofits, and the scholarly community, NCCS builds compatible national, state, and regional databases, and develops uniform standards for reporting on the activities of charitable organizations.
The Protest Response Orders dataset has a list of recorded orders and, in some cases, press releases on or after the death of George Floyd on May 25, 2020, through August 2020 related to the Black Lives Matter movement, protests, and racial equality and equity. Data are recorded for all 50 states and a limited set of counties and cities.
For more information, contact the Rustandy Center's Salma Nassar.
Loans and Deposits
Since 1997, Ratewatch + S&P Global Market Intelligence have surveyed more than 100,000 financial institution locations in the United States to compile advertised depository interest rates (weekly) and loan interest rates (monthly). The dataset covers a large cross section of all branches and depository institutions, sampling a variety of types and sizes. Institutions include banks, credit unions, savings and loan companies, brokers, trust companies, and others. It includes full-service and limited-service facilities and offices, be it brick-and-mortar offices, cyber offices, or home/phone banking.
Surveyed interest rates include a large number of standardized deposit and loan products such as checking and savings accounts; auto loans; certificates of deposits (CD) of different sizes and maturities; home-equity and mortgages loans of different sizes, characteristics, and maturities; and other commercial lending products. Within each category, details are available for multiple terms and/or dollar tiers.
This data is available for academic research only by Chicago Booth faculty, PhD students, and staff. For more information on the database, please contact Genevieve Hensley, program manager at the Chookaszian Accounting Research Center, or the Rustandy Center's Salma Nassar.
Service Charges and Fees
Since 2000, RateWatch + S&P Global Market Intelligence have compiled service charges and fees from nearly 100,000 financial institution locations across the United States on a weekly basis. The dataset covers a large cross section of all branches and depository institutions, sampling a variety of types and sizes, and as of 2013, the dataset covers more than 50 percent of institutions in the United States. Institutions include banks, credit unions, savings and loan companies, brokers, trust companies, and others. It also includes full-service and limited-service facilities and offices, be it brick-and-mortar offices, cyber offices, or home/phone banking.
The Service Charges and Fees dataset includes retail, cash management, and loan fee information nationwide. The dataset includes over 60 product types across both personal and commercial banking, including interest checking, online cash management, domestic wire transfers, safe deposit boxes, corporate and business interest checking, and others. There are also more than 190 product subcategories, including information on loan terms, bill pay, loan fee, and monthly charges. The dataset also includes details on reporting institutions, including, but not limited to, their certification number, FDIC unique number, Federal Reserve ID, routing number, contact information, MSA, and longitude and latitude for mapping purposes. Fees are reported in percentages and in dollars.
This is a historical dataset that includes data from 2001 through February 2020.
This data is available for academic research only by Chicago Booth faculty, PhD students, and staff. For more information on the database, contact the Rustandy Center's Salma Nassar.
The Stanford Latino Entrepreneurship Initiative (SLEI) is a research and education collaboration between Stanford University and the Latino Business Action Network. The research program explores and expands our knowledge of the Latino entrepreneurial segment of the U.S. economy through research, knowledge dissemination, and facilitated collaboration. SLEI conducts an annual national survey, with a sample of 15,000 business owners in 2021, to assess the current state of the U.S. Latino entrepreneurship and is curating a significant panel of Latino entrepreneurs to enable longitudinal research to understand trends over time.
Sustainalytics is an independent source for environmental social governance (ESG) and corporate governance research and ratings. This dataset contains ESG data for firms worldwide, with more than 4,000 firms included in the dataset since 2013.
Sustainalytics provides overall ESG scores as well as constituent scores at the indicator level, of which there are 70 core and industry-specific indicators. Each company in the dataset is classified into one of 42 industry peer groups. Sustainalytics factors in varying degrees of materiality and exposure to risks in their ratings and assesses firms on the basis of their preparedness, disclosure, and quantitative and qualitative performance.
Booth faculty and PhD students can access the data through WRDS. See Booth’s research computing site for additional access information.
The Trucost and S&P Global ESG Scores database consists of three parts:
1) ESG Scores data
ESG Scores data provides one aggregated score, three dimension-level scores and more than 30 criteria-level scores, produced using the S&P Global Corporate Sustainability Assessment process and other sources.
2) Trucost Environmental data
Trucost Environmental data measures companies’ environmental impact across key dimensions. This data can be used to assess the environmental costs, identify and manage environmental and climate risk, and conduct peer and portfolio analyses from a climate and environmental perspective.
3) Climate Analytics data
Climate Analytics data provides company-level data, which measures companies’ exposure to seven climate-change physical risks, analyzes companies’ exposure to carbon pricing risk under different possible climate change scenarios, and evaluates companies’ alignment with the Paris Agreement goal.
This data is available for academic research only by Chicago Booth and University of Chicago faculty, PhD students, and staff. Researchers can access the data through WRDS. See Booth’s research computing site or the University of Chicago Library Webpage for additional access information.
The University of Chicago Library offers access to a variety of sources for researching nonprofit organizations and charitable giving, including industry overviews, data and statistics, and directories of organizations.
Learn More about Our Data Sources
For more information on social sector data sources, contact the Rustandy Center's Salma Nassar. For more social sector data resources, go to Booth’s research computing site. (Please note: the research computing site is only available to those with access to Booth’s intranet.)