Social Sector Data Sources

Data SourcesUncovering new data and providing access to existing data sources makes inquiry into new fields possible and helps remove an obstacle for faculty and students researching the social sector.

The Rustandy Center for Social Sector Innovation provides Chicago Booth faculty with access to data and database administrative support. Below is a list of social sector data sources accessible for academic use. Due to license agreements, some data is restricted to the Booth and University of Chicago communities.

Bloomberg’s electronic library includes data on almost every publicly traded company in the world and biographies of more than one million people. New to Bloomberg Professional in 2014 is access to environmental, social, and governance research assessments, provided by third-party ESG research provider Sustainalytics.

Sustainalytics’s nearly 800 proprietary indicators provide a macrolevel assessment of how companies are managing their ESG capital. All Bloomberg users will have access to high-level company scores and percentile rankings across the ESG dimensions.

Booth faculty, MBA and PhD students, and alumni may access the data via Bloomberg stations in Harper Center and Gleacher Center computing labs. For questions about accessing the data stations, email Booth’s Information Technology staff.

View the Help Desk user guide for more details on the data »

In addition, the Rustandy Center can support faculty and PhD research by executing customized data pulls. For questions about the data and about customization, contact Salma Nassar.

The Entrepreneurship Database program at Emory University works in partnership with the Aspen Network of Development Entrepreneurs as part of the recently formed Global Accelerator Learning Initiative (GALI). Its aim is to collaborate with a growing number of accelerator programs around the world to collect and collate comparable longitudinal data that describes early-stage entrepreneurs and their ventures.

Access the Entrepreneurship Database »

Foundation Directory Online (FDO) by Candid is an online database that profiles grant makers and their grant recipients. Profiles on grant makers include searchable 990s or 990PFs, their total giving and assets, funding interests, officers and trustees, and the number and size of grants awarded.  

FDO also offers profiles on grant recipients, including their sector, geographic and population focus areas, and the number of grants and grant dollars received. Grant recipient profiles may also include 990 forms. Historical data is available dating back to 2003, and the database is updated on an ongoing basis.  

For access information, reach out to Salma Nassar at salma.nassar@chicagobooth.edu.

GuideStar by Candid is a large source of information on nonprofit organizations. GuideStar gathers and disseminates information about IRS-registered nonprofit organizations, and provides information about each nonprofit’s mission, finances, programs, transparency, governance, and more. GuideStar provides complimentary access to a mix of digitized and non-digitized data. Booth offers access to the following proprietary, digitized datasets:

  • Tax Form 990s from 2003-2017
  • Tax Form 990EZs from 2003-2017
  • Tax Form 990PFs from 2014-2017

This data is available for academic research only by Chicago Booth and University of Chicago faculty, PhD students, and staff. For more information on the database, please contact Salma Nassar, senior associate director of research initiatives at the Rustandy Center.

There is also data available from IRS 990s filed by a sample of 394 501(c)3 organizations from 2004 to 2010 (unbalanced panel). The nonprofits included were sampled from the set of nonprofits rated by Charity Navigator. These were sampled to obtain variation within the sample with respect to the first rating publication date for each nonprofit. Access is available for Booth faculty and students. Request access by emailing the Fama-Miller Center.

National Center for Charitable Statistics is a clearinghouse for data on the nonprofit sector in the United States. Working closely with the IRS, other government agencies, nonprofits, and the scholarly community, NCCS builds compatible national, state, and regional databases, and develops uniform standards for reporting on the activities of charitable organizations. Access NCCS »

Loans and Deposits

Since 1997, Ratewatch + S&P Global Market Intelligence have surveyed over 100,000 financial institution locations in the United States to compile advertised depository interest rates (weekly)  and loan interest rates (monthly). The dataset covers a large cross-section of all branches and depository institutions, sampling a variety of types and sizes. Institutions include banks, credit unions, savings and loan companies, brokers, trust companies, and others. It includes full service and limited service facilities and offices, be it brick-and-mortar offices, cyber offices, or home/phone banking. 

Surveyed interest rates include a large number of standardized deposit and loan products such as checking and saving accounts, auto loans, certificates of deposits (CD) of different sizes and maturities, home equity and mortgages loans of different sizes, characteristics, and maturities, and other commercial lending products. Within each category, details are available for multiple terms and/or dollar tiers.

This is a historical dataset that includes 2001 through the month prior to access. It is updated monthly on the Secure FTP server. 

This data is available for academic research only by Chicago Booth faculty, PhD students, and staff. For more information on the database, please contact Genevieve Hensley, Program Manager at the Accounting Research Center, or Salma Nassar, director of research initiatives at the Rustandy Center.

Service Charges and Fees 

Since 2000, RateWatch + S&P Global Market Intelligence have compiled service charges and fees from nearly 100,000 financial institution locations across the United States on a weekly basis. The dataset covers a large cross-section of all branches and depository institutions, sampling a variety of types and sizes, and as of 2013, the dataset covers over 50% of institutions in the U.S.. Institutions include banks, credit unions, savings and loan companies, brokers, trust companies, and others. It also includes full service and limited service facilities and offices, be it brick-and-mortar offices, cyber offices, or home/phone banking. 

The Service Charges and Fees dataset includes retail, cash management, and loan fee information nationwide. The dataset includes over 60 product types across both personal and commercial banking, including interest checking, online cash management, domestic wire transfers, safe deposit boxes, corporate and business interest checking, among others. There are also over 190 product subcategories, including information on loan terms, bill pay, loan fee, and monthly charges. The dataset also includes details on reporting institutions, including, but not limited to their certification number, FDIC unique number, Federal Reserve ID, routing number, contact information, MSA, and longitude and latitude for mapping purposes. Fees are reported in percentages and in dollars. 

This is a historical dataset that includes data from 2001 through February 2020. 

This data is available for academic research only by Chicago Booth faculty, PhD students, and staff. For more information on the database, please contact Salma Nassar, director of research initiatives at the Rustandy Center.

The Stanford Latino Entrepreneurship Initiative, in collaboration with the Latino Business Action Network (LBAN), created a comprehensive database of Latino-owned businesses in order to stimulate better policymaking, business partnerships, and knowledge development.

The database consists of more than 1.3 million Latino-owned companies in the United States. SLEI also collected data from a panel of 2,385 Latino entrepreneurs and plans to survey companies each year to create longitudinal data. Learn more about the SLEI database and potential collaboration opportunities »

Sustainalytics is an independent source for Environmental Social Governance (ESG) and Corporate Governance research and ratings. This dataset contains ESG data for firms worldwide, with over 4,000 firms included in the dataset since 2013.  

Sustainalytics provides overall ESG scores as well as constituent scores at the indicator level, of which there are 70 core and industry-specific indicators. Each company in the dataset is classified into one of 42 industry peer groups. Sustainalytics factors in varying degrees of materiality and exposure to risks in their ratings and assesses firms based on their preparedness, disclosure, quantitative, and qualitative performance.  

Booth faculty and PhD students can access the data, which covers mid-2009 through October 2019. Booth’s subscription updates ratings on an annual basis to account for any changes due to disclosures or relevant controversies. See Booth’s research computing site for additional access information.

The University of Chicago Library offers access to a variety of sources for researching nonprofit organizations and charitable giving, including industry overviews, data and statistics, and directories of organizations. Access the guide »

Information About Social Sector Data Sources

For more information on social sector data sources, email Salma Nassar. For more social sector data resources, go to Booth’s research computing site. (Please note: the research computing site is only available to those with access to Booth's intranet.)

Leverage Your Social Sector

Share the Rustandy Center’s mission with friends and connections committed to tackling social and environmental problems.

Share This Page