Code on a computer screen

Data Sources

We are happy to provide access for Booth researchers and affiliated faculty to datasets for relevant research projects. Please email the Center if you're interested in learning more about any of the datasets listed below.

LinkedIn Job Listings

Provided by Bright Data, this dataset is comprised of millions of LinkedIn job listings that were scraped from publicly accessible job listings on the LinkedIn website.

Dwellsy Rental Listings

This dataset is comprised of millions of rental listings from Dwellsy, a rental marketplace that boasts to have real, high-quality rental data. They achieve this with a marketplace platform that allows property managers and landlords to list properties for free and is integrated directly into many property management systems. 

IBM MarketScan® 

CAAI partners with the Center for Health and the Social Sciences to offer Booth researchers access to a variety of Marketscan health datasets. You can find more info on their website.

NightingaleOpenScience

Nightingale is preparing to launch its data research platform, offering access to health datasets from systems in the US and abroad, focusing primarily on image data, including x-rays, ECGs, etc.

If you are interested in a dataset you do not see listed, please reach out. We are happy to work with PhD students, post-docs, and Booth faculty to help them acquire interesting and relevant datasets for their research.

View through the glass wall of people sitting in a conference room

Our Research

At the Center, we pursue cutting-edge research that investigates how machine learning and AI are changing the way we do business, science, and social policy.

Our Research
About the Center for Applied Artificial Intelligence

About Us

Learn more about the leadership, faculty, and staff of the Center for Applied Artificial Intelligence.

About Us