Start at General Resources. These sources are the ones that most often hold the data that social science students and researchers at Tufts are looking for. Then look for more specific sources by discipline or subject.
For help at any point in the process, contact Social Science Data Librarian Ari Gofman.
For support analyzing data, doing GIS, learning software, or advanced data visualization, contact the Data Lab.
The first question to ask is: do you need microdata or aggregate data? In other words, do you need data about individuals (microdata) or combined/aggregated data, usually by geographic region (for example, income ranges in a particular town).
To narrow down the data that would answer your research question, consider the following steps:
1. Where do you want to find data from? A particular geography? Don't consider only location though: what unit size do you need: in other words, do you want data for an entire country as a country, or broken down by state, county, etc? This is often the biggest limiter of data.
2. When do you want data from? Do you want a snapshot (ex. population in 1999) or do you want longitudinal data that you can compare over time? Make sure the same question is being asked of the same population.
3. Who do you want data about? The entire population? A particular age, gender, socioeconomic class, race, or other demographic group? Are any of the groups you want to research protected or vulnerable? If so, it will be harder to find data, and you will need to be particularly cognizant of your responsibility as a researcher to do no harm.
4. What variables could answer the research question? This is intentionally after the three previous questions, because often
there are multiple variables that could address your question. If you want to know income, for example, you could look for self-reported income, tax filings, or a proxy like car model owned (which correlates closely with wealth). To find out what the variable actually means, check the codebook, which will describe how that question was gathered and what it actually represents. Also ask what type of data: do you need microdata or aggregate data?
5. Why would someone gather this data? What biases does that introduce? Is Coca Cola funding this research on sugar? If you're having trouble finding data, this can also be used to identify stakeholders who may gather data, which you can then intentionally search for.
6. How is this data gathered, and how does that affect your analysis? If people self-report height and wealth, for example, research demonstrates they are consistently over-estimating.
These data sources often hold the data that social science students and researchers at Tufts are looking for. Start here!