title | author | thanks | date |
---|---|---|---|
Pre-Analysis Plan: 3. Spillovers from Sibling Choices |
Adam Altmejd |
Stockholm School of Economics, [email protected] |
2017-04-27 |
The purpose of this project is to evaluate how peer experience influences education choices. We will look at how the education of the older sibling affects the younger's behavior. Using the university application data described in the introduction, we aim to answer two questions:
- Imitation: How is a younger sibling's choice of higher education affected by the education of the older sibling?
- Inspiration: How is a younger sibling's grades affected by the older's admission success?
The question of behavioral spill-overs to siblings is interesting for many reasons. For one thing, not many empirical studies of peer influence on decision making exist, and this study will be able to provide some well needed evidence on how we rely on our siblings. Furthermore, information availability about higher education is highly variable over socio-economic groups. In a low SES family, an older sibling that is admitted to college could inspire the other children to apply. Evidence of such behavior would be useful in order to understand the mechanisms of intergenerational mobility within families. Last, because of the nature of this data, We will be able to get quantitative and heterogenous measures of these effects. For example, one could imagine that attractive fields influence younger siblings to study the same topic, while less attractive fields have effects in the opposite direction. We will measure not only the direction but also the sizes of these different responses, quantities that could be of importance for education policy.
This document is a pre-analysis plan (PAP), registered in a public repository before the author has been given access to the dazta set needed for analysis. Without the possibility to explore the data set it is likely that we will run into many unexpected obstacles. If we for any reason need to deviate from this plan because of such circumstances it will be clearly stated.
This project is closely related to @Joensen2017_spillovers_educational. They study how younger siblings are affected by their older's choice of high school education when the older sibling experiences an increase in availability of high school maths. They find that younger siblings are 2-3 percentage points more likely to to choose a math/science track in high school if the older sibling was given a quasi-randomly introduced expanded choice margin for science related fields. Another related paper is @Dustan2018_family_networks. He studies sibling spillovers in high school choice in Mexico in a regression discontinuity setup, also exploring the mechanisms potentially driving these effects.
There is a small literature about the effect of role models and social transmission mechanisms. @Kosse2016_formation_prosociality randomly expose both low and high SES children to a pro social mentor, and show that the observed gap in prosociality between groups of different SES is closed, even still 2 years after the treatment. Exposure to the mentor also increases probability to apply for gymnasium, the German academic track high school. Moreover, @Dahl2014_peer_effects find strong peer effects in parental leave uptake between coworkers and brothers, giving credence to the existence of a social transmission mechanism between siblings. Of course, there is also a large body of research on intergenerational mobility (see @Black2011_recent_developments for an overview and e.g. @Fagereng2015_why_wealthy for some causal evidence), where correlations in both education attainment and earnings between generation is prevalent.
The existence of an information channel with an impact on student choices has been found in a number of studies. In a similar institutional environment to the Swedish one, @PekkalaKerr2015_postsecondary_education use a large scale field experiment to show that informing students about labor market outcomes of different fields has a large impact on choices of the least informed students. Furthermore, @Fricke2016_exposure_academic use random assignment of the subject matter of students' research paper to show that exposure to Economics increases the likelihood of the field being chosen for major. @Hastings2016_uninformed_college perform a large representative survey of Chilean applicants and find that they systematically overestimate earnings of past graduates. Their respondents list prestige and accreditation as the primary reason for their degree choice. Between 35% and 47% of applicants do not even know what they will earn after graduation. There are many other papers that study the determinants of degree choice, all stressing the importance of non-pecuniary factors. @ScottClayton2012_information_constraints provides a review of this literature, with many examples of how lack of information affects choice.
There is also evidence that the lack of information about higher education is unequal, and affects low SES applicants more. @Hastings2015_effects_earnings use a randomly administered earnings disclosure policy that induces low SES individuals to apply for fields with higher returns. Their results fit well with the hypothesis that I present below; that earnings disclosure mostly affects low SES individuals could be because they are worse informed about the actual outcomes. @Bowen2009_crossing_finish find that a large proportion of highly qualified but poor applicants did not attend the most selective institution that they were qualified for, even though such institutions often offer superior financial aid.
For reasons outlined below we believe that the interesting behavioral responses will be found in analysis that allows for heterogeneity in behavior. Nonetheless, we will also study the aggregated response to use as a baseline model.
When looking at aggregate imitation response of younger siblings, we believe that the measured imitation behavior of the younger sibling will be indistinguishable from zero or slightly negative. This is because siblings will imitate when they are inspired, but not when they are dissappointed, and also occaisionally want to go their own way and thus avoid the choice of the older sibling.
Non-zero effects however will be identified when we look at how behavior varies with how positive the information transmitted from the older to the younger sibling actually is. Having an older sibling that is in medical school gives the individual a lot of information about what such studies entail. More precise estimates of career prospects as well as data on the difficulty of the program and what is actually tought in class, are all factors that the younger sibling initially probably only had a vague idea about. The behavioral response of the younger sibling to this information will thus depend on his or her prior knowledge about the subject. The further the information is from these prior beliefs, the stronger the reaction. The same is true when the sibling is more uncertain and has flatter beliefs. We thus hypothesize that when looking at the distribution of sibling responses in research question (1) imitation effects will be stronger when the information transmitted is more positive, as these are more likely to cause a stronger positive news chock. When sorting by field quality, the worst fields will cause younger siblings to apply less frequently, and the best will cause the opposite reaction. Further, when comparing siblings accross socio-economic status, those with lower SES (flatter priors) will have stronger responses.
With regards to the second research question, we hypothesize that younger siblings of those students that are successfully admitted to their preferred choice will study harder and get better grades. The size of the grade-improvement will depend on the difference in grade requirements between the field that the older sibling was successfully admitted to and their next-best choice. When the difference is large, the younger sibling will be more encouraged to study hard.
These hypothesis will be tested using statistical tests described below. P-values below
For an individual
We are interested in a model where the younger sibling's behavior is causally affected by an information shock about the quality of choice
For the first research question, we can describe this causal relationship with
$$ \text{Imitation}{s(i)}(D_i) = \alpha + \beta I{s(i)} + \varepsilon_{s(i)}. $$
The
For the second question, we have
$$ \text{Inspiration}{s(i)} = \alpha + \beta I{s(i)} + \varepsilon_{s(i)}. $$
But here, the information transmission function might be very different. On one hand, inspiration might be correlated with imitation since if the younger sibling has lower grades than the older, an increase in preference for the older sibling's choice should also make the younger sibling work harder to secure an offer. On the other hans, an effect on the younger sibling's score could come from the older's success in itself. That the older sibling is admitted to the program of their dreams might inspire the younger to work harder even if they focus that effort on a completely different field of study.
We will now explain how we plan to identify and test these causal effects, first presenting the identification strategy and then defining all variables that will be used.
It is well known that the correlation between education choices and different outcomes is highly endogenous (going back to @Mincer1958_investment_human). Students sort by ability, choosing different levels of education depending on how skilled they are to start with. Only with random assignment can we get any proper causal estimates. As explained in the introduction, this study will exploit two sources of exogenous variation; an admission lottery used to break ties and the discontinuities in admissions that can be found around each grade cutoff. Both these sources of variation affect
We start with a reduced form model. Let
It yields the aggregate intention to treat estimates effect from the information shock. With
To test the second part of our hypotheses, that responses will vary heterogeneously with the positivity of the news shock we use an interaction effect between choice and shock quality,
and two first stage equations,
and
where
Between a preferred and less preferred choice many things can vary. The applicant could be randomized between different schools, that sometimes lie in different cities, it could be a randomization between different programs at the same school, etc. To better understand what drives siblings to imitate we will look at these samples separately and evaluate the different magnitudes. Is it the case that siblings mainly follow to the same school, but are not as interested in studying the same field at different schools? Could this be because siblings prefer to live in the same city, creating easier access to e.g. housing. What if said city is the home town of the family?
We will do this by including specifications where we only look at margins where the field, institution, or city varies, and redefine the dependent variable to be
Moreover, we will test the aggregate models in different subgroups. We will divide the sample into three socio-economic status groups by the education level of the siblings' parents. Our hypothesis is that any effect will be attenuated by higher socio-economic status, as kids with highly educated parents have much better access to information about university education. We will also look at the effects separately by gender.
@Joensen2017_spillovers_educational find an effect of sibling imitation but only for sibling pairs where the age difference is small. We will study how the age difference interacts with our treatment using an interaction model similar to the one presented above, and also just by dividing the sample into two groups,
They also study heterogeneity through birth order- and gender interaction effects. We will test their claims by by limiting our sample to the interaction between first- and second-born siblings, and also look at genders separately. @Joensen2017_spillovers_educational argue that competition is an important factor driving imitation, and find that a large part of the imitation comes from brother pairs. The younger brother is 70% more likely to choose STEM fields if their older brothers did so. However competition could also have the opposite effect, where the younger sibling does not want to risk loosing.
There is a different mechanism potentially at play when younger siblings imitate the choices of the older. Having an older student at a specific school or in a certain city could decrease the transaction costs of moving there. The younger sibling could perhaps move in with the older sibling. There are a number ways to study if this material transmission mechanism is driving the results. We will check if any results remain after removing those students who only apply to the exact same field-institution combination as their older sibling. We will also estimate institution-specific imitation effects and compare their size to the main results.
Last, to increase the sample size we will also test the effect including the siblings of those that lotteries and who are thus assigned to their less preferred option. How much does loosing the lottery increase the likelihood that the sibling applies there instead?
The causal model of information transmission above makes it clear that it is not only the admission in itself that has an impact. Information is transmitted between siblings all throughout the older's studies (
Let
and instrument for information transmission with the first stage
This specification produces LATE estimates for the complier group. However, as we argued above, the effect is likely heterogenous and the estimates from this model will actually be a weighted average of many different treatment dimensions. Our hypothesis for this aggregate effect is that
Supplementary to the main specifications above, we will estimate a number of alternative models. The purpose is to (1) distinguish the information transmission mechanisms from other possible causes of imitation and inspiration, (2) analyze how the effect varies across different interesting sub groups, and (3) test the robustness of the findings.
A problem with the above tests is that the data actually consist of multiple different experiments, one for each admission margin, and it is not completely clear what the
To get around these issues we will estimate the treatment effect separately for different choices, and test our hypotheses also on these disaggregated models,
with one first stage for each preferred choice
Each instrument
To test our hypotheses on this disaggregated model is somewhat complicated. An F-test should be significant, as we believe the full model does have explanatory power. We can also rank all choices by quality and should then see a more positive effect for better alternatives. However since the supplementary analysis should be seen mostly as exploratory, we will refine this analysis after we have received the data.
If selection varies systematically not only by what choice the applicant is admitted to but also by what their less preferred choice is (as @Kirkeboen2016_field_study argues it does when looking at financial returns) we should estimate the second stage and the set of first stage equations separately for each such next best choice
To check robustness further we will also test different definitions of the outcome variables and endogeneous variables, and vary the bandwidth around the admission cutoff for the regression discontinuity approach. But since we have yet to explore the data, it is not clear how to exactly specify these supplementary tests. There will probably be many aspects of the data set that could create confounders, and we will want to study if these affect our results. Any such supplementary analysis will be important, but should be seen as exploratory rather than as evidence for or against our hypotheses.
We have yet to exactly define many of the main variables of the models presented above. How to exactly measure outcome is far from obvious. We want variables that do not induce unecessary noise, but on the other hand do answer to changes in the instrument.
Since we did not yet explore the data, many of the exact definitions of variables below have been made based on very little information. To avoid data mining, the current definitions will be used in the main specification, but we will also want to study how changing them affects results. These variations should be seen as robustness checks and considered exploratory. If it happens that a different definition seems to perform better in the first data set, we will hopefully be able to test this on the supplementary data that will be shared with us at a later stage. But then we will first register a new version of this pre analysis plan where it has been clearly specified.
The main identifying variable (instrument in 2SLS) is the result of the admission lottery, but we will also study the effects using the regression discontinuity approach, with the variable being predicted admission based on what side of the grade cutoff the applicant's score was.
When estimating the IV equations also the endogeneous variable needs to be clearly defined. A major part of the information about a field of study is transmitted from older to younger sibling in the early stages of the education and is captured by
As the older sibling continues through their studies, they do learn more and can potentially supply new valuable information (
Optimally, we want to somehow measure not only the information transmission, but rather a news effect that takes the priors of the younger sibling into account, to distinguishign between information that is new and surprising, and that which is not. The interaction with
For the first research question there are two potential routes for how to specify the dependent variable. Either using a binary measure of whether or not the younger sibling includes the older siblings choice in their application, or a discrete measure of the actual rank of the older sibling's choice. The benefit of the second is that it captures an interesting intensive margin of preferences, but its problem is that the length of the total application list is not fixed.
- The primary, binary, measure is
$Y^1_{s(i)} = \mathbb{I}(R_{s(i)}(j) = 1)$ , an indicator function that is equal to$1$ if the younger sibling ranks the older sibling's preferred choice$j$ as their most preferred choice, and$0$ otherwise. - The discrete, supplementary, measure is
$Y^1_{s(i)} = \frac{R_{s(i)}(j)}{\max(R_{s(i)})}$ , the ranking of$j$ in the younger sibling's application, divided by their total number of applications (to get around the problem of variable ranking length).
We define
For the second research question, the dependent variable should be a measure of performance or grit that captures how the younger sibling is motivated to study harder by the fact that older is admitted. For our main test we will use a measure of the change in the younger sibling's GPA. The score will be standardized over the whole applicant pool, separately for each type of admission group score.
- The main specification will use the difference in standardized GPA between elementary school and high school for those siblings that did not yet have a high school degree when the older sibling applied,
$Y^2_{s(i)} = \Delta\text{GPA}_{s(i)}$ . However, this requires younger siblings to be in the early years of high school at the time of applications since their elementary school grades need to have been set beforehand. This limits the sample to some extent. - To increase sample size we will also use an exploratory version which measures the change across individuals, and where the outcome variable is
$Y^2_{s(i)}=\text{GPA}_{s(i)}$ .
In the interaction specification for this research question we define
We will use the same control variables for both research questions and include them in the main specification to increase precision. As a robustness check we will estimate the model without controls as well. The variables are:
- Gender of both siblings and a gender interaction effect
- Age at application and age difference between siblings
- Cohort (year fixed effects)
- Number of siblings
- Foreign background (binary, according to SCB's definition)
- Application score of the older sibling
- Preferred choice (or field/institution) fixed effects, to control for differences in preferences across sibling pairs
- When studying imitation: the grades of the younger sibling that were set at the time when the older sibling was randomly admitted
- When not specifically studying subgorups by parental education:
- fixed effects for parents completed level of education, separately for each parent and each level of education (primary, secondary, tertiary)
- parental income: mean of household disposable income (
DispInk
, by consumption unit) when applicant is between ages 13 and 16 - dummy variables for if any of the parents studied the same field as the older sibling is applying to
When constructing the data set we will have to make a number of decisions on what data to keep and exactly how to measure each feature. Before matching the SCB data to the application data we will construct an application data set that contains one observation per individual, focusing on the relevant admission margins where randomization has occurred (and thus only include a preferred choice
- Keep only applications to degree programs and drop applications to free-standing courses.
- Remove invalid applications. This could be when the student ends up not being eligible or when application data is missing for some reason.
- Keep only the first application period for each individual to a degree program where they either (a) pariticipate in a lottery, or (b) have an application score close enough to a cutoff. Set choice over which the randomization was performed to choice
$j$ (the preferred choice). - Identify the correct treatment margin, i.e. what would the applicant be admitted to if the lottery failed, and set this to the next-best choice
$k$ . - When there are multiple randomizations, keep the margin that includes a successful admission.
As we discussed above we will have to pool applications into aggregated fields or institutions when looking at the effect heterogeneously, and collapse choices into these pooled variables. For example, if a student applied to medical school in two different cities as their preferred choice, then to three engineering schools, and last to a business school; we collapse their choice into (
Sometimes there will be multiple relevant randomization margins for one individual. When using the admission lottery we will then use the first lottery that the applicant wins. I.e. if the applicant is in a lottery for field
This will yield a data set of applicants that have been randomly admitted to field
After joining the data we will also drop all those cases where the younger sibling already has some university education when the older is treated. We will also drop any observations where the dependnt variable is missing. For example, if a younger sibling is not in high school when the older is treated, we cannot calculate