-
Notifications
You must be signed in to change notification settings - Fork 4
/
Copy pathRubin_all.Rmd
630 lines (319 loc) · 136 KB
/
Rubin_all.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
---
title: "Rubin"
author: "边蓓蕾"
date: "2017/3/2"
output: html_document
---
Abstract. Donald Bruce Rubin is John L. Loeb Professor of Statistics at Harvard University. He has made fundamental contributions to statistical methods for missing data, causal inference, survey sampling, Bayesian in- ference, computing and applications to a wide range of disciplines, including psychology, education, policy, law, economics, epidemiology, public health and other social and biomedical sciences.
摘要:Donald Bruce Rubin是哈佛大学的约翰·洛布统计学教授。他在缺失数据,因果推断,抽样调查,贝叶斯推断等统计学方法上做出过重要贡献,其成果惠及心理学,教育,政策,法律,经济,流行病,公共卫生以及其他社会及生物医学领域的计算及应用。
Don was born in Washington, D.C. on December 22, 1943, to Harriet and Allan Rubin. One year later, his family moved to Evanston, Illinois, where he grew up. He developed a keen interest in physics and math- ematics in high school. In 1961, he went to college at Princeton University, intending to major in physics, but graduated in psychology in 1965. He began gradu- ate school in psychology at Harvard, then switched to Computer Science (MS, 1966) and eventually earned a Ph.D. in Statistics under the direction of Bill Cochran in 1970. After graduating from Harvard, he taught for a year in Harvard’s Department of Statistics, and then in 1971 he began working at Educational Testing Ser- vice (ETS) and served as a visiting faculty member at Princeton’s new Statistics Department. He held several visiting academic appointments in the next decade at Harvard, UC Berkeley, University of Texas at Austin and the University of Wisconsin at Madison. He was a full professor at the University of Chicago in 1981– 1983, and in 1984 moved back to the Harvard Statistics Department, where he remains until now, and where he served as chair from 1985 to 1994 and from 2000 to 2004.
1943年12月22日,Don出生于华盛顿。一年后,全家迁居伊利诺伊州埃文斯顿市,他在那里长大。高中时他对物理和数学产生了浓厚的兴趣。1961年,Don就读普林斯顿大学,打算主修物理,却在1965年毕业于心理系。然后Don进入哈佛大学心理学研究生院,1966年转读计算机系硕士,最后师从Bill Cochran,于1970年获得统计学博士学位。自哈佛毕业后,他在哈佛大学统计系教过一年书,1971年开始在ETS工作并成为普林斯顿大学新统计系的访问学者。十年中,他在哈佛大学、加州大学伯克利分校、得克萨斯大学奥斯汀分校、威斯康星麦迪逊分校主持过多个学术会议。在1981-1983年间,他是芝加哥大学的全职教授,于1984年重返哈佛大学统计系工作至今,并分别在1985-1994年,2000-2004年担任主席。
Don has advised or coadvised over 50 Ph.D. stu- dents, written or edited 12 books, and published nearly 400 articles. According to Google Scholar, by May 2014, Rubin’s academic work has 150,000 citations, 16,000 in 2013 alone, placing him at the top with the most cited scholars in the world.
Don指导和共同指导一共超过50名博士生,撰写和编辑过12本著作,发表了近400篇文章。根据谷歌学术截止到2014年5月的统计,Rubin的学术成果已有15万次引用,单单2013年就有1万6千次引用,在全球学者中名列前茅。
For his many contributions, Don has been hon- ored by election to Membership in the US National Academy of Sciences, the American Academy of Arts and Sciences, the British Academy, and Fellowship in the American Statistical Association, Institute of Mathematical Statistics, International Statistical Insti- tute, Guggenheim Foundation, Humboldt Foundation and Woodrow Wilson Society. He has also received the Samuel S. Wilks Medal from the American Sta- tistical Association, the Parzen Prize for Statistical In- novation, the Fisher Lectureship and the George W. Snedecor Award of the Committee of Presidents of Sta- tistical Societies. He was named Statistician of the Year by the American Statistical Association’s Boston and Chicago Chapters. In addition, he has received hon- orary degrees from Bamberg University, Germany and the University of Ljubljana, Slovenia.
由于他的诸多贡献,Don被推选为美国国家科学院成员,美国艺术与科学学院成员,英国国家学术院成员,美国统计学会理事,国际数理统计协会理事,国际统计学会理事,古根海姆基金会理事,洪堡基金会理事,伍德罗·威尔逊基金会理事。他获得了美国统计学会的塞缪尔·威尔克斯奖、针对统计创新的帕仁奖,以及统计学会会长委员会的费希尔讲席奖和斯尼德克奖。他被美国统计学会的波士顿和芝加哥分会誉为年度统计学家。此外,他还获得了德国的班贝格大学以及斯洛文尼亚的卢布尔雅那大学的荣誉学位。
Besides being a statistician, he is a music lover, au- diophile and fan of classic sports cars.
他不但是一个统计学家,还是一个音乐爱好者,古典乐,体育,汽车发烧友。
This interview was initiated on August 7, 2013, dur- ing the Joint Statistical Meetings 2013 in Montreal, in anticipation of Rubin’s 70th birthday, and completed at various times over the following months.
本访谈自2013年8月7日JSM大会期间,Rubin70岁生日之时开始,在接下来的几个月中逐步完成的。
# 开始
Fan: Let’s begin with your childhood. I understand you grew up in a family of lawyers, which must have heavily influenced you intellectually. Can you talk a little about your family?
Fan:让我们从你的小时候说起吧。我知道你生于一个律师家庭,这肯定对你有着深远的影响。能不能介绍一下你的家庭?
Don: Yes. My father was the youngest of four broth- ers, all of whom were lawyers, and we used to have stimulating arguments about all sorts of topics. Prob- ably the most argumentative uncle was Sy (Seymour Rubin, senior partner at Arnold, Fortas and Porter, diplomat, and professor of law at American Univer- sity), from D.C., who had framed personal letters of thanks for service from all the presidents starting with Harry Truman and going through Jerry Ford, as well as from some contenders, such as Adlai Stevenson, and various Supreme Court Justices. I found this impres- sive but daunting. The relevance of this is that it clearly created in me a deep respect for the principles of our le- gal system, to which I find statistics highly relevant— this has obviously influenced my own application of statistics to law, for example, concerning issues as di- verse as the death penalty, affirmative action and the tobacco litigation.
Don:是的。我父亲是他们4兄弟中最年轻的,他们4兄弟都是律师,我们过去在所有的话题上都有激烈的辩论。可能最好辩的是Sy[Seymour Rubin, Arnold, Fortas和Porter的高级合伙人,外交官,美国大学的法学教授]了,他住在华盛顿,他从总统那里获得了许多感谢信,同样也从一些竞争对手那里获得了很多感谢信,如Adlai Stevenson以及不同的最高法院法官。这让我印象深刻,但也让我感到害怕,因为这让我对我们的法律体系所遵守的原则印象深刻,在这里面,我发现统计学与它高度相关,这明显触动了我,并因此做了统计学在法律上的应用,例如考虑那些包括死刑,平权法案和烟草诉讼的问题。
Fabri: We will surely get back to these issues later, but was there anyone else who influenced your interest in statistics?
Fabri: 等下我们再回到这些问题,但还有其他人影响了你对统计学的兴趣吗?
Don: Probably the most influential was Mel, my mother’s brother, a dentist (then a bachelor). He loved to gamble small amounts, either in the bleachers at Wrigley Field, betting on the outcome of the next pitch, while watching the Cubs lose, or at Arlington Race track, where I was taught at a young age how to read the Racing Form and estimate the “true” odds from the various displayed betting pools, while losing two dol- lar bets. Wednesday and Saturday afternoons, during the warm months when I was a preteen, were times to learn statistics—even if at various bookie joints that were sometimes raided. As I recall, I was a decent stu- dent of his, but still lost small amounts.
Don:可能影响最深的是我舅舅Mel,他是一名牙医(还是一名学士)。他喜欢小赌,(比如)在芝加哥箭牌球场的露天看台看到比赛输了就会开始赌一下场比赛的结果,或者在阿灵顿赛道,在那里,他教导了我如何阅读赛马新闻,根据不同的赌注估计胜算,但那时候我还很小,还输了两美元。当我还是个青春期少年的时候,我会在天气温暖的周三和周六的下午学习统计,即使不同的赌注组合有时候会输光。我记得那时我是他相当优秀的学生,但我仍然输了一小部分。
There were two other important influences on my statistical interests from the late 1950s and early 1960s. First, there was an old friend of my father’s from their government days together, a Professor Emeritus of Economics at UC Berkeley, George Mehren, with whom I had many entertaining and educational (to me) arguments, which generated a respect for economics that continues to grow to this day. And second, my wonderful teacher of physics at Evanston Township High School—Robert Anspaugh—who tried to teach me to think like a real scientist, and how to use mathe- matics in the pursuit of science.
还有两个对我的统计学兴趣有很重要的影响发生在20世纪50年代末和60年代初。第一个是(源于)我父亲在政府的老朋友,加州大学伯克利分校经济系的退休教授,George Mehren,我跟着他学习,有了许多文化和教育上的观点,这让我对经济学从那时持续到今天一直保持着尊敬。第二个是我在高中的物理老师,他试着教导我像一个科学家一样思考问题,教我如何用数学解决科学问题。
By the time I left high school for college, I appre- ciated some statistical thinking from gambling, some scientific thinking from physics, and I had deep respect for disciplines other than formal mathematics, in par- ticular, physics and the law. These, in hindsight, are exposures that were crucial to the kind of statistics to which I gravitated in my later years. More details of the influence of my mentors can be found in Rubin (2014b).
在我离开高中进入大学的时候,我很感激我从赌博中学到了统计学思想,从物理中学到了科学思想,我对除了数学以外的很多学科都保持尊敬,特别是物理和法律。事后这些都是我后来被统计学吸引的关键因素。关于更多导师如何影响我的细节可以从这里找到(Rubin(2014b))。
# COLLEGE TIME AT PRINCETON
# 在普林斯顿的大学时代
Fan: You entered Princeton in 1961, first as a physics major, but later changed to psychology. Why the change and why psychology?
Fan: 你在1961年进入普林斯顿,开始主修物理,但后来改修心理学,为什么?
Don: That’s a good question. Inspired by Anspaugh, I wanted to become a physicist. I was lined up for a BA in three years when I entered Princeton, and unknown to me before I entered, also lined up for a crazy plan to get a Ph.D. in physics in five years, in a program being reconditely planned by John Wheeler, a very well-known professor of physics there (and Richard Feynman’s Ph.D. advisor years earlier). In retrospect, this was a wildly over-ambitious agenda, at least for me. For a combination of complications, including the Vietnam War (and its associated drafts) and Profes- sor Wheeler’s sabbatical at a critical time, I think no one succeeded in completing a five-year Ph.D. from entry. In any case, there were many kids like me at Princeton then, who, even though primarily interested in math and physics, were encouraged to explore other subjects. I did that, and one of the courses I took was on personality theory, taught by a wonderful professor, Silvan Tomkins, who later became a good friend. At the end of my second year, I switched from Physics to Psychology, where my mathematical and scientific background seemed both rare and appreciated—it was an immature decision (not sure what a mature one would have been), but a fine one for me because it in- troduced me to some new ways of thinking, as well as to new fabulous academic mentors.
Don:这个问题好。在Anspaugh的鼓舞下,我想成为一个物理学家。我进入普林斯顿被安排3年拿到BA学位,但进来之前我并不知道我还被安排了一个5年拿到物理学博士的疯狂计划,这是物理系非常有名的John Wheeler教授做的计划。回想起来,那对于我来说是一个雄心勃勃的日程。结合一些其他的复杂因素,包括越南战争,Wheeler教授在关键时刻休假,我觉得没人能在5年内完成博士学位。不管怎样,在普林斯顿还有很多孩子和我一样,即使开始对数学和物理感兴趣,也被鼓励去其他学科探索。我就是这么做的,我上的一门课叫人格理论,是一个很好的教授教的,后来我们成为了朋友。在第二学年末,我从物理转到了心理系,在那里,我的数学和科学背景看起来都很弱,喜欢心理学是一个不成熟的决定(我也不确定什么是成熟的),但对我来说,它好的方面是带给了我一些全新的思考方式,也让我认识了一些新的著名学者。
Fabri: You had some computing skills which were uncommon then, right? So you started to use comput- ers quite early.
Fabri: 后来你有了一些不寻常的计算能力对吗?所以你很早就开始用计算机了。
Don: Yes. Sometime between my first and second year at Princeton, I taught myself Fortran. As you men- tioned, those skills were not common, even at places like Princeton then.
Don:是的。我在普林斯顿的第一和第二学年期间,就自学了Fortran。就像你说的,这些技能就算是在当时的普林斯顿,也不常见。
Fabri: Was learning Fortran just a matter of having fun or did you actually use these skills to solve prob- lems?
Fabri: 你当时学Fortran是兴趣还是真的要用这些技能解决问题?
Don: It was for solving problems. When I was in the Psychology Department, I was helping to support myself by coding some of the early batch computer packages for PSTAT, a Princeton statistical software package, which competed with BMDP of UCLA at the time. I also wrote various programs for simulating hu- man behavior.
Don:是为了解决问题。当时在心理系,我为了支持自己的学业,便为PSTAT编写计算机软件包,那是普林斯顿的一个统计软件包,当时和UCLA的BMDP是竞争关系。我也写了一些模拟人类行为的程序。
Fan: In your senior year at Princeton, you applied for Ph.D. programs in psychology and were accepted by several very good places.
Fan:你在普林斯顿高年级的时候,申请了心理学的博士项目,并且被许多不错的地方录取了。
Don: Yes, I was accepted by Stanford, Michigan and Harvard. I met some extraordinary people during my visits to these programs. I went out to Stanford first, and met William Estes, a quiet but wonderful profes- sor with strong mathematical skills and a wry wit, who later moved to Harvard. Michigan had a very strong mathematical psychology program, and when I visited in the spring of 1965, I was hosted primarily by a very promising graduating Ph.D. student, Amos Tversky, who was doing extremely interesting work on human behavior and how people handled risks. In later years, he connected with another psychologist, Daniel Kah- neman, and they wrote a series of extremely influential papers in psychology and economics, which eventually led to Kahneman’s winning the Nobel Prize in Eco- nomics in 2002; Tversky passed away in 1996 and was thus not eligible for the Nobel Prize. Kahneman (who recently was awarded a National Medal of Science by President Obama) always acknowledges that the Nobel Prize was really a joint award (to Tversky and him). I was on a committee sometime last year with Kahne- man, and it was interesting to find out that I had known Tversky longer than he had.
Don:是的,我被斯坦福,密歇根和哈佛录取了。在访问这些项目的过程中,我认识了很多杰出人物。我先去了斯坦福,认识了William Estes,一个很安静但是很厉害的教授,他有很强的数学技能并且风趣幽默,后来他去了哈佛。密歇根的数学心理学项目非常强,当我1965年春天访问的时候,一个非常有前途的博士生,Amos Tversky,接待了我,他当时在做很有趣的有关人类行为和人是如何处理风险的项目。后来的几年,他和另一个心理学家Daniel Kahneman一起在心理学和经济学领域写了一系列极有影响力的文章,也使得Kahneman最终在2002年获得诺贝尔经济学奖。Tversky在1996年去世,因此不能获得诺贝尔奖,但Kahneman经常声明那个诺贝尔奖是两人共同的奖项。去年我和Kahneman一起担任评审,非常有意思的是,我认识Tversky的时间比他长。
Fan: But ultimately you chose Harvard.
Fan: 但最终你选择了哈佛。
Don: Well, we all make strange decisions. The rea- son was that I had an east-coast girlfriend who had an- other year in college.
Don:我们所有人都会做奇怪的决定。其实(选择哈佛的)原因是我在东海岸的女朋友还要再读一年。
# GRADUATE YEARS AT HARVARD
# 在哈佛的研究生时代
Fabri: You first arrived at Harvard in 1965 as a Ph.D. student in psychology, which was in the Department of Social Relations then, but were soon disappointed, and switched to computer science. What happened?
Fabri: 1965年你最初到哈佛是社会关系学院心理学系的博士生,但你很快就失望,并转到了计算机系,(这之中)发生了什么?
Don: When I visited Harvard in the summer of 1965, some senior people in Social Relations appeared to find my background, in subjects like math and physics, at- tractive, so they promised me that I could skip some of the basic more “mathy” requirements. But when I arrived there, the chair of the department, a sociolo- gist, told me something like, “No, no, I looked over your transcript and found your undergraduate educa- tion scientifically deficient because it lacked ‘methods and statistics’ courses. You will have to take them now or withdraw.” Because of all the math and physics that I’d had at Princeton, I felt insulted! I had to get out of there. Because I had independent funding from an NSF graduate fellowship, I looked around. At the time, the main applied math appeared being done in the Di- vision of Engineering and Applied Physics, which re- cently became the Harvard’s “School of Engineering and Applied Sciences.” The division had several sec- tions; one of them was computer science (CS), which seemed happy to have me.
Don:我1965年夏天访问哈佛大学的时候,社会关系学院高年级的人知道了我的背景,数学、物理是很吸引人的学科,他们向我保证可以跳过一些数学类基础课的要求。但当我到了那,系主任,一个社会学家告诉我,“不,不,我看了你的成绩单,觉得你的本科教育缺乏科学性,因为缺少‘方法和统计’这样的课程,你必须要修,不然就退学”。由于我在普林斯顿上完了所有数学和物理的课程,我感觉这很侮辱,我必须要离开那。但因为我拿了NSF的研究生全额奖学金,我开始寻找(新的系)。在那时,工程和应用物理系新开了应用数学,它最近成为了哈佛的工程和应用科学学院。它有很多专业,其中一个就是计算机,看起来很愿意接收我。
Fan: But you got bored again soon. Was this because you found the problems in CS not interesting or chal- lenging enough?
Fan: 但是你很快再次觉得很无聊。是因为你觉得计算机没意思或者不够挑战吗?
Don: No, not really that. There were several reasons. First, there was a big emphasis on automatic language translation, because it was cold war time, and it ap- peared that CS got a lot of money for computational linguistics from ARPA (Advanced Research Projects Agency), now known as DARPA. The Soviet Union, from behind the iron curtain, produced a huge num- ber of documents in Russian, but evidently there were not enough people in the US to translate them. A com- plication is that there are sentences that you could not translate without their context. I still remember one example: “Time flies fast,” a three-word sentence that has three different meanings depending on which of the three words is the verb. If this three-word sentence cannot be automatically translated, how can one get an automatic (i.e., by computer) translation of a complex paragraph? Related to this was Noam Chomsky’s work on transformational grammars, down the river at MIT.
Don:不,不是那样的,有好多因素。首先,它很注重自动化语言翻译,因为那是冷战时期,计算机系得到了来自ARPA(高级研究项目代理,现在是DARPA)的很多经费,用于计算语言学。苏联,从铁幕背后生产了大量的文件,但很显然美国没有足够的人手去翻译。(而且)复杂的是有些句子你无法在没有语境的情况下翻译。我记得一个例子:“时光飞逝”,(这)一个三个单词的句子,可能有三个意思,这取决于这三个单词哪个是动词。如果这个三个单词的句子不能被自动化翻译,(电脑)又怎么能自动化翻译一个复杂的段落呢?这自然是和麻省理工Noam Chomsky的翻译语法相关的。
Second, although I found some real math courses and the ones in CS on mathy topics, such as computa- tional complexity, which dealt with Turing machines, Godel’s theorem, etc., interesting, I found many of the courses dull. Much of the time they were about programming. I remember one of my projects was to write a program to project 4-dimensional figures into 2- dimensions, and then rotate them using a DEC PDP-1. It took an enormous number of hours. Even though my program worked perfectly, I felt it was a gigantic waste of time. I also got a C+ in that course because I never went to any of the classes. Now, having dealt with many students, I would be more sympathetic that I deserved a C+, but not when I was a kid. At that time, I figured there must be something better to do than rotating 4D objects and getting a C+. But marching through rice paddies in Vietnam or departing for some- where in Canada didn’t seem appealing. So after pick- ing up a MS degree in CS in 1966, although I stayed another year in CS, I was ready to try something else.
其次,虽然我发现一些数学课程和一些计算机领域的数学主题很有趣,如计算复杂度,用来解决图灵机,哥德尔定理等,但我发现(还是有)许多课程很无聊,大部分都和编程有关。我记得一个项目是写一个程序将一个4维图片转成二维,然后用一个DEC PDP-1做旋转。这个花费了我好几个小时,即使我的程序很完美,我仍然觉得这很浪费时间。我在那门课得了C+,因为我从来没去上过课。现在,我现在对这个C+有不同的看法,但是当我还小的时候,我不这样想。那个时候,我想一定会有比把4D物体旋转并得到一个C+更好的事情去做。但是参加越南战争,或者离开去加拿大看起来都没有吸引力,所以在1966年拿到计算机硕士学位之后,我准备尝试些别的。
Fabri: How did statistics end up in your path?
Fabri: (那么)统计是如何成为你的最终之路的?
Don: A summer job in Princeton in 1966 led to it. I did some programming for John Tukey in For- tran, LISP and COBOL. I also did some consulting for a Princeton sociology professor, Robert Althauser, basically writing programs to do matched sampling, matching blacks and whites, to study racial disparity in dropout rates at Temple University. I had a conversa- tion with Althauser about how psychology and then CS weren’t working out for me at Harvard. Because Bob was doing some semi-technical things in sociology, he knew of Fred Mosteller, although not personally, and also knew that Harvard had a decade-old Statistics Department that was founded in 1957. He suggested that I contact Mosteller. After getting back to Harvard, I talked to Fred, and he suggested that I take some stat courses. So in my third year in Harvard, I took mostly stat courses and did OK in them. And the Stat depart- ment said “Yes” to me. It also helped to have my own NSF funding, which I had from the start; they kept re- newing for some reason, showing their bad taste prob- ably, but it worked out well for me. Anyway, at the end of my third year at Harvard, I had switched to statistics, my third department in four years.
Don:1966年夏天我在普林斯顿的工作成就了这件事。我用Fortran,LISP和COBOL为John Tukey写了一些程序,(同时)我也为普林斯顿社会学教授Robert Althauser做了一些咨询,写一些程序匹配抽样,匹配黑人和白人,研究天普大学辍学率的种族差异。我跟Althauser谈过关于心理学是怎样的以及为什么计算机专业在哈佛不适合我。因为Bob在社会学上做一些半技术的东西,他认识Fred Mosteller,虽然不是个人名义,也知道哈佛有一个1957年成立的统计系。他建议我联系Mosteller。在回到哈佛后,我和Fred谈了一下,他建议我修一些统计课程。所以在第三年,我上的大部分都是统计课程并且还不错。统计系接收了我并帮我拿到了我自己的NSF资助,从我一开始入学就有。由于某些原因,他们不断重新开始,这可能显示了他们的坏品味,但是对我来说很好。不管怎样,在哈佛的第三年末,我转到了统计系,四年中的第三个系。
Fabri: Besides Mosteller, who else was on the statis- tics faculty then? It was a quite new department, as you said.
Fabri: 除了Mosteller,统计系还有哪些老师?就像你所说的那样,它当时是个很新的系。
Don: The other senior people were Bill Cochran and Art Dempster, who had recently been promoted to tenure. The junior ones were Paul Holland; Jay Gold- man, a probabilist; and Shulamith Gross from Berke- ley, a student of Erich Lehmann’s.
Don:其他的高级教师还有Bill Cochran和Art Dempster,他们最近升为终身教授。初级的有Paul Holland;概率论专家Jay Goldman;来自伯克利的Shulamith Gross,Erich Lehmann的学生。
Fabri: And you decided to work with Bill.
Fabri: 然后你决定和Bill合作。
Don: Actually, I first talked to Fred. Fred always had a lot of projects going; one was with John Tukey and he proposed that I work on it. I told him that I had this matched sampling project of my own, and he sug- gested that I talk to Cochran—Cochran a few years ear- lier was an advisor for the Surgeon General’s report on smoking and lung cancer. It was obviously based on observational data, not on randomized experiments, and Fred said that Cochran knew all about these issues in epidemiology and biostatistics. So I went to knock on Bill’s door. He answered with a grumpy sounding “yes,” I went in and he said, “No, not now, later!” So I thought “Hmmm, rough guy,” but actually he was a sweetheart, with a great Scottish dry sense of humor and a love of scotch and cigarettes (I understand the former, although not the latter).
Don:事实上,我先找了Fred谈话。他总是同时在做很多项目,其中一个是和John Tukey的,他建议我做这个。我告诉他我自己做过这个抽样的项目了,他建议我去找Cochran—Cochran谈,他几年前是为Surgeon General汇报过吸烟和肺癌项目的。这个项目很明显是基于观察数据,而不是随机试验,Fred说Cochran知道所有的关于流行病和生物统计的问题。所以我去敲了Bill的门。他用脾气暴躁的声音说了“请进”,我进去了但他却说,“不,不是现在,一会儿!”。然后我想,“哦,(这是个)粗人”。但事实上他心地善良,有很强的苏格兰幽默感,喜欢苏格兰威士忌和香烟(前者我理解,后者我不能理解)
Fabri: Cochran did have a lasting influence on you, right?
Fabri: Cochran一直对你有着影响,对吗?
Don: Yes, he had a tremendous influence on me. Once I was doing some irrelevant math on matching, which I now see popping up again in the literature. I showed that to Bill, and he asked me, “Do you think that’s important, Don?” I said, “Well, I don’t know.” Then he said, “It is not important to me. If you want to work on it, go find someone else to advise you. I care about statistical problems that matter, not about making things epsilon better.” Another person who was very influential was Art Dempster. Once I did some con- sulting for Data Text, a collection of batch computer programs like PSTAT or BMDP. I was designing pro- grams to calculate analyses of variance, do regressions, ordinary least squares, matrix inversions, all when you have, in hindsight, limited computing power. For ad- vice on some of those I talked to Dempster, who al- ways has great multivariate insights based on his deep understanding of geometry—very Fisherian.
Don:是的,他对我有巨大的影响。有一次我正在做一些不相关的数学匹配,如今我在文献里又看到了。我给Bill看,他问我,“Don,你认为那重要吗?”我说,“我不知道。”然后他说,“这对我不重要。如果你还想做这方面,去找其他人来给你建议。我关心那背后的统计问题,而不是把事情做的更好。”另一个对我有很大影响的是Art Dempster。有一次我为数据文本做一些咨询,一系列的计算机程序像PSTAT或者BMDP。我设计计算机程序计算方差,做回归,普通最小二乘,矩阵求逆,所有都局限于计算。(后来)Dempster和我聊天,基于他对几何的理解已经和Fisher相差无几以及他对多元的深刻理解,他给了我一些建议。
Fan: Your Ph.D. thesis was on matching, which is the start of your life-long pursuit of causal inference. How did your interest in causal inference start?
Fan: 你的博士论文是关于匹配的,从什么时候开始你毕生致力于因果推断的研究?你什么时候对因果推断产生兴趣的?
Don: When I worked with Althauser on the racial disparity problem, I always emphasized to him that it was inherently descriptive, not really causal. I re- membered enough from my physics education in high school and Princeton that association is not causation. So I was probably not intrigued by causal inference per se, but rather by the confusion that the social scien- tists had about it. You have to describe a real or hy- pothetical experiment where you could intervene, and after you intervene, you see how things change, not in time but between intervention (i.e., treatment) groups. If you are not talking about intervention, you can’t talk about causality. For some reason, when I look at old philosophy, it seems to me that they didn’t get it right, whereas in previous centuries, some experimenters got it. They bred cows, or mated hunting falcons. If you mated excellent female and male falcons, the resulting next generation of falcons would generally be better hunters than those resulting from random mating. In the 20th century, many scientists and experimentalists got it.
Don: 当我和Althauser一起研究种族差异问题的时候,我总是跟他强调结果本质上是描述性的,并不是真正的因果关系。我记得我在高中和在普林斯顿学物理的时候就知道关联不是因果。我可能对因果推断并不好奇,但是我对一些社会科学家对它好奇很不解。我必须要描述一个真实的或者假设的实验,你可以介入它,在你介入之后,你会看到事情是如何变化的,不是瞬时的,而是在不同的介入之间(比如不同的治疗)。如果你不谈介入,你就不能谈因果。由于某些原因,当我看旧哲学的时候,对我来说那可能不对,然而在前几个世纪,一些实验员验证了它。他们养奶牛,或者让猎鹰交配。如果你让优秀的公母猎鹰交配,下一代的狩猎水平要高于随机交配的结果。在20世纪,许多科学家和实验员得到了这个结果。
Fabri: So you were only doing descriptive compar- isons in your Ph.D. thesis, and the notation of potential outcomes was not there.
Fabri:所以你在你的博士论文里只做了描述性的比较,潜在的结果注释不在里面。
Don: Partly correct. At that time, the notation of po- tential outcomes was in my mind, because that is the way that Cochran initiated discussions of randomized experiments in the class he taught in 1968. Initially, it was all based on randomization, unbiasedness, Fisher’s test, etc. But the concepts had to be flipped into or- dinary least squares (OLS) regression and analysis of variance tables, because nobody could compute any- thing difficult back then. One of the lessons in Bill’s class in regression and experimental design was to use the abbreviated Dolittle method to invert matrices, by hand! So you really couldn’t do randomization tests in any generality. The other reason I was interested in ex- periments and social science was my family history. There was always this legal question lurking: “But for this alleged misconduct, what would have happened?”
Don:一部分是对的。在那个时候,潜在结果的注释在我脑袋里,因为那个是Cochran1968年在他的随机实验课堂上讨论的方法。最开始全部都是基于随机,无偏,Fisher检验等等。但是这些概念必须要被注入普通最小二乘回归,方差分析表,因为没人能计算出任何困难的东西。Bill有一堂回归与实验设计课,就是用缩略的Dolittle方法手工求矩阵的逆!所以你真的难以对随机检验做任何普适的推广。另一个让我对实验和社会科学感兴趣的原因是我的家庭史。总是有这样一个法律问题埋在我心里:“这个(事情)涉嫌渎职,会发生什么呢?”
Fan: What was your first job after getting your Ph.D. degree in 1970?
Fan: 在你1970年拿到博士学位你的第一份工作是什么?
Don: I stayed at Harvard for one more year, as an instructor in the Statistics Department, partly sup- ported by teaching, partly supported by the Cambridge Project, which was an ARPA funded Harvard–MIT joint effort; the idea was to bring the computer science technologies of MIT and the social sciences research of Harvard together to do wonderful things in the social sciences. In the Statistics Department, I was coteaching with Bob Rosenthal the “Statistics for Psychologists” course that, ironically, the Social Relations Department wanted me to take five years earlier, thereby driving me out of their department! Bob had, and has, tremendous intuition for experimental design and other practical is- sues, and we have written many things together.
Don:我在哈佛多待了一年,作为统计系的讲师,一部分工资来自教学,一部分来自剑桥项目,这是一个ARPA支持的哈佛-麻省理工联合项目;旨在将麻省理工的计算机科学技术和哈佛大学的社会科学研究结合到一起,在社会科学上做一些有趣的事情。在统计系,我和Bob Rosenthal一起教“统计心理学”,讽刺的是,5年前社会科学院就想让我上这门课,(正是)由于这门课,他们把我逐出了他们院!Bob对于实验设计和其他一些实践问题很有灵感,我们一起写了很多东西。
# THE ETS DECADE: MISSING DATA, EM AND CAUSAL INFERENCE
# ETS的10年:缺失数据,EM和因果推断
Fan: After that one year, you went for a position at ETS in Princeton instead of a junior faculty position in a research university. It was quite an unusual choice, given that you could probably have found a position in a respected university statistics department easily.
Fan: 一年后,你在普林斯顿的ETS找了一个职位,而不是大学里的高级教职。这看起来是个很不寻常的选择,因为(当时)你已经能轻易在一个有名望的统计系找到一份教职。
Don: Right—many people thought I was goofy. I did have several good offers, one was to stay at Harvard, and another was to go to Dartmouth. But I met Al Beaton, who was later my boss at ETS in Princeton, at a conference in Madison, Wisconsin, and he offered me a job, which I took. Al had a doctorate in education at Harvard, and had worked with Dempster on compu- tational issues, such as the “sweep operator.” He was a great guy with a deep understanding of practical com- puting issues. Also, he appreciated my research. Be- cause I was an undergrad at Princeton, it was almost like going home. For several years, I taught one course at Princeton. Between the jobs at ETS and Princeton, I was earning twice what the Harvard salary would have been, which allowed me to buy a house on an acre and a half, with a garage for rebuilding an older Mer- cedes roadster, etc. A different style of life from that in Cambridge.
Don:很多人认为我是一个傻瓜。我有很多不错的机会,一个是待在哈佛,另一个是去达特茅斯。但我遇到了Al Beaton,后来他是我在普林斯顿ETS的老板,在威斯康星麦迪逊的一个会议上,他给了我这份工作,我接受了。Al在哈佛有一个博士学位,他跟Dempster一起做一些计算问题,例如“扫描算子”。他是个很好的人,并且对计算问题有很深刻的理解;同时他也很欣赏我的研究。由于我之前曾是普林斯顿的本科生,(因此)这种感觉大概就像是回家。我在普林斯顿有好几年就只教一门课。在ETS和普林斯顿的工作中,我的工资是在哈佛的两倍,这让我买了一栋1.5英亩的房子,有一个车库,还改造了旧的奔驰跑车等等。这是和在剑桥完全不同的生活方式。
Fan: You seem to have had a lot of freedom to pur- sue research at the ETS. What was your responsibility at ETS?
Fan: 看起来你在ETS做研究非常自由。你在ETS的职责是什么?
Don: The position at ETS was like an academic posi- tion with teaching responsibilities replaced by consult- ing on ETS’s social science problems, including psy- chological and educational testing ones. I found con- sulting much easier for me than teaching, and ETS had interesting problems. Also there were many very good people around, like Fred Lord, who was highly respected in psychometrics. The Princeton faculty was great, too: Geoffrey Watson (of the Durbin–Watson statistic) was the chair; Peter Bloomfield was there as a junior faculty member before he moved to North Car- olina; and of course Tukey was still there, even though he spent a lot of time at Bell Labs. John was John, hav- ing a spectacular but very unusual way of thinking— obviously a genius. Stuart Hunter was in the Engineer- ing School then. These were fine times for me, with tremendous freedom to pursue what I regarded as im- portant work.
Don:在ETS的工作有点像教职,只是在做ETS的社会科学的咨询而不是教学了,包括心理教育的问题。我发现相比教学,咨询梗简单,ETS有很多有意思的问题。而且周围有很多不错的人,比如Fred Lord,在心理测验学上很有名望。普林斯顿的老师们也很好:Geoffrey Watson(杜宾-瓦特森统计量命名人)是系主任;Peter Bloomfield在搬到北卡之前是初级教授以及Tukey一直在那,即使他在Bell实验室花了很多时间。John显然是一个有很不寻常的思考方式的天才。Stuart Hunter后来去了工程学院。当时对我来说是一段好时光,我有大量自由的时间来追求我认为重要的工作。
Fabri: By any measure, your accomplishments in the ETS years were astounding. In 1976, you published the paper “Inference and Missing Data” in Biometrika (Rubin, 1976) that lays the foundation for modern anal- ysis of missing data; in 1977, with Arthur Dempster and Nan Laird, you published the EM paper “Max- imum Likelihood from Incomplete Data via the EM Algorithm” in JRSS-B (Dempster, Laird and Rubin, 1977); in 1974, 1977, 1978, you published a series of papers that lay the foundation for the Rubin Causal Model (Rubin, 1974, 1977, 1978a). What was it like for you at that time? How come so many groundbreak- ing ideas exploded in your mind at the same time?
Fabri: 不管怎样看,你在ETS的几年的成就是令人震惊的。在1976年,你在Biometrika上发表了《Inference and Missing Data》(Rubin, 1976),为现代缺失数据的分析奠定了基础;在1977年,你和Arthur Dempster,Nan Laird一起在JRSS-B上发表了《Maximum Likelihood from Incomplete Data via the EM Algorithm》(Dempster, Laird and Rubin, 1977);在1974,1977,1978年,你发表了一系列奠定了因果模型基础的文章(Rubin, 1974, 1977, 1978a)。你在那个时候是什么样的?你是如何在同一时间作出这么多开创性的工作的?
Don: Probably the most important reason is that I al- ways worried about solving real problems. I didn’t read the literature to uncover a hot topic to write about. I al- ways liked math, but I never regarded much of math- ematical statistics as real math—much of it is just so tedious. Can you keep track of these epsilons?
Don:可能最重要的原因是我总是很担忧如何解决实际问题。我没有读过写热门话题的文献。我一直喜欢数学,但是我不认为数理统计是真正的数学,大部分数学问题是很枯燥的。你能一直追溯这些epsilon吗?[编者注:这里的 epsilon 指数学中 epsilon-delta 语言中的 epsilon]
Fabri: There is no coincidence that all these papers share the common theme of missing data.
Fabri: 没想到很巧合的是所有这些论文都有共同的主题:缺失数据。
Don: That’s right. That theme arose when I was a graduate student. The first paper I wrote on missing data, which is also my first sole-authored paper, was on analysis of variance designs, a quite algorithmic pa- per. It was always clear to me, from the experimental design course from Cochran that you should set up ex- periments as missing data problems, with all the poten- tial outcomes under the not-taken treatments missing. But nobody did observational studies that way, which seemed very odd to me. Indeed, nobody was using po- tential outcomes outside the context of randomized ex- periments, and even there, most writers dropped poten- tial outcomes in favor of least squares when actually doing things.
Don:是的。在我还是研究生的时候这个主题就出现了。我写的第一篇关于缺失数据的论文,也是我的第一篇唯一作者的论文,是关于实验设计的方差分析,一个非常偏算法的论文。我很清楚,从Cochran的实验课上,你应该把实验问题当作缺失数据问题,对于没有处理的,潜在结果是缺失的。但是没有人做观测的研究,对我来说很奇怪。事实上,没有人用随机实验以外的潜在结果,甚至,很多学者在真正做事情的时候为了支持最小二乘,直接丢掉潜在结果。
Fan: What was the state of research on missing data before you came into the scene?
Fan: 在你进入这个领域之前,缺失数据是什么样的研究状态?
Don: It was extremely ad hoc. The standard ap- proach to missing data then was comparing the biases of filling in the means, or of regression imputation un- der different situations, but almost always under an im- plicit “missing completely at random” assumption. The purely technical sides of these papers are solid. But I found there were always counter examples to the pro- priety of the specific methods being considered, and to explore them, one almost needed a master’s thesis for each situation. I would rather address the class of prob- lems with some generality. There is a mechanism that creates missing data, which is critical for deciding how to deal with the missing data. That idea of formal indi- cators for missing data goes way back in the contexts of experimental design and survey design. I am consis- tently amazed how this was not used in observational studies until I did so in the 1970s; maybe someone did, but I’ve looked for years and haven’t found anything. But probably because the missing data paper was done in a relatively new way, I had great difficulty in getting it published (more details in Rubin, 2014a).
Don:它非常有局限性。标准的解决缺失数据的方法是比较用均值填补后的偏差,或者在不同的情况下,用回归插补,但是都要基于“随机缺失”的假设。这些论文在纯技术层面上都非常扎实。但是我发现对于科学的方法,总有反例,为了探索这个问题,可能每种情况都是一篇硕士论文。我宁愿用一种更普适的方法来解决这类问题。有一个机制能够生成缺失数据,这对于如何解决缺失数据很关键。那个思想可以还原到实验设计和调查设计上。直到我20世纪70年代做这件事的时候,我很惊讶为什么没有用到观察研究上;可能有人做了,但这么多年我什么都没找到。但可能因为缺失数据的论文相对比较新,发表的过程中我遇到很大的困难(更多细节见 Rubin, 2014a)。
Fan: The EM algorithm is another milestone in mod- ern statistics; it is also relevant in computer science and one of the most important algorithm in data mining. Though similar ideas had been used in several specific contexts before, nobody had realized the generality of EM. How did Dempster, Laird and you discover the generality?
Fan: EM算法是现代统计的另一个里程碑;它也和计算机相关,是数据挖掘中最重要的算法之一。虽然在这之前有类似的想法,但是没有人意识到EM算法的普适性。你和Dempster, Laird是如何发现EM的普适性的?
Don: In those early years at ETS, I had the free- dom to remain in close contact with the Harvard peo- ple, Cochran, Dempster, Holland and Rosenthal, which was very important to me. I always enjoyed talking to Dempster, who is a very principled and deep thinker. I was able to arrange some consulting projects at ETS to bring him to Princeton. Once we were talking about some missing data problem, and we started discussing filling these values in, but I knew it wouldn’t work in generality. I pointed to a paper by Hartley and Hock- ing (1971), where they deserted the approach of itera- tively filling in missing values, as in Hartley (1956) for the counted data case, and went to Newton–Raphson, I think, in the normal case. Even though aspects of EM were known for years, and Hartley and others were sort of nibbling around the edges of EM, apparently nobody put it all together as a general algorithm. Art and I real- ized that you have to fill in sufficient statistics. I had all these examples like t distributions, factor analysis (the ETS guys loved that), latent class models. And Art had a great graduate student, Nan Laird, available to work on parts of it, and we started writing it up. The EM paper was accepted right away by JRSS-B, even with invited discussions.
Don:在ETS的早些年,我可以自由地和哈佛的人联系,比如Cochran, Dempster, Holland和Rosenthal,这对我来说非常重要。我非常喜欢和Dempster聊天,他很有原则,思考问题很深入。我能在ETS安排一些咨询项目从而带他来到普林斯顿。有一次我们聊缺失数据的问题,我们开始讨论插补值,但是我知道它不具有普适性。我指出一篇论文,Hartley和Hocking (1971)写的,里面用了迭代的方法来插补缺失数据,正如Hartley(1956)的计数数据的情形,后来发展到了牛顿算法,我认为,这才是一个普遍情形。即使EM的种种已经被知道很多年了,Hartley和其他人有点咬在EM的边缘,很明显没有人把它总结成一个普适性的算法。Art和我意识到必须填补充分统计量。我做了所有的例子,如t分布,因子分析(ETS的人所喜欢的),隐类模型。Art有一个很好的研究生,Nan Laird,他可以做一部分工作,于是我们开始写起来。EM的论文被JRSS-B接收,并且我们也被邀请讨论。
Fan: Now let’s talk more about causal inference. You are known for proposing the general potential out- come framework. It was Neyman who first mentioned the notation of potential outcomes in his Ph.D. thesis (Neyman, 1990), but the notation seemed to have long been neglected.
Fan: 让我们再说说因果推断。你因为提出了普适的潜在结果框架而出名。最初是Neyman在他的博士论文里提到的,但是这个注释似乎被忽略了很久。
Don: Yes, it was ignored outside randomized exper- iments. Within randomized studies, the notion became standard and used, for example, in Kempthorne’s work, but as I mentioned earlier, ignored otherwise.
Don:是的,它被忽略在随机试验之外了。在随机领域,这个概念变得标准和通用,比如在Kempthorne的工作中,但是正如我之前提到的,它在其他地方都被忽略了。
Fan: Were you aware of Neyman’s work before?
Fan: 你之前注意到Neyman的工作了吗?
Don: No. I wasn’t aware of his work defining po- tential outcomes until 1990 when his Ph.D. thesis was translated into English, although I attributed much of the perspective to him because of his work on surveys in Neyman (1934) and onward (see Rubin, 1990a, fol- lowed by Rubin, 1990b).
Don:没有。虽然我对他在调查有关Neyman (1934)和onward(见Rubin, 1990a及Rubin, 1990b)的工作上十分敬佩,但我都没有注意到他的工作中定义了潜在结果,直到1990年他的博士论文被翻译成英文。
Fabri: You actually met Neyman when you visited Berkeley in the mid-1970s. During all those lunches, had you ever discussed causal inference and potential outcomes with him?
Fabri: 事实上你在20世纪70年代中去伯克利访问见到了Neyman。在吃午饭的时候,你和他讨论过因果推断和潜在结果么?
Don: I did. In fact, I had an office right next to his. Neyman came to Berkeley in the late 30s. He was very impressive, not only as a mathematical statistician, but also as an individual. There was a tremendous aura about him. Shortly after arriving in Berkeley, I gave a talk on missing data and causal inference. The next day, I went to lunch with Neyman and I said something like, “It seems to me that formulating causal prob- lems in terms of missing potential outcomes is an ob- vious thing to do, not just in randomized experiments, but also in observational studies.” Neyman answered to the effect that (remarkable in hindsight because he did so without acknowledging that he was the person who first formulated potential outcomes), “No, causal- ity is far too speculative in nonrandomized settings.” He repeated something like this quote from his biog- raphy, “...Without randomization an experiment has little value irrespective of the subsequent treatment.” (Also see my comment on this conversion in Rubin, 2010.) Then he went to say politely but firmly, “Let’s not talk about that, let’s instead talk about astronomy.” He was very into astronomy at the time.
Don:讨论过。事实上,我的办公室在他右边。Neyman在30年代末来到伯克利。他给人的印象很深刻,他不仅仅是一个数理统计学家,也很有个人魅力。他有一种巨大的光环。在来到伯克利不久,我做了一个缺失数据和因果推断的演讲。第二天,我和Neyman吃午饭,我说,“看起来就缺失潜在结果的因果推断问题是很明显现在要做的事情,不仅仅在随机实验中,在观测研究中也应该做。”Neyman回答(事后很显然他这么做是因为并没有把自己当作第一个提出潜在结果的人),“不,因果在非随机的设定里过于投机了。”他在他的自传中也重复了类似的话,“……没有随机性的实验的后续处理是没有价值的。”(也可以在我对这次访谈的评论 Rubin, 2010.里可以看到)然后他很有礼貌很严肃的说,“我们不讨论这个,我们讨论天文学。”他当时非常痴迷天文学。
Fabri: You probably learned the reasons why he was so involved in the frequentist approach.
Fabri: 你可能知道了为什么他这么支持频率派方法。
Don: Yes. I remember we once had a conversation about what confidence intervals really meant and why the formal Neyman–Pearson approach seemed irrele- vant to me. He said something like, “You misinterpret what we have done. We were doing the mathematics; go back and read my 1934 paper where I first defined a confidence interval.” He defined it as a procedure that has the correct coverage for all prior distributions (see page 589, Neyman, 1934). If you think of that, you are forced to include all point mass priors and, therefore, you are forced to do Neyman–Pearson. He went on to say (approximately), “If you are a real scientist with a class of problems to work on, you don’t care about all point-mass priors, you only care about the priors for the class of problems you will be working on. But if you are doing the mathematics, you can’t talk about the problems you or anyone is working on.” I tried to make this point in a comment (Rubin, 1995), but it didn’t seem to resonate to others.
Don:是的。我记得有一次我们谈论置信区间的真实含义,以及为什么Neyman–Pearson的方法看起来跟我的毫不相关。他说,“你错误地解释了我们已经做的。我们做的是数学;回去读我1934年的论文,里面我定义了置信区间。”他把它定义为对所有先验分布有正确覆盖的过程(见589页, Neyman, 1934)。如果你这么认为,你必须包括所有点的先验,你必须做Neyman–Pearson。(好像)他又继续说道,“如果你是一个真正的科学家,在研究一类问题时,你不要关注所有点的先验,你只需要关注你在做的那类问题的先验。但是如果你在做数学,你不能只谈论你或者其他人在做的东西。”我试图在一篇评论中指出这一点(Rubin, 1995),但似乎并没有引起共鸣。
Fabri: In his famous 1986 JASA paper, Paul Hol- land coined the term “Rubin Causal Model (RCM),” referring to the potential outcome framework to causal inference (Holland, 1986). Can you explain why, if you think so, the term “Rubin Causal Model” is a fair de- scription of your contribution to this topic?
Fabri: 在1986年那篇著名的JASA文章里,Paul Holland提出了“Rubin因果模型(RCM)”,指的是因果推断潜在的结果框架(Holland, 1986)。你能解释下为什么“Rubin因果模型”是你对这个话题的贡献的相对描述吗?如果你也这么认为的话。
Don: Actually Angrist, Imbens and I had a rejoin- der in our 1996 JASA paper (Angrist, Imbens and Ru- bin, 1996), where we explain why we think it is fair. Neyman is pristinely associated with the development of potential outcomes in randomized experiments, no doubt about that. But in the 1974 paper (Rubin, 1974), I made the potential outcomes approach for defining causal effects front and center, not only in randomized experiments, but also in observational studies, which apparently had never been done before. As Neyman told me back in Berkeley, in some sense, he didn’t believe in doing statistical inference for causal effects outside of randomized experiments.
Don:事实上Angrist, Imbens和我在我们1996年的JASA论文(Angrist, Imbens and Rubin, 1996)里一起反驳了这种说法,我们解释了为什么我们认为是相对的。毫无疑问Neyman是最初和随机实验中的潜在结果的发展相关的人。但是在1974年的文章里(Rubin, 1974),我用潜在结果的方法来定义了随机试验和观察研究里因果效应的开始和中间应该是怎样的,这很明显是之前没做过的。正如Neyman在伯克利告诉我的,在某种意义上,他不相信在随机实验之外做因果效应的统计推断。
Fan: Also there are features in the RCM, such as the definition of the assignment mechanism, that belong to you.
Fan: RCM也有很多特点,比如分配机制的定义,这个是你定义的。
Don: Yes, it was crucial to realize that random- ized experiments are embedded in a larger class of as- signment mechanisms, which was not in the literature. Also, in the 1978 paper (Rubin, 1978a), I proposed three integral parts to this RCM framework: potential outcomes, assignment mechanisms, and a (Bayesian) model for the science (the potential outcomes and co- variates). The last two parts were not only something that Neyman never did, he possibly wouldn’t even like the third part. In fact, I think it is unfair to attribute something to someone who is dead, who may not ap- prove of the content being attributed. If the funda- mental idea is clear, such as with Fisher’s random- ization test of a sharp null hypothesis, sure, attribute that idea to Fisher no matter what the test statistic, as in Brillinger, Jones and Tukey (1978). Panos Toulis (a fine Harvard Ph.D. student) helped me track down this statement that I remembered reading in my ETS days from a manuscript John gave to me:
Don:是的,意识到随机实验是嵌套在一个大的分配机制中是很关键的,这个不在论文里。在1978年的论文里(Rubin, 1978a),我给RCM框架提出了3个完整的部分:潜在结果,分配机制以及一个贝叶斯模型(潜在结果和协变量)。后两个部分不仅是Neyman从来没做过的,甚至他很可能不喜欢第三部分。事实上,我认为我把某事归因于死去的人是不公平的,他们可能不赞成被归因的内容。但如Brillinger, Jones and Tukey (1978)中所说的,如果基本的想法很清晰,例如Fisher的随机检验有尖锐的零假设,也可以把这个想法归因于Fisher,不管是什么统计检验。Panos Toulis(一个很好的哈佛的博士生)帮我追溯了这个陈述,我记得在ETS的那些日子我读了John给我的一个稿子:
“In the precomputer era, the fact that almost all work could be done once and for all was of great impor- tance. As a consequence, the advantages of randomiza- tion approaches—except for those few cases where the randomization distributions could be dealt with once and for all—were not adequately valued.
“在前计算机时代,几乎所有的工作都能执行一次,所有的都很重要。因此,随机方法的优势在于,除了那些很少数的情况下随机分布能被一次性处理之外,对于大多数情况,(一次性处理)并没有足够的价值。
One reason for this undervaluation lay in the fact that, so long as randomization was confined to spe- cially manageable key statistics, there seemed no way to introduce into the randomization approach the insights—some misleading and some important and valuable—into what test statistics would be highly sen- sitive to the changes that it was most desired to detect. The disappearance of this situation with the rise of the computer seems not to have received the attention that it deserves.” (Brillinger, Jones and Tukey, 1978, Chap- ter 25, page F-5.)
(随机方法)被低估的一个原因是,只要随机化被限制在一些特定的容易处理的关键统计问题上,看起来就没办法把随机方法的见解——一些会误导的,一些重要的和有价值的见解——引入到对变化很敏感,最需要检验的检验统计量中。随着计算机的起步,这种现象消失了,而且看起来并没有引起它应有的关注度。”(Brillinger, Jones and Tukey, 1978, 25章, F-5页.)
Fabri: Here I am quoting an interesting question by Tom Belin regarding potential outcomes: “Do you be- lieve potential outcomes exist in people as fixed quan- tities, or is the notion that potential outcomes are a de- vice to facilitate causal inference?”
Fabri: 我在这引用Tom Belin提的关于潜在结果的一个有趣的问题:“你认为潜在结果像固定数值一样存在于人群中,还是解决因果推断问题的一个手段?”
Don: Definitely the latter. Among other things, a per- son’s potential outcomes could change over time, and how do we know the people who were studied in the past are still exchangeable with people today? But there are lots of devices like that in science.
Don:当然是后者。在其他事情中,一个人的潜在结果可能一直在变,我们怎么知道我们过去研究的人和现在研究的一直可交换?但是在科学中有很多那样的手段。
Fan: In the RCM, cause/intervention should always be defined before you start the analysis. In other words, the RCM is a framework to investigate the “effects of a cause,” but not the “causes of an effect.” Some criti- cize this as a major limitation. Do you regard this as a limitation? Do you think it is ever possible to draw in- ference on the causes of effects from data, or is it, per se, an interesting question worth further investigation?
Fan: 在RCM中,原因/干预在开始分析前就应该被定义。换句话说,RCM是一个调查“原因效应”的框架,而不是(调查)“一个影响的原因”。一些人批评说这是其主要的局限。你觉得这是个局限吗?你认为可能从效应中推断原因吗?或者,它是个值得研究的有意思的问题吗?
Don: I regard “the cause” of an event topic as more of a cocktail conversation topic than a scientific in- quiry, because it leads to an essentially infinite regress. Someone says, “He died of lung cancer because he smoked three packs a day”; then someone else coun- ters, “Oh no, he died of lung cancer because both of his parents smoked three packs a day and, therefore, there was no hope of his doing anything other than smoking three packs a day”; then another one says, “No, no, his parents smoked because his grandparents smoked— they lived in North Carolina where, back then, every- one smoked three packs a day, so the cause is where the grandparents lived,” and so on. How far back should you go? You can’t talk sensibly about the cause of an event; you can talk about “but for that cause (and there can be many ‘but for’s), what would have happened?” All these questions can be addressed hypothetically. But the cause? The notion is meaningless to me.
Don:我觉得谈论一个事件的原因,这个话题更像喝酒时的谈话,而不是一个科学的谈话,因为它会导致(进程)无限倒退。有人说,“他死于肺癌是因为一天抽3包烟”;有人反驳,“不,他死于肺癌是因为他的父母每天抽3包烟,因此,除了抽3包烟以外,他没希望了”;另一个人说,“不,他的父母抽烟是因为他的祖父母抽烟,他们生活在北卡,那里的每个人都是一天抽3包烟,所以原因是他的祖父母居住的地方”等等。你要追溯到多远?你不能说一个事情的原因;你可以说“除了那个原因(能有很多“除了”)”,可能会发生什么?所有这些问题都能用假设解决。但是说原因?这个概念对我来说没意义。
Fabri: Do you feel that you benefit from knowing about history of statistics when you are thinking about fundamentals of statistics?
Fabri: 你觉得当你思考统计学的基础问题时,了解统计学的发展历史对你有帮助吗?
Don: I know some history, but not a large amount. The most important lessons occur when I feel that I understand why one of the giants, like Fisher or Ney- man, got something wrong. When you understand why a mediocre thinker got something wrong, you learn lit- tle, but when you understand why a genius got some- thing wrong, you learn a tremendous amount.
Don:我知道一些历史,但是不多。最重要的课程发生在当我感觉到我理解了像Fisher或者Neyman这样的天才为什么会出错。当你理解了为什么一个平庸的思想家会出错,你学到的还是少,但是当你理解了为什么天才会出错,你就学到很多了。
# BACK TO HARVARD: PROPENSITY SCORE, MULTIPLE IMPUTATION AND MORE
# 回到哈佛:倾向得分,多重插补法及其他
Fabri: After those productive years at ETS, you spent some time at the EPA (US Environmental Protection Agency). Why did you decide to move, given that you were apparently doing very well at the ETS?
Fabri:您在ETS研究了很多成果之后,就前往EPA(美国环境保护组织)待了一段时间。那么既然您在ETS做的十分出色,为什么突然决定到EPA去?
Don: It started partly from my joking answer to the question, “How long have you been at ETS?” I answered, “Too long.” The problems that I had dealt with at ETS started to appear repetitive, and I felt that I had made important contributions to them including EM and multiple imputation ideas, which were being used to address some serious issues, like test equating, and formulating the right ways to collect data. So I wanted to try something else. At the time, David Rosenbaum was the head of the Office of Radiation Programs at the EPA. He had the grand idea of putting together a team of applied mathematicians and statisticians. Somehow he found my name and invited me to D.C. to find out whether I wanted to lead such a group. Basically, I had the freedom to hire several people of my choice, and I had a good government salary (at the level of “Senior Executive Service”). So I said, “Let’s see whom I can get.” I was able to convince both Rod Little (who was in England at that time) and Paul Rosenbaum (whom I advised while I was still at ETS), as well as Susan Hinkins, who wrote a thesis on missing data at Montana State University, and two others. That was shortly before the presidential election. Then the Democrats lost and Reagan was to come in, and everything seemed to be falling apart. All of a sudden, many of the people above my level at the EPA (most of whom were presidential appointments), had to prepare to turn in their resignations, and had to be concerned about their next positions.
Don:我之前对于问题“你在ETS工作了多长时间”的开玩笑的回答解释了我的部分想法。我当时回答说“很长的时间”。当时我在ETS处理的问题开始变得重复;而且我觉得我已经帮他们在EM和多重插补法方面做出了很重要的帮助,EM和多重插补法已经可以用来处理一些例如测验等值化,公式化收集数据等重要的问题,所以我想尝试一些其他的东西。当时David Rosenbaum是EPA中放射性项目研究办公室的主任。他有一个想法,想召集一些应用数学家和统计学家组建一支队伍。不知怎么的他找到了我,想看看我是否愿意领导这样一支队伍,并邀请我去D.C.。我当时基本上可以自主选择聘用一些人,而且政府给我拨发了不菲的薪水(待遇同高级管理人员)。所以我说,“我先看看我能找到谁。”我可以叫来Rod Little(他当时在英国)、Paul Rosenbaum(我在ETS时曾给他提供过建议)和Susan Hinkins(他在蒙大拿州立大学写了一篇有关缺失数据的论文) 以及另外两个人。那会很快就要进行总统选举了。之后民主党下台,共和党掌权,似乎所有的事情就都乱套了。突然很多在EPA比我的职位高的人(他们中大部分人都是总统任命的)开始准备递交他们的辞职信,也在担忧他们接下来会担任什么职务。
Fabri:So the EPA project ended before it even got started.
Fabri:所以EPA的项目还没开始就结束了。
Don: It didn’t start at all in some sense. I formally signed on at the beginning of December, and after one pay period, I turned in my resignation. But I felt responsible to find jobs for all these people I brought there. Eventually, Susan Hinkins got connected with Fritz Scheuren at the IRS; Paul Rosenbaum got a position at the University of Wisconsin at Madison; Rod got a job related to the Census. One nice thing about that short period of time is that, through the projects I was in charge of, I made several good connections, such as to Herman Chernoff and George Box. George and I really hit it off, primarily because of his insistence on statistics having connections to real problems, but also because of his wonderful sense of humor, which was witty and ribald, and his love of good spirits. In any case, the EPA position led to an invitation to visit Box at the Math Research Center at the University of Wisconsin, which I gladly accepted. That gave me the chance to finish writing the propensity score papers with Paul (Rosenbaum and Rubin, 1983a, 1983b, 1984a).
Don:在某种意义上这个项目就是没开始。我在十二月初正式签约,不过只在一个支付期之后,我就递交了自己的辞职信,但我觉得我有义务为我带来的这些人提供工作。最终,Susan Hinkins和IRS的Fritz Scheuren取得了联系;Paul Rosenbaum在威斯康辛大学麦迪逊分校得到了一个职位;Rod也得到了一份有关普查的工作。在这么短的工作时间内,有一件特别棒的事情是,我通过我管理的这个项目,和Herman Chernoff、George Box等人取得了很好的联系。George和我真的很搭,主要是因为他坚持认为统计应该和实际问题相联系,但也有一部分原因是由于他那粗犷而又很诙谐的幽默感,以及他的积极向上。在任何情况下,我都很同意将威斯康辛大学数学研究中心中EPA的位置留给Box。那也给了我一个机会来和Paul一起完成倾向评分(Rosenbaum and Rubin, 1983a, 1983b, 1984a)的论文。
Fan: Since you mentioned propensity score, arguably the most popular causal inference technique in a wide range of applied disciplines, can you give some insights on the “natural history” of propensity score?
Fan:既然你提到了倾向得分,这项可以说是在大量应用学科中最流行的因果推理技术,那么您能够把倾向评分的发展历程分享给大家吗?
Don: I first met Paul in 1978, when I came to Har-vard on a Guggenheim fellowship; he was a first-year Ph.D. student, extremely bright and devoted. Back in my Princeton days I did some consulting for a psychologist at Rutgers, June Reinisch, who later became the first director of the Kinsey Institute after Kinsey. She was very interested in studying the nature-nurture controversy——what makes men and women so different? She and her husband, who was also a psychologist, were doing experiments on rats and pigs. They injected hormones into the uteri of pregnant animals, and thereby exposed the fetuses to different prebirth environments; this kind of randomized experiment is obviously unethical to do with humans. One of the problems Paul and I were working on for this project, also as part of Paul’s thesis, was matching—matching background characteristics of exposed and unexposed. The covariates included a lot of continuous and discrete variables, some of which were rare events like certain serious diseases prior to, or during, early pregnancy. Soon it became clear that standard matching approaches, like Mahalanobis matching, do not work well in such high dimensional settings. You have to find some type of summaries of these variables and balance the summaries in the treatment and control groups, not individual to individual. Then we realized if you have an assignment mechanism, you can match on the individual assignment probabilities, which is essentially the Horvitz–Thompson idea, to eliminate all systematic bias. I don’t remember the exact details, but I think we first got the propensity score idea when working on a Duke data bank on coronary artery bypass surgery, but refined it for the Reinisch data, which is very similar in principle. Again, the idea of the propensity score is motivated by addressing real problems, but with generality.
Don:我在1978年,为古根海姆奖去往哈佛时,第一次遇见了Paul;他是个一年级博士生,非常聪明且勤奋。在回到普林斯顿的日子里,我为一名罗格斯大学的心理学家,June Reinisch,做了一些咨询,他后来成为了继Kinsey后的第一位Kinsey研究所的主任。她对于天性和教养问题的研究非常有兴趣——什么导致了男人与女人如此不同?她和同为心理学家的丈夫,在老鼠和猪身上做了一些实验。他们将荷尔蒙注射进了怀孕动物的子宫,从而使得暴露给胎儿的产前环境不同。这种随机性的试验显然在人类身上做是不道德的。我和Paul为这个项目工作时出现了一个问题,是曝光和未曝光的匹配背景特征的问题,同样这也是Paul论文中的一部分内容。这项数据的协方差包括很多离散变量和连续变量,这些变量中有一些是很少见的情况,比如说在早期妊娠期间或者之前出现的各种严重的疾病。但不久我们就发现标准匹配方法,比如说马氏匹配,很明显并不适用在高维的设定中。你需要找到某种变量的分类形式并且在治疗组和控制组中平衡这些集合,(变量在其中)并不是一个对应一个那样。然后我们意识到如果可以有一个分配机制,你就可以对个人的分配概率进行匹配来消除所有的系统误差,本质上是Horvitz–Thompson估计的思想。我记不太清具体的了,但我记得我们在利用杜克数据银行有关冠状动脉搭桥手术的数据时,第一次有了倾向得分的想法。这些数据后来改名为了Reinisch数据,但大致上还是类似的。所以由于实际问题,我们再次有了研究具有普遍性的倾向得分的想法。
Fan: Multiple Imputation (MI) is another very influential contribution of yours. Your book “Multiple Im-putation for Nonresponse in Sample Surveys” (Rubin, 1987a) has commonly been cited as the origin of MI. But my understanding is that you first developed the idea and coined the term much earlier.
Fan:多重插补法(MI)是您另一个有极高影响力的贡献的地方。你的书“Multiple Im-putation for Nonresponse in Sample Surveys”(Rubin, 1987a)通常被认为是MI的起源。但我觉得您第一次提出这个想法并且创造了这个术语要早得多。
Don: Correct, I first wrote about MI in an ASA proceedings paper in 1978 (Rubin, 1972, 1978b). That’s where “the 18+ years” comes from when I wrote “Multiple Imputation After 18+ Years” (Rubin, 1996).
Don:是的,我第一次写有关MI的文章是在1978年在一份ASA的记录文件中(Rubin, 1972, 1978b).“Multiple Imputation After 18+ Years” (Rubin, 1996)中“the 18+ years”这个词就是来自于那篇文章。
Fabri: MI has been developed in the context of missing data, but it applicability seems to be far beyond missing data.
Fabri:MI过去在缺失数据的背景下不断发展,但是它的适用性似乎远远超过缺失数据这部分。
Don: Yes, MI has been applied and will be, I think, all over the place. The reason I titled the book that way, “Multiple Imputation for Nonresponse in Sample Surveys,” is that it was obvious to me that in the settings where you need to create public-use data sets, you had to have a separation between the person who fixed up the missing data problem and the many people who might do analyses of the data. So there was an obvious need to do something like this, because users could not possibly have the collection of tools and resources to do the imputation, for example, using confidential information. My Ph.D. students, Trivellore Raghunathan(Raghu) and Jerry Reiter, have made wonderful contributions to confidentiality using MI. Of course, other great Ph.D. students of mine Nat Schenker, Kim Hung Lee, Xiao-Li Meng, Joe Schafer, as well as many others, have also made major contributions to MI. The development of MI really reflects the collective efforts from these people and others like Rod Little and his colleagues and students.
Don:是的,我认为MI现在已经并且将会应用于各个领域。我之所以把书名起为“Multiple Imputation for Nonresponse in Sample Surveys”,是因为很显然在你需要创造公共的数据集时,你需要将修复缺失数据问题的人和作分析的人们进行分离。所以我们很有必要做出一些类似(多重插补法)的东西,因为用户可能没有工具和资源进行收集来插补,比如使用一些机密信息时。我的博士生,Trivellore Raghunathan(Raghu)和Jerry Reiter对于利用MI处理机密信息做出了巨大的贡献。当然,我其他的很棒的博士生Nat Schenker, Kim Hung Lee, Xiao-Li Meng, Joe Schafer,以及其他的很多人,也对MI做出了很大的贡献。MI的发展的确是来自于这些人以及其他人如 Rod Little和他的同事学生们的集体努力。
Fabri: Rod Little once half-jokingly said, “Want to be highly cited? Coauthor a book with Rubin!” And indeed he wrote the book “Statistical Analysis with Miss-ing Data” with you (Little and Rubin, 1987, 2002), which is now regarded as the classic textbook on missing data. There have been a lot of new advances and changes in missing data since then. Will we see a new edition of the book that incorporates these developments sometime soon?
Fabri:Rod Little曾经有一次半开玩笑的说:“(你的作品)想要被高度引用吗?和Rubin合著一本书吧!”而且他的确和您合著了“Statistical Analysis with Miss-ing Data” (Little and Rubin, 1987, 2002),这本书现在被认为是缺失数据的经典教材。但从那时起缺失数据方面有了很大的提高和改变。我们之后会不会看到您这本书在整合了这些发展之后的新版本?
Don: Oh yes, we are working on that now. The main changes from 1987 to 2002 reflect the greater acceptability of Bayesian methods and MCMC type computations. Rod is a fabulous coauthor, a much more fluid writer than I am. I believe this third edition will have even more major changes than the 2002 one did from the 1987 one, but again many driven by computational advances.
Don:嗯是的,我们现在正在做。从1987到2002年的变化主要体现在贝叶斯方法和MCMC型计算更大的可接受性。Rod是位难以置信的合著者,一名比我厉害得多的作家。我相信第三版的主要变化要多于从1987年那版到2002年那版的变化,但这些改变同样也来自于计算的进步。
# ON BAYESIAN
# 关于贝叶斯
Fan: In the 1978 Annals paper (Rubin, 1978a), you gave, for the first time, a rigorous formulation of Bayesian inference for causal effects. But the Bayesian approach to causal inference did not have much following until very recently, and the field of causal inference is still largely frequentist. How do you view the role of Bayesian approach in causal inference?
Fan:在1978年的年报中(Rubin,1978a),您第一次给出了因果效应的贝叶斯推理的严格表述。但是针对因果推断的贝叶斯方法直到最近才有一些后续,而且因果推断领域依然注意是频率统计方面。您怎么看待因果推断的贝叶斯方法?
Don: I believe being Bayesian is the right way to approach things, because the basic frequentist approach, such as the Fisherian tests and Neyman’s unbiased estimates and confidence intervals, usually does not work in complicated problems with many nuisance un-knowns. So you have to go Bayesian to create procedures. You can go partially Bayesian using things like posterior predictive checks, where you put down a null that you may discover evidence against, or direct likelihood approaches as in Frumento et al. (2012); if the data are consistent with a null that is interesting, you live with it. But Neyman-style frequentist evaluations of Bayesian procedures are still relevant.
Don:我相信贝叶斯是正确的逼近事情的方法,因为基本的频率统计方法,比如说Fisherian测试以及Neyman的无偏估计和置信区间经常并不适用处理一些麻烦复杂的未知问题。所以你需要用贝叶斯来模拟过程。当你需要带上和数据一致的空值进行处理时,你可以利用贝叶斯的部分内容比如可能记下一个与事实的空的后测检验,或者(在Frumento等人 (2012)中的)似然法。但是传统风格的频率统计的贝叶斯方法仍然是有重大意义的。
Fan: But why is the field of causal inference still predominantly frequentist?
Fan:但是为什么因果推断的领域仍然主要是频率统计方面呢?
Don: I think there are several reasons. First, there are many Bayesian statisticians who are far more interested in MCMC algebra and algorithms, and do not get into the science. Second, I regard the method of moments (MOM) frequentist approach as pedagogically easier for motivating and revealing sources of information. Take the simple instrumental variable setting with one-sided noncompliance. Here, it is very easy to look at the simple MOM estimate to see where information comes from. With Bayesian methods, the answer is, in some sense, just there in front of you. But when you ask where the information comes from, you have to start with any value, and iterate using conditional expectations, or draws from the current joint distributions. You have to have far more sophisticated mathematical thinking to understand fully Bayesian ideas. There are these problems with missing data (as in my discussion of Efron, 1994) where there are unique, consistent estimates of some parameters using MOM, but for which the joint MLE is on the boundary. So I think it is often easier, pedagogically, to motivate simple estimators and simple procedures, and not try to be efficient when you convey ideas. In causal inference, that corresponds to talking about unbiased or nearly unbiased estimates of causal estimands, as in Rubin (1977). There are other reasons having to do with the current education of most statisticians.
Don:我认为主要有这几个原因。首先,有很多的贝叶斯统计学家对MCMC代数和算法要感兴趣得多,而且还没有科学的研究。第二,我认为矩量法(MOM)的频率方法都教学中能更简单的找到信息来源。拿简单的有片面违规设置的工具变量举个例子。我们能够通过简单的MOM估计来看出信息来自哪里。利用贝叶斯方法,答案就仿佛直接放在你面前一样。但当你询问信息从哪里来的时,你需要从任何值开始,重复利用条件期望,或者绘制当前的联合分布。你需要复杂的数学思维方式来完全理解贝叶斯的思想。有缺失数据的一些问题(如我在Efron,1994讨论中讨论的)要利用MOM对一些参数进行特别的一致性估计,但其联合极大似然估计是在边界上。所以我觉得在教学方面,当你传播思想时,应该鼓励使用简单的估计和过程,并且不要太追求效率。在因果推断中,那和讨论因果中无偏或者几乎无偏的估计是一致的,就是在Rubin(1977)中所说的那样。此外大多数统计学家在最近的教育中(这样做)还有一些其他的原因。
Fan: After EM, starting from the early 1980s, you were heavily involved in developing methods for Bayesian computing, including the Bayesian bootstrap (Rubin, 1981), the sampling importance-resampling (SIR) algorithm (Rubin, 1987b), and (lesser-acknowledged) “approximate Bayesian computation (ABC)” (Rubin, 1984, Section 3.1).
Fan:在研究出EM之后,从20世纪80年代初开始,您参与研究了贝叶斯计算方法,包括贝叶斯自举(Rubin, 1981),采样重要性重采样(SIR)算法(Rubin,1987b),和(很少人承认的)“近似贝叶斯计算(ABC)”(Rubin, 1984,3.1版)
Don: It was clear then that computers were going to allow Bayes to work far more broadly than earlier. You, as well as others such as Simon Tavare, Christian Robert and Jean-Michel Marin, are giving me credit for first proposing ABC. Thanks! Although, frankly, I never thought that would be a useful algorithm except in problems with simple sufficient statistics.
Don:现在很明显电脑能够让贝叶斯应用的范围比之前广很多。你和其他人比如Simon Tavare, Christian Robert and Jean-Michel Marin,对我提出的ABC给予了肯定。谢谢!但坦白来说,我从未想过在简单的充分统计的问题之外,它可以变成如此有用的算法。
Fabri: But you do not seem to have followed up much on these ideas later, even if you have used them. Also you do not label yourself as a Bayesian or a frequentist, even if all these papers made extraordinary contributions to Bayesian inference with fundamental and big ideas.
Fabri:但是您后来虽然使用了这些想法,但似乎并没有根据这些想法接着研究。同样即便所有这些论文都对贝叶斯推理中基本而又庞大的思想做出了非凡的贡献,您也没有把自己定位成贝叶斯统计学家或者一位频率统计学家。
Don: First of all, fundamentally I am hostile to all “religions.” I recently heard a talk by Raghu in Bamberg, Germany, where he said that in his world they have zillions of gods, and I think that is right; you should have zillions of gods, one for this good idea, one for that good idea. And different people can create different gods to whatever extent they want to. I am not a fully-pledged member of the Bayesian camp—I like being friends with them, but I never want to be religiously Bayesian. My attitude is that any complication that creates problems for one form of inference creates problems for all forms of inference, just in different ways. For example, the fact that confounded treatment assignments cause problems for frequentist inference is obvious. Does it generate problems for the Bayesian?Yeah, that point was made in the 1978 Annals paper: Randomization matters to a Bayesian, although not in the same way as to a frequentist, that is, not as the basis for inference, but it affects the likelihood function.
Don:首先,我基本上对所有的“信仰”都持反对意见。我最近听了一篇演讲,演讲人是德国班贝格的Raghu,他提到在他的世界中有无数个神,我觉得他是对的;首先需要有无数个神,这个神提供了这个好主意,那个神提供了那个好主意,而且不同的人可以创造不同的神来做到他们想要的程度。我不是完全同意贝叶斯学派的一员——我喜欢成为他们的朋友,但我不会变成完全信仰贝叶斯的人。我的态度是任何有可能为某一种推断形式造成麻烦的复杂情况同样也可以对所有的推断形式造成麻烦,只不过是不同的方式。比如,讨厌的分配疗法显然会使频率论推断出现问题。那么贝叶斯会出现问题吗?是的,这个问题在1978年的年报汇总有所提及:贝叶斯中的随机问题。尽管不是像影响到了基本推断那样导致频率统计学派不出现问题,但它也影响到了近似的作用。
There is something I am currently working on with a Ph.D. student, Viviana Garcia, that builds on a paper I wrote with Paul Rosenbaum in 1984 (Rosenbaum and Rubin, 1984b), which is the only Bayesian paper that Paul has ever written, at least with me. In that paper, we did some simulations to show there is an effect on Bayesian inference of the stopping rule. We show that if you have a stopping rule and use the “wrong” prior to do the analysis, like a uniform improper prior, but the data are coming from a “correct” prior, and you look at the answer you get from the right prior and from the “wrong” prior, they are different. The portion of the right posterior that you cover using the “wrong” posterior is incorrect. This extends to all situations and it is related to all of these ignorability theorems, and it means that you need to have the right model with respect to the right measure. Of course achieving this is impossible in practice and, therefore, leads to the need for frequentist (Neymanian) evaluations of the operating characteristics of Bayesian procedures when using incorrect models (Rubin, 1984). Bayes works, in principle, there is no doubt, but it can be so hard! It can work, in practice, but you must have some other principles floating around somewhere to evaluate the consequences—how wrong your conclusions can be. So you must have something to fall back on, and I think that is where these frequentist evaluations are extremely useful, not the unconditional Neyman–Pearson frequentist evaluations for all point mass priors (which were critical as mathematical demonstrations that we cannot achieve the ideal goal in any generality), but evaluations for the class of problems that you are dealing with in your situation.
我最近正在和我的博士生Viviana Garcia基于我和Paul Rosenbaum在1984年写的文章(Rosenbaum and Rubin, 1984b)继续进行研究,那可能是Paul唯一的贝叶斯文章,至少是和我一起写的唯一的。在那篇文章中,我们做了一些模拟来展示对终止规则的贝叶斯推断有一些影响。我们发现如果你有终止规则并优先使用“错误”来进行分析,就像一致先验那样数据是从“正确”的那面得来的,那么你会发现你从正确那面和错误那面得到的答案是不一样的。你用“错误”的那部分来覆盖正确的部分是不对的。这可以延伸到所有的情况,而且也与所有这些理论有关。这也意味着你需要有针对正确措施的正确模型。当然,这是在实践中不可能实现的,因此,当我们在贝叶斯方法中使用模型不当时,也导致了对频率(neymanian)估计的操作特性的需要(Rubin,1984)。毫无疑问贝叶斯的方法在原理上行得通,但是它的难度非常大!它在实践中可以使用,但是你需要一些其他的原则作为辅助来估计你的结论有多大的偏差。所以你必须有所依靠,我认为这就是频率学的估计方法非常有用的地方,不是无条件的Neyman–Pearson概率估计的先验知识(这是关键的我们无法实现的数学证明),而是在这种情况下估计你所要处理的问题的分类。
Fan: The 1984 Annals paper “Bayesianly Justifiable and Relevant Frequency Calculations for the Ap-plied Statistician” (Rubin, 1984) is one of my all-time favorite papers. This paper, as the earlier paper by George Box (Box, 1980), deals with the “calibrated Bayes” paradigm with generality, which can be viewed as a compromising or midground between the Bayesian and frequentist paradigms. It has a profound influence on many of us. In particular, Rod Little has strongly advocated “calibrated Bayes” as the 21st cen-tury roadmap of statistics in several of his prominent talks, including the 2005 ASA President’s Invited Address and the 2012 Fisher Lecture. What was the background and reasons for you to write that paper?
Fan:1984年年报上的“Bayesianly Justifiable and Relevant Frequency Calculations for the Ap-plied Statistician” (Rubin, 1984)是我一直最喜爱的文章之一。这篇文章,和早期George Box所写的文章(Box, 1980),都处理了具有一般性的“校准贝叶斯”范式,可以被认为是妥协或者折中的贝叶斯和频率论范式。它对我们许多人都产生了深刻的影响。尤其是是,在包括2005 ASA总统邀请函中和2012年的 Fisher讲座的几个重要会谈中,Rod Little都大力提倡“校准贝叶斯”作为第二十一世纪的路线图。您当时为什么写那篇论文,背景是什么样的?
Don: Interesting question. I was visiting Box at the Mathematics Research Center in 1981–1982 and wrote Rubin (1983) partly during that period—I think it’s a good paper with some good ideas, but without a satisfying big picture. That dissatisfaction led to that 1984 paper—what is the big picture? It took me a very long time to “get it right,” but it all seems very obvious to me now. The idea of posterior predictive checks has been further articulated and advanced in Meng (1994), Gelman, Meng and Stern (1996), and the multiauthored book “Bayesian Data Analysis” (Gelman et al., 1995, 2003, 2014).
Don:非常有趣的问题。我1981至1982年去拜访了在数学研究中心任职的Box,并且在那个时期写了部分的文章(Rubin,1983)——我认为这是一篇有着好的想法的好论文,但并不能让我满意。当时的不满意造就了1984年的那篇论文——什么是我想要的情况?我花了很长时间来构建,但现在对我来说已经很明显了。后测检验的想法在Meng (1994), Gelman, Meng and Stern (1996)的帮助下,在合著的书籍 《Bayesian Data Analysis》 (Gelman等人, 1995, 2003, 2014)中有了进一步的明确和提高。
Fabri: Can you talk a little more about the “Bayesian Data Analysis” book, probably one of the most popular Bayesian textbooks?
Fabri:您能不能针对《Bayesian Data Analysis》说得再多一些?这本书可能是最受欢迎的贝叶斯教材之一。
Don: Yup, I think that the Gelman et al. book might be the most popular Bayesian text. It started out as notes by John Carlin for a Bayesian course that he taught when I was Chair sometime in the mid or late 1980s. Andy must have been a Ph.D. student at that time, with tremendous energy for scholarship. John was heading back to Australia, which is his homeland, and somehow the department had some extra teaching money, and we wanted to keep John around for a year—I do not remember the details. But I do remem-ber that the idea of turning the notes for the course into a full text was percolating. Also Hal Stern was an Associate Professor with us at that time, and so the four of us decided to make it happen. We basically divided up chapters and started writing. Even though John’s initial notes were the starting basis, things changed as soon as Andy “took charge.” Quickly, Andy and Hal were the most active. Andy, with Hal, were even more dominant in the second edition, where I added some parts, edited others, but clearly this was Andy’s show. The third edition, which just came out in early 2014, was even more extreme, with Andy adding two coauthors (David Dun-son and Aki Vehtari) because he liked their work, and they had been responsive to Andy’s requests. As the old man of the group, I just requested that I be the last author; Andy obviously was the first author, and the second and third were as in the first edition. In some ways, I feel like I’m an associate editor of a journal that has Andy as the editor! We get along fine, and clearly it’s a successful book.
Don:好,我认为Gelman等人编的这本书可能的确是最受欢迎的贝叶斯教材。它起初是John Carlin在1980年代中期或后期为准备他教的一门贝叶斯课程而做的笔记,当时我还是主任。Andy 当时应该还是一个获得全额奖学金的博士生。我记不太清了,好像是John要回到他的家乡澳大利亚,而且不知怎么的部门有一些额外的教育资金,所以我们想让John再待一年。但我还记得我们慢慢想到为课程准备的笔记转换为一本教材。同样当时Hal Stern是我们那的副教授。所以我们四个决定让想法成为现实。我们大体分了章节,然后开始编写。尽管最初是基于John的笔记来写,但逐渐就变成了Andy在负责。很快,Andy和Hal成为了最活跃的。在第二版中,Andy和Hal更加活跃,我当时加了一些内容,尽管编辑的人增加了,但是主要的还是Andy的功劳。2014年初出的第三版甚至更极端,Andy又加了两名合著者(David Dun-son和Aki Vehtari),因为Andy喜欢他们的著作,而且他们同意了Andy的请求。作为这个团体中的“老人”,我请求把自己当成是最后的作者,Andy显然会是第一作者,然后第二第三作者和第一版的是相同的。在某些方面,我觉得我是一个杂志的副主编,而安迪则是编辑!我们相处的很好,而且显然这是一本很成功的书籍。
Fan: A revolutionary development in statistics since the early 90s was the MCMC methodology. You left your mark in this with Gelman, proposing the Gelman–Rubin statistic for convergence check (Gelman and Rubin, 1992), which seems to be very much connected to some of your previous work.
Fan:自90年代初,统计学界有一场革命性的发展就是MCMC方法。您和Gelman一起留下了自己的一笔,提出了Gelman-Rubin统计收敛检查(Gelman and Rubin, 1992),这个成果似乎和您之前的某些成果有很多的联系。
Don: Correct. We embedded the convergence check problem into the combination of the multiple imputation and multiple chains frameworks, using the idea of the combining rules for MI. The idea of using multiple chains—that comes from physics—and was Andy’s knowledge, not mine. My contribution was to suggest using modified MI combining rules to help do the assessment of convergence. The idea is powerful because it is so simple. If the starting value does not matter, which is the whole point, then it doesn’t matter, period. The real issue should be how you choose the functions of the estimands that you are assessing, and as always, you want convergence to asymptotic normality to be good for these functions, so that the simple justification for the Gelman–Rubin statistic is roughly accurate.
Don:是的。我们利用MI的组合规则的想法,将收敛检查问题嵌入多重插补和多重链框架之中。但利用多重链这个来自于物理学的想法是Andy提出的,不是我。我的贡献只是建议使用改进的MI组合规则来帮助做评估收敛。这个想法非常好,因为它是如此简单。如果整点起始值不影响过程,那么就没关系。真正的问题应该是如何选择你评估的作用,和往常一样,你想要收敛到渐近正态来有利于使用这些作用,所以对Gelman–Rubin 统计进行简单的调整是需要很准确的。
# THE 1990S: COLLABORATING WITH ECONOMISTS
# 1990年代:与经济学家合作
Fabri: In the 1990s, you started to work with economists. With Joshua Angrist, and particularly with Guido Imbens, you wrote a series of very influential papers, connecting the potential outcomes framework to causal inference with instrumental variables. Can you tell us how this collaboration started?
Fabri:在1990年代,您开始和经济学家合作。您和Joshua Angrist,尤其是和Guido Imbens,写了一系列很有影响力的论文,将潜在结果框架与与工具变量因果推论联系了起来。您能告诉我们这次合作是怎么开始的吗?
Don: Absolutely. I always liked economics; many economists are great characters! It was in the early 90s when Guido came to my office as a junior faculty member in the Harvard Economics Department and basi-cally said, “I think I have something that may interest you.” I had never met him before, and he was asking if the concept of instrumental variables already had a history in statistics. Guido and Josh Angrist had already defined the LATE (local average treatment effect) in an Econometrica paper (Imbens and Angrist, 1994)—although I think CACE (Complier Average Causal Ef-fect) is a much better name because it is more descrip-tive and more precise—local can be local for anything, local for Boston, local for females, etc. Then I asked in return, “Well tell me the setup, I have never heard of it in statistics before” and while he was explaining I started thinking, “Gosh, there is something important here! I have never seen it before,” and then I said, “Let’s meet tomorrow and talk about it more,” because these kinds of assumptions (monotonicity and the “exclusion restriction”) were fascinating to me, and it was clear that there was something there that I had never really thought hard about; it was great. That eventually led to the instrument variables paper (Angrist, Imbens and Rubin, 1996) and the later Bayesian paper (Imbens and Rubin, 1997).
Don:当然可以。我一直都很喜欢经济;许多经济学家都非常好!在90年代早期,Guido作为哈佛经济学系初级教员,来到了我的办公室,说:“我认为我有些东西你会感兴趣。”我之前从未见过他,他还问我工具变量的概念是不是在统计学中已经有一段时间了。Guido和Josh Angrist当时已经在计量经济学论文中定义了”局部平均处理效应”(LATE,local average treatment effect)(Imbens and Angrist, 1994),尽管我认为“编译器的平均因果关系” (CACE,Complier Average Causal Effect)是一个更好的名字,因为它更多更精确的描述本地的什么,比如当地的Boston,当地的女性等等。然后我反问他:“好,那么请告诉我是什么样的吧,我之前从未在统计中听说过”。当他解释的时候,我开始想:“天啊!这些东西一定很重要,我之前从未见过。”然后我说“我们明天见面详谈吧。”因为这些类型的假设(单调性和“排除限制”)吸引着我,而且很明显是我之前没想过没听过的,这些都很棒。这最终导致了有关工具变量的论文(Angrist, Imbens and Rubin, 1996)和之后的贝叶斯论文 (Imbens and Rubin, 1997)。
A closely related development was a project I was consulting on for AMGEN at about the same time, for a product for the treatment of ALS (amyotrophic lateral sclerosis), or Lou Gehrig’s disease, which is a progressive neuromuscular disease that eventually destroys motor neurons, and death follows. The new product was to be compared to the control treatment where the primary outcome was quality of life (QOL) two years post-randomization, as measured by “forced vital capacity” (FVC), essentially, how big a balloon you can blow up. In fact, many people do not reach the endpoint of two-year post-randomization survival, and so two-year QOL is “truncated” or “censored” by death. People were trying to fit this problem into a “missing data” framework, but I realized right away that it was something different.
当时同一时间还有一个我为AMGEN提供咨询的项目,这个项目和(刚才所说的)研究密切相关。我是为了研究用于治疗ALS(肌萎缩性侧索硬化症),或Lou Gehrig氏病的产品,Lou Gehrig氏病是一种渐进的神经肌肉疾病,会破坏运动神经元,最终死亡。新产品是进行对照处理,主要的结果是检测生活质量(QQL),根据用力肺活量(FVC),即你能够一口气吹起多大的气球,来进行两年后的随机分组。事实上,很多人都到不了两年后随机分组时就死了,所以两年期的生活质量检测是被死亡“截断”了的。人们试图通过缺失数据的框架来修补这个问题,但我马上意识到这是和之前不同的东西。
Fan: Essentially both ideas are special cases of the general idea of Principal Stratification, which we can discuss in a moment.
Fan:这两个主意都是有关主分层想法的特殊案例,我们可以讨论一下。
Don: Yes, indeed. These meetings with Guido and this way of thinking were so much more articulated and close to the thinking of European economists in the 30s and 40s, like Tinbergen and Haavelmo, than many subsequent economists who seemed sometimes to be too into their OLS algebra in some sense. There was some correspondence between one of the two—Haavelmo, I think—and Neyman on these hypothetical experiments on supply and demand. European brains were talking to each other, and not simply exchanging technical mathematics!
Don:是的,的确是这样。我和Guido的会面以及这种思维方式都很明确和接近在30年代和40年代欧洲经济学家的思维,比如说Tinbergen和Haavelmo,而不像之后很多在某种意义上太投入于他们的普通最小二乘法的经济学家。他们两中的一个——应该是Haavelmo——和Neyman在这些有关供应和需求的假设试验上有很多的来往联系。欧洲人的确是在相互进行交谈,而不是简单地交流应用数学的内容!
Fabri: I know that many years before you met Guido, with other statisticians, like Tukey, you had discussions about the way economists were treating selection problems, or missing data problems. But you had some adventurous, to say the least, previous experiences with economists dealing with problems that you had worked on, which they had almost neglected completely.
Fabric:我知道在你遇见Guido之前很多年,你和其他的统计学家,比如Tukey,有一些有关经济学家如何处理选择问题以及缺失数据问题的讨论。但是你有些冒险,之前你一直致力于解决经济学家的问题,他们至少可以说是几乎完全忽视。
Don: Yes, James Heckman was tracking my work in the early 1980s when I came to Chicago after ETS. The public exchange came out in the ETS volume edited by Howard Wainer (which is where Glynn, Laird and Rubin, 1986, appears), with comments from Heckman, Tukey, Hartigan and others.
Don:是的,在1980年代初期我离开ETS到芝加哥后,James Heckman就开始继承我的工作。书里显著的变化就是ETS卷由Howard Wainer主编出版(里面有Glynn, Laird and Rubin,1986的文章),以及有来自Heckman,Tukey,Hartigan以及其他人的评论。
Fabri: Economics is a field where the idea of causality is crucial; did you find interest in economics also for this very reason? The problems they have are usually very interesting.
Fabri:经济学这个领域的因果理念非常的关键,您也是因为这个原因对经济学感兴趣的吗?经济中出现的的问题通常都很有意思。
Don: There are often interesting questions from social science students that come up in class. One recent example is how do we answer questions like “What would the Americas be like if they were not settled by Europeans?” I asked the questioner, “Who would they be settled by instead? By the Chinese? By the Africans? What are you talking about? What are we comparing the current American world to?” Another example comes from an undergraduate thesis that I directed, by Alice Xiang, which won both the Hoopes Prize and the economics’ Harris Prize for an outstanding honors thesis. The thesis is on the causal effect of racial affirmative action in law school admissions on some outcomes versus the same proportion of affirmative action admissions but counter-factually based on socioeconomic status. This is not just for cocktail conversation—it was a case recently before the US Supreme Court, Fisher v. University of Texas, which was kicked back to the lower court to reconsider, and additionally the issue was recently affected by a state law in Michigan. There is an amicus brief sent to the US Supreme Court to which Guido (Imbens), former Ph.D. students, Dan Ho, Jim Greiner and I (with others) contributed.
Don:班上社会科学的学生经常会提出一些非常有意思的问题。最近有一个例子是我们应该如何回答类似“如果不是欧洲人定居,那么美国会变成什么样”的问题。我问那个问问题的人:“谁会替代欧洲人定居在美国?中国人?非洲人?你想要说什么?我们拿现在的美国和谁去比较?”另一个例子是由Alice Xiang提出的,他通过本科的毕业论文赢得了胡普斯奖和经济学奖。那篇文章是关于法学院中基于种族活动通过的比例与基于社会经济地位活动通过的比例所进行的比较。这不是简单的一个故事,而是一个事件,最近在美国最高法院中有一个德克萨斯大学的案件,这一案件被驳回了下级法院重新考虑,另外这一案件最近也受到密歇根州法律的影响。Guido(Imbens)以前的博士生,Dan Ho,Jim Greiner和我(和其他人)写了一个法庭简短递送到了美国最高法院。
Such careful formulation of questions is something critical, and to me is central to the field of statistics. It is crucial to formulate clearly your causal question. What is the alternative intervention you are considering, when you talk about the causal effect of affirmative action on graduation rates or barpassage rates? Immediately formulating the problem as an OLS regression is the wrong way to do this,at least to me.
这种需要仔细制定公式的问题是非常关键的,对我来说是统计学的范畴。清楚地阐述你的因果关系是非常重要的。当你讨论到活动通过率和毕业率或者禁止通行率之间的因果关系时,你正在考虑替代的是什么?至少对我来说,立刻将其公式化转为OLS进行解决是错误的。
Fan: You apparently have a long interest in law; besides the aforementioned “affirmative action” thesis, you have done some interesting work in applied statistics in law.
Fan:您看来对于法律一直都很有兴趣;除了上述“通过率行动”的论文,您还做过什么有趣的工作,将统计应用于法律中?
Don: Yes. Paul Rosenbaum was, I think, the first of my Harvard students who did something about statistics in law. Either his qualifying paper or a class paper in 1978 was on the effect of the death penalty. Jim Greiner, another great Ph.D. student of mine, who had a law degree before entering Harvard Statistics, wrote his Ph.D. thesis (and subsequently several important papers) on potential outcomes and causal effects of immutable characteristics. He is now a full professor at the Harvard Law School. There were also several previous undergraduate students of mine who were inter-ested in statistics and law, but (sadly) most went to law school. Since 1980, I have been involved in many legal topics.
Don:是的,还有。我认为Paul Rosenbaum是我在哈佛中第一个把统计应用于法律中的学生。无论是他1978年的学年论文还是课堂论文,都是有关死刑的影响。Jim Greiner,我另一个很棒的博士生,在进入哈佛统计系前有一个法学学位,他的博士论文(以及很多重要的论文)都是有关不可变性的潜在结果和因果效应。他现在是哈佛法学院的全职教授。我同样之前有一些本科生对统计学和法学都很感兴趣,但(遗憾的是)大部分都去了法学院。从1980年起,我就参与了很多的法律话题。
# THE NEW MILLENNIUM: PRINCIPAL STRATIFICATION
# 新千年:主分层
Fabri: The work you did with Guido, as well as the work on censoring due to death, led to your paper on Principal Stratification (Frangakis and Rubin, 2002), coauthored with this brilliant student of yours, Con-stantine Frangakis, who happens to be Fan’s advisor.
Fabri:您和Guido所做的工作,以及由于死亡而进行的普查工作,促使您和您极棒的学生,Constantine Frangakis一起发表了有关主分层的论文(Frangakis and Rubin, 2002),他碰巧也是Fan的顾问。
Don: Yes, Constantine is fabulous, but the original title of that paper was very long, same with the title of his thesis. It went on and on, with probably a few Latin,a few Italian, a few French and a few Greek words!Of course I was exasperated, so I convinced him to simplify the paper’s title to “Principal Stratification in Causal Inference.” He is brilliant, so good that he has no trouble dealing with all the complexity in his own mind, but therefore he struggles at times pulling out the kernels of all these ideas, making them simple.
Don:是的,Constantine非常好,但是最起初的论文题目非常长,和他的论文题目一样。它不断变长,有一些拉丁字母,一些意大利文,一些法文和一些希腊字母!当然我很恼火,所以我说服了他来简化文章的标题,变成了“Principal Stratification in Causal Inference”。他很聪明,所以它对于处理脑中的复杂事情没有问题,但他也尝试着把所有这些想法的精华拿出来,让他们变得简单。
Fan: What do you think is the most remarkable thing about the development of Principal Stratification?
Fan:您认为在主分层的发展过程中哪件事情是最值得铭记的?
Don: It is a whole new collection of ways of thinking about what the real information is in causal problems. Once you understand what the real information is, you can start thinking about how you can get the answers to questions that you want to extract from that information; you always have to make assumptions, and it forces you to explicate what these assumptions are, not in terms of OLS, which no social scientist or doctor would really understand—but in terms of scientific or medical entities. And because you have to make assumptions, be honest and state them clearly. For example, I like your papers (Mealli and Pacini, 2013; Mattei, Li and Mealli, 2013) about multiple post-randomization outcomes, where you discuss that for some outcomes, exclusion restriction or other struc-tural assumptions may be more plausible.
Don:这是一种全新的收集因果问题中真正有用的信息的方式。一旦你理解了什么是真正有用的信息,你可以开始考虑如何从那些信息中提取出问题的答案。你需要作假设,这会迫使你需要说明这些假设是什么,不是通过OLS那种没有社会科学家或者博士能够理解的角度,而是从科学或者医学的角度。而且由于你需要作出假设,所以要坦诚清楚地陈述出来。举个例子,我喜欢你有关多随机化结果的论文(Mealli and Pacini, 2013; Mattei, Li and Mealli, 2013),你在论文中提到了对于一些结果,排除限制或其他结构的假设可能是更合理的。
Fabri: Principal Stratification is sometimes compared to other tools for doing so-called mediation analysis—what is your view about inferring on mediation effects?
Fabri:主分层有时会和为了所谓的中介分析的其他工具相比较,您是怎么看待它在中介作用上的影响的?
Don: I think we (Don and Fabri) discussed a paper recently in JRSS-A, and those discussions summarize my–our view on that. Essentially, some of the people writing about mediation seem to misunderstand what a function is. They write down something that has two arguments inside parenthesis, with a comma separating them, and they seem to think that therefore something is well defined!
Don:我觉得咱们(Don和Fabri)是在讨论最近JRSS—A上的一篇文章,而且这些讨论结果总结了我们的观点。从本质上讲,一些些有关中介作用文章的人似乎误解了它的作用是什么。他们写了一些东西,括号里面有两个参数,用逗号分隔,然后他们就觉得这个就已经定义很清楚了。
Fan: Even though causal inference has gained increasing attention in statistics and beyond, there seems to be a lot of misunderstanding, misuse, misinterpretation and mystifying of causal inference. Why? And what needs to be done to change?
Fan:尽管因果推断在统计界受到越来越多的关注,现在似乎仍然有很多的误解,误用,将其神秘化,为什么?我们需要如何改变?
Don: I think it is partly because causal inference is a very different topic from many topics in statis-tics in that it does not demand a lot of technical advanced mathematical knowledge, but does demand a lot of conceptual and basic mathematical sophistication. Principal Stratification is one such example. Writing down notation does not take the place of understanding what the notation means and how to prove things mathematically. Also partly because causal inference has become a popular topic, it has been flooded with publications that are often done casually. For some fields, it is important to bridge the “old” (everything-based-on-OLS) thinking with the newer ideas. That’s a battle Guido and I constantly had to deal with when writing our book (Imbens and Rubin, 2015).
Don:我认为这是因为因果推理是一个在统计学中与其它很不同的话题,而且它也不需要很多技巧性很高等的数学知识,但需要大量的概念和基本数学的较高程度。主分层就是这样的一个例子。记下符号并不能代替理解符号的意思以及知道如何用数学来证明事物。另外还有一部分原因是因果推断已经成为一个热门的话题,它在出版物中已经时不时被提到了。在某些领域,它是联系老方法(基于OLS的一切事物)和新想法的桥梁。那是一场我和Guido在写书(Imbens and Rubin, 2015)时需要打赢的战役。
Fan: You mentioned the book; when will it finally come out? It has been forthcoming for the last ten years or so.
Fan:您提到了那本书,那本书什么时候能最终发行?它在过去的十年左右就开始准备出版了。
Don: (Laughing) Come on, Fan, that’s not fair! Has it only been ten years? We have promised the publisher (Cambridge University Press) that it will be ready by September 30, 2013. It will be about 500 pages, 25 chapters. It will be followed by another volume, dealing with topics that we could not get to in the volume due to length, such as principal stratification beyond IV settings, or because we believe the topics have not been sharply and cleanly formulated yet, such as regression discontinuity designs, or using propensity scores with multiple treatments. Also in this volume, we didn’t discuss so-called case–control studies, which are the meat of much of epidemiology; it is very important to embed these studies into a framework that makes sense, not just teach them as a bag of tricks.
Don:(大笑)嘿,Fan,那不公平!到现在只有十年吗?我们许诺给出版商(剑桥大学出版社)在2013年9月30号它就可以完成。它会有500页,25章。这将是另一卷来面对我们由于长度而无法处理的话题,比如说在四维智商的主分层,或者是一些因为相信这个话题还没有直截了当地用公式表示出来的话题,比如回归不连续的设计,或者将倾向得分和多重处理相结合。同样在这一本中,我们不会考虑所谓的案例——控制研究,那是许多流行病学的菜。将这些研究嵌入到一个有意义的框架中是非常重要的,而不仅仅是只把它们作为一个技巧包。
# MENTORING, CONSULTING AND EDITORSHIP
# 指导,咨询和编辑
Fabri: You have advised over 50 Ph.D. students and many BA students as well. This sounds like a job inter-view, but what is your teaching philosophy?
Fabri:您曾经给超过50名博士生以及许多本科生提过建议。这听上去像求职面试,那您的教学理念是什么?
Don: My view is that one should approach teaching very differently depending on the kind of students you have and their goals. Harvard has tremendous under-graduate and graduate students, but their strengths vary and their objectives vary. A long time ago I decided that I don’t have the desire or ability to be an entertainer in class, that is, to entertain to get their attention. If they find me entertaining, fine; but it is better if they find the topic I am presenting entertaining.
Don:我的观点是,一个人应该以不同的方式对待教学,这取决于你有什么样的学生和他们的目标。哈佛大学毕业生和研究生都很庞大,但他们的优势各不相同,他们的目标也各不相同。很久以前我就发现我没有欲望或能力成为一个在课堂上活跃气氛,得到他们注意的人。如果他们觉得我很有意思,这很好;但我觉得如果他们觉得我正在讲述的话题很有意思,那就更好了。
Fabri: Many of your students went on to become leaders and not only in academia. And you often say that the thing that you are the most proud of is your students. Though it is clearly impossible to talk about them here one by one, can you share some of your fond memories of the students?
Fabri:您的很多学生成为了各行各业的领袖。您经常说您最自豪的就是您的学生。虽然在这里一个接一个地谈论他们显然是不可能的,但您能否分享一些您和学生的美好回忆吗?
Don: Fabri, that is a killer question unless we have another day for this. What I can say is that it has been a great pleasure to supervise so many very talented students. I could start listing my superb Ph.D. students at the University of Chicago and at Harvard. All of my Ph.D. students are talented in many, and sometimes different, dimensions: among them there are two COPSS award winners, one president of the ASA, one president of ENAR, two JSM program chairs, and other such honors, and many of them made substantial contributions to government, academia and industry.
Don:Fabri,这可是个很能打发时光的话题啊,我们可以改天再聊。我能说的是,能教授很多很有天赋的学生的确是一大乐趣。我可以开始列出我在芝加哥大学和哈佛的优秀博士生。我所有的博士生在各行各业都非常出彩:在这其中有两位COPSS奖的获奖者,有一位ASA的主席,有一位ENAR的主席,两位JSM的项目主任以及其他很多。他们很多人都为政府、学术界。工业做出了不可替代的贡献。
Fan: You also have advised a large number of undergraduate students on a wide range of topics. This is quite uncommon because some people find mentoring undergraduates more challenging and less rewarding than mentoring graduate students. What is your take on this?
Fan:您还建议了许多本科生广泛研究主题。这是相当罕见的,因为一些人认为本科生导师比研究生导师更有挑战性而更少回报。您对此有什么看法?
Don: I am not completely innocent on this charge. I have no interest in “babysitting” and trying to mo-tivate unmotivated students, either undergraduate or graduate. But Harvard does attract some extremely tal-ented and motivated undergraduates, some of whom I had the pleasure to advise. Five have won Hoopes and other prizes for outstanding undergraduate theses.
Don:对于这种想法我也有一些责任,我对于“保姆”式的教学以及激励那些自身没有动力的人没有兴趣,无论是本科生还是研究生。但是哈佛确实吸引了一些非常有天赋而又努力的本科生,我很想为他们提供建议。他们中有五个由于优秀本科毕业论文而获得了胡普斯奖。
Fabri: Now let’s talk about writing, which both Fan and I, as many others, have some quite memorable first-hand experience. You are known as a perfectionist in writing. As you mentioned, you are willing to with-draw accepted papers if you are not a hundred percent satisfied with them.
Fabri:现在我们来讨论一些写作,Fan和我,和其他很多人一样,有一些非常难忘的第一手经验。您是一位出色的作家,正如您所提到的那样,只要您对论文不是百分百愿意,就算别人接受了,您也希望撤回自己的论文。
Don: Yes, as you guys know, I am generally a pain in the neck as a coauthor. I have withdrawn three accepted papers, and tried to improve them; all eventually got reaccepted. One of these is the paper with you guys and others on multiple imputation for the CDC Anthrax vaccine trial (Li et al., 2014). You were not too happy about it initially.
Don:是的,正如你们所知道的那样,我通常有点讨厌合著。我当时拿回了三篇已经通过的文章,然后尝试再修改它们;最终所有的文章都再次通过了。其中之一就是与你和其他人对CDC的炭疽疫苗试验的多重插补法(李等人,2014)的论文。你最初不是很高兴。
Fabri: (Laughing) Yeah, we tried to revolt without success. A different question: How do you approach rejections? Do you have some advice for young statis-ticians on that?
Fabri:(大笑)是的,我们试图反抗却没有成功。一个很特别的问题:您如何处理拒绝的情况呢?您能不能给年轻的统计学家们提供一些建议?
Don: Over the years I had many papers immediately rejected or rejected with the suggestion that it would not be wise to resubmit. However, in almost all of these cases, this treatment led to markedly improved publications, somewhere. In fact, I think that the drafts that have been repeatedly rejected possibly represent my best contributions. Certainly, the repeated rejections, combined with my trying to address various comments, led to better exposition and sometimes better problem formulation, too. The most important idea is: Do not think that people who are critics are hostile. In the vast majority of cases, editors and reviewers are giving up their time to try to help authors, and, I believe, are often especially generous and helpful to younger or inexperienced authors. Do not read into rejection letters personal attacks, which are extremely rare. So my advice is: Quality trumps quantity, and stick with good ideas even when you have to do polite battle with editors and reviewers—they are not perfect judges, but they are, almost uniformly, on your side. More details of these are given in Rubin (2014b).
Don:多年来我有许多文章被立刻拒绝或者被拒绝伴随着“提交是不明智的”建议。然而,几乎在所有这些情况下,这种对待方式引领着国家不断提高。事实上,我认为那些一再被否决的草稿可能代表了我最佳的贡献。当然,一再拒绝,加上我试图解决的各种评论,可能会产生更好的阐述以及更好的解决办法。最重要的一点是:不要认为批评你的人是有敌意的。在绝大多数情况下,我相信,编辑和审稿人都会放弃自己的时间来帮助作者而且往往会在面对年轻的或经验不足的作者时更不吝啬自己的帮助。别把拒绝理解为个人攻击,这是极为少见的。所以我的建议是:质量胜于数量,坚持自己好的想法,即使你需要不断地和编辑与审稿人进行礼貌的战斗,他们不是完美的评判人,但他们几乎都是站在你这边的。更多的细节可以在Rubin(2014b)中找到。
Fan: In 1978, you became the Coordinating and Applications Editor of JASA. Is there anything particularly unique about your editorship?
Fan:在1978年,您成为了JASA的协调与应用编辑。在您作为编辑的日子里,有什么独特的事情吗?
Don: As author, I am willing to withdraw accepted papers. As a new editor, at least then, I was also willing to suggest to authors that they withdraw papers ac-cepted by the previous editors! I took some heat for that at the beginning. I read through all the papers that the previous editorial board had accepted and were awaiting copyediting for publication; for the ones that I thought were bad (I remember there were about eight), I wrote, “Dear authors, I think you should consider withdrawing this paper,” with long explanations of why I thought it would be an embarrassment to them if the paper were published. Fabri knows that I can be brutally frank about such suggestions.
Don:作为作者,我愿意撤回那些之前已经接受的文章。作为一名新编辑,至少在那会,我还是愿意建议那些作者撤回那些之前编辑接受了的文章的。我起初还是非常费了很大的心力的。我阅读了所有之前编辑答应登刊并且等待印刷出版的论文;对于那些我认为不好(我记得一共有八篇),我给他们写到:“亲爱的作者,我认为你应该考虑撤回这篇论文”后面附着我为什么举得如果论文出版了对他们来说并不是很好的长长的解释。Fabri知道我当时那些建议有多直白坦诚。
Fan: Did they comply?
Fan:他们同意了吗?
Don: Yes, all but one. This one author fought, and I kept saying, “You have to fix this up.” Eventually, the changes made the paper OK. For the other ones, the authors agreed with my criticisms: Just because the previous editor didn’t get a good reviewer or they over-looked mistakes, does not mean the paper should appear. But I was not very popular, at least at first.
Don:是的,只有一个没同意。这个作者不断抗争,但我坚持说:“你需要改一下它。“渐渐地,他做的改变让论文可以通过了。还有另外一个作者,他对我的意见表示赞成:只是因为先前的编辑没有好好翻看文章或者发现问题,并不代表这篇论文就可以在刊物上出现。但我并不是很感冒,至少起初是这样。
Fabri: You have done a wide range of consulting. What is the role that consulting plays in your research?
Fabri:您之前做过广泛的咨询,咨询在您的研究中扮演什么样的角色?
Don: To me consulting is always a stimulating source of problems. As I mentioned before, for example, propensity score technology partly came from the consulting work we did for June Reinisch.
Don:对于我来说咨询就是问题解决的一个来源。正如我之前所提到的那样,比如说倾向得分的技术有部分来自于我们为June Reinisch所做的咨询工作。
Fabri: One of the more controversial cases in which you are involved as a consultant is the US tobacco litigation case, in which you represented the tobacco companies as an expert witness. Would you mind sharing some of your thoughts on this case?
Fabri:有一个很具争议性的案件是当时您作为顾问参与的美国烟草诉讼案,在这其中您代表烟草公司作为一个专家证人。您能给我们分享一下您当时的想法吗?
Don: Happy to. This comes from my family background dealing with lawyers. We have a legal system where certain things are legal, certain things are not. You should generally obey laws even if you don’t like them, or you should try to change them. If a company is making a legal product, and they are advertising it legally under current laws, then accept it or work to change the laws. If they lie, punish them for lying, if that is legal to do. You never see a commercial for sporty cars that show the cars going around corners extremely slowly and safely. How do they advertise cars?They usually show them sweeping around corners, and say “Don’t do this on your own.” Things that are enjoyable typically have uncertainties or risks associated with them. Flying to Europe to visit Fabri has risks!
Don:乐意之至。这是来自于和律师有关的家族背景,我们有一套法律系统告诉我们什么是合法的,什么不是。你应该遵守法律即使你不喜欢它们,或者你应该尝试改变它们。如果一个公司在生产合法的产品,而且他们在现有的法律条款下合法的利用广告,那么我们就应该接受它或者尝试改变法律。如果他们欺骗我们了,惩罚他们是合法的,那么他们就应该因此受到惩罚。你从来没有看到一个跑车的广告上面写着汽车拐弯时平稳安全。他们如何宣传汽车?他们通常会展示汽车开过拐角,并说“请勿模仿”。那些看上去令人愉快的事情通常有不确定性或与他们相关的风险。比如飞往欧洲去拜访Fabri是有风险的!
Certainly I do not doubt that no matter how I would intervene to reduce cigarette smoking, lung cancer rates would drop. But what intervention that would reduce smoking would involve reducing illegal conduct of the cigarette industry—that is the essence of the legal question.
当然,我毫不怀疑,无论我如何干预来减少吸烟,肺癌率都将不断下降。但什么样的减少吸烟的干预能够减少卷烟行业的非法行为,是这个法律问题的实质。
When I was first contacted by a tobacco lawyer, I was very reluctant to consult for them, and I feared strong pressure to be dishonest, which was absent throughout. The original topic was simply to comment on the ways the plaintiffs’ experts were handling missing data. On examination, their methods seemed to me to be not the best available and, at worst, silly (e.g., when missing “marital status,” call them “married”). As I continued to read these initial reports, I was appalled that hundreds of billions of dollars could be sought on the basis of such analyses. From a broader perspective, the logic underlying most of the analyses also seemed to me entirely confused. For example, alleged misconduct seemed to play no role in nearly all calculations, and phrases such as “caused by” or “attributable to,” were used nearly interchangeably and often apparently without thought. Should nearly a trillion dollars in damages be awarded on the basis of faulty logic and bad statistical analyses because we “know” the defendant is evil and guilty? If the issue were assessing the tobacco industry a trillion dollar fine for lying about its products, I would be amazed but mute. But these reports were using statistical arguments to set the numbers—is it acceptable to use bad statistics to set numbers because we “know” the defendant is guilty? What sort of precedent does that imply? The ethics of this consulting is discussed at some length in Rubin (2002).
Don:当我第一次和烟草行业的律师接触,我对向他们提供咨询是很不情愿的,我害怕不诚实会成为我巨大的压力,但实际上是不存在的。最初的主题仅仅是评判原告专家处理缺失数据的方式。在检查中,他们的方法在我看来不是最好的,甚至说是愚蠢的(例如,当不清楚“婚姻状况”时,叫他们“已婚”)。当我继续审查这些一手资料时,我很震惊数千亿美元在这种基础的数据分析中都浪费了。从更广泛的角度来看,大多数分析的逻辑似乎也完全混淆了。例如,所谓的不当行为似乎在几乎所有的计算中都没有出现,而且如“引起”或“归属于”的短语也经常互换,相互混淆。近一兆美元的损失在对错误的逻辑和糟糕的统计分析的基础上,因为我们“知道”的被告人的申辩是邪恶的有罪的,就应该归罪于被告方吗?如果问题是烟草行业对其产品的虚假广告为其带来万亿美元的利润,我会震惊但是沉默。但是,只是因为我们“知道”被告有罪,我们就可以接受这些报告使用错误的统计参数来设置数字吗?这意味着什么样的先例?这种伦理咨询在Rubin作了较为详细的讨论(2002)。
Fabri: We have talked quite a lot about statistics. Let’s talk about some of your other passions in life, for example, music, audio systems and sports cars.
Fabri:关于统计我们已经谈论了很多。让我们聊聊你人生中的其他爱好,例如音乐,音频系统和跑车。
Don: There are other passions, too, and their order is very age dependent (I leave more to your perceptions). When a kid, for example, sports cars, both driving them and rebuilding them, was the top of those three hobbies. But age (poorer vision, slower reflexes, more aches and pains, etc.) shifted the balance more to music, both live and recorded—luckily my ears are still good enough to enjoy these, but as more age catches up, things may shift.
Don:我当然有很多乐趣,而且他们的顺序就是按照年龄来排列的(我留下了让你自己想象的空间)。当我是个小孩子,比如,一辆跑车,驾驶它还有重建它,是这三个中最重要的爱好。但当年龄增长(更差的视力,更慢的反应,更多的伤痛等等)会将爱好的平衡更多的转换为音乐,尤其是现场直播或者刻录的。很幸运我的耳朵足够好,能够享受这些,但随着年龄继续增加,爱好可能还会再变化。
Fan and Fabri: Well, it has been nearly three hours since we started the conversation. Here is the final question before letting you go for dinner: What is your short advice to young researchers in statistics?
Fan and Fabri:好的,我们已经聊了接近三个小时了。在您去吃晚饭前,我们有最后一个问题想问您,您对统计这个行业里年轻的研究员有什么简短的建议吗?
Don: Have fun! Don’t be grumpy. If lucky, you may live to have a wonderful 70th birthday celebration!
Don:享受统计!不要暴躁。如果幸运的话,你可以有一个精彩的七十岁生日!
# ACKNOWLEDGMENTS
# 致谢
We thank Elizabeth Zell, Guido Imbens, Tom Belin, Rod Little, Dale Rinkel and Alan Zaslavsky for helpful suggestions. This work is partially funded by NSF-SES Grant 1155697.
我们特别感谢Elizabeth Zell, Guido Imbens, Tom Belin, Rod Little, Dale Rinkel以及 Alan Zaslavsky提出的宝贵意见。这项工作有美国国家科学基金会的部分资金补助。