From 3352e282146b0f89735288320b80b7d15b47889f Mon Sep 17 00:00:00 2001 From: Yuxuan Li Date: Mon, 7 Aug 2017 18:49:49 +0800 Subject: [PATCH] Update Rubin_all.Rmd --- Rubin_all.Rmd | 122 +++++++++++++++++++++++++------------------------- 1 file changed, 61 insertions(+), 61 deletions(-) diff --git a/Rubin_all.Rmd b/Rubin_all.Rmd index 0125e73..756b61c 100644 --- a/Rubin_all.Rmd +++ b/Rubin_all.Rmd @@ -1,13 +1,13 @@ --- title: "Rubin" -author: "边蓓蕾" +author: "边蓓蕾" "李宇轩" date: "2017/3/2" output: html_document --- -Abstract. Donald Bruce Rubin is John L. Loeb Professor of Statistics at Harvard University. He has made fundamental contributions to statistical methods for missing data, causal inference, survey sampling, Bayesian in- ference, computing and applications to a wide range of disciplines, including psychology, education, policy, law, economics, epidemiology, public health and other social and biomedical sciences. +Abstract. Donald Bruce Rubin is John L. Loeb Professor of Statistics at Harvard University. He has made fundamental contributions to statistical methods for missing data, causal inference, survey sampling, Bayesian inference, computing and applications to a wide range of disciplines, including psychology, education, policy, law, economics, epidemiology, public health and other social and biomedical sciences. -摘要:Donald Bruce Rubin是哈佛大学的约翰·洛布统计学教授。他在缺失数据,因果推断,抽样调查,贝叶斯推断等统计学方法上做出过重要贡献,其成果惠及心理学,教育,政策,法律,经济,流行病,公共卫生以及其他社会及生物医学领域的计算及应用。 +摘要:Donald Bruce Rubin是哈佛大学的约翰·洛布统计学教授。他在缺失数据(missing data),因果推断(causal inference),抽样调查(survey sampling),贝叶斯推断(Bayesian inference)等统计学方法上做出过重要贡献,其成果惠及心理学,教育,政策,法律,经济,流行病,公共卫生以及其他社会及生物医学领域的计算及应用。 Don was born in Washington, D.C. on December 22, 1943, to Harriet and Allan Rubin. One year later, his family moved to Evanston, Illinois, where he grew up. He developed a keen interest in physics and math- ematics in high school. In 1961, he went to college at Princeton University, intending to major in physics, but graduated in psychology in 1965. He began gradu- ate school in psychology at Harvard, then switched to Computer Science (MS, 1966) and eventually earned a Ph.D. in Statistics under the direction of Bill Cochran in 1970. After graduating from Harvard, he taught for a year in Harvard’s Department of Statistics, and then in 1971 he began working at Educational Testing Ser- vice (ETS) and served as a visiting faculty member at Princeton’s new Statistics Department. He held several visiting academic appointments in the next decade at Harvard, UC Berkeley, University of Texas at Austin and the University of Wisconsin at Madison. He was a full professor at the University of Chicago in 1981– 1983, and in 1984 moved back to the Harvard Statistics Department, where he remains until now, and where he served as chair from 1985 to 1994 and from 2000 to 2004. @@ -15,11 +15,11 @@ Don was born in Washington, D.C. on December 22, 1943, to Harriet and Allan Rubi Don has advised or coadvised over 50 Ph.D. stu- dents, written or edited 12 books, and published nearly 400 articles. According to Google Scholar, by May 2014, Rubin’s academic work has 150,000 citations, 16,000 in 2013 alone, placing him at the top with the most cited scholars in the world. -Don指导和共同指导一共超过50名博士生,撰写和编辑过12本著作,发表了近400篇文章。根据谷歌学术截止到2014年5月的统计,Rubin的学术成果已有15万次引用,单单2013年就有1万6千次引用,在全球学者中名列前茅。 +Don指导和共同指导了超过50名博士生,撰写和编辑过12本著作,发表了近400篇文章。根据谷歌学术截止到2014年5月的统计,Rubin的学术成果已有15万次引用,单单2013年就有1万6千次引用,在全球学者中名列前茅。 -For his many contributions, Don has been hon- ored by election to Membership in the US National Academy of Sciences, the American Academy of Arts and Sciences, the British Academy, and Fellowship in the American Statistical Association, Institute of Mathematical Statistics, International Statistical Insti- tute, Guggenheim Foundation, Humboldt Foundation and Woodrow Wilson Society. He has also received the Samuel S. Wilks Medal from the American Sta- tistical Association, the Parzen Prize for Statistical In- novation, the Fisher Lectureship and the George W. Snedecor Award of the Committee of Presidents of Sta- tistical Societies. He was named Statistician of the Year by the American Statistical Association’s Boston and Chicago Chapters. In addition, he has received hon- orary degrees from Bamberg University, Germany and the University of Ljubljana, Slovenia. +For his many contributions, Don has been hon- ored by election to Membership in the US National Academy of Sciences, the American Academy of Arts and Sciences, the British Academy, and Fellowship in the American Statistical Association, Institute of Mathematical Statistics, International Statistical Insti- tute, Guggenheim Foundation, Humboldt Foundation and Woodrow Wilson Society. He has also received the Samuel S. Wilks Medal from the American Statistical Association, the Parzen Prize for Statistical In- novation, the Fisher Lectureship and the George W. Snedecor Award of the Committee of Presidents of Statistical Societies. He was named Statistician of the Year by the American Statistical Association’s Boston and Chicago Chapters. In addition, he has received honorary degrees from Bamberg University, Germany and the University of Ljubljana, Slovenia. -由于他的诸多贡献,Don被推选为美国国家科学院成员,美国艺术与科学学院成员,英国国家学术院成员,美国统计学会理事,国际数理统计协会理事,国际统计学会理事,古根海姆基金会理事,洪堡基金会理事,伍德罗·威尔逊基金会理事。他获得了美国统计学会的塞缪尔·威尔克斯奖、针对统计创新的帕仁奖,以及统计学会会长委员会的费希尔讲席奖和斯尼德克奖。他被美国统计学会的波士顿和芝加哥分会誉为年度统计学家。此外,他还获得了德国的班贝格大学以及斯洛文尼亚的卢布尔雅那大学的荣誉学位。 +由于他的诸多贡献,Don被推选为美国国家科学院成员,美国艺术与科学学院成员,英国国家学术院成员,美国统计学会理事,国际数理统计协会理事,国际统计学会理事,古根海姆基金会理事,洪堡基金会理事,伍德罗·威尔逊基金会理事。他获得了美国统计学会的塞缪尔·威尔克斯奖(Samuel S. Wilks Medal)、针对统计创新的帕仁奖(the Parzen Prize),以及统计学会会长委员会的费希尔讲席奖(the Fisher Lectureship)和斯尼德克奖(the George W. Snedecor Award)。他被美国统计学会的波士顿和芝加哥分会誉为年度统计学家。此外,他还获得了德国的班贝格大学以及斯洛文尼亚的卢布尔雅那大学的荣誉学位。 Besides being a statistician, he is a music lover, au- diophile and fan of classic sports cars. @@ -35,7 +35,7 @@ Fan: Let’s begin with your childhood. I understand you grew up in a family of Fan:让我们从你的小时候说起吧。我知道你生于一个律师家庭,这肯定对你有着深远的影响。能不能介绍一下你的家庭? -Don: Yes. My father was the youngest of four broth- ers, all of whom were lawyers, and we used to have stimulating arguments about all sorts of topics. Prob- ably the most argumentative uncle was Sy (Seymour Rubin, senior partner at Arnold, Fortas and Porter, diplomat, and professor of law at American Univer- sity), from D.C., who had framed personal letters of thanks for service from all the presidents starting with Harry Truman and going through Jerry Ford, as well as from some contenders, such as Adlai Stevenson, and various Supreme Court Justices. I found this impres- sive but daunting. The relevance of this is that it clearly created in me a deep respect for the principles of our le- gal system, to which I find statistics highly relevant— this has obviously influenced my own application of statistics to law, for example, concerning issues as di- verse as the death penalty, affirmative action and the tobacco litigation. +Don: Yes. My father was the youngest of four brothers, all of whom were lawyers, and we used to have stimulating arguments about all sorts of topics. Prob- ably the most argumentative uncle was Sy (Seymour Rubin, senior partner at Arnold, Fortas and Porter, diplomat, and professor of law at American Univer- sity), from D.C., who had framed personal letters of thanks for service from all the presidents starting with Harry Truman and going through Jerry Ford, as well as from some contenders, such as Adlai Stevenson, and various Supreme Court Justices. I found this impres- sive but daunting. The relevance of this is that it clearly created in me a deep respect for the principles of our le- gal system, to which I find statistics highly relevant— this has obviously influenced my own application of statistics to law, for example, concerning issues as diverse as the death penalty, affirmative action and the tobacco litigation. Don:是的。我父亲是他们4兄弟中最年轻的,他们4兄弟都是律师,我们过去在所有的话题上都有激烈的辩论。可能最好辩的是Sy[Seymour Rubin, Arnold, Fortas和Porter的高级合伙人,外交官,美国大学的法学教授]了,他住在华盛顿,他从总统那里获得了许多感谢信,同样也从一些竞争对手那里获得了很多感谢信,如Adlai Stevenson以及不同的最高法院法官。这让我印象深刻,但也让我感到害怕,因为这让我对我们的法律体系所遵守的原则印象深刻,在这里面,我发现统计学与它高度相关,这明显触动了我,并因此做了统计学在法律上的应用,例如考虑那些包括死刑,平权法案和烟草诉讼的问题。 @@ -45,7 +45,7 @@ Fabri: 等下我们再回到这些问题,但还有其他人影响了你对统 Don: Probably the most influential was Mel, my mother’s brother, a dentist (then a bachelor). He loved to gamble small amounts, either in the bleachers at Wrigley Field, betting on the outcome of the next pitch, while watching the Cubs lose, or at Arlington Race track, where I was taught at a young age how to read the Racing Form and estimate the “true” odds from the various displayed betting pools, while losing two dol- lar bets. Wednesday and Saturday afternoons, during the warm months when I was a preteen, were times to learn statistics—even if at various bookie joints that were sometimes raided. As I recall, I was a decent stu- dent of his, but still lost small amounts. -Don:可能影响最深的是我舅舅Mel,他是一名牙医(还是一名学士)。他喜欢小赌,(比如)在芝加哥箭牌球场的露天看台看到比赛输了就会开始赌一下场比赛的结果,或者在阿灵顿赛道,在那里,他教导了我如何阅读赛马新闻,根据不同的赌注估计胜算,但那时候我还很小,还输了两美元。当我还是个青春期少年的时候,我会在天气温暖的周三和周六的下午学习统计,即使不同的赌注组合有时候会输光。我记得那时我是他相当优秀的学生,但我仍然输了一小部分。 +Don:可能影响最深的是我舅舅Mel,他是一名牙医(还是一名学士)。他喜欢小赌,(比如)在芝加哥箭牌球场的露天看台看到比赛输了就会开始赌一下场比赛的结果,或者在阿灵顿赛道,在那里,他教导了我如何阅读赛马新闻,根据不同的赌注估计胜算,但那时候我还很小,还输了两美元。当我还是个青春期少年的时候,我会在天气温暖的周三和周六的下午学习统计,即使不同的赌注组合有时候会输光[这句话存在疑问]。我记得那时我是他相当优秀的学生,但我仍然输了一小部分。 There were two other important influences on my statistical interests from the late 1950s and early 1960s. First, there was an old friend of my father’s from their government days together, a Professor Emeritus of Economics at UC Berkeley, George Mehren, with whom I had many entertaining and educational (to me) arguments, which generated a respect for economics that continues to grow to this day. And second, my wonderful teacher of physics at Evanston Township High School—Robert Anspaugh—who tried to teach me to think like a real scientist, and how to use mathe- matics in the pursuit of science. @@ -65,13 +65,13 @@ Fan: 你在1961年进入普林斯顿,开始主修物理,但后来改修心 Don: That’s a good question. Inspired by Anspaugh, I wanted to become a physicist. I was lined up for a BA in three years when I entered Princeton, and unknown to me before I entered, also lined up for a crazy plan to get a Ph.D. in physics in five years, in a program being reconditely planned by John Wheeler, a very well-known professor of physics there (and Richard Feynman’s Ph.D. advisor years earlier). In retrospect, this was a wildly over-ambitious agenda, at least for me. For a combination of complications, including the Vietnam War (and its associated drafts) and Profes- sor Wheeler’s sabbatical at a critical time, I think no one succeeded in completing a five-year Ph.D. from entry. In any case, there were many kids like me at Princeton then, who, even though primarily interested in math and physics, were encouraged to explore other subjects. I did that, and one of the courses I took was on personality theory, taught by a wonderful professor, Silvan Tomkins, who later became a good friend. At the end of my second year, I switched from Physics to Psychology, where my mathematical and scientific background seemed both rare and appreciated—it was an immature decision (not sure what a mature one would have been), but a fine one for me because it in- troduced me to some new ways of thinking, as well as to new fabulous academic mentors. -Don:这个问题好。在Anspaugh的鼓舞下,我想成为一个物理学家。我进入普林斯顿被安排3年拿到BA学位,但进来之前我并不知道我还被安排了一个5年拿到物理学博士的疯狂计划,这是物理系非常有名的John Wheeler教授做的计划。回想起来,那对于我来说是一个雄心勃勃的日程。结合一些其他的复杂因素,包括越南战争,Wheeler教授在关键时刻休假,我觉得没人能在5年内完成博士学位。不管怎样,在普林斯顿还有很多孩子和我一样,即使开始对数学和物理感兴趣,也被鼓励去其他学科探索。我就是这么做的,我上的一门课叫人格理论,是一个很好的教授教的,后来我们成为了朋友。在第二学年末,我从物理转到了心理系,在那里,我的数学和科学背景看起来都很弱,喜欢心理学是一个不成熟的决定(我也不确定什么是成熟的),但对我来说,它好的方面是带给了我一些全新的思考方式,也让我认识了一些新的著名学者。 +Don:这是个好问题。在Anspaugh的鼓舞下,我想成为一个物理学家。我进入普林斯顿被安排3年拿到BA学位,但进来之前我并不知道我还被安排了一个5年拿到物理学博士的疯狂计划,这是物理系非常有名的John Wheeler教授做的计划。回想起来,那对于我来说是一个雄心勃勃的日程。结合一些其他的复杂因素,包括越南战争,Wheeler教授在关键时刻休假,我觉得没人能在5年内完成博士学位。不管怎样,在普林斯顿还有很多孩子和我一样,即使开始对数学和物理感兴趣,也被鼓励去其他学科探索。我就是这么做的,我上的一门课叫人格理论,是一个很好的教授教的,后来我们成为了朋友。在第二学年末,我从物理转到了心理系,在那里,我的数学和科学背景看起来都很弱,喜欢心理学是一个不成熟的决定(我也不确定什么是成熟的),但对我来说,它好的方面是带给了我一些全新的思考方式,也让我认识了一些新的著名学者。 Fabri: You had some computing skills which were uncommon then, right? So you started to use comput- ers quite early. Fabri: 后来你有了一些不寻常的计算能力对吗?所以你很早就开始用计算机了。 -Don: Yes. Sometime between my first and second year at Princeton, I taught myself Fortran. As you men- tioned, those skills were not common, even at places like Princeton then. +Don: Yes. Sometime between my first and second year at Princeton, I taught myself Fortran. As you mentioned, those skills were not common, even at places like Princeton then. Don:是的。我在普林斯顿的第一和第二学年期间,就自学了Fortran。就像你说的,这些技能就算是在当时的普林斯顿,也不常见。 @@ -117,11 +117,11 @@ Fan: 但是你很快再次觉得很无聊。是因为你觉得计算机没意思 Don: No, not really that. There were several reasons. First, there was a big emphasis on automatic language translation, because it was cold war time, and it ap- peared that CS got a lot of money for computational linguistics from ARPA (Advanced Research Projects Agency), now known as DARPA. The Soviet Union, from behind the iron curtain, produced a huge num- ber of documents in Russian, but evidently there were not enough people in the US to translate them. A com- plication is that there are sentences that you could not translate without their context. I still remember one example: “Time flies fast,” a three-word sentence that has three different meanings depending on which of the three words is the verb. If this three-word sentence cannot be automatically translated, how can one get an automatic (i.e., by computer) translation of a complex paragraph? Related to this was Noam Chomsky’s work on transformational grammars, down the river at MIT. -Don:不,不是那样的,有好多因素。首先,它很注重自动化语言翻译,因为那是冷战时期,计算机系得到了来自ARPA(高级研究项目代理,现在是DARPA)的很多经费,用于计算语言学。苏联,从铁幕背后生产了大量的文件,但很显然美国没有足够的人手去翻译。(而且)复杂的是有些句子你无法在没有语境的情况下翻译。我记得一个例子:“时光飞逝”,(这)一个三个单词的句子,可能有三个意思,这取决于这三个单词哪个是动词。如果这个三个单词的句子不能被自动化翻译,(电脑)又怎么能自动化翻译一个复杂的段落呢?这自然是和麻省理工Noam Chomsky的翻译语法相关的。 +Don:不,不是那样的,有好多因素。首先,它很注重自动化语言翻译,因为那是冷战时期,计算机系得到了来自ARPA(高级研究项目代理,现在是DARPA)的很多经费,用于计算语言学。苏联,从铁幕背后生产了大量的文件[这句话存在疑问],但很显然美国没有足够的人手去翻译。(而且)复杂的是有些句子你无法在没有语境的情况下翻译。我记得一个例子:“时光飞逝”,(这)一个三个单词的句子,可能有三个意思,这取决于这三个单词哪个是动词。如果这个三个单词的句子不能被自动化翻译,(电脑)又怎么能自动化翻译一个复杂的段落呢?这自然是和麻省理工Noam Chomsky的翻译语法相关的[down the river 我翻译成了自然,但我觉得这块可能还是有点问题]。 Second, although I found some real math courses and the ones in CS on mathy topics, such as computa- tional complexity, which dealt with Turing machines, Godel’s theorem, etc., interesting, I found many of the courses dull. Much of the time they were about programming. I remember one of my projects was to write a program to project 4-dimensional figures into 2- dimensions, and then rotate them using a DEC PDP-1. It took an enormous number of hours. Even though my program worked perfectly, I felt it was a gigantic waste of time. I also got a C+ in that course because I never went to any of the classes. Now, having dealt with many students, I would be more sympathetic that I deserved a C+, but not when I was a kid. At that time, I figured there must be something better to do than rotating 4D objects and getting a C+. But marching through rice paddies in Vietnam or departing for some- where in Canada didn’t seem appealing. So after pick- ing up a MS degree in CS in 1966, although I stayed another year in CS, I was ready to try something else. -其次,虽然我发现一些数学课程和一些计算机领域的数学主题很有趣,如计算复杂度,用来解决图灵机,哥德尔定理等,但我发现(还是有)许多课程很无聊,大部分都和编程有关。我记得一个项目是写一个程序将一个4维图片转成二维,然后用一个DEC PDP-1做旋转。这个花费了我好几个小时,即使我的程序很完美,我仍然觉得这很浪费时间。我在那门课得了C+,因为我从来没去上过课。现在,我现在对这个C+有不同的看法,但是当我还小的时候,我不这样想。那个时候,我想一定会有比把4D物体旋转并得到一个C+更好的事情去做。但是参加越南战争,或者离开去加拿大看起来都没有吸引力,所以在1966年拿到计算机硕士学位之后,我准备尝试些别的。 +其次,虽然我发现一些数学课程和一些计算机领域的数学主题很有趣,如计算复杂度,用来解决图灵机,哥德尔定理等,但我发现(还是有)许多课程很无聊,大部分都和编程有关。我记得一个项目是写一个程序将一个4维图片转成二维,然后用一个DEC PDP-1做旋转。这个问题花费了我好几个小时,即使我的程序很完美,我仍然觉得这很浪费时间。我在那门课得了C+,因为我从来没去上过课。现在,我现在对这个C+有不同的看法,但是当我还小的时候,我不这样想。那个时候,我想一定会有比把4D物体旋转并得到一个C+更好的事情去做。但是参加越南战争,或者离开去加拿大看起来都没有吸引力,所以在1966年拿到计算机硕士学位之后,我准备尝试些别的。 Fabri: How did statistics end up in your path? @@ -129,7 +129,7 @@ Fabri: (那么)统计是如何成为你的最终之路的? Don: A summer job in Princeton in 1966 led to it. I did some programming for John Tukey in For- tran, LISP and COBOL. I also did some consulting for a Princeton sociology professor, Robert Althauser, basically writing programs to do matched sampling, matching blacks and whites, to study racial disparity in dropout rates at Temple University. I had a conversa- tion with Althauser about how psychology and then CS weren’t working out for me at Harvard. Because Bob was doing some semi-technical things in sociology, he knew of Fred Mosteller, although not personally, and also knew that Harvard had a decade-old Statistics Department that was founded in 1957. He suggested that I contact Mosteller. After getting back to Harvard, I talked to Fred, and he suggested that I take some stat courses. So in my third year in Harvard, I took mostly stat courses and did OK in them. And the Stat depart- ment said “Yes” to me. It also helped to have my own NSF funding, which I had from the start; they kept re- newing for some reason, showing their bad taste prob- ably, but it worked out well for me. Anyway, at the end of my third year at Harvard, I had switched to statistics, my third department in four years. -Don:1966年夏天我在普林斯顿的工作成就了这件事。我用Fortran,LISP和COBOL为John Tukey写了一些程序,(同时)我也为普林斯顿社会学教授Robert Althauser做了一些咨询,写一些程序匹配抽样,匹配黑人和白人,研究天普大学辍学率的种族差异。我跟Althauser谈过关于心理学是怎样的以及为什么计算机专业在哈佛不适合我。因为Bob在社会学上做一些半技术的东西,他认识Fred Mosteller,虽然不是个人名义,也知道哈佛有一个1957年成立的统计系。他建议我联系Mosteller。在回到哈佛后,我和Fred谈了一下,他建议我修一些统计课程。所以在第三年,我上的大部分都是统计课程并且还不错。统计系接收了我并帮我拿到了我自己的NSF资助,从我一开始入学就有。由于某些原因,他们不断重新开始,这可能显示了他们的坏品味,但是对我来说很好。不管怎样,在哈佛的第三年末,我转到了统计系,四年中的第三个系。 +Don:1966年夏天我在普林斯顿的工作成就了这件事。我用Fortran,LISP和COBOL为John Tukey写了一些程序,(同时)我也为普林斯顿社会学教授Robert Althauser做了一些咨询,写一些程序匹配抽样,匹配黑人和白人,研究天普大学辍学率的种族差异。我跟Althauser谈过关于心理学是怎样的以及为什么计算机专业在哈佛不适合我。因为Bob在社会学上做一些半技术的东西,他认识Fred Mosteller,虽然不是个人名义,也知道哈佛有一个1957年成立的统计系。他建议我联系Mosteller。在回到哈佛后,我和Fred谈了一下,他建议我修一些统计课程。所以在第三年,我上的大部分都是统计课程,感觉还不错。统计系接收了我并帮我拿到了我自己的NSF资助,从我一开始入学就有。由于某些原因,他们不断重新开始,这可能显示了他们的品味不怎么样,但是对我来说很好。不管怎样,在哈佛的第三年末,我转到了统计系,四年中的第三个系。 Fabri: Besides Mosteller, who else was on the statis- tics faculty then? It was a quite new department, as you said. @@ -161,7 +161,7 @@ Fan: 你的博士论文是关于匹配的,从什么时候开始你毕生致力 Don: When I worked with Althauser on the racial disparity problem, I always emphasized to him that it was inherently descriptive, not really causal. I re- membered enough from my physics education in high school and Princeton that association is not causation. So I was probably not intrigued by causal inference per se, but rather by the confusion that the social scien- tists had about it. You have to describe a real or hy- pothetical experiment where you could intervene, and after you intervene, you see how things change, not in time but between intervention (i.e., treatment) groups. If you are not talking about intervention, you can’t talk about causality. For some reason, when I look at old philosophy, it seems to me that they didn’t get it right, whereas in previous centuries, some experimenters got it. They bred cows, or mated hunting falcons. If you mated excellent female and male falcons, the resulting next generation of falcons would generally be better hunters than those resulting from random mating. In the 20th century, many scientists and experimentalists got it. -Don: 当我和Althauser一起研究种族差异问题的时候,我总是跟他强调结果本质上是描述性的,并不是真正的因果关系。我记得我在高中和在普林斯顿学物理的时候就知道关联不是因果。我可能对因果推断并不好奇,但是我对一些社会科学家对它好奇很不解。我必须要描述一个真实的或者假设的实验,你可以介入它,在你介入之后,你会看到事情是如何变化的,不是瞬时的,而是在不同的介入之间(比如不同的治疗)。如果你不谈介入,你就不能谈因果。由于某些原因,当我看旧哲学的时候,对我来说那可能不对,然而在前几个世纪,一些实验员验证了它。他们养奶牛,或者让猎鹰交配。如果你让优秀的公母猎鹰交配,下一代的狩猎水平要高于随机交配的结果。在20世纪,许多科学家和实验员得到了这个结果。 +Don: 当我和Althauser一起研究种族差异问题的时候,我总是跟他强调结果本质上是描述性的,并不是真正的因果关系。我记得我在高中和在普林斯顿学物理的时候就知道关联不是因果。我可能对因果推断并不好奇,但是我对一些社会科学家对它好奇很不解。我必须要描述一个真实的或者假设的实验,你可以干预它,在你干预之后,你会看到事情是如何变化的,不是瞬时的,而是在不同的干预之间(比如不同的治疗)[这里面的介入我都改成了干预,可以再看看是否有问题]。如果你不谈干预,你就不能谈因果。由于某些原因,当我看旧哲学的时候,对我来说那可能不对,然而在前几个世纪,一些实验员验证了它。他们养奶牛,或者让猎鹰交配。如果你让优秀的公母猎鹰交配,下一代的狩猎水平要高于随机交配的结果。在20世纪,许多科学家和实验员得到了这个结果。 Fabri: So you were only doing descriptive compar- isons in your Ph.D. thesis, and the notation of potential outcomes was not there. @@ -169,15 +169,15 @@ Fabri:所以你在你的博士论文里只做了描述性的比较,潜在的 Don: Partly correct. At that time, the notation of po- tential outcomes was in my mind, because that is the way that Cochran initiated discussions of randomized experiments in the class he taught in 1968. Initially, it was all based on randomization, unbiasedness, Fisher’s test, etc. But the concepts had to be flipped into or- dinary least squares (OLS) regression and analysis of variance tables, because nobody could compute any- thing difficult back then. One of the lessons in Bill’s class in regression and experimental design was to use the abbreviated Dolittle method to invert matrices, by hand! So you really couldn’t do randomization tests in any generality. The other reason I was interested in ex- periments and social science was my family history. There was always this legal question lurking: “But for this alleged misconduct, what would have happened?” -Don:一部分是对的。在那个时候,潜在结果的注释在我脑袋里,因为那个是Cochran1968年在他的随机实验课堂上讨论的方法。最开始全部都是基于随机,无偏,Fisher检验等等。但是这些概念必须要被注入普通最小二乘回归,方差分析表,因为没人能计算出任何困难的东西。Bill有一堂回归与实验设计课,就是用缩略的Dolittle方法手工求矩阵的逆!所以你真的难以对随机检验做任何普适的推广。另一个让我对实验和社会科学感兴趣的原因是我的家庭史。总是有这样一个法律问题埋在我心里:“这个(事情)涉嫌渎职,会发生什么呢?” +Don:你说的部分对。在那个时候,潜在结果的注释在我脑袋里,因为那个是Cochran1968年在他的随机实验课堂上讨论的方法。最开始全部都是基于随机,无偏,Fisher检验等等。但是这些概念必须要被注入普通最小二乘回归,方差分析表中,因为没人能计算出任何困难的东西。Bill有一堂回归与实验设计课,就是用缩略的Dolittle方法手工求矩阵的逆!所以你真的难以对随机检验做任何普适的推广。另一个让我对实验和社会科学感兴趣的原因是我的家庭史。总是有这样一个法律问题埋在我心里:“(如果)这个(事情)涉嫌渎职,会发生什么呢?” Fan: What was your first job after getting your Ph.D. degree in 1970? -Fan: 在你1970年拿到博士学位你的第一份工作是什么? +Fan: 在你1970年拿到博士学位后你的第一份工作是什么? Don: I stayed at Harvard for one more year, as an instructor in the Statistics Department, partly sup- ported by teaching, partly supported by the Cambridge Project, which was an ARPA funded Harvard–MIT joint effort; the idea was to bring the computer science technologies of MIT and the social sciences research of Harvard together to do wonderful things in the social sciences. In the Statistics Department, I was coteaching with Bob Rosenthal the “Statistics for Psychologists” course that, ironically, the Social Relations Department wanted me to take five years earlier, thereby driving me out of their department! Bob had, and has, tremendous intuition for experimental design and other practical is- sues, and we have written many things together. -Don:我在哈佛多待了一年,作为统计系的讲师,一部分工资来自教学,一部分来自剑桥项目,这是一个ARPA支持的哈佛-麻省理工联合项目;旨在将麻省理工的计算机科学技术和哈佛大学的社会科学研究结合到一起,在社会科学上做一些有趣的事情。在统计系,我和Bob Rosenthal一起教“统计心理学”,讽刺的是,5年前社会科学院就想让我上这门课,(正是)由于这门课,他们把我逐出了他们院!Bob对于实验设计和其他一些实践问题很有灵感,我们一起写了很多东西。 +Don:我在哈佛作为统计系的讲师多待了一年,我的工资一部分来自教学,一部分来自剑桥项目,这是一个ARPA支持的哈佛-麻省理工联合项目:旨在将麻省理工的计算机科学技术和哈佛大学的社会科学研究结合到一起,在社会科学上做一些有趣的事情。在统计系,我和Bob Rosenthal一起教“统计心理学”,讽刺的是,5年前社会科学院就想让我上这门课,(正是)由于这门课,他们把我逐出了他们院!Bob对于实验设计和其他一些实践问题很有灵感,我们一起写了很多东西。 # THE ETS DECADE: MISSING DATA, EM AND CAUSAL INFERENCE # ETS的10年:缺失数据,EM和因果推断 @@ -188,7 +188,7 @@ Fan: 一年后,你在普林斯顿的ETS找了一个职位,而不是大学里 Don: Right—many people thought I was goofy. I did have several good offers, one was to stay at Harvard, and another was to go to Dartmouth. But I met Al Beaton, who was later my boss at ETS in Princeton, at a conference in Madison, Wisconsin, and he offered me a job, which I took. Al had a doctorate in education at Harvard, and had worked with Dempster on compu- tational issues, such as the “sweep operator.” He was a great guy with a deep understanding of practical com- puting issues. Also, he appreciated my research. Be- cause I was an undergrad at Princeton, it was almost like going home. For several years, I taught one course at Princeton. Between the jobs at ETS and Princeton, I was earning twice what the Harvard salary would have been, which allowed me to buy a house on an acre and a half, with a garage for rebuilding an older Mer- cedes roadster, etc. A different style of life from that in Cambridge. -Don:很多人认为我是一个傻瓜。我有很多不错的机会,一个是待在哈佛,另一个是去达特茅斯。但我遇到了Al Beaton,后来他是我在普林斯顿ETS的老板,在威斯康星麦迪逊的一个会议上,他给了我这份工作,我接受了。Al在哈佛有一个博士学位,他跟Dempster一起做一些计算问题,例如“扫描算子”。他是个很好的人,并且对计算问题有很深刻的理解;同时他也很欣赏我的研究。由于我之前曾是普林斯顿的本科生,(因此)这种感觉大概就像是回家。我在普林斯顿有好几年就只教一门课。在ETS和普林斯顿的工作中,我的工资是在哈佛的两倍,这让我买了一栋1.5英亩的房子,有一个车库,还改造了旧的奔驰跑车等等。这是和在剑桥完全不同的生活方式。 +Don:很多人认为我是一个傻瓜。我有很多不错的机会,一个是待在哈佛,另一个是去达特茅斯。但我遇到了Al Beaton,后来他是我在普林斯顿ETS的老板,在威斯康星麦迪逊的一个会议上,他给了我这份工作,我接受了。Al在哈佛有一个博士学位,他跟Dempster一起做一些计算问题,例如“扫描算子”(sweep operator)。他是个很好的人,并且对计算问题有很深刻的理解;同时他也很欣赏我的研究。由于我之前曾是普林斯顿的本科生,(因此)这种感觉大概就像是回家。我在普林斯顿有好几年就只教一门课。在ETS和普林斯顿的工作中,我的工资是在哈佛的两倍,这让我买了一栋1.5英亩的房子,一个车库,还改造了旧的奔驰跑车等等。这是和在剑桥完全不同的生活方式。 Fan: You seem to have had a lot of freedom to pur- sue research at the ETS. What was your responsibility at ETS? @@ -196,7 +196,7 @@ Fan: 看起来你在ETS做研究非常自由。你在ETS的职责是什么? Don: The position at ETS was like an academic posi- tion with teaching responsibilities replaced by consult- ing on ETS’s social science problems, including psy- chological and educational testing ones. I found con- sulting much easier for me than teaching, and ETS had interesting problems. Also there were many very good people around, like Fred Lord, who was highly respected in psychometrics. The Princeton faculty was great, too: Geoffrey Watson (of the Durbin–Watson statistic) was the chair; Peter Bloomfield was there as a junior faculty member before he moved to North Car- olina; and of course Tukey was still there, even though he spent a lot of time at Bell Labs. John was John, hav- ing a spectacular but very unusual way of thinking— obviously a genius. Stuart Hunter was in the Engineer- ing School then. These were fine times for me, with tremendous freedom to pursue what I regarded as im- portant work. -Don:在ETS的工作有点像教职,只是在做ETS的社会科学的咨询而不是教学了,包括心理教育的问题。我发现相比教学,咨询梗简单,ETS有很多有意思的问题。而且周围有很多不错的人,比如Fred Lord,在心理测验学上很有名望。普林斯顿的老师们也很好:Geoffrey Watson(杜宾-瓦特森统计量命名人)是系主任;Peter Bloomfield在搬到北卡之前是初级教授以及Tukey一直在那,即使他在Bell实验室花了很多时间。John显然是一个有很不寻常的思考方式的天才。Stuart Hunter后来去了工程学院。当时对我来说是一段好时光,我有大量自由的时间来追求我认为重要的工作。 +Don:在ETS的工作有点像教职,只是在ETS做的是的社会科学的咨询而不是教学了,包括心理教育的问题。我发现相比教学,咨询梗简单,ETS有很多有意思的问题。而且周围有很多不错的人,比如Fred Lord,在心理测验学上很有名望。普林斯顿的老师们也很好:Geoffrey Watson(杜宾-瓦特森统计量命名人)是系主任;Peter Bloomfield在搬到北卡之前是初级教授以及Tukey一直在那,即使他在Bell实验室花了很多时间。John显然是一个有很不寻常的思考方式的天才。Stuart Hunter后来去了工程学院。当时对我来说是一段好时光,我有大量自由的时间来追求我认为重要的工作。 Fabri: By any measure, your accomplishments in the ETS years were astounding. In 1976, you published the paper “Inference and Missing Data” in Biometrika (Rubin, 1976) that lays the foundation for modern anal- ysis of missing data; in 1977, with Arthur Dempster and Nan Laird, you published the EM paper “Max- imum Likelihood from Incomplete Data via the EM Algorithm” in JRSS-B (Dempster, Laird and Rubin, 1977); in 1974, 1977, 1978, you published a series of papers that lay the foundation for the Rubin Causal Model (Rubin, 1974, 1977, 1978a). What was it like for you at that time? How come so many groundbreak- ing ideas exploded in your mind at the same time? @@ -204,7 +204,7 @@ Fabri: 不管怎样看,你在ETS的几年的成就是令人震惊的。在1976 Don: Probably the most important reason is that I al- ways worried about solving real problems. I didn’t read the literature to uncover a hot topic to write about. I al- ways liked math, but I never regarded much of math- ematical statistics as real math—much of it is just so tedious. Can you keep track of these epsilons? -Don:可能最重要的原因是我总是很担忧如何解决实际问题。我没有读过写热门话题的文献。我一直喜欢数学,但是我不认为数理统计是真正的数学,大部分数学问题是很枯燥的。你能一直追溯这些epsilon吗?[编者注:这里的 epsilon 指数学中 epsilon-delta 语言中的 epsilon] +Don:可能最重要的原因是我总是很担忧如何解决实际问题。我没有读过写热门话题的文献。我一直喜欢数学,但是我不认为数理统计是真正的数学,大部分数学问题是很枯燥的。你能一直追溯这些epsilon吗?[编者注:这里的 epsilon 指数学中 epsilon-delta 语言中的 epsilon][这里有待推敲] Fabri: There is no coincidence that all these papers share the common theme of missing data. @@ -220,7 +220,7 @@ Fan: 在你进入这个领域之前,缺失数据是什么样的研究状态? Don: It was extremely ad hoc. The standard ap- proach to missing data then was comparing the biases of filling in the means, or of regression imputation un- der different situations, but almost always under an im- plicit “missing completely at random” assumption. The purely technical sides of these papers are solid. But I found there were always counter examples to the pro- priety of the specific methods being considered, and to explore them, one almost needed a master’s thesis for each situation. I would rather address the class of prob- lems with some generality. There is a mechanism that creates missing data, which is critical for deciding how to deal with the missing data. That idea of formal indi- cators for missing data goes way back in the contexts of experimental design and survey design. I am consis- tently amazed how this was not used in observational studies until I did so in the 1970s; maybe someone did, but I’ve looked for years and haven’t found anything. But probably because the missing data paper was done in a relatively new way, I had great difficulty in getting it published (more details in Rubin, 2014a). -Don:它非常有局限性。标准的解决缺失数据的方法是比较用均值填补后的偏差,或者在不同的情况下,用回归插补,但是都要基于“随机缺失”的假设。这些论文在纯技术层面上都非常扎实。但是我发现对于科学的方法,总有反例,为了探索这个问题,可能每种情况都是一篇硕士论文。我宁愿用一种更普适的方法来解决这类问题。有一个机制能够生成缺失数据,这对于如何解决缺失数据很关键。那个思想可以还原到实验设计和调查设计上。直到我20世纪70年代做这件事的时候,我很惊讶为什么没有用到观察研究上;可能有人做了,但这么多年我什么都没找到。但可能因为缺失数据的论文相对比较新,发表的过程中我遇到很大的困难(更多细节见 Rubin, 2014a)。 +Don:它非常有局限性[ad hoc 待推敲]。标准的解决缺失数据的方法是比较用均值填补后的偏差,或者在不同的情况下,用回归插补,但是都要基于“随机缺失”的假设。这些论文在纯技术层面上都非常扎实。但是我发现对于科学的方法,总有反例,为了探索这个问题,可能每种情况都是一篇硕士论文。我宁愿用一种更普适的方法来解决这类问题。有一个机制能够生成缺失数据,这对于如何解决缺失数据很关键。那个思想可以还原到实验设计和调查设计上。直到我20世纪70年代做这件事的时候,我很惊讶为什么没有用到观察研究上;可能有人做了,但这么多年我什么都没找到。但可能因为缺失数据的论文相对比较新,发表的过程中我遇到很大的困难(更多细节见 Rubin, 2014a)。 Fan: The EM algorithm is another milestone in mod- ern statistics; it is also relevant in computer science and one of the most important algorithm in data mining. Though similar ideas had been used in several specific contexts before, nobody had realized the generality of EM. How did Dempster, Laird and you discover the generality? @@ -228,7 +228,7 @@ Fan: EM算法是现代统计的另一个里程碑;它也和计算机相关, Don: In those early years at ETS, I had the free- dom to remain in close contact with the Harvard peo- ple, Cochran, Dempster, Holland and Rosenthal, which was very important to me. I always enjoyed talking to Dempster, who is a very principled and deep thinker. I was able to arrange some consulting projects at ETS to bring him to Princeton. Once we were talking about some missing data problem, and we started discussing filling these values in, but I knew it wouldn’t work in generality. I pointed to a paper by Hartley and Hock- ing (1971), where they deserted the approach of itera- tively filling in missing values, as in Hartley (1956) for the counted data case, and went to Newton–Raphson, I think, in the normal case. Even though aspects of EM were known for years, and Hartley and others were sort of nibbling around the edges of EM, apparently nobody put it all together as a general algorithm. Art and I real- ized that you have to fill in sufficient statistics. I had all these examples like t distributions, factor analysis (the ETS guys loved that), latent class models. And Art had a great graduate student, Nan Laird, available to work on parts of it, and we started writing it up. The EM paper was accepted right away by JRSS-B, even with invited discussions. -Don:在ETS的早些年,我可以自由地和哈佛的人联系,比如Cochran, Dempster, Holland和Rosenthal,这对我来说非常重要。我非常喜欢和Dempster聊天,他很有原则,思考问题很深入。我能在ETS安排一些咨询项目从而带他来到普林斯顿。有一次我们聊缺失数据的问题,我们开始讨论插补值,但是我知道它不具有普适性。我指出一篇论文,Hartley和Hocking (1971)写的,里面用了迭代的方法来插补缺失数据,正如Hartley(1956)的计数数据的情形,后来发展到了牛顿算法,我认为,这才是一个普遍情形。即使EM的种种已经被知道很多年了,Hartley和其他人有点咬在EM的边缘,很明显没有人把它总结成一个普适性的算法。Art和我意识到必须填补充分统计量。我做了所有的例子,如t分布,因子分析(ETS的人所喜欢的),隐类模型。Art有一个很好的研究生,Nan Laird,他可以做一部分工作,于是我们开始写起来。EM的论文被JRSS-B接收,并且我们也被邀请讨论。 +Don:在ETS的早些年,我可以自由地和哈佛的人联系,比如Cochran, Dempster, Holland和Rosenthal,这对我来说非常重要。我非常喜欢和Dempster聊天,他很有原则,思考问题很深入。我能在ETS安排一些咨询项目从而带他来到普林斯顿[这句话待推敲]。有一次我们聊缺失数据的问题,我们开始讨论插补值,但是我知道它不具有普适性。我指出一篇论文,Hartley和Hocking (1971)写的,里面用了迭代的方法来插补缺失数据,正如Hartley(1956)的计数数据的情形,后来发展到了牛顿算法,我认为,这才是一个普遍情形。即使EM的种种已经被知道很多年了,Hartley和其他人有点咬在EM的边缘,很明显没有人把它总结成一个普适性的算法。Art和我意识到必须填补充分统计量。我做了所有的例子,如t分布,因子分析(ETS的人所喜欢的),隐类模型。Art有一个很好的研究生,Nan Laird,他可以做一部分工作,于是我们开始写起来。EM的论文被JRSS-B接收,并且我们也被邀请讨论。 Fan: Now let’s talk more about causal inference. You are known for proposing the general potential out- come framework. It was Neyman who first mentioned the notation of potential outcomes in his Ph.D. thesis (Neyman, 1990), but the notation seemed to have long been neglected. @@ -252,7 +252,7 @@ Fabri: 事实上你在20世纪70年代中去伯克利访问见到了Neyman。在 Don: I did. In fact, I had an office right next to his. Neyman came to Berkeley in the late 30s. He was very impressive, not only as a mathematical statistician, but also as an individual. There was a tremendous aura about him. Shortly after arriving in Berkeley, I gave a talk on missing data and causal inference. The next day, I went to lunch with Neyman and I said something like, “It seems to me that formulating causal prob- lems in terms of missing potential outcomes is an ob- vious thing to do, not just in randomized experiments, but also in observational studies.” Neyman answered to the effect that (remarkable in hindsight because he did so without acknowledging that he was the person who first formulated potential outcomes), “No, causal- ity is far too speculative in nonrandomized settings.” He repeated something like this quote from his biog- raphy, “...Without randomization an experiment has little value irrespective of the subsequent treatment.” (Also see my comment on this conversion in Rubin, 2010.) Then he went to say politely but firmly, “Let’s not talk about that, let’s instead talk about astronomy.” He was very into astronomy at the time. -Don:讨论过。事实上,我的办公室在他右边。Neyman在30年代末来到伯克利。他给人的印象很深刻,他不仅仅是一个数理统计学家,也很有个人魅力。他有一种巨大的光环。在来到伯克利不久,我做了一个缺失数据和因果推断的演讲。第二天,我和Neyman吃午饭,我说,“看起来就缺失潜在结果的因果推断问题是很明显现在要做的事情,不仅仅在随机实验中,在观测研究中也应该做。”Neyman回答(事后很显然他这么做是因为并没有把自己当作第一个提出潜在结果的人),“不,因果在非随机的设定里过于投机了。”他在他的自传中也重复了类似的话,“……没有随机性的实验的后续处理是没有价值的。”(也可以在我对这次访谈的评论 Rubin, 2010.里可以看到)然后他很有礼貌很严肃的说,“我们不讨论这个,我们讨论天文学。”他当时非常痴迷天文学。 +Don:讨论过。事实上,我的办公室在他右边。Neyman在30年代末来到伯克利。他给人的印象很深刻,他不仅仅是一个数理统计学家,也很有个人魅力。他自带大光环。在来到伯克利不久,我做了一个缺失数据和因果推断的演讲。第二天,我和Neyman吃午饭,我说,“看起来就缺失潜在结果的因果推断问题是很明显现在要做的事情,不仅仅在随机实验中,在观测研究中也应该做。”Neyman回答(事后很显然他这么做是因为并没有把自己当作第一个提出潜在结果的人),“不,因果在非随机的设定里过于投机了。”他在他的自传中也重复了类似的话,“……没有随机性的实验的后续处理是没有价值的。”(也可以在我对这次访谈的评论 Rubin, 2010.里可以看到)然后他很有礼貌但也很严肃的说,“我们不讨论这个,我们讨论天文学。”他当时非常痴迷天文学。 Fabri: You probably learned the reasons why he was so involved in the frequentist approach. @@ -268,7 +268,7 @@ Fabri: 在1986年那篇著名的JASA文章里,Paul Holland提出了“Rubin因 Don: Actually Angrist, Imbens and I had a rejoin- der in our 1996 JASA paper (Angrist, Imbens and Ru- bin, 1996), where we explain why we think it is fair. Neyman is pristinely associated with the development of potential outcomes in randomized experiments, no doubt about that. But in the 1974 paper (Rubin, 1974), I made the potential outcomes approach for defining causal effects front and center, not only in randomized experiments, but also in observational studies, which apparently had never been done before. As Neyman told me back in Berkeley, in some sense, he didn’t believe in doing statistical inference for causal effects outside of randomized experiments. -Don:事实上Angrist, Imbens和我在我们1996年的JASA论文(Angrist, Imbens and Rubin, 1996)里一起反驳了这种说法,我们解释了为什么我们认为是相对的。毫无疑问Neyman是最初和随机实验中的潜在结果的发展相关的人。但是在1974年的文章里(Rubin, 1974),我用潜在结果的方法来定义了随机试验和观察研究里因果效应的开始和中间应该是怎样的,这很明显是之前没做过的。正如Neyman在伯克利告诉我的,在某种意义上,他不相信在随机实验之外做因果效应的统计推断。 +Don:事实上Angrist, Imbens和我在我们1996年的JASA论文(Angrist, Imbens and Rubin, 1996)里一起反驳了这种说法,我们解释了为什么我们认为是相对的。毫无疑问Neyman是最初和随机实验中的潜在结果的发展相关的人。但是在1974年的文章里(Rubin, 1974),我用潜在结果的方法来定义了随机试验和观察研究里因果效应的开始和中间应该是怎样的[这段话待推敲],这很明显是之前没做过的。正如Neyman在伯克利告诉我的,在某种意义上,他不相信在随机实验之外做因果效应的统计推断。 Fan: Also there are features in the RCM, such as the definition of the assignment mechanism, that belong to you. @@ -280,11 +280,11 @@ Don:是的,意识到随机实验是嵌套在一个大的分配机制中是 “In the precomputer era, the fact that almost all work could be done once and for all was of great impor- tance. As a consequence, the advantages of randomiza- tion approaches—except for those few cases where the randomization distributions could be dealt with once and for all—were not adequately valued. -“在前计算机时代,几乎所有的工作都能执行一次,所有的都很重要。因此,随机方法的优势在于,除了那些很少数的情况下随机分布能被一次性处理之外,对于大多数情况,(一次性处理)并没有足够的价值。 +“在前计算机时代,几乎所有的工作都能执行一次,所有的都很重要。因此,随机方法的优势在于,除了那些很少数的情况下随机分布能被一次性处理之外,对于大多数情况,(一次性处理)并没有足够的价值[我觉得一次性处理不太准确,这块可能需要再推敲一下]。 One reason for this undervaluation lay in the fact that, so long as randomization was confined to spe- cially manageable key statistics, there seemed no way to introduce into the randomization approach the insights—some misleading and some important and valuable—into what test statistics would be highly sen- sitive to the changes that it was most desired to detect. The disappearance of this situation with the rise of the computer seems not to have received the attention that it deserves.” (Brillinger, Jones and Tukey, 1978, Chap- ter 25, page F-5.) -(随机方法)被低估的一个原因是,只要随机化被限制在一些特定的容易处理的关键统计问题上,看起来就没办法把随机方法的见解——一些会误导的,一些重要的和有价值的见解——引入到对变化很敏感,最需要检验的检验统计量中。随着计算机的起步,这种现象消失了,而且看起来并没有引起它应有的关注度。”(Brillinger, Jones and Tukey, 1978, 25章, F-5页.) +(随机方法)被低估的一个原因是,只要随机化被限制在一些特定的容易处理的关键统计问题上,看起来就没办法把随机方法的见解——一些会误导的,一些重要的和有价值的见解——引入到对变化很敏感,最需要检验的检验统计量中[这句话待推敲]。随着计算机的起步,这种现象消失了,而且看起来并没有引起它应有的关注度。”(Brillinger, Jones and Tukey, 1978, 25章, F-5页.) Fabri: Here I am quoting an interesting question by Tom Belin regarding potential outcomes: “Do you be- lieve potential outcomes exist in people as fixed quan- tities, or is the notion that potential outcomes are a de- vice to facilitate causal inference?” @@ -320,7 +320,7 @@ Fabri:您在ETS研究了很多成果之后,就前往EPA(美国环境保护 Don: It started partly from my joking answer to the question, “How long have you been at ETS?” I answered, “Too long.” The problems that I had dealt with at ETS started to appear repetitive, and I felt that I had made important contributions to them including EM and multiple imputation ideas, which were being used to address some serious issues, like test equating, and formulating the right ways to collect data. So I wanted to try something else. At the time, David Rosenbaum was the head of the Office of Radiation Programs at the EPA. He had the grand idea of putting together a team of applied mathematicians and statisticians. Somehow he found my name and invited me to D.C. to find out whether I wanted to lead such a group. Basically, I had the freedom to hire several people of my choice, and I had a good government salary (at the level of “Senior Executive Service”). So I said, “Let’s see whom I can get.” I was able to convince both Rod Little (who was in England at that time) and Paul Rosenbaum (whom I advised while I was still at ETS), as well as Susan Hinkins, who wrote a thesis on missing data at Montana State University, and two others. That was shortly before the presidential election. Then the Democrats lost and Reagan was to come in, and everything seemed to be falling apart. All of a sudden, many of the people above my level at the EPA (most of whom were presidential appointments), had to prepare to turn in their resignations, and had to be concerned about their next positions. -Don:我之前对于问题“你在ETS工作了多长时间”的开玩笑的回答解释了我的部分想法。我当时回答说“很长的时间”。当时我在ETS处理的问题开始变得重复;而且我觉得我已经帮他们在EM和多重插补法方面做出了很重要的帮助,EM和多重插补法已经可以用来处理一些例如测验等值化,公式化收集数据等重要的问题,所以我想尝试一些其他的东西。当时David Rosenbaum是EPA中放射性项目研究办公室的主任。他有一个想法,想召集一些应用数学家和统计学家组建一支队伍。不知怎么的他找到了我,想看看我是否愿意领导这样一支队伍,并邀请我去D.C.。我当时基本上可以自主选择聘用一些人,而且政府给我拨发了不菲的薪水(待遇同高级管理人员)。所以我说,“我先看看我能找到谁。”我可以叫来Rod Little(他当时在英国)、Paul Rosenbaum(我在ETS时曾给他提供过建议)和Susan Hinkins(他在蒙大拿州立大学写了一篇有关缺失数据的论文) 以及另外两个人。那会很快就要进行总统选举了。之后民主党下台,共和党掌权,似乎所有的事情就都乱套了。突然很多在EPA比我的职位高的人(他们中大部分人都是总统任命的)开始准备递交他们的辞职信,也在担忧他们接下来会担任什么职务。 +Don:我之前对于问题“你在ETS工作了多长时间”的开玩笑的回答解释了我的部分想法。我当时回答说“很长的时间”。当时我在ETS处理的问题开始变得重复;而且我觉得我已经帮他们在EM和多重插补法(multiple imputation)方面做出了很重要的帮助,EM和多重插补法已经可以用来处理一些例如测验等值化(test equating),公式化收集数据等重要的问题,所以我想尝试一些其他的东西。当时David Rosenbaum是EPA中放射性项目研究办公室的主任。他有一个想法,想召集一些应用数学家和统计学家组建一支队伍。不知怎么的他找到了我,想看看我是否愿意领导这样一支队伍,并邀请我去D.C.。我当时基本上可以自主选择聘用一些人,而且政府给我拨发了不菲的薪水(待遇同高级管理人员)。所以我说,“我先看看我能找到谁。”我可以叫来Rod Little(他当时在英国)、Paul Rosenbaum(我在ETS时曾给他提供过建议)和Susan Hinkins(他在蒙大拿州立大学写了一篇有关缺失数据的论文) 以及另外两个人。那会很快就要进行总统选举了。之后民主党下台,共和党掌权,似乎所有的事情就都乱套了。突然很多在EPA比我的职位高的人(他们中大部分人都是总统任命的)开始准备递交他们的辞职信,也在担忧他们接下来会担任什么职务。 Fabri:So the EPA project ended before it even got started. @@ -328,19 +328,19 @@ Fabri:所以EPA的项目还没开始就结束了。 Don: It didn’t start at all in some sense. I formally signed on at the beginning of December, and after one pay period, I turned in my resignation. But I felt responsible to find jobs for all these people I brought there. Eventually, Susan Hinkins got connected with Fritz Scheuren at the IRS; Paul Rosenbaum got a position at the University of Wisconsin at Madison; Rod got a job related to the Census. One nice thing about that short period of time is that, through the projects I was in charge of, I made several good connections, such as to Herman Chernoff and George Box. George and I really hit it off, primarily because of his insistence on statistics having connections to real problems, but also because of his wonderful sense of humor, which was witty and ribald, and his love of good spirits. In any case, the EPA position led to an invitation to visit Box at the Math Research Center at the University of Wisconsin, which I gladly accepted. That gave me the chance to finish writing the propensity score papers with Paul (Rosenbaum and Rubin, 1983a, 1983b, 1984a). -Don:在某种意义上这个项目就是没开始。我在十二月初正式签约,不过只在一个支付期之后,我就递交了自己的辞职信,但我觉得我有义务为我带来的这些人提供工作。最终,Susan Hinkins和IRS的Fritz Scheuren取得了联系;Paul Rosenbaum在威斯康辛大学麦迪逊分校得到了一个职位;Rod也得到了一份有关普查的工作。在这么短的工作时间内,有一件特别棒的事情是,我通过我管理的这个项目,和Herman Chernoff、George Box等人取得了很好的联系。George和我真的很搭,主要是因为他坚持认为统计应该和实际问题相联系,但也有一部分原因是由于他那粗犷而又很诙谐的幽默感,以及他的积极向上。在任何情况下,我都很同意将威斯康辛大学数学研究中心中EPA的位置留给Box。那也给了我一个机会来和Paul一起完成倾向评分(Rosenbaum and Rubin, 1983a, 1983b, 1984a)的论文。 +Don:在某种意义上这个项目就是没开始。我在十二月初正式签约,不过只在一个支付期之后,我就递交了自己的辞职信,但我觉得我有义务为我带来的这些人提供工作。最终,Susan Hinkins和IRS的Fritz Scheuren取得了联系;Paul Rosenbaum在威斯康辛大学麦迪逊分校得到了一个职位;Rod也得到了一份有关普查的工作。在这么短的工作时间内,有一件特别棒的事情是,我通过我管理的这个项目,和Herman Chernoff、George Box等人取得了很好的联系。George和我真的很搭,主要是因为他坚持认为统计应该和实际问题相联系,但也有一部分原因是由于他那粗犷而又很诙谐的幽默感,以及他的积极向上。在任何情况下,我都很同意将威斯康辛大学数学研究中心中EPA的位置留给Box。那也给了我一个机会来和Paul一起完成倾向评分(propensity score)(Rosenbaum and Rubin, 1983a, 1983b, 1984a)的论文。 Fan: Since you mentioned propensity score, arguably the most popular causal inference technique in a wide range of applied disciplines, can you give some insights on the “natural history” of propensity score? -Fan:既然你提到了倾向得分,这项可以说是在大量应用学科中最流行的因果推理技术,那么您能够把倾向评分的发展历程分享给大家吗? +Fan:既然你提到了倾向得分,这项可以说是在大量的应用学科中最流行的因果推理技术,那么您能够把倾向评分的发展历程分享给大家吗? Don: I first met Paul in 1978, when I came to Har-vard on a Guggenheim fellowship; he was a first-year Ph.D. student, extremely bright and devoted. Back in my Princeton days I did some consulting for a psychologist at Rutgers, June Reinisch, who later became the first director of the Kinsey Institute after Kinsey. She was very interested in studying the nature-nurture controversy——what makes men and women so different? She and her husband, who was also a psychologist, were doing experiments on rats and pigs. They injected hormones into the uteri of pregnant animals, and thereby exposed the fetuses to different prebirth environments; this kind of randomized experiment is obviously unethical to do with humans. One of the problems Paul and I were working on for this project, also as part of Paul’s thesis, was matching—matching background characteristics of exposed and unexposed. The covariates included a lot of continuous and discrete variables, some of which were rare events like certain serious diseases prior to, or during, early pregnancy. Soon it became clear that standard matching approaches, like Mahalanobis matching, do not work well in such high dimensional settings. You have to find some type of summaries of these variables and balance the summaries in the treatment and control groups, not individual to individual. Then we realized if you have an assignment mechanism, you can match on the individual assignment probabilities, which is essentially the Horvitz–Thompson idea, to eliminate all systematic bias. I don’t remember the exact details, but I think we first got the propensity score idea when working on a Duke data bank on coronary artery bypass surgery, but refined it for the Reinisch data, which is very similar in principle. Again, the idea of the propensity score is motivated by addressing real problems, but with generality. -Don:我在1978年,为古根海姆奖去往哈佛时,第一次遇见了Paul;他是个一年级博士生,非常聪明且勤奋。在回到普林斯顿的日子里,我为一名罗格斯大学的心理学家,June Reinisch,做了一些咨询,他后来成为了继Kinsey后的第一位Kinsey研究所的主任。她对于天性和教养问题的研究非常有兴趣——什么导致了男人与女人如此不同?她和同为心理学家的丈夫,在老鼠和猪身上做了一些实验。他们将荷尔蒙注射进了怀孕动物的子宫,从而使得暴露给胎儿的产前环境不同。这种随机性的试验显然在人类身上做是不道德的。我和Paul为这个项目工作时出现了一个问题,是曝光和未曝光的匹配背景特征的问题,同样这也是Paul论文中的一部分内容。这项数据的协方差包括很多离散变量和连续变量,这些变量中有一些是很少见的情况,比如说在早期妊娠期间或者之前出现的各种严重的疾病。但不久我们就发现标准匹配方法,比如说马氏匹配,很明显并不适用在高维的设定中。你需要找到某种变量的分类形式并且在治疗组和控制组中平衡这些集合,(变量在其中)并不是一个对应一个那样。然后我们意识到如果可以有一个分配机制,你就可以对个人的分配概率进行匹配来消除所有的系统误差,本质上是Horvitz–Thompson估计的思想。我记不太清具体的了,但我记得我们在利用杜克数据银行有关冠状动脉搭桥手术的数据时,第一次有了倾向得分的想法。这些数据后来改名为了Reinisch数据,但大致上还是类似的。所以由于实际问题,我们再次有了研究具有普遍性的倾向得分的想法。 +Don:我在1978年,为古根海姆奖去往哈佛时,第一次遇见了Paul;他是个一年级博士生,非常聪明且勤奋。在回到普林斯顿的日子里,我为一名罗格斯大学的心理学家,June Reinisch,做了一些咨询,他后来成为了继Kinsey后的第一位Kinsey研究所的主任。她对于天性和教养问题的研究非常有兴趣——什么导致了男人与女人如此不同?她和同为心理学家的丈夫,在老鼠和猪身上做了一些实验。他们将荷尔蒙注射进了怀孕动物的子宫,从而使得暴露给胎儿的产前环境不同。这种随机性的试验显然在人类身上做是不道德的。我和Paul为这个项目工作时出现了一个问题,是曝光和未曝光的匹配背景特征的问题[这句话待推敲],同时这也是Paul论文中的一部分内容。这项数据的协方差包括很多离散变量和连续变量,这些变量中有一些是很少见的情况,比如说在早期妊娠期间或者之前出现的各种严重的疾病。但不久我们就发现标准匹配方法,比如说马氏匹配(Mahalanobis matching),很明显并不适用在高维的设定中。你需要找到某种变量的分类形式并且在治疗组和控制组中平衡这些集合[这句话待推敲],(变量在其中)并不是一个对应一个那样。然后我们意识到如果可以有一个分配机制,你就可以对个人的分配概率进行匹配来消除所有的系统误差,本质上是Horvitz–Thompson估计的思想[这句话待推敲]。我记不太清具体的了,但我记得我们在利用杜克数据银行有关冠状动脉搭桥手术的数据时,第一次有了倾向得分的想法。这些数据后来改名为了Reinisch数据,但大致上还是类似的。所以由于实际问题,我们再次有了研究具有普遍性的倾向得分的想法。 Fan: Multiple Imputation (MI) is another very influential contribution of yours. Your book “Multiple Im-putation for Nonresponse in Sample Surveys” (Rubin, 1987a) has commonly been cited as the origin of MI. But my understanding is that you first developed the idea and coined the term much earlier. -Fan:多重插补法(MI)是您另一个有极高影响力的贡献的地方。你的书“Multiple Im-putation for Nonresponse in Sample Surveys”(Rubin, 1987a)通常被认为是MI的起源。但我觉得您第一次提出这个想法并且创造了这个术语要早得多。 +Fan:多重插补法(MI)是您另一个有极高影响力的贡献的地方。你的书“Multiple Im-putation for Nonresponse in Sample Surveys”(Rubin, 1987a)通常被认为是MI的起源。但我觉得您第一次提出这个想法并创造了这个术语的时间要早得多。 Don: Correct, I first wrote about MI in an ASA proceedings paper in 1978 (Rubin, 1972, 1978b). That’s where “the 18+ years” comes from when I wrote “Multiple Imputation After 18+ Years” (Rubin, 1996). @@ -368,11 +368,11 @@ Don:嗯是的,我们现在正在做。从1987到2002年的变化主要体现 Fan: In the 1978 Annals paper (Rubin, 1978a), you gave, for the first time, a rigorous formulation of Bayesian inference for causal effects. But the Bayesian approach to causal inference did not have much following until very recently, and the field of causal inference is still largely frequentist. How do you view the role of Bayesian approach in causal inference? -Fan:在1978年的年报中(Rubin,1978a),您第一次给出了因果效应的贝叶斯推理的严格表述。但是针对因果推断的贝叶斯方法直到最近才有一些后续,而且因果推断领域依然注意是频率统计方面。您怎么看待因果推断的贝叶斯方法? +Fan:在1978年的年报中(Rubin,1978a),您第一次给出了因果效应的贝叶斯推断的严格表述。但是针对因果推断的贝叶斯方法直到最近才有一些后续,而且因果推断领域依然主要是频率统计方面。您怎么看待因果推断的贝叶斯方法? Don: I believe being Bayesian is the right way to approach things, because the basic frequentist approach, such as the Fisherian tests and Neyman’s unbiased estimates and confidence intervals, usually does not work in complicated problems with many nuisance un-knowns. So you have to go Bayesian to create procedures. You can go partially Bayesian using things like posterior predictive checks, where you put down a null that you may discover evidence against, or direct likelihood approaches as in Frumento et al. (2012); if the data are consistent with a null that is interesting, you live with it. But Neyman-style frequentist evaluations of Bayesian procedures are still relevant. -Don:我相信贝叶斯是正确的逼近事情的方法,因为基本的频率统计方法,比如说Fisherian测试以及Neyman的无偏估计和置信区间经常并不适用处理一些麻烦复杂的未知问题。所以你需要用贝叶斯来模拟过程。当你需要带上和数据一致的空值进行处理时,你可以利用贝叶斯的部分内容比如可能记下一个与事实的空的后测检验,或者(在Frumento等人 (2012)中的)似然法。但是传统风格的频率统计的贝叶斯方法仍然是有重大意义的。 +Don:我相信贝叶斯是正确的逼近事情的方法,因为基本的频率统计方法,比如说Fisherian测试以及Neyman的无偏估计和置信区间经常并不适用于处理一些麻烦复杂的未知问题。所以你需要用贝叶斯来模拟过程。当你需要带上和数据一致的空值进行处理时,你可以利用贝叶斯的部分内容比如可能记下一个事实的空集的后测检验[这句话待推敲],或者(在Frumento等人 (2012)中的)似然法。但是传统风格的频率统计的贝叶斯方法仍然是有重大意义的。 Fan: But why is the field of causal inference still predominantly frequentist? @@ -380,11 +380,11 @@ Fan:但是为什么因果推断的领域仍然主要是频率统计方面呢 Don: I think there are several reasons. First, there are many Bayesian statisticians who are far more interested in MCMC algebra and algorithms, and do not get into the science. Second, I regard the method of moments (MOM) frequentist approach as pedagogically easier for motivating and revealing sources of information. Take the simple instrumental variable setting with one-sided noncompliance. Here, it is very easy to look at the simple MOM estimate to see where information comes from. With Bayesian methods, the answer is, in some sense, just there in front of you. But when you ask where the information comes from, you have to start with any value, and iterate using conditional expectations, or draws from the current joint distributions. You have to have far more sophisticated mathematical thinking to understand fully Bayesian ideas. There are these problems with missing data (as in my discussion of Efron, 1994) where there are unique, consistent estimates of some parameters using MOM, but for which the joint MLE is on the boundary. So I think it is often easier, pedagogically, to motivate simple estimators and simple procedures, and not try to be efficient when you convey ideas. In causal inference, that corresponds to talking about unbiased or nearly unbiased estimates of causal estimands, as in Rubin (1977). There are other reasons having to do with the current education of most statisticians. -Don:我认为主要有这几个原因。首先,有很多的贝叶斯统计学家对MCMC代数和算法要感兴趣得多,而且还没有科学的研究。第二,我认为矩量法(MOM)的频率方法都教学中能更简单的找到信息来源。拿简单的有片面违规设置的工具变量举个例子。我们能够通过简单的MOM估计来看出信息来自哪里。利用贝叶斯方法,答案就仿佛直接放在你面前一样。但当你询问信息从哪里来的时,你需要从任何值开始,重复利用条件期望,或者绘制当前的联合分布。你需要复杂的数学思维方式来完全理解贝叶斯的思想。有缺失数据的一些问题(如我在Efron,1994讨论中讨论的)要利用MOM对一些参数进行特别的一致性估计,但其联合极大似然估计是在边界上。所以我觉得在教学方面,当你传播思想时,应该鼓励使用简单的估计和过程,并且不要太追求效率。在因果推断中,那和讨论因果中无偏或者几乎无偏的估计是一致的,就是在Rubin(1977)中所说的那样。此外大多数统计学家在最近的教育中(这样做)还有一些其他的原因。 +Don:我认为主要有这几个原因。首先,有很多的贝叶斯统计学家对MCMC代数和算法要感兴趣得多,而且还没有科学的研究。第二,我认为矩量法(MOM)的频率方法在教学中能更简单的找到信息来源。拿简单的有片面违规设置的工具变量举个例子[这句话待推敲],我们能够通过简单的MOM估计来看出信息来自哪里。利用贝叶斯方法,答案就仿佛直接放在你面前一样。但当你询问信息从哪里来的时,你需要从任何值开始,重复利用条件期望,或者绘制当前的联合分布。你需要有复杂的数学思维方式来完全理解贝叶斯的思想。有缺失数据的一些问题(如我在Efron,1994讨论中讨论的)要利用MOM对一些参数进行特别的一致性估计[这句话里的unique, consistent estimates,我翻译成了特别的一致性估计,可能有点问题],但其联合极大似然估计是在边界上。所以我觉得在教学方面,当你传播思想时,应该鼓励使用简单的估计和过程,而且不要太追求效率。在因果推断中,那和讨论因果中无偏或者几乎无偏的估计是一致的,就是在Rubin(1977)中所说的那样。此外大多数统计学家在最近的教育中(这样做)还有一些其他的原因。 Fan: After EM, starting from the early 1980s, you were heavily involved in developing methods for Bayesian computing, including the Bayesian bootstrap (Rubin, 1981), the sampling importance-resampling (SIR) algorithm (Rubin, 1987b), and (lesser-acknowledged) “approximate Bayesian computation (ABC)” (Rubin, 1984, Section 3.1). -Fan:在研究出EM之后,从20世纪80年代初开始,您参与研究了贝叶斯计算方法,包括贝叶斯自举(Rubin, 1981),采样重要性重采样(SIR)算法(Rubin,1987b),和(很少人承认的)“近似贝叶斯计算(ABC)”(Rubin, 1984,3.1版) +Fan:在研究出EM之后,从20世纪80年代初开始,您参与研究了贝叶斯计算方法,包括贝叶斯自举(Bayesian bootstrap)(Rubin, 1981),采样重要性重采样(SIR, the sampling importance-resampling)算法(Rubin,1987b),和(很少人承认的)“近似贝叶斯计算(ABC,approximate Bayesian computation)(Rubin, 1984,3.1版) Don: It was clear then that computers were going to allow Bayes to work far more broadly than earlier. You, as well as others such as Simon Tavare, Christian Robert and Jean-Michel Marin, are giving me credit for first proposing ABC. Thanks! Although, frankly, I never thought that would be a useful algorithm except in problems with simple sufficient statistics. @@ -396,15 +396,15 @@ Fabri:但是您后来虽然使用了这些想法,但似乎并没有根据这 Don: First of all, fundamentally I am hostile to all “religions.” I recently heard a talk by Raghu in Bamberg, Germany, where he said that in his world they have zillions of gods, and I think that is right; you should have zillions of gods, one for this good idea, one for that good idea. And different people can create different gods to whatever extent they want to. I am not a fully-pledged member of the Bayesian camp—I like being friends with them, but I never want to be religiously Bayesian. My attitude is that any complication that creates problems for one form of inference creates problems for all forms of inference, just in different ways. For example, the fact that confounded treatment assignments cause problems for frequentist inference is obvious. Does it generate problems for the Bayesian?Yeah, that point was made in the 1978 Annals paper: Randomization matters to a Bayesian, although not in the same way as to a frequentist, that is, not as the basis for inference, but it affects the likelihood function. -Don:首先,我基本上对所有的“信仰”都持反对意见。我最近听了一篇演讲,演讲人是德国班贝格的Raghu,他提到在他的世界中有无数个神,我觉得他是对的;首先需要有无数个神,这个神提供了这个好主意,那个神提供了那个好主意,而且不同的人可以创造不同的神来做到他们想要的程度。我不是完全同意贝叶斯学派的一员——我喜欢成为他们的朋友,但我不会变成完全信仰贝叶斯的人。我的态度是任何有可能为某一种推断形式造成麻烦的复杂情况同样也可以对所有的推断形式造成麻烦,只不过是不同的方式。比如,讨厌的分配疗法显然会使频率论推断出现问题。那么贝叶斯会出现问题吗?是的,这个问题在1978年的年报汇总有所提及:贝叶斯中的随机问题。尽管不是像影响到了基本推断那样导致频率统计学派不出现问题,但它也影响到了近似的作用。 +Don:首先,我基本上对所有的“信仰”都持反对意见。我最近听了一篇演讲,演讲人是德国班贝格的Raghu,他提到在他的世界中有无数个神,我觉得他是对的;首先需要有无数个神,这个神提供了这个好主意,那个神提供了那个好主意,而且不同的人可以创造不同的神来做到他们想要的程度。我不是完全同意贝叶斯学派的一员——我喜欢成为他们的朋友,但我不会变成完全信仰贝叶斯的人。我的态度是任何有可能为某一种推断形式造成麻烦的复杂情况同样也可以对所有的推断形式造成麻烦,只不过是不同的方式。比如,讨厌的分配疗法显然会使频率论推断出现问题。那么贝叶斯会出现问题吗?是的,这个问题在1978年的年报汇总有所提及:贝叶斯中的随机问题。尽管其影响方式不是像基本推断那样影响到了频率统计学派,但它也影响到了近似的作用。[这句话待推敲] There is something I am currently working on with a Ph.D. student, Viviana Garcia, that builds on a paper I wrote with Paul Rosenbaum in 1984 (Rosenbaum and Rubin, 1984b), which is the only Bayesian paper that Paul has ever written, at least with me. In that paper, we did some simulations to show there is an effect on Bayesian inference of the stopping rule. We show that if you have a stopping rule and use the “wrong” prior to do the analysis, like a uniform improper prior, but the data are coming from a “correct” prior, and you look at the answer you get from the right prior and from the “wrong” prior, they are different. The portion of the right posterior that you cover using the “wrong” posterior is incorrect. This extends to all situations and it is related to all of these ignorability theorems, and it means that you need to have the right model with respect to the right measure. Of course achieving this is impossible in practice and, therefore, leads to the need for frequentist (Neymanian) evaluations of the operating characteristics of Bayesian procedures when using incorrect models (Rubin, 1984). Bayes works, in principle, there is no doubt, but it can be so hard! It can work, in practice, but you must have some other principles floating around somewhere to evaluate the consequences—how wrong your conclusions can be. So you must have something to fall back on, and I think that is where these frequentist evaluations are extremely useful, not the unconditional Neyman–Pearson frequentist evaluations for all point mass priors (which were critical as mathematical demonstrations that we cannot achieve the ideal goal in any generality), but evaluations for the class of problems that you are dealing with in your situation. -我最近正在和我的博士生Viviana Garcia基于我和Paul Rosenbaum在1984年写的文章(Rosenbaum and Rubin, 1984b)继续进行研究,那可能是Paul唯一的贝叶斯文章,至少是和我一起写的唯一的。在那篇文章中,我们做了一些模拟来展示对终止规则的贝叶斯推断有一些影响。我们发现如果你有终止规则并优先使用“错误”来进行分析,就像一致先验那样数据是从“正确”的那面得来的,那么你会发现你从正确那面和错误那面得到的答案是不一样的。你用“错误”的那部分来覆盖正确的部分是不对的。这可以延伸到所有的情况,而且也与所有这些理论有关。这也意味着你需要有针对正确措施的正确模型。当然,这是在实践中不可能实现的,因此,当我们在贝叶斯方法中使用模型不当时,也导致了对频率(neymanian)估计的操作特性的需要(Rubin,1984)。毫无疑问贝叶斯的方法在原理上行得通,但是它的难度非常大!它在实践中可以使用,但是你需要一些其他的原则作为辅助来估计你的结论有多大的偏差。所以你必须有所依靠,我认为这就是频率学的估计方法非常有用的地方,不是无条件的Neyman–Pearson概率估计的先验知识(这是关键的我们无法实现的数学证明),而是在这种情况下估计你所要处理的问题的分类。 +我最近正在和我的博士生Viviana Garcia基于我和Paul Rosenbaum在1984年写的文章(Rosenbaum and Rubin, 1984b)继续进行研究,那可能是Paul唯一的贝叶斯文章,至少是和我一起写的唯一的。在那篇文章中,我们做了一些模拟来展示对终止规则的贝叶斯推断有一些影响。我们发现如果你有终止规则并优先使用“错误”来进行分析,就像一致先验那样数据是从“正确”的那面得来的,那么你会发现你从正确那面和错误那面得到的答案是不一样的。你用“错误”的那部分来覆盖正确的部分是不对的[这句话待推敲]。这可以延伸到所有的情况,而且也与所有这些理论有关。这也意味着你需要有针对正确措施的正确模型。当然,这是在实践中不可能实现的,因此,当我们在贝叶斯方法中使用模型不当时,也导致了对频率(neymanian)估计的操作特性的需要(Rubin,1984)。毫无疑问贝叶斯的方法在原理上行得通,但是它的难度非常大!它在实践中可以使用,但是你需要一些其他的原则作为辅助来估计你的结论有多大的偏差。所以你必须有所依靠,我认为这就是频率学的估计方法非常有用的地方,它不是无条件的Neyman–Pearson概率的先验估计(关键是这是我们无法实现的数学证明),而是在这种情况下估计你所要处理的问题的分类。 Fan: The 1984 Annals paper “Bayesianly Justifiable and Relevant Frequency Calculations for the Ap-plied Statistician” (Rubin, 1984) is one of my all-time favorite papers. This paper, as the earlier paper by George Box (Box, 1980), deals with the “calibrated Bayes” paradigm with generality, which can be viewed as a compromising or midground between the Bayesian and frequentist paradigms. It has a profound influence on many of us. In particular, Rod Little has strongly advocated “calibrated Bayes” as the 21st cen-tury roadmap of statistics in several of his prominent talks, including the 2005 ASA President’s Invited Address and the 2012 Fisher Lecture. What was the background and reasons for you to write that paper? -Fan:1984年年报上的“Bayesianly Justifiable and Relevant Frequency Calculations for the Ap-plied Statistician” (Rubin, 1984)是我一直最喜爱的文章之一。这篇文章,和早期George Box所写的文章(Box, 1980),都处理了具有一般性的“校准贝叶斯”范式,可以被认为是妥协或者折中的贝叶斯和频率论范式。它对我们许多人都产生了深刻的影响。尤其是是,在包括2005 ASA总统邀请函中和2012年的 Fisher讲座的几个重要会谈中,Rod Little都大力提倡“校准贝叶斯”作为第二十一世纪的路线图。您当时为什么写那篇论文,背景是什么样的? +Fan:1984年年报上的“Bayesianly Justifiable and Relevant Frequency Calculations for the Ap-plied Statistician” (Rubin, 1984)是我一直最喜爱的文章之一。这篇文章,和早期George Box所写的文章(Box, 1980),都处理了具有一般性的“校准贝叶斯(calibrated Bayes)”范式,可以被认为是妥协或者说是折中的贝叶斯和频率论范式。它对我们许多人都产生了深刻的影响。尤其是是,在包括2005 ASA总统邀请函中和2012年的 Fisher讲座的几个重要会谈中,Rod Little都大力提倡“校准贝叶斯”作为第二十一世纪的路线图。您当时为什么写那篇论文,背景是什么样的? Don: Interesting question. I was visiting Box at the Mathematics Research Center in 1981–1982 and wrote Rubin (1983) partly during that period—I think it’s a good paper with some good ideas, but without a satisfying big picture. That dissatisfaction led to that 1984 paper—what is the big picture? It took me a very long time to “get it right,” but it all seems very obvious to me now. The idea of posterior predictive checks has been further articulated and advanced in Meng (1994), Gelman, Meng and Stern (1996), and the multiauthored book “Bayesian Data Analysis” (Gelman et al., 1995, 2003, 2014). @@ -416,15 +416,15 @@ Fabri:您能不能针对《Bayesian Data Analysis》说得再多一些?这 Don: Yup, I think that the Gelman et al. book might be the most popular Bayesian text. It started out as notes by John Carlin for a Bayesian course that he taught when I was Chair sometime in the mid or late 1980s. Andy must have been a Ph.D. student at that time, with tremendous energy for scholarship. John was heading back to Australia, which is his homeland, and somehow the department had some extra teaching money, and we wanted to keep John around for a year—I do not remember the details. But I do remem-ber that the idea of turning the notes for the course into a full text was percolating. Also Hal Stern was an Associate Professor with us at that time, and so the four of us decided to make it happen. We basically divided up chapters and started writing. Even though John’s initial notes were the starting basis, things changed as soon as Andy “took charge.” Quickly, Andy and Hal were the most active. Andy, with Hal, were even more dominant in the second edition, where I added some parts, edited others, but clearly this was Andy’s show. The third edition, which just came out in early 2014, was even more extreme, with Andy adding two coauthors (David Dun-son and Aki Vehtari) because he liked their work, and they had been responsive to Andy’s requests. As the old man of the group, I just requested that I be the last author; Andy obviously was the first author, and the second and third were as in the first edition. In some ways, I feel like I’m an associate editor of a journal that has Andy as the editor! We get along fine, and clearly it’s a successful book. -Don:好,我认为Gelman等人编的这本书可能的确是最受欢迎的贝叶斯教材。它起初是John Carlin在1980年代中期或后期为准备他教的一门贝叶斯课程而做的笔记,当时我还是主任。Andy 当时应该还是一个获得全额奖学金的博士生。我记不太清了,好像是John要回到他的家乡澳大利亚,而且不知怎么的部门有一些额外的教育资金,所以我们想让John再待一年。但我还记得我们慢慢想到为课程准备的笔记转换为一本教材。同样当时Hal Stern是我们那的副教授。所以我们四个决定让想法成为现实。我们大体分了章节,然后开始编写。尽管最初是基于John的笔记来写,但逐渐就变成了Andy在负责。很快,Andy和Hal成为了最活跃的。在第二版中,Andy和Hal更加活跃,我当时加了一些内容,尽管编辑的人增加了,但是主要的还是Andy的功劳。2014年初出的第三版甚至更极端,Andy又加了两名合著者(David Dun-son和Aki Vehtari),因为Andy喜欢他们的著作,而且他们同意了Andy的请求。作为这个团体中的“老人”,我请求把自己当成是最后的作者,Andy显然会是第一作者,然后第二第三作者和第一版的是相同的。在某些方面,我觉得我是一个杂志的副主编,而安迪则是编辑!我们相处的很好,而且显然这是一本很成功的书籍。 +Don:好,我认为Gelman等人编的这本书可能的确是最受欢迎的贝叶斯教材。它起初是John Carlin在1980年代中期或后期为准备他教的一门贝叶斯课程而做的笔记,当时我还是主任。Andy 当时应该还是一个获得全额奖学金的博士生。我记不太清了,好像是John要回到他的家乡澳大利亚,而且不知怎么的部门有一些额外的教育资金,所以我们想让John再待一年。但我还记得我们慢慢想到把为课程准备的笔记转换为一本教材。同样当时Hal Stern是我们那的副教授。所以我们四个决定让想法成为现实。我们大体分了章节,然后开始编写。尽管最初是基于John的笔记来写,但逐渐就变成了Andy在负责。很快,Andy和Hal成为了最活跃的。在第二版中,Andy和Hal更加活跃,我当时加了一些内容,尽管编辑的人增加了,但是主要的还是Andy的功劳。2014年初出的第三版甚至更极端,Andy又加了两名合著者(David Dun-son和Aki Vehtari),因为Andy喜欢他们的著作,而且他们同意了Andy的请求。作为这个团体中的“老人”,我请求把自己当成是最后的作者,Andy显然会是第一作者,然后第二第三作者和第一版的是相同的。在某些方面,我觉得我是一个杂志的副主编,而安迪则是编辑!我们相处的很好,而且显然这是一本很成功的书籍。 Fan: A revolutionary development in statistics since the early 90s was the MCMC methodology. You left your mark in this with Gelman, proposing the Gelman–Rubin statistic for convergence check (Gelman and Rubin, 1992), which seems to be very much connected to some of your previous work. -Fan:自90年代初,统计学界有一场革命性的发展就是MCMC方法。您和Gelman一起留下了自己的一笔,提出了Gelman-Rubin统计收敛检查(Gelman and Rubin, 1992),这个成果似乎和您之前的某些成果有很多的联系。 +Fan:自90年代初,统计学界有一场革命性的发展就是MCMC方法。您和Gelman一起留下了自己的一笔,提出了Gelman-Rubin统计收敛检查(the Gelman–Rubin statistic for convergence check )(Gelman and Rubin, 1992),这个成果似乎和您之前的某些成果有很多的联系。 Don: Correct. We embedded the convergence check problem into the combination of the multiple imputation and multiple chains frameworks, using the idea of the combining rules for MI. The idea of using multiple chains—that comes from physics—and was Andy’s knowledge, not mine. My contribution was to suggest using modified MI combining rules to help do the assessment of convergence. The idea is powerful because it is so simple. If the starting value does not matter, which is the whole point, then it doesn’t matter, period. The real issue should be how you choose the functions of the estimands that you are assessing, and as always, you want convergence to asymptotic normality to be good for these functions, so that the simple justification for the Gelman–Rubin statistic is roughly accurate. -Don:是的。我们利用MI的组合规则的想法,将收敛检查问题嵌入多重插补和多重链框架之中。但利用多重链这个来自于物理学的想法是Andy提出的,不是我。我的贡献只是建议使用改进的MI组合规则来帮助做评估收敛。这个想法非常好,因为它是如此简单。如果整点起始值不影响过程,那么就没关系。真正的问题应该是如何选择你评估的作用,和往常一样,你想要收敛到渐近正态来有利于使用这些作用,所以对Gelman–Rubin 统计进行简单的调整是需要很准确的。 +Don:是的。我们利用MI的组合规则的想法,将收敛检查问题嵌入多重插补和多重链(multiple chains frameworks)框架之中。但利用多重链这个来自于物理学的想法是Andy提出的,不是我。我的贡献只是建议使用改进的MI组合规则来帮助做评估收敛。这个想法非常好,因为它是如此简单。如果整点起始值不影响过程,那么就没关系。真正的问题应该是如何选择你评估的作用,和往常一样,你想要收敛到渐近正态来有利于使用这些作用,因此对Gelman–Rubin 统计进行简单的调整是需要很准确的。 # THE 1990S: COLLABORATING WITH ECONOMISTS @@ -436,7 +436,7 @@ Fabri:在1990年代,您开始和经济学家合作。您和Joshua Angrist, Don: Absolutely. I always liked economics; many economists are great characters! It was in the early 90s when Guido came to my office as a junior faculty member in the Harvard Economics Department and basi-cally said, “I think I have something that may interest you.” I had never met him before, and he was asking if the concept of instrumental variables already had a history in statistics. Guido and Josh Angrist had already defined the LATE (local average treatment effect) in an Econometrica paper (Imbens and Angrist, 1994)—although I think CACE (Complier Average Causal Ef-fect) is a much better name because it is more descrip-tive and more precise—local can be local for anything, local for Boston, local for females, etc. Then I asked in return, “Well tell me the setup, I have never heard of it in statistics before” and while he was explaining I started thinking, “Gosh, there is something important here! I have never seen it before,” and then I said, “Let’s meet tomorrow and talk about it more,” because these kinds of assumptions (monotonicity and the “exclusion restriction”) were fascinating to me, and it was clear that there was something there that I had never really thought hard about; it was great. That eventually led to the instrument variables paper (Angrist, Imbens and Rubin, 1996) and the later Bayesian paper (Imbens and Rubin, 1997). -Don:当然可以。我一直都很喜欢经济;许多经济学家都非常好!在90年代早期,Guido作为哈佛经济学系初级教员,来到了我的办公室,说:“我认为我有些东西你会感兴趣。”我之前从未见过他,他还问我工具变量的概念是不是在统计学中已经有一段时间了。Guido和Josh Angrist当时已经在计量经济学论文中定义了”局部平均处理效应”(LATE,local average treatment effect)(Imbens and Angrist, 1994),尽管我认为“编译器的平均因果关系” (CACE,Complier Average Causal Effect)是一个更好的名字,因为它更多更精确的描述本地的什么,比如当地的Boston,当地的女性等等。然后我反问他:“好,那么请告诉我是什么样的吧,我之前从未在统计中听说过”。当他解释的时候,我开始想:“天啊!这些东西一定很重要,我之前从未见过。”然后我说“我们明天见面详谈吧。”因为这些类型的假设(单调性和“排除限制”)吸引着我,而且很明显是我之前没想过没听过的,这些都很棒。这最终导致了有关工具变量的论文(Angrist, Imbens and Rubin, 1996)和之后的贝叶斯论文 (Imbens and Rubin, 1997)。 +Don:当然可以。我一直都很喜欢经济;许多经济学家都非常好!在90年代早期,Guido作为哈佛经济学系初级教员,到我的办公室说:“我认为我有些东西你会感兴趣。”我之前从未见过他,他还问我工具变量(instrumental variables)的概念是不是很久之前就在统计学中已经有了。Guido和Josh Angrist当时已经在计量经济学论文中定义了”局部平均处理效应”(LATE,local average treatment effect)(Imbens and Angrist, 1994),尽管我认为“编译器的平均因果关系” (CACE,Complier Average Causal Effect)是一个更好的名字,因为它更多更精确的描述本地的什么,比如当地的Boston,当地的女性等等。然后我反问他:“好,那么请告诉我是什么样的吧,我之前从未在统计中听说过”。当他解释的时候,我开始想:“天啊!这些东西一定很重要,我之前从未见过。”然后我说“我们明天见面详谈吧。”因为这些类型的假设(单调性和“排除限制(exclusion restriction)”)吸引着我,而且很明显这都是我之前没想过没听过的,这些都很棒。这最终导致了有关工具变量的论文(Angrist, Imbens and Rubin, 1996)和之后的贝叶斯论文 (Imbens and Rubin, 1997)。 A closely related development was a project I was consulting on for AMGEN at about the same time, for a product for the treatment of ALS (amyotrophic lateral sclerosis), or Lou Gehrig’s disease, which is a progressive neuromuscular disease that eventually destroys motor neurons, and death follows. The new product was to be compared to the control treatment where the primary outcome was quality of life (QOL) two years post-randomization, as measured by “forced vital capacity” (FVC), essentially, how big a balloon you can blow up. In fact, many people do not reach the endpoint of two-year post-randomization survival, and so two-year QOL is “truncated” or “censored” by death. People were trying to fit this problem into a “missing data” framework, but I realized right away that it was something different. @@ -448,11 +448,11 @@ Fan:这两个主意都是有关主分层想法的特殊案例,我们可以 Don: Yes, indeed. These meetings with Guido and this way of thinking were so much more articulated and close to the thinking of European economists in the 30s and 40s, like Tinbergen and Haavelmo, than many subsequent economists who seemed sometimes to be too into their OLS algebra in some sense. There was some correspondence between one of the two—Haavelmo, I think—and Neyman on these hypothetical experiments on supply and demand. European brains were talking to each other, and not simply exchanging technical mathematics! -Don:是的,的确是这样。我和Guido的会面以及这种思维方式都很明确和接近在30年代和40年代欧洲经济学家的思维,比如说Tinbergen和Haavelmo,而不像之后很多在某种意义上太投入于他们的普通最小二乘法的经济学家。他们两中的一个——应该是Haavelmo——和Neyman在这些有关供应和需求的假设试验上有很多的来往联系。欧洲人的确是在相互进行交谈,而不是简单地交流应用数学的内容! +Don:是的,的确是这样。我和Guido的会面以及这种思维方式都很明确,也很接近在30年代和40年代欧洲经济学家的思维,比如说Tinbergen和Haavelmo,而不像之后很多在某种意义上太投入于他们的普通最小二乘法的经济学家。他们两中的一个——应该是Haavelmo——和Neyman在这些有关供应和需求的假设试验上有很多的来往联系。欧洲人的确是在相互交谈(具体问题),而不是简单地交流如何应用数学![这句话待推敲] Fabri: I know that many years before you met Guido, with other statisticians, like Tukey, you had discussions about the way economists were treating selection problems, or missing data problems. But you had some adventurous, to say the least, previous experiences with economists dealing with problems that you had worked on, which they had almost neglected completely. -Fabric:我知道在你遇见Guido之前很多年,你和其他的统计学家,比如Tukey,有一些有关经济学家如何处理选择问题以及缺失数据问题的讨论。但是你有些冒险,之前你一直致力于解决经济学家的问题,他们至少可以说是几乎完全忽视。 +Fabric:我知道在你遇见Guido之前很多年,你和其他的统计学家,比如Tukey,有一些有关经济学家如何处理选择问题以及缺失数据问题的讨论[selection problems我翻译成了选择问题,可能要再推敲一下]。但是你有些冒险,之前你一直致力于解决经济学家的问题,他们至少可以说是几乎完全忽视。 Don: Yes, James Heckman was tracking my work in the early 1980s when I came to Chicago after ETS. The public exchange came out in the ETS volume edited by Howard Wainer (which is where Glynn, Laird and Rubin, 1986, appears), with comments from Heckman, Tukey, Hartigan and others. @@ -464,11 +464,11 @@ Fabri:经济学这个领域的因果理念非常的关键,您也是因为这 Don: There are often interesting questions from social science students that come up in class. One recent example is how do we answer questions like “What would the Americas be like if they were not settled by Europeans?” I asked the questioner, “Who would they be settled by instead? By the Chinese? By the Africans? What are you talking about? What are we comparing the current American world to?” Another example comes from an undergraduate thesis that I directed, by Alice Xiang, which won both the Hoopes Prize and the economics’ Harris Prize for an outstanding honors thesis. The thesis is on the causal effect of racial affirmative action in law school admissions on some outcomes versus the same proportion of affirmative action admissions but counter-factually based on socioeconomic status. This is not just for cocktail conversation—it was a case recently before the US Supreme Court, Fisher v. University of Texas, which was kicked back to the lower court to reconsider, and additionally the issue was recently affected by a state law in Michigan. There is an amicus brief sent to the US Supreme Court to which Guido (Imbens), former Ph.D. students, Dan Ho, Jim Greiner and I (with others) contributed. -Don:班上社会科学的学生经常会提出一些非常有意思的问题。最近有一个例子是我们应该如何回答类似“如果不是欧洲人定居,那么美国会变成什么样”的问题。我问那个问问题的人:“谁会替代欧洲人定居在美国?中国人?非洲人?你想要说什么?我们拿现在的美国和谁去比较?”另一个例子是由Alice Xiang提出的,他通过本科的毕业论文赢得了胡普斯奖和经济学奖。那篇文章是关于法学院中基于种族活动通过的比例与基于社会经济地位活动通过的比例所进行的比较。这不是简单的一个故事,而是一个事件,最近在美国最高法院中有一个德克萨斯大学的案件,这一案件被驳回了下级法院重新考虑,另外这一案件最近也受到密歇根州法律的影响。Guido(Imbens)以前的博士生,Dan Ho,Jim Greiner和我(和其他人)写了一个法庭简短递送到了美国最高法院。 +Don:班上社会科学的学生经常会提出一些非常有意思的问题。最近有一个例子是我们应该如何回答类似“如果不是欧洲人定居,那么美国会变成什么样”的问题。我问那个问问题的人:“谁会替代欧洲人定居在美国?中国人?非洲人?你想要说什么?我们拿现在的美国和谁去比较?”另一个例子是由Alice Xiang提出的,他通过本科的毕业论文赢得了胡普斯奖(the Hoopes Prize)和经济学奖(the economics’ Harris Prize)。那篇文章是关于法学院中基于种族活动通过的比例与基于社会经济地位活动通过的比例所进行的比较。这不是简单的一个故事,而是一个事件,德克萨斯大学在美国最高法院中就有一个这样的案件,后来这一案件被驳回了下级法院重新考虑,另外这一案件最近也受到密歇根州法律的影响。Guido(Imbens)以前的博士生,Dan Ho,Jim Greiner和我(和其他人)写了一个顾问建议递送到了美国最高法院。 Such careful formulation of questions is something critical, and to me is central to the field of statistics. It is crucial to formulate clearly your causal question. What is the alternative intervention you are considering, when you talk about the causal effect of affirmative action on graduation rates or barpassage rates? Immediately formulating the problem as an OLS regression is the wrong way to do this,at least to me. -这种需要仔细制定公式的问题是非常关键的,对我来说是统计学的范畴。清楚地阐述你的因果关系是非常重要的。当你讨论到活动通过率和毕业率或者禁止通行率之间的因果关系时,你正在考虑替代的是什么?至少对我来说,立刻将其公式化转为OLS进行解决是错误的。 +这种需要仔细制定公式的问题是非常关键的,对我来说是统计学的范畴。清楚地阐述你的因果关系是非常重要的,当你讨论到活动通过率和毕业率或者禁止通行率之间的因果关系时,你正在考虑替代的是什么[这句话待推敲]?至少对我来说,立刻将其公式化转为OLS进行解决是错误的。 Fan: You apparently have a long interest in law; besides the aforementioned “affirmative action” thesis, you have done some interesting work in applied statistics in law. @@ -476,7 +476,7 @@ Fan:您看来对于法律一直都很有兴趣;除了上述“通过率行 Don: Yes. Paul Rosenbaum was, I think, the first of my Harvard students who did something about statistics in law. Either his qualifying paper or a class paper in 1978 was on the effect of the death penalty. Jim Greiner, another great Ph.D. student of mine, who had a law degree before entering Harvard Statistics, wrote his Ph.D. thesis (and subsequently several important papers) on potential outcomes and causal effects of immutable characteristics. He is now a full professor at the Harvard Law School. There were also several previous undergraduate students of mine who were inter-ested in statistics and law, but (sadly) most went to law school. Since 1980, I have been involved in many legal topics. -Don:是的,还有。我认为Paul Rosenbaum是我在哈佛中第一个把统计应用于法律中的学生。无论是他1978年的学年论文还是课堂论文,都是有关死刑的影响。Jim Greiner,我另一个很棒的博士生,在进入哈佛统计系前有一个法学学位,他的博士论文(以及很多重要的论文)都是有关不可变性的潜在结果和因果效应。他现在是哈佛法学院的全职教授。我同样之前有一些本科生对统计学和法学都很感兴趣,但(遗憾的是)大部分都去了法学院。从1980年起,我就参与了很多的法律话题。 +Don:是的,还有。我认为Paul Rosenbaum是我在哈佛中第一个把统计应用于法律中的学生。无论是他1978年的学年论文还是课堂论文,写的都是有关死刑的影响。Jim Greiner,我另一个很棒的博士生,在进入哈佛统计系前有一个法学学位,他的博士论文(以及很多重要的论文)都是有关不可变性(immutable characteristics)的潜在结果和因果效应。他现在是哈佛法学院的全职教授。我之前同样有一些本科生对统计学和法学都很感兴趣,但(遗憾的是)大部分都去了法学院。从1980年起,我就参与了很多的法律话题。 # THE NEW MILLENNIUM: PRINCIPAL STRATIFICATION @@ -484,11 +484,11 @@ Don:是的,还有。我认为Paul Rosenbaum是我在哈佛中第一个把统 Fabri: The work you did with Guido, as well as the work on censoring due to death, led to your paper on Principal Stratification (Frangakis and Rubin, 2002), coauthored with this brilliant student of yours, Con-stantine Frangakis, who happens to be Fan’s advisor. -Fabri:您和Guido所做的工作,以及由于死亡而进行的普查工作,促使您和您极棒的学生,Constantine Frangakis一起发表了有关主分层的论文(Frangakis and Rubin, 2002),他碰巧也是Fan的顾问。 +Fabri:您和Guido所做的工作,以及由于死亡而进行的普查工作促使您和您一个很棒的学生,Constantine Frangakis一起发表了有关主分层的论文(Frangakis and Rubin, 2002),他碰巧也是Fan的顾问。 Don: Yes, Constantine is fabulous, but the original title of that paper was very long, same with the title of his thesis. It went on and on, with probably a few Latin,a few Italian, a few French and a few Greek words!Of course I was exasperated, so I convinced him to simplify the paper’s title to “Principal Stratification in Causal Inference.” He is brilliant, so good that he has no trouble dealing with all the complexity in his own mind, but therefore he struggles at times pulling out the kernels of all these ideas, making them simple. -Don:是的,Constantine非常好,但是最起初的论文题目非常长,和他的论文题目一样。它不断变长,有一些拉丁字母,一些意大利文,一些法文和一些希腊字母!当然我很恼火,所以我说服了他来简化文章的标题,变成了“Principal Stratification in Causal Inference”。他很聪明,所以它对于处理脑中的复杂事情没有问题,但他也尝试着把所有这些想法的精华拿出来,让他们变得简单。 +Don:是的,Constantine非常好,但是最起初的论文题目非常长,和他的论文题目一样。它不断变长,有一些拉丁字母,一些意大利文,一些法文和一些希腊字母!当然我很恼火,所以我说服了他来简化文章的标题,变成了“Principal Stratification in Causal Inference”。他很聪明,所以他能够轻松处理脑中的复杂事情,但他也尝试着把所有这些想法的精华拿出来,让他们变得简单。 Fan: What do you think is the most remarkable thing about the development of Principal Stratification? @@ -496,15 +496,15 @@ Fan:您认为在主分层的发展过程中哪件事情是最值得铭记的 Don: It is a whole new collection of ways of thinking about what the real information is in causal problems. Once you understand what the real information is, you can start thinking about how you can get the answers to questions that you want to extract from that information; you always have to make assumptions, and it forces you to explicate what these assumptions are, not in terms of OLS, which no social scientist or doctor would really understand—but in terms of scientific or medical entities. And because you have to make assumptions, be honest and state them clearly. For example, I like your papers (Mealli and Pacini, 2013; Mattei, Li and Mealli, 2013) about multiple post-randomization outcomes, where you discuss that for some outcomes, exclusion restriction or other struc-tural assumptions may be more plausible. -Don:这是一种全新的收集因果问题中真正有用的信息的方式。一旦你理解了什么是真正有用的信息,你可以开始考虑如何从那些信息中提取出问题的答案。你需要作假设,这会迫使你需要说明这些假设是什么,不是通过OLS那种没有社会科学家或者博士能够理解的角度,而是从科学或者医学的角度。而且由于你需要作出假设,所以要坦诚清楚地陈述出来。举个例子,我喜欢你有关多随机化结果的论文(Mealli and Pacini, 2013; Mattei, Li and Mealli, 2013),你在论文中提到了对于一些结果,排除限制或其他结构的假设可能是更合理的。 +Don:这是一种全新的收集因果问题中真正有用的信息的方式。一旦你理解了什么是真正有用的信息,你可以开始考虑如何从那些信息中提取出问题的答案。你需要作假设,这会迫使你需要说明这些假设是什么,不是通过OLS那种没有社会科学家或者博士能够理解的角度,而是从科学或者医学的角度。而且由于你需要作出假设,所以要坦诚清楚地陈述出来。举个例子,我喜欢你有关多随机化结果(multiple post-randomization outcomes)的论文(Mealli and Pacini, 2013; Mattei, Li and Mealli, 2013),你在论文中提到了对于一些结果,排除限制或其他结构的假设可能是更合理的。 Fabri: Principal Stratification is sometimes compared to other tools for doing so-called mediation analysis—what is your view about inferring on mediation effects? -Fabri:主分层有时会和为了所谓的中介分析的其他工具相比较,您是怎么看待它在中介作用上的影响的? +Fabri:主分层有时会和为了所谓的中介分析(mediation analysis)的其他工具相比较,您是怎么看待它在中介作用上的影响的? Don: I think we (Don and Fabri) discussed a paper recently in JRSS-A, and those discussions summarize my–our view on that. Essentially, some of the people writing about mediation seem to misunderstand what a function is. They write down something that has two arguments inside parenthesis, with a comma separating them, and they seem to think that therefore something is well defined! -Don:我觉得咱们(Don和Fabri)是在讨论最近JRSS—A上的一篇文章,而且这些讨论结果总结了我们的观点。从本质上讲,一些些有关中介作用文章的人似乎误解了它的作用是什么。他们写了一些东西,括号里面有两个参数,用逗号分隔,然后他们就觉得这个就已经定义很清楚了。 +Don:我觉得咱们(Don和Fabri)是在讨论最近JRSS—A上的一篇文章,而且这些讨论结果总结了我们的观点。从本质上讲,写了一些有关中介作用文章的人似乎误解了它的作用是什么。他们写了一些东西,括号里面有两个参数,用逗号分隔,然后他们就觉得这个就已经定义很清楚了。 Fan: Even though causal inference has gained increasing attention in statistics and beyond, there seems to be a lot of misunderstanding, misuse, misinterpretation and mystifying of causal inference. Why? And what needs to be done to change? @@ -512,7 +512,7 @@ Fan:尽管因果推断在统计界受到越来越多的关注,现在似乎 Don: I think it is partly because causal inference is a very different topic from many topics in statis-tics in that it does not demand a lot of technical advanced mathematical knowledge, but does demand a lot of conceptual and basic mathematical sophistication. Principal Stratification is one such example. Writing down notation does not take the place of understanding what the notation means and how to prove things mathematically. Also partly because causal inference has become a popular topic, it has been flooded with publications that are often done casually. For some fields, it is important to bridge the “old” (everything-based-on-OLS) thinking with the newer ideas. That’s a battle Guido and I constantly had to deal with when writing our book (Imbens and Rubin, 2015). -Don:我认为这是因为因果推理是一个在统计学中与其它很不同的话题,而且它也不需要很多技巧性很高等的数学知识,但需要大量的概念和基本数学的较高程度。主分层就是这样的一个例子。记下符号并不能代替理解符号的意思以及知道如何用数学来证明事物。另外还有一部分原因是因果推断已经成为一个热门的话题,它在出版物中已经时不时被提到了。在某些领域,它是联系老方法(基于OLS的一切事物)和新想法的桥梁。那是一场我和Guido在写书(Imbens and Rubin, 2015)时需要打赢的战役。 +Don:我认为这是因为因果推理是一个在统计学中与其它很不同的话题,而且它也不需要很多技巧性以及很高等的数学知识,但需要大量的概念和较高程度的基础数学。主分层就是这样的一个例子。记下符号并不能代替理解符号的意思以及知道如何用数学来证明事物;另外还有一部分原因是因果推断已经成为一个热门的话题,它在出版物中已经时不时被提到了。在某些领域,它是联系老方法(基于OLS的一切事物)和新想法的桥梁。那是一场我和Guido在写书(Imbens and Rubin, 2015)时需要打赢的战役。 Fan: You mentioned the book; when will it finally come out? It has been forthcoming for the last ten years or so. @@ -520,7 +520,7 @@ Fan:您提到了那本书,那本书什么时候能最终发行?它在过 Don: (Laughing) Come on, Fan, that’s not fair! Has it only been ten years? We have promised the publisher (Cambridge University Press) that it will be ready by September 30, 2013. It will be about 500 pages, 25 chapters. It will be followed by another volume, dealing with topics that we could not get to in the volume due to length, such as principal stratification beyond IV settings, or because we believe the topics have not been sharply and cleanly formulated yet, such as regression discontinuity designs, or using propensity scores with multiple treatments. Also in this volume, we didn’t discuss so-called case–control studies, which are the meat of much of epidemiology; it is very important to embed these studies into a framework that makes sense, not just teach them as a bag of tricks. -Don:(大笑)嘿,Fan,那不公平!到现在只有十年吗?我们许诺给出版商(剑桥大学出版社)在2013年9月30号它就可以完成。它会有500页,25章。这将是另一卷来面对我们由于长度而无法处理的话题,比如说在四维智商的主分层,或者是一些因为相信这个话题还没有直截了当地用公式表示出来的话题,比如回归不连续的设计,或者将倾向得分和多重处理相结合。同样在这一本中,我们不会考虑所谓的案例——控制研究,那是许多流行病学的菜。将这些研究嵌入到一个有意义的框架中是非常重要的,而不仅仅是只把它们作为一个技巧包。 +Don:(大笑)嘿,Fan,那是不对的!到现在只有十年吗?我们许诺给出版商(剑桥大学出版社)在2013年9月30号它就可以完成。它会有500页,25章。它的前一卷是有关解决我们由于长度而无法处理的话题,比如说静脉之外的主分层[这句话待推敲],或者是一些因为相信这个话题还没有直截了当地用公式表示出来的话题,比如回归不连续的设计(regression discontinuity designs),或者将倾向得分和多重处理(multiple treatments)相结合,它会在这一卷发行之后再发行。同样在这一本中,我们不会考虑所谓的案例——控制研究,那是许多流行病学的菜(,但不是我们的)。将这些研究嵌入到一个有意义的框架中,而不仅仅是只把它们作为一个技巧包是非常重要的。[这段话中间following volume那句话可能需要再推敲一下] # MENTORING, CONSULTING AND EDITORSHIP @@ -540,11 +540,11 @@ Fabri:您的很多学生成为了各行各业的领袖。您经常说您最自 Don: Fabri, that is a killer question unless we have another day for this. What I can say is that it has been a great pleasure to supervise so many very talented students. I could start listing my superb Ph.D. students at the University of Chicago and at Harvard. All of my Ph.D. students are talented in many, and sometimes different, dimensions: among them there are two COPSS award winners, one president of the ASA, one president of ENAR, two JSM program chairs, and other such honors, and many of them made substantial contributions to government, academia and industry. -Don:Fabri,这可是个很能打发时光的话题啊,我们可以改天再聊。我能说的是,能教授很多很有天赋的学生的确是一大乐趣。我可以开始列出我在芝加哥大学和哈佛的优秀博士生。我所有的博士生在各行各业都非常出彩:在这其中有两位COPSS奖的获奖者,有一位ASA的主席,有一位ENAR的主席,两位JSM的项目主任以及其他很多。他们很多人都为政府、学术界。工业做出了不可替代的贡献。 +Don:Fabri,这可是个很能打发时光的话题啊,我们可以改天再聊。我能说的是,能教授很多很有天赋的学生的确是一大乐趣。我可以开始列出我在芝加哥大学和哈佛的很多优秀博士生。我所有的博士生在各行各业都非常出彩:在这其中有两位COPSS奖的获奖者,有一位ASA的主席,有一位ENAR的主席,两位JSM的项目主任以及其他很多。他们很多人都为政府、学术界、工业做出了不可替代的贡献。 Fan: You also have advised a large number of undergraduate students on a wide range of topics. This is quite uncommon because some people find mentoring undergraduates more challenging and less rewarding than mentoring graduate students. What is your take on this? -Fan:您还建议了许多本科生广泛研究主题。这是相当罕见的,因为一些人认为本科生导师比研究生导师更有挑战性而更少回报。您对此有什么看法? +Fan:您还建议许多本科生研究主题要广泛。这是相当罕见的,因为一些人认为本科生导师比研究生导师更有挑战性而更少回报。您对此有什么看法? Don: I am not completely innocent on this charge. I have no interest in “babysitting” and trying to mo-tivate unmotivated students, either undergraduate or graduate. But Harvard does attract some extremely tal-ented and motivated undergraduates, some of whom I had the pleasure to advise. Five have won Hoopes and other prizes for outstanding undergraduate theses. @@ -564,7 +564,7 @@ Fabri:(大笑)是的,我们试图反抗却没有成功。一个很特别 Don: Over the years I had many papers immediately rejected or rejected with the suggestion that it would not be wise to resubmit. However, in almost all of these cases, this treatment led to markedly improved publications, somewhere. In fact, I think that the drafts that have been repeatedly rejected possibly represent my best contributions. Certainly, the repeated rejections, combined with my trying to address various comments, led to better exposition and sometimes better problem formulation, too. The most important idea is: Do not think that people who are critics are hostile. In the vast majority of cases, editors and reviewers are giving up their time to try to help authors, and, I believe, are often especially generous and helpful to younger or inexperienced authors. Do not read into rejection letters personal attacks, which are extremely rare. So my advice is: Quality trumps quantity, and stick with good ideas even when you have to do polite battle with editors and reviewers—they are not perfect judges, but they are, almost uniformly, on your side. More details of these are given in Rubin (2014b). -Don:多年来我有许多文章被立刻拒绝或者被拒绝伴随着“提交是不明智的”建议。然而,几乎在所有这些情况下,这种对待方式引领着国家不断提高。事实上,我认为那些一再被否决的草稿可能代表了我最佳的贡献。当然,一再拒绝,加上我试图解决的各种评论,可能会产生更好的阐述以及更好的解决办法。最重要的一点是:不要认为批评你的人是有敌意的。在绝大多数情况下,我相信,编辑和审稿人都会放弃自己的时间来帮助作者而且往往会在面对年轻的或经验不足的作者时更不吝啬自己的帮助。别把拒绝理解为个人攻击,这是极为少见的。所以我的建议是:质量胜于数量,坚持自己好的想法,即使你需要不断地和编辑与审稿人进行礼貌的战斗,他们不是完美的评判人,但他们几乎都是站在你这边的。更多的细节可以在Rubin(2014b)中找到。 +Don:多年来我有许多文章被立刻拒绝或者被拒绝伴随着“提交是不明智的”建议。然而,几乎在所有这些情况下,这种对待方式引领着国家不断提高。事实上,我认为那些一再被否决的草稿可能代表了我最佳的贡献。当然,一再拒绝,加上我试图解决的各种评论,可能就会有更好的阐述以及更好的解决办法。最重要的一点是:不要认为批评你的人是有敌意的。在绝大多数情况下,我相信,编辑和审稿人都会放弃自己的时间来帮助作者而且往往会在面对年轻的或经验不足的作者时更不吝啬自己的帮助。别把拒绝理解为个人攻击,这是极为少见的。所以我的建议是:质量胜于数量,坚持自己好的想法,即使你需要不断地和编辑与审稿人进行礼貌的战斗,他们不是完美的评判人,但他们几乎都是站在你这边的。更多的细节可以在Rubin(2014b)中找到。 Fan: In 1978, you became the Coordinating and Applications Editor of JASA. Is there anything particularly unique about your editorship? @@ -572,7 +572,7 @@ Fan:在1978年,您成为了JASA的协调与应用编辑。在您作为编辑 Don: As author, I am willing to withdraw accepted papers. As a new editor, at least then, I was also willing to suggest to authors that they withdraw papers ac-cepted by the previous editors! I took some heat for that at the beginning. I read through all the papers that the previous editorial board had accepted and were awaiting copyediting for publication; for the ones that I thought were bad (I remember there were about eight), I wrote, “Dear authors, I think you should consider withdrawing this paper,” with long explanations of why I thought it would be an embarrassment to them if the paper were published. Fabri knows that I can be brutally frank about such suggestions. -Don:作为作者,我愿意撤回那些之前已经接受的文章。作为一名新编辑,至少在那会,我还是愿意建议那些作者撤回那些之前编辑接受了的文章的。我起初还是非常费了很大的心力的。我阅读了所有之前编辑答应登刊并且等待印刷出版的论文;对于那些我认为不好(我记得一共有八篇),我给他们写到:“亲爱的作者,我认为你应该考虑撤回这篇论文”后面附着我为什么举得如果论文出版了对他们来说并不是很好的长长的解释。Fabri知道我当时那些建议有多直白坦诚。 +Don:作为作者,我愿意撤回那些之前已经接受的文章。作为一名新编辑,至少在那会,我还是愿意建议那些作者撤回那些之前编辑接受了的文章的。我起初还是非常费了很大的心力的。我阅读了所有之前编辑答应登刊并且等待印刷出版的论文;对于那些我认为不好(我记得一共有八篇),我给他们写到:“亲爱的作者,我认为你应该考虑撤回这篇论文”后面附着我为什么觉得如果论文出版了对他们来说并不是很好的长长的解释。Fabri知道我当时那些建议有多直白坦诚。 Fan: Did they comply? @@ -596,7 +596,7 @@ Fabri:有一个很具争议性的案件是当时您作为顾问参与的美国 Don: Happy to. This comes from my family background dealing with lawyers. We have a legal system where certain things are legal, certain things are not. You should generally obey laws even if you don’t like them, or you should try to change them. If a company is making a legal product, and they are advertising it legally under current laws, then accept it or work to change the laws. If they lie, punish them for lying, if that is legal to do. You never see a commercial for sporty cars that show the cars going around corners extremely slowly and safely. How do they advertise cars?They usually show them sweeping around corners, and say “Don’t do this on your own.” Things that are enjoyable typically have uncertainties or risks associated with them. Flying to Europe to visit Fabri has risks! -Don:乐意之至。这是来自于和律师有关的家族背景,我们有一套法律系统告诉我们什么是合法的,什么不是。你应该遵守法律即使你不喜欢它们,或者你应该尝试改变它们。如果一个公司在生产合法的产品,而且他们在现有的法律条款下合法的利用广告,那么我们就应该接受它或者尝试改变法律。如果他们欺骗我们了,惩罚他们是合法的,那么他们就应该因此受到惩罚。你从来没有看到一个跑车的广告上面写着汽车拐弯时平稳安全。他们如何宣传汽车?他们通常会展示汽车开过拐角,并说“请勿模仿”。那些看上去令人愉快的事情通常有不确定性或与他们相关的风险。比如飞往欧洲去拜访Fabri是有风险的! +Don:乐意之至。这是来自于和律师有关的家族背景,我们有一套法律系统告诉我们什么是合法的,什么不是。即使你不喜欢它们,你也应该遵守法律,或者你应该尝试改变它们。如果一个公司在生产合法的产品,而且他们在现有的法律条款下合法的利用广告,那么我们就应该接受它或者尝试改变法律。如果他们欺骗我们了,惩罚他们是合法的,那么他们就应该因此受到惩罚。你从来没有看到一个跑车的广告上面写着汽车拐弯时平稳安全。他们如何宣传汽车?他们通常会展示汽车开过拐角,并说“请勿模仿”。那些看上去令人愉快的事情通常有不确定性或与他们相关的风险。比如飞往欧洲去拜访Fabri是有风险的! Certainly I do not doubt that no matter how I would intervene to reduce cigarette smoking, lung cancer rates would drop. But what intervention that would reduce smoking would involve reducing illegal conduct of the cigarette industry—that is the essence of the legal question. @@ -604,7 +604,7 @@ Certainly I do not doubt that no matter how I would intervene to reduce cigarett When I was first contacted by a tobacco lawyer, I was very reluctant to consult for them, and I feared strong pressure to be dishonest, which was absent throughout. The original topic was simply to comment on the ways the plaintiffs’ experts were handling missing data. On examination, their methods seemed to me to be not the best available and, at worst, silly (e.g., when missing “marital status,” call them “married”). As I continued to read these initial reports, I was appalled that hundreds of billions of dollars could be sought on the basis of such analyses. From a broader perspective, the logic underlying most of the analyses also seemed to me entirely confused. For example, alleged misconduct seemed to play no role in nearly all calculations, and phrases such as “caused by” or “attributable to,” were used nearly interchangeably and often apparently without thought. Should nearly a trillion dollars in damages be awarded on the basis of faulty logic and bad statistical analyses because we “know” the defendant is evil and guilty? If the issue were assessing the tobacco industry a trillion dollar fine for lying about its products, I would be amazed but mute. But these reports were using statistical arguments to set the numbers—is it acceptable to use bad statistics to set numbers because we “know” the defendant is guilty? What sort of precedent does that imply? The ethics of this consulting is discussed at some length in Rubin (2002). -Don:当我第一次和烟草行业的律师接触,我对向他们提供咨询是很不情愿的,我害怕不诚实会成为我巨大的压力,但实际上是不存在的。最初的主题仅仅是评判原告专家处理缺失数据的方式。在检查中,他们的方法在我看来不是最好的,甚至说是愚蠢的(例如,当不清楚“婚姻状况”时,叫他们“已婚”)。当我继续审查这些一手资料时,我很震惊数千亿美元在这种基础的数据分析中都浪费了。从更广泛的角度来看,大多数分析的逻辑似乎也完全混淆了。例如,所谓的不当行为似乎在几乎所有的计算中都没有出现,而且如“引起”或“归属于”的短语也经常互换,相互混淆。近一兆美元的损失在对错误的逻辑和糟糕的统计分析的基础上,因为我们“知道”的被告人的申辩是邪恶的有罪的,就应该归罪于被告方吗?如果问题是烟草行业对其产品的虚假广告为其带来万亿美元的利润,我会震惊但是沉默。但是,只是因为我们“知道”被告有罪,我们就可以接受这些报告使用错误的统计参数来设置数字吗?这意味着什么样的先例?这种伦理咨询在Rubin作了较为详细的讨论(2002)。 +Don:当我第一次和烟草行业的律师接触,我对向他们提供咨询是很不情愿的,我害怕不诚实会成为我巨大的压力,但实际上是不存在的。最初的主题仅仅是评判原告专家处理缺失数据的方式。在检查中,他们的方法在我看来不是最好的,甚至说是愚蠢的(例如,当不清楚“婚姻状况”时,叫他们“已婚”)。当我继续审查这些一手资料时,我很震惊数千亿美元在这种基础的数据分析中都浪费了。从更广泛的角度来看,大多数分析的逻辑似乎也完全混淆了。例如,所谓的不当行为似乎在几乎所有的计算中都没有出现,而且如“引起”或“归属于”的短语也经常互换,相互混淆。近一兆美元的损失在对错误的逻辑和糟糕的统计分析的基础上,就因为我们“知道”的被告人的申辩是邪恶的有罪的,我们就应该归罪于被告方吗?如果问题是烟草行业对其产品的虚假广告为其带来了万亿美元的利润,我会震惊,但会保持沉默。但是,只是因为我们“知道”被告有罪,我们就可以接受这些报告使用错误的统计参数来设置数字吗?这意味着什么样的先例?这种伦理咨询在Rubin作了较为详细的讨论(2002)。 Fabri: We have talked quite a lot about statistics. Let’s talk about some of your other passions in life, for example, music, audio systems and sports cars. @@ -612,7 +612,7 @@ Fabri:关于统计我们已经谈论了很多。让我们聊聊你人生中的 Don: There are other passions, too, and their order is very age dependent (I leave more to your perceptions). When a kid, for example, sports cars, both driving them and rebuilding them, was the top of those three hobbies. But age (poorer vision, slower reflexes, more aches and pains, etc.) shifted the balance more to music, both live and recorded—luckily my ears are still good enough to enjoy these, but as more age catches up, things may shift. -Don:我当然有很多乐趣,而且他们的顺序就是按照年龄来排列的(我留下了让你自己想象的空间)。当我是个小孩子,比如,一辆跑车,驾驶它还有重建它,是这三个中最重要的爱好。但当年龄增长(更差的视力,更慢的反应,更多的伤痛等等)会将爱好的平衡更多的转换为音乐,尤其是现场直播或者刻录的。很幸运我的耳朵足够好,能够享受这些,但随着年龄继续增加,爱好可能还会再变化。 +Don:我当然有很多乐趣,而且他们的顺序就是按照年龄来排列的(我留下了让你自己想象的空间)。当我是个小孩子,比如,一辆跑车,驾驶它还有修好它,是这三个中最重要的爱好。但当年龄增长(更差的视力,更慢的反应,更多的伤痛等等)会将爱好的平衡更多的转换为音乐,尤其是现场直播或者刻录的。很幸运我的耳朵足够好,能够享受这些,但随着年龄继续增加,爱好可能还会再变化。 Fan and Fabri: Well, it has been nearly three hours since we started the conversation. Here is the final question before letting you go for dinner: What is your short advice to young researchers in statistics?