传播心理案例集锦

刘海明 · 发表于 2019-7-18 22:36:09

【案例】

刘海明 · 发表于 2019-7-18 22:36:31

刘海明 · 发表于 2019-7-18 22:36:55

【案例】避免“德西效应”蔓延

“德西效应”含义

德西效应（Westerners effect）认为适度的奖励有利于巩固个体的内在动机，但过多的奖励却有可能降低个体对事情本身的兴趣．降低其内在动机。

著名实验

心理学家德西在1971年做了一个专门的实验。他让大学生做被试者，在实验室里解有趣的智力难题。

实验分三个阶段，第一阶段，所有的被试者都无奖励；第二阶段，将被试者分为两组，实验组的被试者完成一个难题可得到1美元的报酬，而控制组的被试者跟第一阶段相同，无报酬；第三阶段，为休息时间，被试者可以在原地自由活动，并把他们是否继续去解题作为喜爱这项活动的程度指标。

实验组(奖励组)被试者在第二阶段确实十分努力，而在第三阶段继续解题的人数很少，表明兴趣与努力的程度在减弱，而控制组(无奖励组)被试者有更多人花更多的休息时间在继续解题，表明兴趣与努力的程度在增强。即奖励组对解题的兴趣衰减得快，而无奖励组在进入第三阶段后，仍对解题保持了较大得兴趣。

实验结果说明：在某些情况下，人们在外在报酬和内在报酬兼得的时候，不但不会增强工作动机，反而会减低工作动机。此时，动机强度会变成两者之差。人们把这种规律称为德西效应。

可见，进行一项愉快的活动(即内感报酬)，如果提供外部的物质奖励(外加报酬)，反而会减少这项活动对参与者的吸引力。

“德西效应”的应用启示

“德西效应”与学生奖励

当学生尚没有形成自发内在学习动机时，教师可以从外界给以激励刺激，以推动学生的学习活动，这种奖励是必要和有效的。但是，如果学习活动本身已经使学生感到很有兴趣，此时再给学生奖励不仅显得多此一举，还有可能适得其反。一味奖励会使学生把奖励看成学习的目的，导致学习目标的转移，而只专注于当前的名次和奖赏物。

因此，作为教师，要特别注意正确使用奖励的方法而不滥用奖励，要避免“德西效应”。

老人与喧哗不止的孩子

一位老人在一个小乡村里休养，但附近却住着一些十分顽皮的孩子，他们天天互相追逐打闹，喧哗的吵闹声使老人无法好好休息，在屡禁不止的情况下，老人想出了一个办法——他把孩子们都叫到一起，告诉他们谁叫的声音越大，谁得到的奖励就越多，他每次都根据孩子们吵闹的情况给予不同的奖励。到孩子们已经习惯于获取奖励的时候，老人开始逐渐减少所给的奖励，最后无论孩子们怎么吵，老人一分钱也不给。

结果，孩子们认为受到的待遇越来越不公正，认为“不给钱了谁还给你叫”，再也不到老人所住的房子附近大声吵闹了。

讨论：内部动机、外部动机及其操纵

人的动机分两种：内部动机和外部动机。如果按内部动机去行动，我们就是自己的主人。如果驱使我们的是外部动机，我们就会被外部因素所左右，成为它的奴隶。老人的算计很简单，就是将孩子们的内部动机———“快乐地玩”———变成了外部动机———“为美分玩”，而他操纵着外部因素，所以也操纵了孩子们的行为。当有一天满足不了孩子的愿望了，自然就有办法对付这些顽皮的孩子了。

德西的实验结论以及趣闻轶事对我们改进教育方式很有启迪，作为家庭教育，是孩子教育的一部分，家长首先应该引导孩子树立远大的理想，增进孩子对学习的情感和兴趣，激发孩子对学习活动本身的动机，帮助孩子获得成功和乐趣；其次不能以自己的人生观、价值观看待成长的下一代，尤其一些经商的家长，总认为金钱是万能的，这在教育孩子学习的问题上显然是行不通的，这样做会使孩子迷失方向，感受不到在实现理想的过程中的成功体验，孩子在学习上怕吃苦，干任何事都和利益挂钩，在集体里无服务意识，怕吃亏，我们一直所倡导的中华民族美德中的无私奉献精神不具备，就连最起码的道德底线也达不到。

再者，家长的奖励可以是对孩子的学习有利的，诸如买书，或体育器械，对他的心身健康有帮助，如果奖励一些诸如手机之类的东西，引起学生以此在同学之间炫耀、攀比，给学校教育带来了不良影响，有些学生上课铃声不断，就像我前面列举的那位同学手机的彩铃声，她干扰了正常的教学秩序。而中学生玩手机带来的弊端远远不止这些。听说，就是在发达的西方国家，中学生是不准带手机的，《素质教育在美国》一书中谈到，在美国，学生如有不当的东西带进学校，如手机，学校一律没收，并不再还给学生。

相比之下，我们也没收，但没收后要求学生家长来校认领。曾经一名高三年级的家长认领时还跟老师说，你们学校管得太严了，现在都啥时代了，我的孩子在初中就给他买了手机……

道德教育效率不高，教学效果不好，家庭教育的偏差是不是也难辞其咎？

天下的父母都希望自己的孩子有出息，以后过上美满幸福的生活，这一愿望能不能实现，就看我们为孩子的成长做了什么，更要看我们是怎样做的。切记，勿让“德西效应”在教育中滋生蔓延。

引起“德西效应”滋生蔓延的因素

不当的薪酬奖励

薪酬是企业管人的一个有效硬件，直接影响到员工的工作情绪，但是每一个公司都不轻易使用这件精确制导武器。如果使用不好，可能会带来“德西效应”，不仅不能激励员工，还可能造成负面影响。在IBM有一句拗口的话：加薪非必然！IBM的工资水平在外企中不是最高的，也不是最低的，但IBM有一个让所有员工坚信不疑的游戏规则：干得好加薪是必然的。

1996年初IBM推出个人业绩评估计划(PBC)。PBC从三个方面win(致胜)、executive(执行)、team(团队精神)来考察员工工作的情况。IBM薪酬政策的精神是通过有竞争力的策略，吸引和激励业绩表现优秀的员工继续在岗位上保持高水平。IBM独特而有效的薪金管理，能够通过薪金管理达到奖励先进、督促平庸。IBM将外在报酬和内在报酬相互挂钩而且有效地避免了“德西效应”的产生，这种管理已经发展成为了一种高效绩文化(high performance culture)。

一个私人企业老总每每向人抱怨自己的高级人才大量走失：“我已经连续给他们涨了很多次工资了，怎么看不到一点成效呢？”就薪金这个角度来看，原有的外加报酬如果距离人才需要满足的水平太远，直接激励的原有强度又不足，必然导致“德西效应”。如果人才觉得工作本身所具有的外在报酬和内在报酬都不尽如人意，即使外在报酬不断增加，也无法达到他的预期，转投他处是必然的结局。

在实际工作中，有些单位的表彰评比活动过多过滥，并不一定能起到好的效果。其主要原因，就是这种评比表彰往往流于形式，没有真正起到树立典型、弘扬先进的作用。如果对干部职工完成了应完成的任务、履行了应履行的义务、遵守了应遵守的规章制度这些本来就应该做到的一般行为，当作突出表现大张旗鼓的进行表彰，甚至为了照顾情绪，拿表彰送人情，“排排坐，吃果果”，对今后的工作就可能出现负效应。

人们就会把这些一般行为当成是一般人难以做到、应是“积极分子”的专利，做到了就应该受到领导的褒奖，如果得不到就会失去心理平衡和工作的动力。可以说，这种送人情的表彰是一种短视行为。当然，作为领导，应该注意发现每一位下属的“闪光点”，在适当场合恰如其分地进行表扬激励。但必须注意，这种表扬是有限度的，是在平时工作中随时进行的，真正树立典型。

只有这样，人们才会把应承担的义务看作是“应该做的”、“必须做的”，做不到应该受到严厉批评，做到了不应当“邀功请赏”，只有做得好才会立功受奖。

不当奖励

奖励不可以随便泛滥，因为当一个人进行一项愉快的活动时，给他提供奖励结果反而会减少这项活动对他内在的吸引力。在某些时候，当外加报酬和内感报酬兼得，不但不会使工作的动机力量倍增，积极性更高，反而其效果会降低，变成二者之差。

以教育为例，在新课程改革的形势下，广大教师对“兴趣是最好的老师”有了更深刻的意义认识，对“学习过程是学生积极主动自主构建的过程”等新的课程思想有了认同。于是，在课堂教学中，教师为了调动学生兴趣采用了丰富多彩的教学形式和教学手段，确实收到了较好的学习效果。

例如，一些小学教师喜欢采取物质奖励的形式：给学生准备了粘贴画、有的准备了小风车、还有的直接把香甜的水果奖给学生。于是，课堂上出现了空前的气氛高潮，学生个个精神抖擞，争先恐后，场面热烈异常。这里，我们教师的这种为教育而奉献的精神着实可贵。为了讲好一节课，我们的教师无怨无悔地掏了自己原本就不丰厚的腰包。这种物质的投入也确实换来了课堂上学生的兴高采烈、情趣盎然。但当我们冷静之后，透视这精彩的背后，就不难发现物质奖励的种种弊端。

兴趣不持久

兴趣不持久是“德西效应”的孪生姐妹。“德西效应”告诉我们：在生活中，教学中，培养个人积极主动、持之以恒的兴趣和坚忍不拔的意志，仅靠物质的刺激远远不够。虽然“重赏之下，必有勇夫”，但由物质刺激所激发的兴趣，在一定程度上是淡薄的，也是短暂的。

教师在课堂教学中以物质奖励所激发出来的学生学习兴趣只能是暂时的，无法保持持久。

思想不集中

不要随意奖励学生，因为这很可能导致他们思想不集中。因为当教师对学生实施物质奖励的时候，由于小学生的年龄特点，决定了学生对物质的奖品会更感兴趣。因学生急切地想得到奖品，就要迫不及待地争取“先机”，而使其不能很好的集中思维、组织语言，导致思维深度不够，学习效果不佳。而且当获奖学生拿到奖品后，由于其自控力差，常常是马上欣喜于自己所得，开始专注于奖赏物，摩挲把玩，甚至想立刻品尝那诱人的瓜果呢！而其他学生也会把视线和思维降落到奖赏物及获奖学生身上。教师的问题在此时就变得那么微不足道，甚至还需要教师再多费一番工夫来组织教学呢！

心理不平衡

课堂中教师要实施物质奖励，奖品的数量往往是少量的，只能是一小部分学生才有幸获得。但课堂中由于教师给学生发言的机会的不均等造成学生所获奖的机率也极其的不均衡。应该说许多没得到奖的学生也具备获得奖品的能力，只是教师没给机会。因此我们看到了在奖品发放完了之后课堂上马上有了唏嘘声、哀叹声或不满声。

得到奖品的自然是乐滋滋了，没得到的呢，马上像泄了气的皮球，无精打采了。有甚者还表现出极度的不满，怨老师对自己的不理睬，怨获奖学生也没什么了不起的！造成学生情绪的不快，心理的偏激。甚至有的学生还对此产生逆反心理，无论教师拿什么作奖都表现得无动于衷，“我不稀罕！我也不学了！”效果真的不敢恭维。

欺骗色彩奖励

课堂教学中我们会看到这种现象：教师声情并茂的渲染气氛说谁有好的表现教师将有精美的礼物送给他。可是当学生表现突出时，教师拿出的奖品却是用彩纸剪出的苹果、桃子或者是刻得简单的小花，对学生而言毫无吸引力，毫无用处。获奖学生甚至是极不情愿的领取了奖品，情绪表现是大失所望的。

在第一个学生受到这种欺骗后，其他学生的情趣也深受打击。这样的物质奖励不能激起学生兴趣，可以说是费时低效的，情绪调动显得十分牵强。

注重物质所得

学生会过于注重物质所得，影响人生观的确立。课堂教学中教师频繁使用物质奖励会导致学生把学习看成是带有物质所得的一种活动，会改变学生的学习方向，把获奖看成是学习的目标。久之，学生会过于看重物质的获取。学习、做事都带上了物质索取的色彩，成为有偿活动而不是自发行为。最终会影响其正确人生观的确立。

苏霍姆林斯基说过：如果你只指望靠表面看得见的刺激来激发学生的兴趣，那就永远也培养不出学生对脑力劳动的真正热爱。要力求使学生亲自去发现兴趣的源泉，使他们在这种发现中感到自己付出劳动并得到了进步。这本身就是一个最重要的兴趣来源。

课堂教学中，如果未知本身已经使人感到很有兴趣，此时再给奖励不仅显得多此一举，还有可能适得其反。这里提醒教师：课堂教学中要学会正确使用鼓励方法而不要滥用奖励，要避免产生“德西效应”。

编辑：冉玲琳

原文链接：https://mp.weixin.qq.com/s/nTo2NMmF7tKm_F5k9J28eA

刘海明 · 发表于 2019-7-18 22:37:10

【案例】
编辑推荐：假新闻、名人八卦与政治阴谋论
令人沮丧的新闻纷至沓来，美国一份八卦小报却刊登了一则好消息：詹妮弗·安妮斯顿(Jennifer Aniston)和布拉德·皮特(Brad Pitt)上演大复合，而且安妮斯顿怀孕了！
别太认真。时报记者已向安妮斯顿的公关查证：这是假新闻。在各式出版品走向寒冬之际，虚假消息充斥在媒体和网络上，真相似乎不再重要，人们只选择相信符合自己期望的“事实”。
就像有些人爱看名人八卦一样，特朗普的支持者似乎更愿意相信这样的故事：特朗普从真人秀中的精明商人到试图粉碎精英阶级的总统候选人，最终入主白宫，成为誓言打倒“深层政府”的总统。在这个故事中，特别检察官穆勒扮演着安妮斯顿故事中安吉丽娜·朱莉(Angelina Jolie)的角色，试图挫败主人公的成长。
那些具有吸引力的叙事，不论是八卦还是政治，都很容易得到受众。但当穆勒就通俄门调查发布最终报告时，希望我们能够看到，真相比人们想要相信的故事更加强大。

编辑：邢海波

刘海明 · 发表于 2019-7-18 22:37:28

【案例】

社交媒体为何让我们感到更加孤独？

[size=0.75]2017年 3月 6日

分享平台微博

分享平台人人网

美国心理学家认为，推特（Twitter）、脸书（Facebook）、Pinterest等社交媒体令人感到更加孤独。

[color=rgb(80, 80, 80) !important]ADVERTISEMENT

最新研究报告指出，每天用社交媒体超过2个小时会使一个人的社会隔离感增加一倍。

报告认为，社交媒体上展示他人理想化的生活，可能引发羡慕嫉妒。

相关研究还调查了Instagram、Snapchat以及Tumblr用户。

美国匹兹堡大学儿科教授伊丽莎白·米勒是报告的作者之一。

她表示："我们还不知道这种现象的根本原因—是社交媒体的使用，还是所感知的社会隔离"。

"可能的原因是，年轻人最初感到孤独就去玩社交媒体；或者是由于他们大量使用社交媒体，越觉得与真实的世界隔离。"

报告提出的理论认为，一个人在网上的时间越多，在真实世界中的互动就越少。

社交媒体的使用还可能引发更多被排斥的感觉。例如，看到朋友们参加活动充满乐趣的照片而自己却没有受到邀请。

研究团队针对近2000名19-32岁的成年人开展问卷调查。

匹兹堡大学医学院教授布莱恩·普里马克教授说："这是一个重要研究课题，因为年轻人的精神健康问题和社会孤立感已经达到流行病的程度"。

他指出，"我们本质上是社会动物，但现代生活方式容易将我们分隔，而不是把我们聚在一起。"

"社交媒体表平面上看起来，提供了填补社会空虚的机会，但我认为这一研究显示，它可能不是人们希望得到的解决方式。"

https://www.bbc.com/zhongwen/simp/world-39181015

刘海明 · 发表于 2019-7-18 22:38:38

【案例】数据告诉你，为什么总是谣言比真相跑得快（中英文）

文章来源：科学杂志/再建巴别塔

原文链接：https://mp.weixin.qq.com/s/mIBZ6GdiShy9MMpeZn--jg

翻译：刘航、陈天蓝

校对：罗人杰

虚假消息以及其对政治、经济、社会生态可能产生的影响引发了全世界的担忧。为了探究虚假消息究竟是如何传播的，Vosoughi et al.把2006至2017年发布在推特上的传言级联(rumor cascades)搜集在一起进行研究。研究数据显示，大约有126，000条传言被近三百万人传播。虚假消息往往比真实消息传得更广：位列传言级联前1%的内容被散布到了一千至十万人中；然而真实消息的受众却很少能超过一千人。同时，虚假消息的传播速度也比真实消息快得多。消息本身的新奇度和受众的情绪体验可能是造成上述现象的原因。

我们调查了2006至2017年推特上所有核实过的真假消息不同的散布程度。（研究的数据由三百万人转发了超过四百五十万次的126,000条消息组成。）我们根据来自六个独立的事实核查机构的信息来判定消息的真假，它们的判定结果有着95-98%的一致性。虚假消息在任何种类的消息中都明显比真实消息散布得更广、更快、更深入，这一现象在政治消息方面尤为突出，甚至超过了恐怖主义、自然灾害、科学资讯、都市传说、金融消息等方面的传言。我们发现虚假消息比真实消息更加新奇，这点说明了人们更加倾向于分享新奇的消息。虚假的故事激起人们的恐惧、憎恶与惊讶，而真实的故事则激发人们的期望、悲伤、喜悦与信任。与普遍看法不同，机器人加速真假消息传播的程度是相同的，这暗示着虚假消息的传播速度超过真实消息并不是因为机器人，而是人类自身的原因。

关于决策、合作、交流与市场的基础理论都认为对于事实和准确的定义对于几乎每个人的行为决策都至关重要，然而真实与虚假的消息却同样通过网络媒介快速传播。定义真假已经变成一项政府理所当然的工作，而不是人们基于各种事实多层次的讨论。我们的经济体系也未能免于虚假消息的干扰。错误传言影响了股价，并动摇了人们对于大规模投资的积极性。例如，一条声称奥巴马在一场爆炸中受伤的推特使1300亿美金的股值人间蒸发。受网上流传的假消息的影响，我们对于一切消息的原有反应均受到了破坏。

新的技术在促进即时消息交换与大规模消息级联的同时，也助长了虚假消息的传播。然而，尽管我们越来越依靠这些新技术获得消息，我们却很少知道它们到底在多大程度上助长虚假消息的传播。关于虚假消息传播的坊间分析受到了媒体足够的重视，可是很少有大规模的实证调查来探究虚假消息的渗透过程及其社会根源。关于虚假消息传播的研究目前仅局限于小范围的、特殊的样本，忽视了两大重要的科学问题：真实消息与虚假消息的散布过程究竟有何不同？个人判断中的哪些因素造成了这些不同呢？

目前的研究都着重于单一传言的传播，比如希格斯玻色子的发现(the discovery of the Higgs boson)、2010年的海地地震；或者是研究发源于同一个灾难性事件的多种传言，比如2013年波士顿马拉松爆炸案；又或者建立传言散布的理论模型、发明传言甄别技术与可信度衡量办法、探求限制传言传播的方法。但是，几乎没有研究能够彻底地探究为什么虚假消息与真实消息的传播过程不同。

比如说，尽管Del Vicario et al.和 Bessi et al.研究了科学消息与阴谋论的传播，他们并没有衡量这些消息的真实性。科学消息与阴谋论并不一定都是真的，而且它们在文风上截然不同，这种文风的不同对于它们的传播有着重要的影响，但与它们的真实性毫无关系。为了理解虚假消息是如何传播的，我们有必要将真实与虚假的科学消息、阴谋论区分开来，分别研究它们的散布过程，并且将消息按照主题、文风的不同进行分类比较。迄今为止唯一通过真实性来辨别传言的是Friggeri et al.的研究。他分析了散布在Facebook上的4000条传言，但是他的侧重点在于事实调查是如何影响传言传播的，而非虚假消息与真实消息的散布过程有何不同。

在如今的政治生态与学术文献中，围绕着“伪造消息”、社交网络上针对美国内政的境外干涉以及我们对于何谓消息、伪造消息、虚假消息、传言、传言级联的理解产生了许多不固定的术语。在以往，我们用真实性判定伪造消息，但是如今“伪造消息”一词在我们的政治与媒体生态中被高度极化。政客们会利用一种精明的政治策略，将不利于自己身份的消息判定为不可靠的或编造的，并将有利于自身的消息列为可靠消息。由于这个原因，“伪造消息”这个术语已经失去了它本来的意思，从而失去了其学术性。因此，在这篇论文中，我们有意地避免使用“伪造消息”这个术语，而是使用更加客观明确的 “真实消息”与“虚假消息”。尽管“伪造消息”与“误报”都暗示着对事实的有意歪曲，我们在论文中并不会探究传言制造者的企图。相反，我们会将焦点放在真实性与被验证为真或假的消息上。

同时，我们有意地接纳了对于“消息”这一词的宽泛的定义。传统看法认为人们对于某一具体事件的阐释与评价是构成“消息”的基础，但现在人们把推特上任何一个公开的说法都叫消息。我们将消息定义为任何含有个人主张的言论，而将传言定义为事件或言论在推特上散布的社会现象。就是说，传言本质上是社会性的，它涉及人与人之间的观点交流。而消息，不管是否被分享，只是一种言论而已。

当一名用户通过发推特、传照片、贴文章链接等方式就某一主题发表个人言论时，传言级联便开始产生了。他人会通过转发的方式扩大传言的影响。一条传言的扩散过程可以看成是一个或多个级联的集合（级联是指由同一个消息来源不停转发从而形成的传言扩散模式）。比如说，一个人可以通过发表对某一具体事件的言论来触发一个传言级联，而第二个人则基于相同的事件建构起独立于第一层级的第二层传言级联。如果两个层级之间互相独立，那么它们就是同一传言的两个级联。级联的规模由转发数量决定，而级联的层数则由用户基于同一事件单独发帖的次数决定。比如说，如果10个人分别发了有关传言A的帖子，但是没有人转发，那么传言A就有10个层级，每个层级的规模为1。同样，如果2个人分别发了关于传言B的帖子，每个帖子都分别有100个人转发，那么传言B就有2个层级，每个层级的规模为100。

我们利用从推特创始之初（2006年）至2017年所有经核实的传言级联中提取出的综合数据探究了真实消息、虚假消息与半真半假消息不同的散布过程。数据包含了被三百万人转发了超过四百五十万次的126,000条消息。我们将那六所独立的事实核查机构(snopes.com, politifact.com, factcheck.org, truthorfiction.com, hoax-slayer.com, and urbanlegends.about.com)调查过的所有传言级联都作为调查样本（这六个机构的判定结果有着95-98%的高度一致性），解构传言的标题、正文以及结论，并自动收集推特上这些传言相应的级联。我们收集了传言所有的英文回复并且利用文字识别技术从图片中提取文字。对于每一条转发，我们都提取出原帖以及所有对原帖的转发。接着，我们量化了级联的深度（原帖被不同用户转发的次数），规模（级联中涉及的用户数），最大广度（在任何深度中级联中所能容纳的最大用户数），和构造式病毒(structural virality)（这是一种插入内容之中的测度，这些内容通过单一的庞大消息源或者多层级模式传播——在这种模式中每个个体的直接参与都是整个传播的一部分）。

当一个传言被转发，级联的深度、规模、最大广度和构造式病毒都会上升(图 1A)。在级联数1~1000的区间内，虚假传言占更大的比例；而在级联数大于1000的区间内，真实传言占更大的比例(图 1B)。政治方面的传言也呈现这一特征(图 1D)。虚假传言的总量在2013、2015年末达到高峰，2016年末再次登顶，与最近的总统选举存在关联(图 1C)。数据还显示，政治方面的虚假传言在2012与2016年总统选举时显著增加，而在2014年俄罗斯合并克里米亚半岛时，半真半假的传言陡增(图 1E)。政治传言是我们数据中最大的传言类别，它含有45000个级联，之后依次是都市传说、商业、恐怖主义、科学、环境、自然灾害方面的传言。

▲传言级联

(A)传言级联的一个例子，以及它的深度、规模、最大广度和构造式扩散过程。“Nodes”指的是推特用户。

(B)真、假、混合型（半真半假）级联的互补累积分布函数（The complementary cumulative distribution functions (CCDFs)），该函数描述了拥有特定级联数的传言在其类别中所占的比例。

(D)所有真、假、混合型政治方面的传言的互补累积分布函数(CCDFs)。

(E) 2006~2017年推特上所有散布的真、假、混合型政治方面传言的季度计数（Quarterly counts），在每个类别中都标注出了具体的样本。

(F)七种最常见类别传言级联的总数直方图。

当我们分析真假消息的扩散过程时，我们发现虚假消息在任何消息类别中都明显比真实消息散布得更广、更快、更深入。相比于真实级联，明显更多的虚假级联超过了深度10，而虚假级联的前0.01%比真实级联在推特中多散布了8个单位，比原帖多散布了19个单位(图 2A)。虚假消息也比真实消息传到了更多人耳中。位列传言级联前1%的内容被散布到了一千至十万人中；与之形成鲜明反差的是，真实消息的受众却很少能超过一千人(图 2B)。虚假消息在级联的每一个深度上都比真实消息传到更多人耳中，这意味着许多人所转发的虚假消息比真实消息更多(图 2C)。病毒式传播助长了虚假消息的传播，也就是说，虚假消息不仅仅通过传统方式传播，相反，它们更多是采用以病毒式分支流程为特征的点对点传播模式(图 2D)。

▲真假传闻的互补累积分布函数(CCDFs)

(A)深度

(B)规模

(C)最大广度

(D)结构式病毒

(E and F)真假传言级联散布到某一(E)深度与某一(F)用户数量所需要的分钟数

(G)每个深度上不同的用户数

(H)真假级联每个深度的平均广度。在(H)中，图表呈对数正态分布。标准误差集中在传言层面。（也就是说，同一个传言的不同级联集中在一起）

真实消息若要传到1500人耳中，需要花比虚假消息多5倍的时间(图 2F)；若要形成一个深度为10的级联，则要花虚假消息20倍的时间(图 2E)。在每个深度的级联上，虚假消息都比真实消息散布得更广(图 2H)、被更多用户转发(图 2G).。

虚假的政治消息(图 1D)传播得更深入(图 3A)、更广泛(图 3C)、受众更多(图 3B)并且比任何其他类别的虚假消息都具有病毒性(图 3D)。虚假的政治消息的传播也更快达到一定深度(图 3E)，而且，它传到20000人耳中所需要的时间，几乎是其他类别的虚假消息传到10000人耳中所需要的时间的三分之一(图 3F)。虽然其他种类的虚假消息在1-10的深度抵达同样数量的独特用户，但虚假政治消息通常会在深度上超过10的情况下达到最独特的用户（图3G）。虽然所有其他种类的虚假消息以较浅的深度传播得稍微更广，虚假政治消息则以更大的深度传递地更广，表明更流行的虚假政治消息表现出更广和更快的扩散动态（图3H）。对所有消息种类的分析表明那些关于政治、都市传说和科学的消息传播到的人数最多，然而关于政治和都市传说的消息传播得最快，并且在结构式病毒方面，它们是最具病毒性的。

▲虚假政治和其他类型的传言级联的补充累积分布函数（CCDF）

(A) 深度

(B) 规模

(C) 最大广度

(D) 结构式病毒

(E和F)虚假政治消息和其他虚假消息级联散布到某一(E)深度和某一(F)用户数量所需要的分钟数

(G) 每个深度上不同的用户数

(H) 传言级联每个深度的平均广度。在(H)中，图表呈对数正态分布。标准误差集中在传言层面。

人们可能怀疑网络中结构因素或者在级联中的用户个体性格特征解释了为什么假比真以更快的速度行进：可能那些传递虚假消息的人“追随”更多的人，并且有更多的追随者，推文发得更频繁，他们更多是那些通过“验证”的用户，或者使用推特的时间更久。但是当我们比较涉及真假传言级联的用户时，发现在每种情况下，事实与此恰恰相反：传递虚假消息的用户明显有更少的追随者(K-S test = 0.104, P ~ 0.0)、自己也追随更少的人(K-S test = 0.136, P ~ 0.0)；在推特上明显不那么活跃(K-S test = 0.054, P ~ 0.0)；被验证的显然更少(K-S test = 0.004, P < 0.001)；使用推特的时间更少(K-S test = 0.125, P ~ 0.0)。尽管有这些差异，虚假消息的扩散依旧比事实更广更快，所以原因并非如此。

▲估计消息的传播，真假消息的新奇性和对消息的回应中含有的情绪内容之间的相关性的模型

(A) 关于这些测量测试关于参与真假传言级联的用户的描述性统计以及关于这些度量在真假传言级联中分布差异的K-S测试。

(B) 预计用户转发传言偏好的逻辑回归模型结果作为一个多变量函数展示在左边；系数：logit系数；z,得分。

(C) 与用户转发传言推文前60天中其推特语库中所显示内容相比较，在真（绿色）假（红色）传言推文中消息唯一性（IU）、缩放的Bhattacharyya距离（BD）和K-L散度（KL）的差异。

(D) 对真（绿色）假（红色）传言的回应中含有的情绪内容，由NRC分类为七个层次。

(E) 与用户观看传言推文前60天中看到的推文语库中内容相比较，真假传言推文IU,KL和BD的平均值和方差。以及关于它们在真假传言中差异的K-S测试。

(F) 对真假传言的回应中含有的情绪内容（由NRC分类为七个层次）的平均值和方差，以及关于它们在真假传言中差异的K-S测试。所有标准误差都集中在传言层面，并且所有模型都在传言级别上通过集中稳定的标准误差进行估计。

当我们预设一个转发偏好的模型时，我们发现虚假消息被转发的可能性比真实消息多70%，即使当我们控制了账户年龄，活动水平、转发者数量和原始推文作者的关注量以及原始推文作者是否是已经验证的用户后。由于用户的特征和网络框架不能解释真假消息传播的差异，我们寻找了它们传播差异的其他解释。

一种解释来自信息理论和贝叶斯决策理论。新奇吸引人们的注意力，促进了富有成效的决策制定，并且鼓励信息分享，因为新奇更新了我们对这个世界的理解。当一条消息是新奇的，它不仅令人惊讶，而且更有价值，无论是从信息的理论前景（此处它对做决策提供了巨大的帮助）还是从社会前景（此处它传达了人们处于“知道”或者有渠道获取独特的“内部”消息的社会地位）来看。因此我们测试了虚假消息是否比真实消息更新奇以及是否推特用户更偏向于转发新奇的消息。

为了评定新奇性，我们随机选择5000个传播真假传言的用户，并在决定转发传言前60天内他们所接触的推文中随机抽取25000个样本。之后我们指定了一个LDA模型(latent Dirichlet Allocation Topic model)（其中包含200个主题，并且在1000万条英语推文中试验过）计算传言推文和用户转发传言推文前接触的所有推文间的信息距离。它显示出我们数据中的每条推文在200个主题中的概率分布。然后，通过将传言推文的主题分布和用户转发前60天中所接触到推文的主题分布加以比较，我们测量了真假传言中信息的新奇程度。我们发现，在所有新奇性度量中，虚假传言比真实消息要新奇得多，显示出明显更高的消息唯一性（K-S检验=0.457，P~0.0），Kullback-Leibler（K-L）发散性（K-S检验=0.433，P~0.0）和Bhattacharyya距离（K-S检验=0.4）。15，P~0）（类似于Hellinger距离）。最后的两个指标用于测量输入推文主题内容的概率分布和用户先前接触的推文语库之间的差异。

尽管测量中虚假传言比真实消息更新奇，但用户却未必察觉到了这点。因此，通过比较用户对真假传言回应中的情绪内容，我们评估用户对包含真假传言的信息的看法。我们使用加拿大国家委员会（NRC）编制的标准词典对回复中的情绪进行分类，得到了一个详尽的包含140000英语词汇以及它们与8种情绪（基于Plutchik的工作，基本情绪是：愤怒、恐惧、期望、信任、惊讶、悲伤、快乐、厌恶）之间联系的列表，以及 32000个推特标签与其相关情绪的列表。我们从回复推文中移除禁用词和网址后，计算了推文中出现的与8种情绪相关联的单词比重，为每个回复设立一个情绪偏向（即以上归纳的情绪之一）。我们发现虚假传言所激发的回复中表达了更多的惊讶（KS测试= 0.205，P~0.0）（证实了新奇性的假设）和厌恶（KS测试= 0.102，P~0.0），然而真实传言所激发的回复中则表达了更多的悲伤（KS测试= 0.037，P~0.0）、期望（KS测试= 0.038，P~0.0）、愉快（KS测试=0.061，P~0.0）和信任（KS测试= 0.060，P~0.0）（图4，D和F）。虚假消息回复中表达的情绪似乎显明了，除新奇之外，还有激发人们分享虚假消息的其他因素。我们不能认定新奇导致转发或者新奇是使虚假消息转发更多的唯一原因，即使我们的确发现虚假消息更新奇并且新奇的消息更可能被转发。

大量诊断统计和操作检查验证了我们的结果并证明它们的鲁棒性（译注：指算法的稳定性）。第一，由于每个真假传言都存在多层级联，因此与相同传言的级联相关联的方差和误差项将是相关的。因此，我们选择了集中稳定的标准误差，并计算了它们在传言水平上集中的所有方差。通过比较有无集中误差的分析来检测我们结果的鲁棒性，我们发现即使这种集中降低了我们估算的准确性，我们结果的方向、大小和重要性也没有改变，而且chi-square (P ~ 0.0) 和拟合优度检验（d = 3.4649×10-6，P~1.0）表明这些模型是很精确的。

第二，为了让六个组织核查推文事实，我们选择样本的限制中可能会出现选择偏好。事实核查可能会挑选某些类型的传言或许更偏向于它们。为了验证我们的分析在这一选择上的鲁棒性以及我们的结果对所有真假传言级联的普适性，我们独立检验另一个未经任何事实核查组织验证的传言级联样本。这些传言是由三个MIT和Wellesley大学的本科生查证的。自2016年起，我们训练这些学生使用我们自动传言探测算法在300万份英文推特中去检测传言。这些本科生助手们使用网上简单的搜索引擎调查了这些检测过的传言的真实性。在他们研究基础上，我们要求他们标记这些传言为真、假或者混合，并且移除掉所有以前被事实核查组织查证过的传言。我们的这些助手们独立工作且没有受到其他干涉，他们调查的13240个传言级联有90%的吻合度，达到了0.88的Fleiss’ kappa。当我们比较助手们达成一致的真假传言的传播动态时，发现与我们主数据预计的结果十分吻合。那些稳定数据中的虚假传言的深度、规模、最大广度、结构式病毒和速度，以及每个深度上的最大用户数量数值上都更大。当我们扩展到对那些仅获得了大多数人同意而不是有着一致意见的消息时，我们得到了同样的结果。

第三，尽管真假消息的传播方式的差异的确值得一探究竟，不管其中是否有机器人活动，但人们依旧可能担心我们关于人类判断的结论可能会因为我们分析中机器人的存在而脱轨。因此，在进行分析之前，我们用了一个复杂的机器检测算法来识别、移除所有的机器人。当我们把机器人的流量增加进分析之中，我们发现我们的主要结论都没有改变——在所有类别的消息中，虚假消息依旧比真实消息传播得更远、更快、更深、更广。当我们移除所有的由机器人开始的推文级联时（包括人类对机器人原始推文的转发），或者当我们用第二个独立的机器人检测算法，并且（为了证实我们分析的鲁棒性）改变算法探测的灵敏度阈值时，分析结果依然保持不变。机器人参与同时加速了真假传言的传播，它大致上同等地影响了它们的传播。这就表明虚假消息比真实消息传播得更远、更快、更深、更广的原因出于人类，而非机器人。

最后，更多对真假消息传播差异的行为解释的研究显然是必要的。尤其是需要与用户更直接的互动，通过采访、调查、临床实验甚至神经影像，对驱使人们传播真假消息的动机有更清晰的认识。在以后的工作中，我们支持人们运用这些方法或其他途径去调查驱使人们传播真假消息的因素。

虚假消息可能会导致恐怖袭击或者自然灾害期间资源的错误分配、商业投资失误和选举误导。不幸的是，即使网络虚假消息的数量明显增加，对于虚假消息传播方式和原因的科学理解目前还建立在临时的而非大规模、系统的研究上。我们对推特上传播的已验明的真假传言的分析证实：虚假消息的传递更具有渗透性，它同样推翻了关于虚假消息传播的传统观念，人们可能认为网络框架与个人偏好促进了虚假消息的传播，但是结果却恰恰相反。

尽管网络和个人因素更偏好于事实，但人们却更有可能转发虚假消息，推动虚假消息传播。此外，即使最近国会委员会就美国虚假消息问题举行的例会仍聚焦于自动机器人在传播虚假消息中扮演的角色，我们的结论也是：人类的行为比机器人更多促成真假消息传递的差异。这表明虚假消息遏制政策也应该要强调行为干涉，例如标榜、鼓励阻止假消息的传播，而不是完全集中在削减机器人。理解虚假消息如何传播只是朝着控制它迈出的第一步。我们希望我们的工作在虚假消息传播的原因、结果和可能的解决方法方面能激起的更大范围的研究。

Science 09 Mar 2018:

Vol. 359, Issue 6380, pp. 1146-1151

DOI: 10.1126/science.aap9559

Lies spread faster than the truth

There is worldwide concern over false news and the possibility that it can influence political, economic, and social well-being. To understand how false news spreads, Vosoughi et al. used a data set of rumor cascades on Twitter from 2006 to 2017. About 126,000 rumors were spread by ～3 million people. False news reached more people than the truth; the top 1% of false news cascades diffused to between 1000 and 100,000 people, whereas the truth rarely diffused to more than 1000 people. Falsehood also diffused faster than the truth. The degree of novelty and the emotional reactions of recipients may be responsible for the differences observed.

Science, this issue p. 1146

Abstract

We investigated the differential diffusion of all of the verified true and false news stories distributed on Twitter from 2006 to 2017. The data comprise ~126,000 stories tweeted by ~3 million people more than 4.5 million times. We classified news as true or false using information from six independent fact-checking organizations that exhibited 95 to 98% agreement on the classifications. Falsehood diffused significantly farther, faster, deeper, and more broadly than the truth in all categories of information, and the effects were more pronounced for false political news than for false news about terrorism, natural disasters, science, urban legends, or financial information. We found that false news was more novel than true news, which suggests that people were more likely to share novel information. Whereas false stories inspired fear, disgust, and surprise in replies, true stories inspired anticipation, sadness, joy, and trust. Contrary to conventional wisdom, robots accelerated the spread of true and false news at the same rate, implying that false news spreads more than the truth because humans, not robots, are more likely to spread it.

Foundational theories of decision-making (1–3), cooperation (4), communication (5), and markets (6) all view some conceptualization of truth or accuracy as central to the functioning of nearly every human endeavor. Yet, both true and false information spreads rapidly through online media. Defining what is true and false has become a common political strategy, replacing debates based on a mutually agreed on set of facts. Our economies are not immune to the spread of falsity either. False rumors have affected stock prices and the motivation for large-scale investments, for example, wiping out $130 billion in stock value after a false tweet claimed that Barack Obama was injured in an explosion (7). Indeed, our responses to everything from natural disasters (8, 9) to terrorist attacks (10) have been disrupted by the spread of false news online.

New social technologies, which facilitate rapid information sharing and large-scale information cascades, can enable the spread of misinformation (i.e., information that is inaccurate or misleading). But although more and more of our access to information and news is guided by these new technologies (11), we know little about their contribution to the spread of falsity online. Though considerable attention has been paid to anecdotal analyses of the spread of false news by the media (12), there are few large-scale empirical investigations of the diffusion of misinformation or its social origins. Studies of the spread of misinformation are currently limited to analyses of small, ad hoc samples that ignore two of the most important scientific questions: How do truth and falsity diffuse differently, and what factors of human judgment explain these differences?

Current work analyzes the spread of single rumors, like the discovery of the Higgs boson (13) or the Haitian earthquake of 2010 (14), and multiple rumors from a single disaster event, like the Boston Marathon bombing of 2013 (10), or it develops theoretical models of rumor diffusion (15), methods for rumor detection (16), credibility evaluation (17, 18), or interventions to curtail the spread of rumors (19). But almost no studies comprehensively evaluate differences in the spread of truth and falsity across topics or examine why false news may spread differently than the truth. For example, although Del Vicario et al. (20) and Bessi et al. (21) studied the spread of scientific and conspiracy-theory stories, they did not evaluate their veracity. Scientific and conspiracy-theory stories can both be either true or false, and they differ on stylistic dimensions that are important to their spread but orthogonal to their veracity. To understand the spread of false news, it is necessary to examine diffusion after differentiating true and false scientific stories and true and false conspiracy-theory stories and controlling for the topical and stylistic differences between the categories themselves. The only study to date that segments rumors by veracity is that of Friggeri et al. (19), who analyzed ~4000 rumors spreading on Facebook and focused more on how fact checking affects rumor propagation than on how falsity diffuses differently than the truth (22).

In our current political climate and in the academic literature, a ﬂuid terminology has arisen around “fake news,” foreign interventions in U.S. politics through social media, and our understanding of what constitutes news, fake news, false news, rumors, rumor cascades, and other related terms. Although, at one time, it may have been appropriate to think of fake news as referring to the veracity of a news story, we now believe that this phrase has been irredeemably polarized in our current political and media climate. As politicians have implemented a political strategy of labeling news sources that do not support their positions as unreliable or fake news, whereas sources that support their positions are labeled reliable or not fake, the term has lost all connection to the actual veracity of the information presented, rendering it meaningless for use in academic classiﬁcation. We have therefore explicitly avoided the term fake news throughout this paper and instead use the more objectively veriﬁable terms “true” or “false” news. Although the terms fake news and misinformation also imply a willful distortion of the truth, we do not make any claims about the intent of the purveyors of the information in our analyses. We instead focus our attention on veracity and stories that have been veriﬁed as true or false.

We also purposefully adopt a broad deﬁnition of the term news. Rather than deﬁning what constitutes news on the basis of the institutional source of the assertions in a story, we refer to any asserted claim made on Twitter as news (we defend this decision in the supplementary materials section on “reliable sources,” section S1.2). We deﬁne news as any story or claim with an assertion in it and a rumor as the social phenomena of a news story or claim spreading or diffusing through the Twitter network. That is, rumors are inherently social and involve the sharing of claims between people. News, on the other hand, is an assertion with claims, whether it is shared or not.

A rumor cascade begins on Twitter when a user makes an assertion about a topic in a tweet, which could include written text, photos, or links to articles online. Others then propagate the rumor by retweeting it. A rumor’s diffusion process can be characterized as having one or more cascades, which we deﬁne as instances of a rumor-spreading pattern that exhibit an unbroken retweet chain with a common, singular origin. For example, an individual could start a rumor cascade by tweeting a story or claim with an assertion in it, and another individual could independently start a second cascade of the same rumor (pertaining to the same story or claim) that is completely independent of the ﬁrst cascade, except that it pertains to the same story or claim. If they remain independent, they represent two cascades of the same rumor. Cascades can be as small as size one (meaning no one retweeted the original tweet). The number of cascades that make up a rumor is equal to the number of times the story or claim was independently tweeted by a user (not retweeted). So, if a rumor “A” is tweeted by 10 people separately, but not retweeted, it would have 10 cascades, each of size one. Conversely, if a second rumor “B” is independently tweeted by two people and each of those two tweets is retweeted 100 times, the rumor would consist of two cascades, each of size 100.

Here we investigate the differential diffusion of true, false, and mixed (partially true, partially false) news stories using a comprehensive data set of all of the fact-checked rumor cascades that spread on Twitter from its inception in 2006 to 2017. The data include ~126,000 rumor cascades spread by ~3 million people more than 4.5 million times. We sampled all rumor cascades investigated by six independent fact-checking organizations (snopes.com, politifact.com, factcheck.org, truthorfiction.com, hoax-slayer.com, and urbanlegends.about.com) by parsing the title, body, and verdict (true, false, or mixed) of each rumor investigation reported on their websites and automatically collecting the cascades corresponding to those rumors on Twitter. The result was a sample of rumor cascades whose veracity had been agreed on by these organizations between 95 and 98% of the time. We cataloged the diffusion of the rumor cascades by collecting all English-language replies to tweets that contained a link to any of the aforementioned websites from 2006 to 2017 and used optical character recognition to extract text from images where needed. For each reply tweet, we extracted the original tweet being replied to and all the retweets of the original tweet. Each retweet cascade represents a rumor propagating on Twitter that has been verified as true or false by the fact-checking organizations (see the supplementary materials for more details on cascade construction). We then quantified the cascades’ depth (the number of retweet hops from the origin tweet over time, where a hop is a retweet by a new unique user), size (the number of users involved in the cascade over time), maximum breadth (the maximum number of users involved in the cascade at any depth), and structural virality (23) (a measure that interpolates between content spread through a single, large broadcast and that which spreads through multiple generations, with any one individual directly responsible for only a fraction of the total spread) (see the supplementary materials for more detail on the measurement of rumor diffusion).

As a rumor is retweeted, the depth, size, maximum breadth, and structural virality of the cascade increase (Fig. 1A). A greater fraction of false rumors experienced between 1 and 1000 cascades, whereas a greater fraction of true rumors experienced more than 1000 cascades (Fig. 1B); this was also true for rumors based on political news (Fig. 1D). The total number of false rumors peaked at the end of both 2013 and 2015 and again at the end of 2016, corresponding to the last U.S. presidential election (Fig. 1C). The data also show clear increases in the total number of false political rumors during the 2012 and 2016 U.S. presidential elections (Fig. 1E) and a spike in rumors that contained partially true and partially false information during the Russian annexation of Crimea in 2014 (Fig. 1E). Politics was the largest rumor category in our data, with ~45,000 cascades, followed by urban legends, business, terrorism, science, entertainment, and natural disasters (Fig. 1F).

Fig. 1 Rumor cascades.

(A) An example rumor cascade collected by our method as well as its depth, size, maximum breadth, and structural virality over time. “Nodes” are users. (B) The complementary cumulative distribution functions (CCDFs) of true, false, and mixed (partially true and partially false) cascades, measuring the fraction of rumors that exhibit a given number of cascades. (C) Quarterly counts of all true, false, and mixed rumor cascades that diffused on Twitter between 2006 and 2017, annotated with example rumors in each category. (D) The CCDFs of true, false, and mixed political cascades. (E) Quarterly counts of all true, false, and mixed political rumor cascades that diffused on Twitter between 2006 and 2017, annotated with example rumors in each category. (F) A histogram of the total number of rumor cascades in our data across the seven most frequent topical categories.

When we analyzed the diffusion dynamics of true and false rumors, we found that falsehood diffused significantly farther, faster, deeper, and more broadly than the truth in all categories of information [Kolmogorov-Smirnov (K-S) tests are reported in tables S3 to S10]. A significantly greater fraction of false cascades than true cascades exceeded a depth of 10, and the top 0.01% of false cascades diffused eight hops deeper into the Twittersphere than the truth, diffusing to depths greater than 19 hops from the origin tweet (Fig. 2A). Falsehood also reached far more people than the truth. Whereas the truth rarely diffused to more than 1000 people, the top 1% of false-news cascades routinely diffused to between 1000 and 100,000 people (Fig. 2B). Falsehood reached more people at every depth of a cascade than the truth, meaning that many more people retweeted falsehood than they did the truth (Fig. 2C). The spread of falsehood was aided by its virality, meaning that falsehood did not simply spread through broadcast dynamics but rather through peer-to-peer diffusion characterized by a viral branching process (Fig. 2D).

Fig. 2 Complementary cumulative distribution functions (CCDFs) of true and false rumor cascades.

(A) Depth. (B) Size. (C) Maximum breadth. (D) Structural virality. (E and F) The number of minutes it takes for true and false rumor cascades to reach any (E) depth and (F) number of unique Twitter users. (G) The number of unique Twitter users reached at every depth and (H) the mean breadth of true and false rumor cascades at every depth. In (H), plot is lognormal. Standard errors were clustered at the rumor level (i.e., cascades belonging to the same rumor were clustered together; see supplementary materials for additional details).

It took the truth about six times as long as falsehood to reach 1500 people (Fig. 2F) and 20 times as long as falsehood to reach a cascade depth of 10 (Fig. 2E). As the truth never diffused beyond a depth of 10, we saw that falsehood reached a depth of 19 nearly 10 times faster than the truth reached a depth of 10 (Fig. 2E). Falsehood also diffused significantly more broadly (Fig. 2H) and was retweeted by more unique users than the truth at every cascade depth (Fig. 2G).

False political news (Fig. 1D) traveled deeper (Fig. 3A) and more broadly (Fig. 3C), reached more people (Fig. 3B), and was more viral than any other category of false information (Fig. 3D). False political news also diffused deeper more quickly (Fig. 3E) and reached more than 20,000 people nearly three times faster than all other types of false news reached 10,000 people (Fig. 3F). Although the other categories of false news reached about the same number of unique users at depths between 1 and 10, false political news routinely reached the most unique users at depths greater than 10 (Fig. 3G). Although all other categories of false news traveled slightly more broadly at shallower depths, false political news traveled more broadly at greater depths, indicating that more-popular false political news items exhibited broader and more-accelerated diffusion dynamics (Fig. 3H). Analysis of all news categories showed that news about politics, urban legends, and science spread to the most people, whereas news about politics and urban legends spread the fastest and were the most viral in terms of their structural virality (see fig. S11 for detailed comparisons across all topics).

Fig. 3 Complementary cumulative distribution functions (CCDFs) of false political and other types of rumor cascades.

(A) Depth. (B) Size. (C) Maximum breadth. (D) Structural virality. (E and F) The number of minutes it takes for false political and other false news cascades to reach any (E) depth and (F) number of unique Twitter users. (G) The number of unique Twitter users reached at every depth and (H) the mean breadth of these false rumor cascades at every depth. In (H), plot is lognormal. Standard errors were clustered at the rumor level.

One might suspect that structural elements of the network or individual characteristics of the users involved in the cascades explain why falsity travels with greater velocity than the truth. Perhaps those who spread falsity “followed” more people, had more followers, tweeted more often, were more often “verified” users, or had been on Twitter longer. But when we compared users involved in true and false rumor cascades, we found that the opposite was true in every case. Users who spread false news had significantly fewer followers (K-S test = 0.104, P ~ 0.0), followed significantly fewer people (K-S test = 0.136, P ~ 0.0), were significantly less active on Twitter (K-S test = 0.054, P ~ 0.0), were verified significantly less often (K-S test = 0.004, P < 0.001), and had been on Twitter for significantly less time (K-S test = 0.125, P ~ 0.0) (Fig. 4A). Falsehood diffused farther and faster than the truth despite these differences, not because of them.

Fig. 4 Models estimating correlates of news diffusion, the novelty of true and false news, and the emotional content of replies to news.

(A) Descriptive statistics on users who participated in true and false rumor cascades as well as K-S tests of the differences in the distributions of these measures across true and false rumor cascades. (B) Results of a logistic regression model estimating users’ likelihood of retweeting a rumor as a function of variables shown at the left. coeff, logit coefficient; z, z score. (C) Differences in the information uniqueness (IU), scaled Bhattacharyya distance (BD), and K-L divergence (KL) of true (green) and false (red) rumor tweets compared to the corpus of prior tweets the user was exposed to in the 60 days before retweeting the rumor tweet. (D) The emotional content of replies to true (green) and false (red) rumor tweets across seven dimensions categorized by the NRC. (E) Mean and variance of the IU, KL, and BD of true and false rumor tweets compared to the corpus of prior tweets the user has seen in the 60 days before seeing the rumor tweet as well as K-S tests of their differences across true and false rumors. (F) Mean and variance of the emotional content of replies to true and false rumor tweets across seven dimensions categorized by the NRC as well as K-S tests of their differences across true and false rumors. All standard errors are clustered at the rumor level, and all models are estimated with cluster-robust standard errors at the rumor level.

When we estimated a model of the likelihood of retweeting, we found that falsehoods were 70% more likely to be retweeted than the truth (Wald chi-square test, P ~ 0.0), even when controlling for the account age, activity level, and number of followers and followees of the original tweeter, as well as whether the original tweeter was a verified user (Fig. 4B). Because user characteristics and network structure could not explain the differential diffusion of truth and falsity, we sought alternative explanations for the differences in their diffusion dynamics.

One alternative explanation emerges from information theory and Bayesian decision theory. Novelty attracts human attention (24), contributes to productive decision-making (25), and encourages information sharing (26) because novelty updates our understanding of the world. When information is novel, it is not only surprising, but also more valuable, both from an information theoretic perspective [in that it provides the greatest aid to decision-making (25)] and from a social perspective [in that it conveys social status on one that is “in the know” or has access to unique “inside” information (26)]. We therefore tested whether falsity was more novel than the truth and whether Twitter users were more likely to retweet information that was more novel.

To assess novelty, we randomly selected ~5000 users who propagated true and false rumors and extracted a random sample of ~25,000 tweets that they were exposed to in the 60 days prior to their decision to retweet a rumor. We then specified a latent Dirichlet Allocation Topic model (27), with 200 topics and trained on 10 million English-language tweets, to calculate the information distance between the rumor tweets and all the prior tweets that users were exposed to before retweeting the rumor tweets. This generated a probability distribution over the 200 topics for each tweet in our data set. We then measured how novel the information in the true and false rumors was by comparing the topic distributions of the rumor tweets with the topic distributions of the tweets to which users were exposed in the 60 days before their retweet. We found that false rumors were significantly more novel than the truth across all novelty metrics, displaying significantly higher information uniqueness (K-S test = 0.457, P ~ 0.0) (28), Kullback-Leibler (K-L) divergence (K-S test = 0.433, P ~ 0.0) (29), and Bhattacharyya distance (K-S test = 0.415, P ~ 0.0) (which is similar to the Hellinger distance) (30). The last two metrics measure differences between probability distributions representing the topical content of the incoming tweet and the corpus of previous tweets to which users were exposed.

Although false rumors were measurably more novel than true rumors, users may not have perceived them as such. We therefore assessed users’ perceptions of the information contained in true and false rumors by comparing the emotional content of replies to true and false rumors. We categorized the emotion in the replies by using the leading lexicon curated by the National Research Council Canada (NRC), which provides a comprehensive list of ~140,000 English words and their associations with eight emotions based on Plutchik’s (31) work on basic emotion—anger, fear, anticipation, trust, surprise, sadness, joy, and disgust (32)—and a list of ~32,000 Twitter hashtags and their weighted associations with the same emotions (33). We removed stop words and URLs from the reply tweets and calculated the fraction of words in the tweets that related to each of the eight emotions, creating a vector of emotion weights for each reply that summed to one across the emotions. We found that false rumors inspired replies expressing greater surprise (K-S test = 0.205, P ~ 0.0), corroborating the novelty hypothesis, and greater disgust (K-S test = 0.102, P ~ 0.0), whereas the truth inspired replies that expressed greater sadness (K-S test = 0.037, P ~ 0.0), anticipation (K-S test = 0.038, P ~ 0.0), joy (K-S test = 0.061, P ~ 0.0), and trust (K-S test = 0.060, P ~ 0.0) (Fig. 4, D and F). The emotions expressed in reply to falsehoods may illuminate additional factors, beyond novelty, that inspire people to share false news. Although we cannot claim that novelty causes retweets or that novelty is the only reason why false news is retweeted more often, we do find that false news is more novel and that novel information is more likely to be retweeted.

Numerous diagnostic statistics and manipulation checks validated our results and confirmed their robustness. First, as there were multiple cascades for every true and false rumor, the variance of and error terms associated with cascades corresponding to the same rumor will be correlated. We therefore specified cluster-robust standard errors and calculated all variance statistics clustered at the rumor level. We tested the robustness of our findings to this specification by comparing analyses with and without clustered errors and found that, although clustering reduced the precision of our estimates as expected, the directions, magnitudes, and significance of our results did not change, and chi-square (P ~ 0.0) and deviance (d) goodness-of-fit tests (d = 3.4649 × 10–6, P ~ 1.0) indicate that the models are well specified (see supplementary materials for more detail).

Second, a selection bias may arise from the restriction of our sample to tweets fact checked by the six organizations we relied on. Fact checking may select certain types of rumors or draw additional attention to them. To validate the robustness of our analysis to this selection and the generalizability of our results to all true and false rumor cascades, we independently verified a second sample of rumor cascades that were not verified by any fact-checking organization. These rumors were fact checked by three undergraduate students at Massachusetts Institute of Technology (MIT) and Wellesley College. We trained the students to detect and investigate rumors with our automated rumor-detection algorithm running on 3 million English-language tweets from 2016 (34). The undergraduate annotators investigated the veracity of the detected rumors using simple search queries on the web. We asked them to label the rumors as true, false, or mixed on the basis of their research and to discard all rumors previously investigated by one of the fact-checking organizations. The annotators, who worked independently and were not aware of one another, agreed on the veracity of 90% of the 13,240 rumor cascades that they investigated and achieved a Fleiss’ kappa of 0.88. When we compared the diffusion dynamics of the true and false rumors that the annotators agreed on, we found results nearly identical to those estimated with our main data set (see fig. S17). False rumors in the robustness data set had greater depth (K-S test = 0.139, P ~ 0.0), size (K-S test = 0.131, P ~ 0.0), maximum breadth (K-S test = 0.139, P ~ 0.0), structural virality (K-S test = 0.066, P ~ 0.0), and speed (fig. S17) and a greater number of unique users at each depth (fig. S17). When we broadened the analysis to include majority-rule labeling, rather than unanimity, we again found the same results (see supplementary materials for results using majority-rule labeling).

Third, although the differential diffusion of truth and falsity is interesting with or without robot, or bot, activity, one may worry that our conclusions about human judgment may be biased by the presence of bots in our analysis. We therefore used a sophisticated bot-detection algorithm (35) to identify and remove all bots before running the analysis. When we added bot traffic back into the analysis, we found that none of our main conclusions changed—false news still spread farther, faster, deeper, and more broadly than the truth in all categories of information. The results remained the same when we removed all tweet cascades started by bots, including human retweets of original bot tweets (see supplementary materials, section S8.3) and when we used a second, independent bot-detection algorithm (see supplementary materials, section S8.3.5) and varied the algorithm’s sensitivity threshold to verify the robustness of our analysis (see supplementary materials, section S8.3.4). Although the inclusion of bots, as measured by the two state-of-the-art bot-detection algorithms we used in our analysis, accelerated the spread of both true and false news, it affected their spread roughly equally. This suggests that false news spreads farther, faster, deeper, and more broadly than the truth because humans, not robots, are more likely to spread it.

Finally, more research on the behavioral explanations of differences in the diffusion of true and false news is clearly warranted. In particular, more robust identification of the factors of human judgment that drive the spread of true and false news online requires more direct interaction with users through interviews, surveys, lab experiments, and even neuroimaging. We encourage these and other approaches to the investigation of the factors of human judgment that drive the spread of true and false news in future work.

False news can drive the misallocation of resources during terror attacks and natural disasters, the misalignment of business investments, and misinformed elections. Unfortunately, although the amount of false news online is clearly increasing (Fig. 1, C and E), the scientific understanding of how and why false news spreads is currently based on ad hoc rather than large-scale systematic analyses. Our analysis of all the verified true and false rumors that spread on Twitter confirms that false news spreads more pervasively than the truth online. It also overturns conventional wisdom about how false news spreads. Though one might expect network structure and individual characteristics of spreaders to favor and promote false news, the opposite is true. The greater likelihood of people to retweet falsity more than the truth is what drives the spread of false news, despite network and individual factors that favor the truth. Furthermore, although recent testimony before congressional committees on misinformation in the United States has focused on the role of bots in spreading false news (36), we conclude that human behavior contributes more to the differential spread of falsity and truth than automated robots do. This implies that misinformation-containment policies should also emphasize behavioral interventions, like labeling and incentives to dissuade the spread of misinformation, rather than focusing exclusively on curtailing bots. Understanding how false news spreads is the first step toward containing it. We hope our work inspires more large-scale research into the causes and consequences of the spread of false news as well as its potential cures.

Supplementary Materials

www.sciencemag.org/content/359/6380/1146/suppl/DC1

Materials and Methods

Figs. S1 to S20

Tables S1 to S39

References (37–75)

http://www.sciencemag.org/about/science-licenses-journal-article-reuse

This is an article distributed under the terms of the Science Journals Default License.

Acknowledgments: We are indebted to Twitter for providing funding and access to the data. We are also grateful to members of the MIT research community for invaluable discussions. The research was approved by the MIT institutional review board. The analysis code is freely available at https://goo.gl/forms/AKIlZujpexhN7fY33. The entire data set is also available, from the same link, upon signing an access agreement stating that (i) you shall only use the data set for the purpose of validating the results of the MIT study and for no other purpose; (ii) you shall not attempt to identify, reidentify, or otherwise deanonymize the data set; and (iii) you shall not further share, distribute, publish, or otherwise disseminate the data set. Those who wish to use the data for any other purposes can contact and make a separate agreement with Twitter.

编辑：吴悠

刘海明 · 发表于 2019-7-18 22:39:04

改变心理学的40项研究（1）

文章来源：成林思语

专题一、生物学和人类行为

1.一个脑还是两个脑？——脑割裂实验

【理论由来】

所谓割裂脑实验就是将大脑左、右两个半球之间的胼胝体割断，外界信息传至大脑半球皮层的某一部分后，不能同时又将此信息通过横向胼胝体纤维传至对侧皮层相对应的部分，每个半球各自独立地进行活动，彼此不能知道对侧半球的活动情况。

从1952年至 1961年的10年里，斯佩里先用猫、猴子、猩猩做了大量的割裂脑实验，取得了一些成绩，为以后做“裂脑人”的研究奠定了基础。从1961年开始，斯佩里把“裂脑人”作为研究大脑两半球各种机能的研究对象，对“裂脑人”长时间进行了一系列的实验研究。

file:///C:/Users/le/AppData/Local/Temp/ksohtml2008/wps12.png

【实验结论】

左半脑主要负责逻辑理解、记忆、时间、语言、判断、排列、分类、逻辑、分析、书写、推理、抑制、五感(视、听、嗅、触、味觉)等，思维方式具有连续性、延续性和分析性。因此左脑可以称作“意识脑”、“学术脑”、“语言脑”。

右半脑主要负责空间形象记忆、直觉、情感、身体协调、视知觉、美术、音乐节奏、想像、灵感、顿悟等，思维方式具有无序性、跳跃性、直觉性等。

斯佩里认为右脑具有图像化机能，如企划力、创造力、想像力；与宇宙共振共鸣机能，如第六感、透视力、直觉力、灵感、梦境等；超高速自动演算机能，如心算、数学；超高速大量记忆，如速读、记忆力。右脑像万能博士，善于找出多种解决问题的办法，许多高级思维功能取决于右脑。把右脑潜力充分挖掘出来，才能表现出人类无穷的创造才能。所以右脑又可以称作“本能脑”、“潜意识脑”、“创造脑”、“音乐脑”、“艺术脑”。右脑的神奇功能征服了全世界，斯佩里为全人类作出了卓越的贡献，受到全世界人民的爱戴，被誉为“右脑先生”、“世界右脑开发第一人”，斯佩里的重要研究成果是对人类大脑科学研究的重大里程碑。

　　人的左脑主要从事逻辑思维，右脑主要从事形象思维，是创造力的源泉，是艺术和经验学习的中枢，右脑的存储量是左脑的100万倍。然而现实生活中95％的人，仅仅只是使用了自己的左脑。科学家们指出，终其一生，大多数人只运用了大脑的3％—4％，其余的97％都蕴藏在右脑的潜意识之中，这是一个多么令人吃惊和遗憾的事实！人的大脑蕴藏着极大的潜能，这种潜能至今还“沉睡”着，所以深入挖掘左右两半球的智能区非常重要，而大脑潜能的开发重在右脑的开发。左脑是人的“本生脑”，记载着人出生以来的知识，管理的是近期的和即时的信息；右脑则是人的“祖先脑”，储存从古至今人类进化过程中的遗传因子的全部信息，很多本人没有经历的事情，一接触就能熟练掌握就是这个道理。右脑是潜能激发区，右脑会突然在人类的精神生活的深层展现出迹象；右脑是创造力爆发区，右脑不但有神奇的记忆能力又有高速信息处理能力，右脑发达的人会突然爆发出一种幻想、一项创新、一项发明等等。右脑是低耗高效工作区，右脑不需要很多能量就可以高速计算复杂的数学题，高速记忆、高质量记忆，具有过目不忘的本领，人的大量情绪行为也被右脑所控制。

file:///C:/Users/le/AppData/Local/Temp/ksohtml2008/wps14.png

2.小心视崖——婴儿视崖实验

【实验原理与逻辑】

视崖即"视觉的悬崖"。该实验设计的要点为排除过去经验对被试的影响，如果在排除过去经验的情况下，被试的结果和未排除时一样，那么即可证明知觉无需以往经验的支持，证明深度知觉和避免从高处跌落的能力是自动生成的，而非经验的产物。

【实验装置】

装置的中央有一个能容纳会爬的婴儿的平台，平台两边复盖着厚玻璃。平台与两边厚玻璃上铺着同样黑白相间的格子布料。一边的布料与玻璃紧贴，不造成深度，形成"浅滩"；另一边的布料与玻璃相隔数尺距离，造成深度，形成"悬崖"。

实验时，让婴儿的母亲先后站在装置的"深"、"浅"两侧招唤婴儿，观察婴儿是否拒绝从有深度错觉的"悬崖"一边爬向母亲，借以研究婴儿的深度知觉的发生。

实验证明，出生六个月左右（甚至更早些）的婴儿，尽管母亲向他（她）招手呼唤，诱导其爬向母亲，但均被拒绝。这说明他们已经具有了深度知觉的能力。动物实验表明，凡是一出生就会行走的动物，生后不久就能看出深度而躲避"悬崖"；而生后尚需经过一定时间才会行走的动物，则和人类的婴儿一样，要到会爬一段时间以后，才能看出深度而躲避"悬崖"。

吉布森和沃克（Gibson & Walk，1961）对36名6.5-14个月会爬的婴儿进行了视崖测试，结果表明：有足够大的视崖深度时（大约90cm或更多），只有不到10%（3名）的婴儿会越过悬崖爬向母亲，而有27名婴儿从中间爬向浅滩。当深侧的方格图案距离玻璃板越来越近时，就有越来越多的婴儿爬过深滩；当视崖深度是26cm时，有38%的婴儿爬过深滩；而1m时只有8%（主要是年龄较大的婴儿）。

沃克（Walk，1979）研究发现，当视崖深度为26cm时，68%的7-9个月的婴儿爬过深滩，而10-13个月的婴儿只有23%。其结论是：

（1）婴儿很早就有了深度知觉；

（2）婴儿深度知觉的能力随着年龄的递增在不断发展；

（3） 9个月以前的婴儿的深度知觉阈限为26cm。对婴儿视崖研究还发现，婴儿存在着深度恐惧。

file:///C:/Users/le/AppData/Local/Temp/ksohtml2008/wps16.png

专题二、知觉和意识

3.睡眠，毫无疑问就会做梦——梦的剥夺实验

科学家做了很多剥夺人做梦的实验，即当睡眠者一出现做梦的脑电波时，就立即被唤醒，如此反复进行。结果发现，对梦的剥夺，会导致人体一系列生生理异常。如脉搏、血压、体温以及皮肤电压均增加，植物神经系统机能有所减弱。同时还会引起人心理上一系列不良反应，如紧张、焦虑、易怒、记忆障碍、出现幻觉、定向障碍等。显然，做梦已成为人体一种正常的必不可少的生理过程。正常的梦境活动，成为保护肌体正常生命活动的重要困素之一。

日本山梨大学研究人员发现人脑中存在着两类相反的催眠肽，一类催无梦睡眠肽，另一类催有梦睡眠肽，将它施于动物，使其睡眠的有梦期延长，结果实验动物的平均寿命大大提高。我们爱做梦,实在是老天给我们的一种恩赐,即使没有生理上的寿命延长,实际上,我们能同时拥有两个世界,与不做梦的朋友相比,经历了更多.

梦对健康有积极作用

许多人都认为，一夜无梦是一件幸福的事。研究者却发现梦可以锻炼大脑的思维能力。接受临床实验的人群中有一部分在即将进入梦眠状态即被强行叫醒，结果他们即使在学习中勤奋努力也难以取得良好的成绩。梦对人脑的积极作用与计算机对数据的处理有相同之外，它使大脑皮层之间的联系进入一个更为良好的阶段。梦对保持人类的精神健康也起着非常积极的作用。许多患有慢性忧郁症的病人就长期被混乱的梦或缺乏梦的异常睡眠所困扰。当他们进行了通过电脑调节脑波的睡眠治疗后，病情有不同程度的好转。那么，为什么有些人不能做梦或者说完全回忆不了梦境呢？这是受到无线电波的干扰导致的。如果消除了这些干扰，4-6天后就可以重新享受做梦的惊险和乐趣。

做梦对人有许多好处，做梦可以锻炼脑的功能。做梦是人脑的一种工作程序，对大脑白天接受的信息进行整理，大脑白天不能处理的信息能在梦境中得到很好的处理，白天苦苦思素而无法解决的难题能在夜晚的梦境中迎刃而解。根据脑电图的测试也发现，人脑在做梦时的活动是相当强烈的，我们能够从做梦时测到快速的、紊乱的脑电波，其强度有时会超过觉醒时的强度。从这一点来看，做梦是锻炼人脑功能的一种自身需要。

file:///C:/Users/le/AppData/Local/Temp/ksohtml2008/wps18.png

经常听到有人抱怨"作了一夜的梦，一宿没休息好"，"晚上梦多，简直没有睡着"。那么真的是做梦耽误了休息吗？做梦是大脑不曾休息或休息不好的表现吗？这种观念是不对的，做梦并能回忆梦境并非睡眠不深的指标，也不能说做了梦就表示没有睡好。实际上，做梦是一种生理现象，不管你有没有梦的回忆，你每天晚上毕定要做4-5回梦。因此说整夜做梦或没有做梦都不现实，也谈不上梦多梦少。有研究表明，梦感与失眠的程度没有必然的联系。也就是说，有无梦感和梦感的程度均不能作为失眠与否及失眠程度的客观指征。

原文链接：https://mp.weixin.qq.com/s?__biz=MzAxOTk0NjM5OA==&mid=2247493732&idx=1&sn=f6e5cd91beaec9e577df8f28165d506e&chksm=9c3de695ab4a6f83e069691ef9723b79f8cf6c895d5fe5ee66327b4bb0f6a5c04536fb3ffb29&mpshare=1&scene=1&srcid=&from=singlemessage&ascene=1&devicetype=android-26&version=2700033c&nettype=WIFI&abtest_cookie=BAABAAoACwASABMABQAjlx4AVpkeAMeZHgDTmR4A3JkeAAAA&lang=zh_CN&pass_ticket=pTcrO38Y6sBf0DltFENi6koxIfI%2BpGzqDcdevGocGJ3G%2Bfx1ma5KyOedUkfsQx9E&wx_header=1

编辑：高杰

刘海明 · 发表于 2019-10-11 22:45:47

有个词叫“理性无知”，当无知符合某个人的心理需求，那么他就会保持无知，就像大跃进时期的农民当然知道肥猪赛大象的荒谬，工人也知道土法炼钢只能炼出什么东西。
但它符合人们的心理需要，所以没有人质疑。

编辑：陈茗

刘海明 · 发表于 2019-10-14 22:04:19

【案例】

中国传统心理学的创造性转化

中国的地域环境和生活方式造就了中国人特有的思维方式，并发展出中国特有的传统心理学，成为明显不同于西方心理学的一种独特体系。通过心理学的中西交流，使“中国传统心理学”创造性地转化成“中国心理学”，是任何拥有文化自觉意识的中国心理学者都难以拒绝的使命。

近几年来，我一直在思考这样一个问题：中国的心理学研究的突破点在哪里？表面来看，这是杞人忧天式的多虑，因为改革开放以来，尤其是2000年以来，中国心理学家们在国际上发表的研究成果数量逐年增加，呈现出一派繁荣景象。但站在国际的视野，可以看到能称之为“中国心理学”（Psychology of China）的东西似乎还有待进一步发现，迄今为止更多的只是一种“心理学在中国”（Psychology in China）的研究。令人困惑的问题在于，既然心理现象是古今中外人类社会日常普遍存在的现象，难道拥有几千年历史与文化的中国就没有自己的心理学吗？

过去七八十年来，我国老一辈心理学家张耀翔、潘菽、高觉敷、燕国材、杨鑫辉等人已经通过不懈努力，从中国文化典籍中陆续挖掘出了丰富的心理学思想。当然，他们一般认为中国过去只有心理学思想，没有心理学，如高觉敷说，“在西方心理学传入中国之前，我国是否就没有心理学呢？不错，我们没有心理学，但有心理学的思想。”（高觉敷，1985）

如果我们以上下五千年的视野审视东西方文化背景下的人类对自我心理的探究历史，就会注意到，至少在1582年西方传教士利玛窦来华传教之前（由此上溯至公元前551年孔子出生），文化迥异的中国与欧洲对人类自我心理的探究是各自独立进行的。也就是说，在过去2000多年的时间里，中国古代思想家们对人类心理的研究是自成一体的，这就是在中国儒、道、佛等文化中所表现出的一系列心理学思想。因此，“心理学”虽然是一个舶来词汇，但中国过去一直拥有自己的心理学是一个无可辩驳的事实。我们不妨称之为“中国传统心理学”（Chinese Traditional Psychology）。

在西方心理学进入中国之前，中国以自己的话语在中国本土地域言说自己的心理学。因此，“中国传统心理学”的话语体系与西方心理学在很大程度上是不兼容的。但是，在中国事实上已经融入全球、西方心理学已经建立并传入中国至少一百年的现实情况下，中国传统心理学必须要在与西方心理学的对话交流中才能新生。幸运的是，尽管中国传统心理学与西方心理学的话语体系在很大程度上不兼容，但普遍性的人性的存在以及多元文化的互补价值使心理学的中西对话交流变得不仅可能而且必要。

显然，中国传统心理学是普遍性的人性在中国的一种重要表现形式，是人类心理学的重要组成部分。儒道佛作为中国文化的主要代表，其所表现出的一系列心理学思想和内容（如“仁且智”“致良知”“情志相胜”“禅修顿悟”等），有一些富有人文特色，也有一些富含科学成分。中国的地域环境和生活方式造就了中国人特有的思维方式（典型如《易经》中展示的取象比类的思维方式），并发展出中国特有的传统心理学，成为明显不同于西方心理学的一种独特体系，有效地弥补了西方心理学的某些明显不足。

中国传统心理学与西方心理学的对话交流事实上已经发生。当老一辈心理学家从中国文化典籍中陆续挖掘出心理学思想的时候，中国传统心理学与西方心理学的对话交流便开始了，只不过这种对话交流的契合性还需要进一步提升，广度还需要进一步拓展，深度还需要进一步挖掘，方法还需要进一步丰富。

通过与西方心理学的对话交流，“中国传统心理学”必将如凤凰涅槃般浴火重生，最终产生“中国心理学”。这个过程，我称之为中国传统心理学的创造性转化。考虑到世界心理学还在科学化的进程中，这种创造性转化，就其本质而言可称为“科学的心理学诠释”。在这里，“诠释”是指中西文化互释，既包括“以西释中”，也包括“以中释西”。其具体的诠释路径应该是，在心理学科学化目标的指引下，先以西方心理学解析和批判中国传统心理学，逐步达成现代中国心理学，再以中国心理学解析和批判西方心理学，从而逐步达成科学心理学。这样由“以西释中”到“以中释西”，通过中西文化互释以建立和完善中国心理学，最终与现代西方心理学一起，共同建立科学心理学的大厦。通过心理学的中西交流，使“中国传统心理学”创造性地转化成“中国心理学”，是任何拥有文化自觉意识的中国心理学者都难以拒绝的使命。

“创造性转化”一词近几十年来在中国思想以及中国哲学研究领域广泛使用，就笔者目前所知，最早似源自于华人学者林毓生先生对中国传统人文的探讨。在中国的现代化进程中，如何面对中国传统文化确实是一个十分复杂的难题。以儒道佛为代表的中国传统文化，相比于从古希腊、古希伯来和古罗马文明为源头演化而来的西方文化而言，确实存在某些方面的不足。中国文化未能发展出抽象的形式逻辑，中国文化也未能发展出一套内在自洽的政治哲学体系，例如儒家提出的格物致知、诚意正心、修身齐家治国平天下的一套内圣外王哲学并没有内在逻辑上的必然性，但是，儒道佛中有关心身关系的自身实证（或曰“体证”，如禅修顿悟即是如此）研究存在着西方文化不擅长的一面。这一点，在当前中国心理学界依然只限于极少数心理学人的感同身受。中国的现代化必须植根于中国传统，但是不能囿于中国传统。因此，中国的现代化必须通过中国传统的创造性转化生发出来。同理，“中国心理学”也只能通过“中国传统心理学”的创造性转化生发出来。

过去几年来，我与罗劲教授（首都师范大学）、郭斯萍教授（广州大学）就中国传统心理学的创造性转化问题进行了多次交流，并联合罗非（中国科学院心理研究所）、彭彦琴（苏州大学）、应小萍（中国社会科学院社会学研究所）、舒曼（华东交通大学）、孙俊才（曲阜师范大学）等国内多位学者一起开展了有关中国传统心理学思想的现代诠释研究，已对有关“人贵论”“仁且智”“禅修顿悟”“信则灵”“情志相胜”等概念和命题作了初步的现代心理学诠释，希望能在这方面有所作为，为中国传统心理学的创造性转化、为中国心理学的建立做一点初步的工作（我们已将这些成果编撰为《儒道佛与认知神经科学》一书，即将由科学出版社出版）。这些研究成果也许能让我们窥见“中国心理学”的一鳞半爪，当然也可能是幼稚的、不成熟的，还需要深化，我们等待国内外同行的批评和建议。我坚信，无论是对于未来中国心理学的建立，还是对于科学心理学的发展，这样的工作都是非常必要的，并且是价值重大的。

（作者单位：南京师范大学心理学院）

原文链接：https://mp.weixin.qq.com/s/CFRw1ER7EXyw60GoLujbDA

编辑：董莉

刘海明 · 发表于 2019-10-19 20:56:39

【案例】

邓克效应，邓宁-克鲁格效应(Dunning–Kruger effect)，是一种认知偏差，能力欠缺的人有一种虚幻的自我优越感，错误地认为自己比真实情况更加优秀。Kruger和Dunning将其归咎于元认知上的缺陷，能力欠缺的人无法认识到自身的无能，不能准确评估自身的能力。简言之即：庸人容易因欠缺自知之明而自我膨胀。

编辑：董莉

帐号		自动登录	找回密码
密码			实名注册