Daryl Bem の予知実験の追試成功が科学界に与えた衝撃

October 29, 2025 · 239 min read

前置き

「Daryl Bem の予知実験」について Jimmy Akin が解説している動画を AI で整理した。

要旨

予知実験と科学論争

この文字起こしは、ポッドキャスト番組「Jimmy Akin's Mysterious World」のエピソード377の一部であり、‌‌心理学者ダリル・ベム‌‌による‌‌「未来を感じる（Feeling the Future）」‌‌と題された‌‌予知実験とそれを取り巻く科学論争‌‌について焦点を当てています。

ベムの実験は、被験者が将来の出来事についての情報を無意識に知覚する能力、すなわち‌‌予知能力‌‌を持っている可能性を強く示唆し、その結果は‌‌統計的に非常に有意‌‌であり、懐疑的な科学界に大きな衝撃を与えました。

この議論では、予知現象の‌‌科学的根拠‌‌、頻度論的統計と‌‌ベイズ統計‌‌の適用、そして‌‌再現性（レプリケーション）‌‌の重要性といった主要な科学的手法の問題が掘り下げられ、多くの懐疑論者が結果を受け入れ難いと感じたにもかかわらず、その後の‌‌メタ分析‌‌でもベムの結論が裏付けられたことが解説されています。

前置き
要旨
ダリル・ベムの予知実験と科学的論争に関するブリーフィング
「未来を感じる」— 心理学者ダリル・ベムの驚くべき9つの予知実験を徹底解説
科学を揺るがした「予知実験」から学ぶ、科学的思考のレッスン
予知実験が暴いた科学の脆さ：ダリル・ベム論文が心理学に残した教訓
研究論文レビュー：ダリル・ベムの『Feeling the Future』が提起した予知能力の科学的証拠とその論争
実験の概要と背景
1. 実験の背景と文脈
2. 実験の概要と方法論
予知実験の具体例
実験結果と統計的意義
科学界の論争と反応
再現性
情報源
文字起こし（話者識別）

ダリル・ベムの予知実験と科学的論争に関するブリーフィング

エグゼクティブ・サマリー

2011年、社会心理学者のダリル・ベムは、権威ある学術誌『Journal of Personality and Social Psychology』に『Feeling the Future』と題する論文を発表し、科学界に衝撃を与えた。この論文は、一般の人々が未来の出来事を無意識に予知する能力を持つことを示唆する9つの実験結果を報告した。実験のうち8つは統計的に有意な結果を示し、その総合的な結果は、偶然によって生じる確率が数十億分の1以下（6.5シグマ）という驚異的なものであった。

この発見は、特に懐疑的な科学者たちから激しい反発を招き、「狂気の沙汰」とまで評された。批判は主に、①探索的研究と確証的研究の混同、②ベイズ統計学ではなく頻度論的統計学を使用したこと、③未発表の研究が結果を歪める「ファイルドロワー効果」、④再現性の欠如、という4点に集中した。

しかし、これらの批判に対してベムと彼の擁護者たちは、統計的手法を用いた反論や、再現研究の重要性を強調することで応じた。ベム自身が実験手法を公開し、再現を奨励した結果、2016年には彼の研究を含む90の実験（33の研究室、14カ国）を対象としたメタ分析が発表された。この分析でも、予知の存在を支持する結果は6シグマを超え、ベムの当初の結論を強力に裏付けた。ベイズ統計学を用いた分析でも、その証拠は「決定的」と評価された。本資料は、ベムの実験、その結果、巻き起こった論争、そして再現研究による検証の全容を詳述する。

1. 序論：ダリル・ベムと『Feeling the Future』論文

ダリル・ベムは、コーネル大学の社会心理学者であり、当初は主流の心理学を専門としていたが、1990年代から超心理学の研究に関心を持つようになった。彼の研究の集大成として、2011年に発表された論文が『Feeling the Future: Experimental Evidence for Anomalous Retroactive Influences on Cognition and Affect』（未来を感じる：認知と感情に対する異常な遡及的影響の実験的証拠）である。

この論文が主流の心理学界で非常に権威のある学術誌に掲載されたことが、その影響を一層大きなものにした。論文の核心は、人間が未来の出来事から無意識に情報を受け取る能力（予知）を持つという仮説を検証した9つの実験の結果報告であった。

主な発見：

実施された9つの実験のうち、8つで統計的に有意な結果が得られた。
9つの実験結果を統合すると、そのp値は2.6 x 10⁻¹¹となり、偶然による結果である確率は約370億分の1に過ぎなかった。
この結果は統計的に6.5シグマに相当し、素粒子物理学において新粒子の発見を宣言するために必要とされる「ゴールドスタンダード」の5シグマを大きく上回っていた。

2. 9つの予知実験の概要

ベムの実験設計の独創性は、確立された標準的な心理学実験のプロトコルを基にしながら、原因（刺激）と結果（反応）の順序を逆転させた点にある。これにより、通常の因果関係ではなく、未来の出来事が現在の反応に影響を与える「遡及的因果（retro-causation）」、すなわち予知を検証することが可能になった。

実験番号	実験名	目的と手法
実験1	エロティックな刺激の予知的検出	参加者は2つのカーテンのどちらかの後ろに画像があると信じて選択する。選択後、コンピューターがランダムに画像の表示位置を決定する。セクシーな画像が未来に表示される位置を、偶然（50%）以上に正しく選択できるかを検証した。
実験2	ネガティブな刺激の予知的回避	参加者は2つの鏡像画像のどちらを好むかを選択する。選択後、コンピューターはランダムに片方を「ターゲット」と定め、選択が一致すればポジティブな画像を、不一致ならばネガティブな画像（攻撃的な犬など）をサブリミナルで表示する。未来の罰（ネガティブ画像）を無意識に避けられるかを検証した。
実験3	遡及的プライミング	参加者は画像がポジティブかネガティブかを判断する。判断後、コンピューターがランダムにポジティブまたはネガティブな単語（プライム）を表示する。未来に表示されるプライム単語と画像の感情価が一致する場合、判断時間が短縮されるかを検証した。
実験4	遡及的プライミング（再現）	実験3の再現。ただし、プライム単語を画像と意味的に関連するものに限定し、手続きを洗練させた。
実験5	遡及的馴化	参加者は2つの類似画像から好む方を選択する。選択後、コンピューターはランダムに片方をターゲットとし、その画像をサブリミナルで繰り返し表示する。未来で繰り返し接触する画像を、より好ましく感じるかを検証した。
実験6	遡及的馴化（再現）	実験5の再現。ただし、刺激にセクシーな画像を追加した。セクシーな画像は繰り返し見ると飽きが生じるため、未来で繰り返し表示される画像を逆に好まなくなるという仮説も検証した。
実験7	遡及的退屈誘導	参加者は2つの中立的な画像から好む方を選択する。選択後、コンピューターが片方をターゲットとし、その画像を意識的に知覚できる長さで繰り返し表示する。未来の反復表示によって生じる「退屈」を予知できるかを検証した。この実験のみが統計的有意水準に達しなかった（p値 = 0.096）。
実験8	遡及的想起促進	参加者に単語リストを提示し、覚えている単語を答えてもらう。その後、コンピューターがリストの半分の単語をランダムに選び、参加者にその単語を練習させる。未来で練習することになる単語を、現在においてより多く想起できるかを検証した。
実験9	遡及的想起促進（再現）	実験8の再現。練習方法を一部変更して実施した。

3. 科学界の反応と主要な批判

ベムの論文は、科学界、特に懐疑論者たちの間で激しい反発を引き起こした。彼の仮説は、「狂気が狂気の上に積み重なったもの」と見なされた。なぜなら、①予知が存在し、②訓練されていない一般人がそれを使用し、③しかも無意識かつ常時使用している、という三重の意味で常識に反すると考えられたからだ。

ある著名な心理学者は論文を読んで「物理的に気分が悪くなった」と述べ、また別の心理学者は「結果は明らかに不正操作されている」と断じた。これらの感情的な反応とは別に、科学的方法論に基づいた主要な批判が4つ提起された。

探索的研究と確証的研究の混同

批判者たちは、ベムが仮説を生成するための探索的研究と、仮説を厳密に検証するための確証的研究を明確に区別していなかった可能性を指摘した。実験の途中で手順を変更すると、意図した結果を導きやすくなる危険性がある。

ベイズ統計学の不使用

ベムは、心理学で標準的な頻度論的統計学（p値など）を用いた。しかし批判者たちは、特に「ありそうもない」仮説を評価する際には、事前の信念（prior beliefs）を考慮に入れるベイズ統計学を用いるべきだと主張した。

ファイルドロワー効果（出版バイアス）

学術誌は肯定的な結果（統計的に有意な結果）が出た研究を掲載し、否定的な結果（有意差なし）が出た研究を掲載しない傾向がある。このため、ベムの肯定的な結果だけが公表され、予知を否定する多くの未発表研究が「ファイルキャビネットの中（in file drawers）」に眠っているのではないかという懸念が示された。

再現性の必要性

科学において、ある発見が本物であると認められるためには、独立した第三者による再現実験で同じ結果が得られることが不可欠である。批判者たちは、ベムの結果が再現されるまでは結論を保留すべきだと主張した。

4. 批判への反論と再現研究の結果

ベムと彼の支持者たちは、これらの批判に対して詳細な反論を行った。そして最終的には、最も重要な検証である再現研究によって、その主張の妥当性が試されることになった。

ベム側の応答

探索的研究と確証的研究について：ベムは、研究の初期段階では後年ほど厳密ではなかった部分があったことを認めつつ、最終的な解決策は再現研究にあると主張した。
ベイズ統計学について：ベムは2人の統計学者と共に、自身のデータをベイズ統計学で再分析した論文を発表。その結果、予知仮説を支持するベイズ因子は13,669となり、一般的に「決定的な証拠」とされる基準値100をはるかに超えることを示した。
ファイルドロワー効果について： 2016年のメタ分析では、公表された効果を統計的に無意味なレベルまで引き下げるには、544件もの未発表の失敗研究が存在する必要があると算出された。超心理学分野の研究者の総数を考えると、これは非現実的な数字であると結論付けられた。

再現性の検証：2016年のメタ分析

ベムは当初から再現を奨励しており、実験に必要なプログラムや資料を誰でも利用できるように公開していた。これにより、世界中の研究室で再現実験が行われた。

2016年、ベムと彼の共同研究者たちは、これまでの再現研究を統合したメタ分析の結果を発表した。

対象：ベム自身の研究を含む、14カ国33の研究室で実施された90の実験。
頻度論的分析の結果：
- 全90実験を統合した結果は6シグマを上回り、p値は1.2 x 10⁻¹⁰（偶然である確率は約100億分の1）であった。
- ベムの最初の9実験を除外しても、p値は1.1 x 10⁻⁵（偶然である確率は約10万分の1）であり、依然として非常に高い統計的有意性を示した。
ベイズ分析の結果：
- 全実験を統合したベイズ因子は51億（5.1 x 10⁹）に達した。
- ベムの実験を除外しても、ベイズ因子は3,853であり、どちらも「決定的な証拠」の基準値100を大幅に超えていた。

このメタ分析は、ベムの実験結果が単なる偶然や研究上の欠陥によるものではなく、再現性のある現象であることを強力に示唆している。

5. 信仰の観点からの考察

カトリック教会の理解では、ベムが発見したような現象は信仰と必ずしも矛盾しない。特に、教会の博士と称される聖人たちは、人間が持つ自然な能力としての予知を認めていた。

聖グレゴリウス大教皇：「魂は時として、それ自身の精妙な力によって未来を予見することがある」と述べており、人間の魂が持つ自然な能力としての未来予知の可能性に言及した。
聖トマス・アクィナス：神から与えられる超自然的な預言とは区別される「自然的預言」という概念を提唱した。この自然的預言は、 infallible（誤り得ない）ではなく、蓋然的な未来の知識をもたらすものであり、現代の超心理学が示す予知の確率論的な性質と一致する。

したがって、ベムの研究結果は、キリスト教の教えと根本的に対立するものではないと解釈できる。

6. 結論

ダリル・ベムが2011年に発表した『Feeling the Future』論文は、一般人が無意識に予知能力を使用している可能性を示唆し、科学界に大きな論争を巻き起こした。当初は懐疑論者から方法論的な欠陥や統計解釈の誤りを厳しく批判されたが、ベムとその支持者たちはこれらの批判に反論し、最終的には科学的検証の根幹である再現研究にその結論を委ねた。

2016年の大規模なメタ分析は、独立した多数の研究室による再現実験を統合した結果、ベムの当初の発見を裏付ける強力な証拠を示した。頻度論的統計とベイズ統計の両方で得られた結果は、予知現象が統計的に頑健で再現可能であることを示唆している。この一連の出来事は、常識に反すると思われる現象であっても、厳密な科学的手法によって検証され、その証拠が積み重ねられた重要な事例と言える。

「未来を感じる」— 心理学者ダリル・ベムの驚くべき9つの予知実験を徹底解説

導入：科学界を揺るがした「予知」の実験

この記事の目的は、コーネル大学の社会心理学者ダリル・ベムが行った、「未来予知」に関する9つの画期的な実験の概要を、専門用語をできるだけ避け、平易な言葉で分かりやすく解説することです。

2011年、ベムは「Feeling the Future（未来を感じる）」と題された論文を発表しました。この論文が、エリートで非常に尊敬されている主流の心理学雑誌『Journal of Personality and Social Psychology』に掲載されたことで、科学界に衝撃が走りました。なぜなら、彼の実験結果は、私たちの常識を根底から揺るがすものだったからです。

この研究が検証しようとした核心的な問いは、非常にシンプルかつ深遠なものです。

人間の脳は、まだ起きていない未来の出来事を無意識に察知できるのか？

この問いに、ベムは厳密な科学実験の手法で挑みました。

1. ベム実験の核心的アイデア：時間を逆転させる

ベムの実験がいかに独創的であったか、その秘密は「時間の逆転」というシンプルなアイデアにあります。彼は、すでに確立されている心理学の実験モデルを取り上げ、その「原因（刺激）」と「結果（反応）」の順序を単純に逆転させたのです。

この「時間逆転」のコンセプトを最も直感的に理解できるのが、彼が行った9つの実験のうちの8番目、「逆行性記憶促進」です。

通常の記憶実験は、以下の手順で行われます。

単語リストの学習（原因）: 参加者に単語のリストを見せます。
一部の単語の練習: リストの一部の単語を練習させます。
記憶テスト（結果）: リスト全体の単語をどれだけ覚えているかテストします。

この場合、練習した単語の方がよく思い出せるのは当然です。

しかし、ベムはこの順序を逆転させました。

単語リストの学習: 参加者に単語のリストを見せます。
記憶テスト（結果）: 先に、どれだけ覚えているかテストします。
一部の単語の練習（原因）: テストの後で、コンピューターがランダムに選んだ単語を練習させます。

これはまるで「宿題をやる前にテストを受ける」ようなものです。常識的に考えれば、未来に行う練習が現在のテスト結果に影響を与えるはずがありません。しかし、ベムは「もし予知能力が存在するなら、未来に練習する単語を無意識に感じ取り、現在のテストの時点で、その単語をより多く思い出せるのではないか？」という仮説を立てました。

それでは、この独創的なアイデアに基づいた9つの具体的な実験を、一つずつ詳しく見ていきましょう。

2. 9つの予知実験：その巧妙な手法と仮説

ベムの9つの実験は、検証する心理現象によって大きく3つのグループに分けることができます。

グループA：本能的な「引き寄せ」と「回避」の実験

このグループは、人間が持つ本能的な反応が、未来の出来事によって影響を受けるかを検証しました。

実験1：魅力的な刺激の予知

仮説: 人々は、未来に提示される「性的な画像」を無意識に予知し、それを引き寄せようとするのではないか。
実験手順:
1. 参加者は、コンピューター画面の左右に表示された2つのカーテンのうち、絵が隠されていると思う方をクリックします。
2. 重要なのは、参加者がクリックを終えた後で、コンピューターがランダムにどちらかのカーテンの裏に性的な画像（または中立的な壁の画像）を配置することです。
3. もし予知能力があれば、偶然の確率である50%を超えて、性的な画像が配置されるカーテンを選択するはずです。

実験2：不快な刺激の予知的回避

仮説: 人々は、未来に提示される「不快な画像」を無意識に予知し、それを避けようとするのではないか。
実験手順:
1. 参加者は、左右に並んだ瓜二つの画像（鏡像）のうち、好ましいと思う方をボタンで選びます。
2. 参加者が選択を終えた後、コンピューターがランダムにどちらかを「ターゲット」として決定します。
3. もし参加者が未来の「ターゲット」を選んでいれば、ご褒美としてポジティブな画像（例：美しい夕焼け）がサブリミナル（意識に上らないほど一瞬）で表示されます。外れていれば、罰としてネガティブな画像（例：獰猛な犬）が表示されます。
4. もし予知能力があれば、罰を避けるために、未来の「ターゲット」を偶然の確率（50%）以上に選択するはずです。

本能的な反応だけでなく、ベムはより高度な認知プロセスである「プライミング」にも未来からの影響が及ぶのかを検証しました。

グループB：「未来からのヒント」で反応は変わるか？ (プライミング実験)

まず、「プライミング効果」とは、先に見聞きした情報（プライム）が、その後の判断や行動に無意識のうちに影響を与える心理現象です。例えば、「美しい」という単語を見た後では、美しい夕焼けの写真を「ポジティブだ」と判断するスピードが速くなります。

実験3 & 4：逆行性プライミング

仮説: 未来に表示される「プライム単語」が、現在の画像の判断速度に影響を与えるのではないか。
実験手順:
1. 参加者は、画面に表示された画像（例：美しい夕焼け、獰猛な犬）がポジティブかネガティブかを、できるだけ速くボタンを押して判断します。
2. 参加者が判断を終えた後で、コンピューターがランダムにポジティブな単語（例：beautiful）またはネガティブな単語（例：ugly）をプライムとして表示します。
3. もし予知能力があれば、未来に表示されるプライム単語と画像の感情が一致する場合（例：夕焼けの画像 → beautifulという未来の単語）、反応時間が速くなるはずです。

反応速度だけでなく、未来の出来事が現在の「好み」にまで影響を与える可能性について、ベムはさらに探求を進めました。

グループC：未来の「慣れ」と「飽き」が現在の好みを左右する (ハビチュエーション実験)

次に、「ハビチュエーション（慣れ）」の効果です。これは、何度も見たり聞いたりしたものに対して、人は親しみを覚え、好意を抱きやすくなる心理現象です。

実験5, 6, 7：逆行性ハビチュエーションと退屈

仮説: 未来に何度も繰り返し見せられる画像に対して、現在時点ですでに「慣れ」を感じ、より好ましく思うのではないか。また、性的な画像の場合は未来の反復提示で「飽き」が生じ、逆に好ましく思わなくなるのではないか。
実験手順:
1. 参加者は、2枚の似た画像のうち、どちらが好きかを選択します。
2. 参加者が選択を終えた後で、コンピューターがランダムにどちらか一方を「ターゲット」として選びます。
3. その後、選ばれた「ターゲット」画像が何度も繰り返し（サブリミナル、または意識的に）表示されます。
4. 中立的またはネガティブな画像（実験5）の場合、未来に繰り返し表示される方の画像を、現在時点ですでに「慣れ親しんだ」ものとして好みやすく（偶然の50%以上に）なると予測されました。
5. ‌‌性的な画像（実験6）‌‌では、新たな仮説が検証されました。未来の反復提示による「飽き」を予知し、逆にその画像を好まなくなる（選択率が50%以下に）のではないか、というものです。この効果が実際に観測されたため、続く実験7では「退屈」という現象に焦点を当てたデザインが採用されました。

これら独創的な9つの実験は、一体どのような結果をもたらしたのでしょうか。その衝撃的なデータを見ていきましょう。

3. 結果：偶然では説明できない驚異的なデータ

実験結果は、驚くべきものでした。9つの実験のうち、実に8つで、「統計的に有意な」結果、つまり偶然とは考えにくい明確な効果が確認されたのです。

ここで、「統計的有意性」や「p値」という概念を簡単に説明します。

p値: ある実験結果が「単なる偶然」によって得られる確率を示します。
p値が0.05未満: 科学の世界では、これが「統計的に有意」と判断される一つの基準です。これは、「その結果が偶然である可能性が5%未満（20回に1回未満）であること」を意味します。

ベムの実験では、8つの実験がこの基準をクリアしました。しかし、本当に衝撃的なのは、9つの実験結果をすべて統合した際の数値です。

評価指標	結果	説明
p値	2.6 x 10⁻¹¹	結果が偶然である確率は、約370億分の1 という天文学的な低さでした。
シグマレベル	6.5シグマ	これは、素粒子物理学で新粒子発見の「ゴールドスタンダード」とされる「5シグマ」（偶然である確率が約170万分の1）をはるかに超える驚異的な数値です。

物理学の世界であれば、新粒子の発見を宣言できるレベルを遥かに超える強力な証拠が、心理学の実験で示されたのです。

これほど強力なデータが示されたにもかかわらず、科学界はこれを素直に受け入れることができませんでした。なぜなら、それは常識を根底から覆すものだったからです。

4. 科学界の衝撃と論争、そして「再現」の重要性

ベムの論文が発表されると、懐疑的な科学者たちから強烈な拒否反応が巻き起こりました。ある著名な心理学者は「狂気の沙汰だ、純然たる狂気だ」と述べ、別の科学者は論文を読んで「気分が悪くなった」とさえ語っています。

この結果は、既存の科学的世界観に対するあまりに大きな挑戦であったため、単に「予知は存在しない」と結論づけるのではなく、「科学（という手法）自体が根本的に壊れているに違いない」と主張する記事まで現れるほど、議論は過熱しました。

このような反応の中で、科学が最も重要視する原則が前面に押し出されます。それは「再現性」です。一つの研究だけでは結論は出せません。他の独立した研究者が同じ実験を追試して、同じ結果が得られることが不可欠なのです。

しかし、この再現性の議論において、しばしば見過ごされがちな重要な事実があります。それは、超心理学の研究者たちが、主流の科学者たちよりも‌‌ methodologicalな基準を高く設定する傾向がある‌‌という点です。例えば、結果が出なかった研究が公表されず、成功例だけが世に出る「ファイル・ドロワー効果（出版バイアス）」という問題に対し、超心理学の学術誌は早くから「有意な結果が出なかった研究」も積極的に掲載してきました。また、実験計画を事前に公開する「事前登録」といった不正を防ぐ手法も、他の分野に先駆けて取り入れています。

この「再現性」という科学の根幹をなす要求に、世界中の研究者たちが応えました。そして数年後、その集大成となる衝撃的な報告が発表されたのです。2016年に発表されたメタ分析（複数の研究結果を統合して分析する手法）です。

その結果は、再び科学界を驚かせました。

対象: 14カ国、33の研究室で行われた90の実験
結果: ベムの最初の実験結果は再現され、全体として6シグマを超える非常に強力な効果が確認された。（p値は1.2 x 10⁻¹⁰、偶然である確率は約100億分の1）
洞察: ベムの元々の実験データを除外し、再現実験の結果だけを分析しても、ベイジアン統計という手法において「決定的証拠」とされる基準（ベイズファクター100）をはるかに超える3,853という値が示された。

このメタ分析は、ベムが見出した現象が、彼一人の研究室だけの特殊な結果ではないことを強力に示唆しました。

5. まとめ：私たちは皆、無意識に未来を感じているのかもしれない

この記事で解説したダリル・ベムの研究は、巧妙にデザインされた実験手法によって、ごく普通の大学生の中に、まだ起きていない「未来からの影響」が存在することを示唆しました。

この研究の最も重要な洞察は、超能力者のような特殊な人物が特別な能力を発揮するのではなく、「ごく普通の人間が、日常生活の中で無意識に予知能力を使っている」可能性を示した点にあります。未来に魅力的な出来事があればそれを引き寄せ、不快な出来事があればそれを避けようとする。未来のヒントを無意識に得て現在の判断を速め、未来に慣れ親しむものに現在から好意を抱く。これらは、私たちが日々経験していることかもしれません。

ベムの実験は、時間と意識についての私たちの理解に、大きな問いを投げかけています。もしかしたら、私たちが感じる「直感」や「虫の知らせ」には、まだ科学が解明しきれていない、未来からの微かなシグナルが含まれているのかもしれません。

科学を揺るがした「予知実験」から学ぶ、科学的思考のレッスン

導入：科学の世界を震撼させた論文

2011年、著名な心理学誌『Journal of Personality and Social Psychology』に掲載された一編の論文が、科学界に大きな衝撃を与えました。コーネル大学の社会心理学者ダリル・ベムが発表したその論文のタイトルは「Feeling the Future」。内容は驚くべきもので、「人間には未来の出来事を無意識に感知する予知能力がある」ことを、一連の実験によって証明したと主張するものでした。

さらに科学者たちを驚かせたのは、その実験結果が偶然である確率が約370億分の1という、統計的に極めて強力な証拠を伴っていたことです。常識を覆す主張と、無視できないほどの強力なデータ。この論文は、科学の正しさを測る物差しとは何か、そして科学はどうあるべきかという、根源的な問いを投げかけ、科学界を揺るがす大論争へと発展していきました。

1. 「未来を感じる」実験とは？

この論争の中心となった実験は、どのようなものだったのでしょうか。実験を主導したダリル・ベムは、もともと超心理学の専門家ではなく、コーネル大学で教鞭をとる主流の社会心理学者でした。彼が尊敬される主流の研究者であったからこそ、その論文は科学界にこれほどの衝撃を与えたのです。

彼の実験のアイデアは、シンプルかつ革新的でした。通常の心理学実験では、「刺激（原因）が反応（結果）に先行する」のが大前提です。しかしベムは、この時間的な順序を意図的に逆転させました。つまり、「未来の出来事（原因）が、現在の行動（結果）に影響を与えるか」を検証しようとしたのです。これは「逆行因果（retro-causation）」と呼ばれる仮説のテストでした。

このアイデアを最もよく示すのが、9つある実験のうちの8番目、「単語記憶テスト」です。通常のテストとベムの実験の違いを見てみましょう。

通常の記憶テストの手順	ベムの予知実験の手順
1. 単語リストを見る	1. 単語リストを見る
2. ランダムに選ばれた半分の単語を練習する	2. 記憶している単語を回答する
3. 記憶している単語を回答する	3. 回答後に、ランダムに選ばれた半分の単語を練習する

ベムの実験は、まるで「宿題をやる前にテストを受ける」ようなものです。ここでの核心的な問いは、「未来で行う練習（宿題）が、現在のテストの成績に影響を与えるのか？」というものでした。もし、未来で練習することになる単語を、現在のテストでより多く思い出せるなら、それは予知能力の証拠ではないかと考えたのです。

では、ベムはこの驚くべき仮説を、どのように「科学的」に検証したのでしょうか？ここで重要になるのが、統計の力です。

2. 「偶然」と「本物」を見分ける科学の物差し

ベムの主張がなぜこれほどまでに衝撃的だったのかを理解するには、科学者が結果の信頼性を判断するために用いる2つの基本的な統計概念を知る必要があります。

2.1. p値：偶然である確率を示す指標

‌‌p値（p-value）‌‌とは、非常にシンプルに言えば「実験で得られた結果が、単なる偶然によって生じる確率」を示す指標です。

科学の世界では、一般的に p値が0.05未満（5%未満）であることが、結果が「偶然ではない」と判断する一つの基準とされています。
この基準を満たした結果は、「統計的に有意である」と呼ばれ、意味のある発見である可能性が高いと見なされます。

ベムの研究では、9つの実験結果を統合したp値が 2.6 × 10の-11乗という驚異的な数値でした。これは、彼が示した予知効果が偶然の産物である確率が、わずか約370億分の1しかないことを意味します。この圧倒的な数値があったからこそ、科学者たちは「ありえない」と一蹴するだけでなく、真剣に向き合わざるを得なかったのです。

2.2. シグマ（σ）：結果の確からしさのレベル

シグマ（σ）どれだけ「特別」で「ありえない」結果かを示す指標です。シグマの値が大きいほど、その結果が偶然である確率は低くなります。

物理学の例: 素粒子物理学の世界では、新しい粒子を発見したと宣言するための「ゴールドスタンダード」として5シグマという非常に厳しい基準が設けられています。これは、結果が偶然である確率が約170万分の1という、極めて高い確信度を要求するものです。
ベムの研究結果: ベムの実験結果をシグマに換算すると、6.5シグマに達しました。これは、科学の中でも最も厳格な分野とされる物理学の発見基準さえも上回るものでした。

これほど強力な統計的証拠にもかかわらず、なぜ多くの科学者はベムの研究に懐疑的だったのでしょうか。その理由は、科学が単一の実験結果だけでなく、より厳格な検証プロセスを重視するからです。

3. 科学的探究の核心：反論から学ぶ重要概念

ベムの研究に対する批判や議論は、単なる揚げ足取りではありませんでした。それは、科学コミュニティが驚異的な主張を検証するために用いる、多角的な「思考の道具箱」そのものだったのです。これらの批判を通じて、科学的思考に不可欠な4つの重要概念を学んでいきましょう。

3.1. 再現性：科学の再現ドラマ

科学における発見が「本物」であるかどうかを確かめる最も基本的な方法、それが‌‌再現性（Reproducibility）‌‌です。ある研究者の発見が本物ならば、「他の独立した研究者が、同じ手順で実験（追試）を行ったときに、同じ結果が得られる」はずです。これが確認されて初めて、その発見は客観的な事実として認められます。

ベム自身も再現性の重要性を理解しており、誰でも追試ができるよう、実験に用いたプログラムや手順を公開し、再現を奨励しました。そして2016年、ベムの研究の追試結果をまとめたメタ分析（複数の研究結果を統合して分析する手法）が発表されました。世界14カ国、33の研究室で行われた90の実験データを統合した結果は、驚くべきものでした。ベムの元々の実験結果は再現され、全体として6シグマを超える非常に強力な結果が再び示されたのです。そのp値は1.2 × 10の-10乗、すなわち偶然である確率はわずか約83億分の1でした。

3.2. ファイルドロワー効果：公表されなかった研究たちの叫び

ファイルドロワー効果（File Drawer Effect）、または出版バイアス（Publication Bias）とは、科学界が抱える根深い問題の一つです。肯定的な結果（効果があった）が出た研究は論文として出版されやすい一方で、否定的な結果（効果がなかった）が出た研究は注目されず、研究者の「ファイル引き出し（File Drawer）」に眠ってしまう傾向があります。このバイアスにより、世に出ている論文だけを見ると、実際よりも効果が大きく見えてしまう危険性があるのです。

この批判に対し、2016年のメタ分析は次のように反論しました。統計的な計算により、再現実験の有意な結果を覆すには、「結果が出なかった未発表の研究が、少なくとも544本もファイル引き出しに眠っている必要がある」と算出されました。しかし、この分野の研究者の総数を考えると、これほど多くの未発表研究が存在するとは考えにくく、ファイルドロワー効果だけでは結果を説明できないと結論づけられました。

3.3. ベイズ統計 vs. 頻度論的統計：信念を更新する考え方

ベムが当初用いた統計手法（p値など）は「頻度論的統計」と呼ばれます。これに対し、批判者の一部は「ベイズ統計」を用いるべきだと主張しました。

頻度論的統計 (Frequentist Statistics): 実験結果が「偶然で起こる確率（p値）」を計算する、伝統的で客観性を重んじるアプローチです。法廷での証拠だけを見て「この証拠が偶然に生じる確率は？」と問う裁判官に似ています。
ベイズ統計 (Bayesian Statistics): 研究者が持つ「事前の信念（この仮説は、どれくらい確からしいか）」を、新しいデータ（実験結果）によって更新していくアプローチです。「容疑者が犯人であるという当初の確信度は？」という事前の信念を持ち、新しい証拠が見つかるたびにその確信度を更新していく探偵のような考え方と言えるでしょう。

ベムの研究をベイズ統計で再分析した結果も、予知仮説を強力に支持するものでした。ベイズ統計では「ベイズ因子」という指標が使われ、この値が‌‌100を超えると「決定的証拠」‌‌と見なされます。2016年のメタ分析では、以下の驚くべき結果が示されました。

ベムの元々の研究と追試研究をすべて統合したベイズ因子は、51億 (5.1 × 10の9乗) という天文学的な数値に達しました。
さらに重要なことに、ベム自身の元々の9つの実験を除外して追試研究だけを分析しても、ベイズ因子は3,853となり、依然として「決定的証拠」の基準をはるかに上回ったのです。

3.4. 探索的研究と検証的研究：仮説探しか、仮説検証か

科学研究には、大きく分けて2つのフェーズがあります。「探索的研究」は新しい仮説を生み出すための自由度の高い研究、「検証的研究」は特定の仮説が正しいかどうかを厳密にテストするための研究です。

ベムの初期の研究は、この2つの区別が曖昧だったのではないか、という批判がありました。この議論は、単にベム個人への批判に留まらず、心理学という分野全体が抱える構造的な問題、いわゆる「再現性の危機」を浮き彫りにする転換点となりました。この論争をきっかけに、現代の科学では‌‌事前登録（Pre-registration）‌‌の重要性が広く認識されるようになったのです。事前登録とは、実験を始める前に研究計画（仮説、手順、分析方法など）を公開のデータベースに登録する制度で、研究の透明性を高め、信頼性を向上させるための重要な仕組みです。

これらの激しい論争は、単に予知能力の有無を巡るものだけではありませんでした。それは、科学という営みそのものを、より良くするための重要な問いを私たちに投げかけたのです。

4. まとめ：ベム論争が科学に教えてくれたこと

このガイドでは、ダリル・ベムの予知実験を巡る一連の論争を通じて、科学の世界における重要な概念を学んできました。

学んだ重要概念の再確認:
- p値: 結果が偶然である確率。
- シグマ（σ）: 結果の確からしさのレベル。
- 再現性: 他の研究者が同じ結果を再現できること。科学の客観性の礎。
- ファイルドロワー効果: 公表されない研究が結果を歪める可能性。
- ベイズ統計: 事前の信念をデータで更新していく統計的アプローチ。

この一連の出来事から私たちが得られる最も重要な教訓は、以下の2つです。

科学とは、自己修正していく厳格な「プロセス」であることこの一連の騒動が示す科学の力とは、信じがたい一つの「結果」そのものではなく、その後に続く、時に激しい懐疑、検証、そして改良を巡る厳格な「プロセス」にこそ宿っているということです。これこそが科学の進歩を支える原動力であり、予知能力の有無を信じるか否かよりも、このプロセスを理解することの方がはるかに重要なのです。
常識を疑う研究が、科学の方法論を前進させることベムの研究は多くの科学者から「ありえない」と一蹴されました。しかし、その挑戦的な研究と無視できないデータがあったからこそ、心理学の研究手法そのものを見直す「再現性の危機」という大きなきっかけが生まれました。常識に挑戦する研究は、たとえその仮説が最終的に受け入れられなくても、科学の方法論を洗練させ、分野全体を前進させる力を持つのです。

結局のところ、この事例が示すのは、科学とは完成された知識の体系ではなく、常に健全な懐疑と厳密な検証を通じて真理に一歩ずつ近づこうとする「終わりのない探究の物語」である、ということなのです。そしてこの物語は、私たち一人ひとりが、日々の情報に対して健全な懐疑心と探究心を持つことの大切さを教えてくれます。

予知実験が暴いた科学の脆さ：ダリル・ベム論文が心理学に残した教訓

導入：一流学術誌に掲載された「ありえない」論文

2011年、著名な社会心理学者ダリル・ベムが、権威ある学術誌『Journal of Personality and Social Psychology』に一篇の論文を発表し、科学界に激震が走った。その論文「Feeling the Future」は、人間が未来の出来事を無意識に感知する能力、すなわち「予知能力」の実在を示唆する9つの実験結果を報告していたのである。この出来事は、単に超常現象の証明を巡る論争に留まらず、科学が自らの正当性を担保するための方法論そのものの健全性に対する、深刻な問いを投げかける「リトマス試験紙」となった。それは、科学と疑似科学を分かつ境界線（demarcation problem）はどこにあるのか、という科学哲学の中心的な問いを、予期せぬ形で突きつけたのだ。

ベムの論文がこれほどまでの論争を巻き起こした核心には、一つの根深い緊張関係が存在する。それは、「常識的にはありえない」とされる疑似科学的な主張が、心理学の標準的な実験手続きと統計分析という「堅固に見える」鎧をまとって提示されたという事実である。この論文は、正統な科学的手法を用いて「ありえない」結論に到達したことで、懐疑派が依拠してきた方法論的境界線そのものを揺るがしたのである。

本稿では、このベムの予知実験を巡る論争を、超心理学というトピックの真偽を問うこと自体を目的とするのではなく、科学の自己修正プロセスがいかに機能し、そして時に機能不全に陥るのかを明らかにする重要なケーススタディとして考察する。この論争は、結果として心理学研究のあり方を大きく変えるきっかけとなり、現代の科学が直面する方法論的危機とその克服の物語を我々に教えてくれるのだ。

1. 「未来を感じる」実験の巧妙な設計と衝撃的な結果

ダリル・ベムの実験がなぜ多くの科学者を当惑させたのかを理解するためには、まずその独創的な方法論と、統計的に圧倒的とされる結果を分析することが不可欠である。彼の実験の巧妙さは、全く新しい実験パラダイムを創造したのではなく、既存の確立された心理学実験の構造を巧みに利用した点にあった。

ベムの実験設計の核心は、既存の心理学実験における原因と結果の連鎖を、時間的に「逆転」させるという極めて単純なアイデアに基づいている。通常の実験では、「刺激（原因）」が被験者の「反応（結果）」に先行する。しかしベムは、被験者の反応を先に記録し、その後にランダムに決定される刺激を提示することで、未来の刺激が現在の反応に影響を与える、すなわち「遡及的影響（retroactive influence）」を検証しようと試みたのだ。

具体例として、彼の論文における‌‌実験8「遡及的リコール促進（retroactive facilitation of recall）」‌‌を見てみましょう。この実験の手順は以下の通りです。

被験者はまず、単語リスト（例えば48単語）を提示される。
次に、記憶している単語をできるだけ多く書き出す「リコールテスト」を受ける。
テスト終了後、コンピューターがリストの中からランダムに半分の単語を選択し、被験者はその単語をタイピングするなどの「練習」を行う。

ここでの直観に反する仮説は、「テストを受けた後に練習することになる単語は、そうでない単語よりも、テスト時点での正答率が統計的に高くなる」というものである。まるで、未来に行う宿題が、過去に受けたテストの成績を向上させるかのようだ。

ベムはこのような設計の実験を9つ実施し、そのうち8つで統計的に有意な結果（p < 0.05）を得たと報告した。さらに衝撃的だったのは、9つの実験結果を統合したメタ分析の結果である。その統計的確率は、偶然では到底説明できないレベルに達しており、素粒子物理学で新粒子の発見を宣言する際の「ゴールドスタンダード」とされる5シグマ（約170万分の1の確率）をはるかに上回る6.5シグマ（偶然である確率が約370億分の1）に相当するとされたのだ。

この結果は、懐疑派にとって単なる「計算間違い」や「実験の不備」として片付けられるものではありませんでした。なぜなら、ベムが用いた手法は、当時の心理学研究で広く受け入れられていた標準的なものだったからです。もしこの手法で「ありえない」結論が導き出されるのであれば、問題は予知能力の存在そのものよりも、心理学という学問分野全体が依拠してきた研究手法の根幹にあるのではないか——。この深刻な問いが、科学界を揺るがす大論争の幕開けとなったのです。

2. 懐疑論の噴出：科学的方法論への全面的な問い直し

ベム論文が突きつけた難問に対し、科学界は現象そのものを検討するのではなく、圧倒的多数が方法論の解剖を選択した。その反応は迅速かつ激烈であり、著名な懐疑派レイ・ハイマンが「正気の沙汰ではない、全くの狂気だ」と断じ、ある心理学者が論文を読んで「物理的に気分が悪くなった」と告白するほど、感情的な拒絶反応を引き起こした。しかし、その批判は単なる結果への不信にとどまらず、心理学研究で当時、標準とされていた研究慣行そのものへの、より本質的で広範な批判へと発展していった。

2.1. P値への過信と統計的有意性の罠

ベムの論文は、科学界におけるp値への過度な依存という問題を浮き彫りにした。p値が0.05未満であれば「統計的に有意」と見なす慣行は、「pハッキング」と呼ばれる問題行動を助長する。これは、分析方法を少し変えたり、被験者を追加したりすることで、意図せずともp値を恣意的に操作できてしまう危険性である。科学社会学の観点から見れば、これは「発表か死か（publish or perish）」という制度的圧力が、ロバート・マートンが提唱した「組織化された懐疑主義」といった科学の規範（ノーム）を侵食し、望ましい結果を出すための逆機能的な慣行を生み出した事例と分析できる。ベムの驚異的な結果は、こうした柔軟な研究慣行が積み重なった結果ではないか、という疑念を生んだのだ。

2.2. 探索的研究と検証的研究の混同

当時の研究報告の慣行として、仮説を生み出すための自由な「探索的研究」と、事前に立てた仮説を厳密にテストする「検証的研究」の境界が曖昧であったことも問題視された。研究者はしばしば、データ収集後に様々な分析を試み、最も「興味深い」結果を見つけ出し、あたかもそれが最初から検証しようとしていた仮説であったかのように論文を執筆することがあった。この慣行は、結果論的に仮説を構築する「HARKing (Hypothesizing After the Results are Known)」として批判された。

2.3. ベイズ統計学の視点

ベムが用いた伝統的な頻度論的統計学に対し、懐疑派からはベイズ統計学の観点からの批判が提出された。この批判の核心は、「異常な主張には異常な証拠が必要（extraordinary claims require extraordinary evidence）」という原則にある。ベイズ統計学では、ある仮説のもっともらしさ（事前確率）を考慮して証拠を評価する。懐疑派にとって、予知の事前確率は限りなくゼロに近かった。そのため、彼らの世界観では、標準的な心理学実験から得られるいかなるデータも、この極端に低い事前確率を覆すには不十分だったのである。

しかし、議論は単純ではなかった。後にベムと彼の協力者である統計学者が行ったベイズ分析では、彼らのデータは予知仮説を支持する「決定的証拠」にあたるという結果が示されたのだ。ベイズ統計で証拠の強さを示すベイズ因子は、100を超えると「決定的証拠」と見なされるが、彼らの分析結果は実に13,669という驚異的な値に達した。これにより、論争は単なる統計手法の選択問題ではなく、根源的な世界観の対立であることがより鮮明になった。

2.4. 「ファイルドロワー問題」という名の出版バイアス

科学研究における最も根深い問題の一つが、「ファイルドロワー（書類棚）問題」である。これは、肯定的な結果（有意差あり）が出た研究は学術誌に掲載されやすいのに対し、否定的な結果（有意差なし）に終わった研究は未発表のまま研究者の書類棚にしまい込まれてしまうという出版バイアスのことである。仮にベムの実験と同じような追試が多数行われ、そのほとんどが失敗に終わっていたとしても、それらが公表されなければ、世の中には成功例だけが存在するように見えてしまう。このバイアスは、分野全体として誤った結論を強化してしまう深刻な危険性をはらんでいる。

これらの批判は、ベム個人への攻撃というよりも、当時の心理学が抱えていた構造的な問題点を白日の下に晒すものでした。そして、科学的主張の正当性を担保する最後の砦、すなわち「再現性」の検証へと、論争の焦点は必然的に移っていくことになります。

3. 再現性の検証：論争の最終的な審判

いかなる科学的主張も、それが真実であるならば、他の独立した研究者によって再現可能でなければならない。この再現性の原則こそが、科学を単なる思い込みや偶然から区別する根幹である。ベムの実験を巡る論争は、この原則の重要性を改めて浮き彫りにし、後に心理学全体を巻き込む「再現性の危機」の象徴的な事例となった。

ベム自身は再現研究を強く奨励しており、論文発表当初から実験に用いたソフトウェアや資料一式を希望者に無償で提供した。彼のこのオープンな姿勢に応じ、世界中の多くの研究室が追試に乗り出した。その後の数年間で、追試の結果はまさに賛否両論となり、ベムの結果を再現できたとする研究もあれば、完全に失敗したとする研究も現れ、論争は一筋縄ではいかない様相を呈した。

この混乱に一つの審判を下すべく、2016年にベム自身を含む研究チームが、それまでに行われた追試研究を網羅的に分析したメタ分析の結果を発表した。その内容は、多くの人々にとって再び衝撃的なものであった。

対象: 世界14カ国、33の研究室によって行われた90の実験データが統合された。
全体の結果: ベム自身の元実験（9実験）を含めて分析すると、効果は6シグマを超え（p = 1.2 x 10⁻¹⁰）、依然として非常に強力なものであった。
ベムの実験を除外した場合: 最も重要なのは、ベムの元実験データを除外し、追試研究だけを分析した場合の結果である。それでもなお、p値は極めて低く（p = 1.1 x 10⁻⁵）、偶然である確率は約10万分の1だった。さらにベイズ統計学による分析でも、ベイズ因子は3,853に達し、「決定的証拠」とされる基準値100を大幅に上回っていた。
ファイルドロワー問題への耐性: このメタ分析の結果が、未発表の失敗研究（ファイルドロワー問題）によって覆される可能性も計算された。その結果、この効果を統計的に無意味なレベルまで引き下げるには、544もの未発表の失敗研究が存在する必要があると算出された。これは、この分野の研究者の数を考えると非現実的な数字である。

このメタ分析の結果は、論争に終止符を打つどころか、むしろ火に油を注ぐ形となった。再現研究の統合結果ですら、これほど強力な証拠を示してしまったのである。もはや問題は、個々の実験の不備や統計の誤用ではなく、「科学は『異常な主張』に対して、一体どのような種類の、そしてどれほどの強度の証拠を求めるべきなのか」という、より深く、より哲学的な問いへと昇華されたのだ。

4. 論争の遺産：科学の自己修正プロセスへの貢献

予知能力が実在するかどうかという根源的な問いとは別に、ダリル・ベムの論文を巡る一連の騒動が、結果として科学コミュニティ、とりわけ心理学分野の自己改革を力強く促す重要な触媒となったことは、歴史の逆説と言えるだろう。この論争は、科学が内包する脆弱性を露呈させると同時に、その困難を乗り越えようとする自己修正能力の高さを証明する機会ともなったのだ。

事前登録（Pre-registration）の推進: 論争の核心にあった「探索的研究と検証的研究の混同」や「pハッキング」への対策として、研究計画（仮説、実験手続き、分析計画）をデータ収集前に公的な登録機関に登録する「事前登録」の文化が急速に広まった。これにより、研究者は後から都合の良い仮説や分析手法に切り替えることができなくなり、研究の透明性と信頼性が飛躍的に向上した。
オープンサイエンス運動の加速: 皮肉なことに、この改革の精神を先取りしていたのは、ベム自身であった。彼が当初から実験資料を公開し、再現を積極的に促した姿勢は、結果的にオープンサイエンスの理念を体現するものだった。彼の論文が引き起こした論争を通じて、研究データ、実験資料、分析コードなどを共有することの重要性が広く認識され、第三者による検証が容易になり、科学的知見の頑健性をコミュニティ全体で担保する体制が強化されつつある。
統計手法への健全な懐疑: 単一のp値（特にp < 0.05という恣意的な閾値）に依存する研究評価の危険性が、研究者コミュニティに広く浸透した。p値だけでなく、効果の大きさを示す「効果量」やその不確実性を示す「信頼区間」、そして「ベイズ統計学」といった、より多角的で豊かな証拠評価の方法論が重視されるようになった。
再現研究の価値向上: 従来、学術界では新規性のある「発見」が高く評価され、地道な再現研究は軽視されがちであった。しかし、ベム論争とそれに続く「再現性の危機」は、科学的知見の信頼性を確保する上で、再現研究がいかに不可欠であるかを痛感させた。今日では、再現研究を専門とする学術誌が創刊されるなど、その価値は再評価されている。

最終的に、ダリル・ベムの論文は、その結論の真偽がどうであれ、一つの「失敗」や「スキャンダル」として片付けられるべきものではない。むしろ、それは科学という営みが持つ構造的な脆弱性と、その脆弱性を認識し、より良い実践を目指して自らを改革しようとする力強い自己修正能力の両方を、これ以上ないほど劇的に描き出した一幕として記憶されるべきなのである。

結論：科学を進歩させる「健全な危機」

ダリル・ベムの予知実験を巡る一連の論争は、超常現象という刺激的なトピックの皮を一枚めくると、科学的知識がいかにして構築され、検証され、そして時に修正されるのかという、科学という営みの根幹をなすプロセスそのものに光を当てるものであった。それは、整然とした教科書の世界とは異なる、生々しく、人間的で、そしてダイナミックな科学の現場を我々に見せてくれた。

この一件から得られる最も重要な教訓は、科学の進歩が、順風満帆な発見の連続によってのみもたらされるわけではない、という事実である。むしろ、科学は時として、ベム論文が引き起こしたような深刻な「方法論的危機」や激しい論争を経験し、それを乗り越える過程で、より厳格で、より透明性の高い基準を自らに課すことによって、真に前進するのだ。

ベムの論文は、我々に厄介な問いを突きつける。もし我々の方法が健全で、統計が頑健で、結果が再現可能であるにもかかわらず、その結論が「信じがたい」ものであり続けるならば、その欠陥は結論にあるのか、それとも我々が「科学的証拠」と見なすものの哲学的定義そのものにあるのか。この論争が残した真の遺産は、より良い方法論だけでなく、科学という営みに不可欠な、継続的な認識論的謙虚さ（epistemological humility）なのである。

研究論文レビュー：ダリル・ベムの『Feeling the Future』が提起した予知能力の科学的証拠とその論争

序論

2011年、著名な社会心理学者ダリル・ベムは、心理学界に衝撃を与える論文『Feeling the Future: Experimental Evidence for Anomalous Retroactive Influences on Cognition and Affect』を発表しました。予知（precognition）という、通常は超常現象として扱われるテーマに対し、査読付きの一流学術誌である『Journal of Personality and Social Psychology』で統計的に極めて有意な証拠を提示したのです。この論文は、未来の出来事が現在の我々の認知や感情に影響を及ぼす可能性を示唆し、科学界全体、特に懐疑論者たちの間で激しい論争を巻き起こしました。

本レビューは、この画期的な論文を批判的に分析し、その意義と論争点を多角的に考察することを目的とします。まず、ベムが採用した独創的な実験設計「時間反転アプローチ」を解説し、続いて9つの実験の概要とその驚くべき統計的証拠を検証します。さらに、科学界からの主要な批判点を整理し、それに対する反論と、後に行われた大規模な追試研究（メタ分析）の結果を詳述します。最後に、この一連の「ベム論争」が、単なる予知の存否をめぐる議論に留まらず、心理学の研究方法論そのものに与えた深遠な影響を明らかにします。

1. 研究の背景と方法論的革新

ダリル・ベムは、コーネル大学の著名な社会心理学者であり、そのキャリアの初期は主流心理学の研究に専念していました。しかし、1990年代に超心理学の研究（特にガンツフェルト実験のメタ分析）に関するコンサルティングを依頼されたことをきっかけに、この分野への関心を深め、自ら実験的研究に着手するに至りました。ベムが主流科学の第一線で活躍してきたという経歴は、彼の超心理学研究に一定の学術的信頼性を与える上で重要な役割を果たしました。

ベムの研究における最大の方法論的革新は、「時間反転（time-reversal）」アプローチの採用にあります。通常の心理学実験では、「刺激（原因）が反応（結果）に先行する」という因果律に従って設計されます。しかしベムは、この順序を意図的に逆転させました。彼の実験では、まず被験者の「反応」を測定し、その後に、その反応に対応する「刺激」をランダムに提示するという手続きを取りました。この設計により、未来に起こる出来事（刺激）が現在の行動（反応）に影響を与える、すなわち「逆行性因果（retro-causation）」が存在する可能性を検証しようとしたのです。

この方法論は戦略的に極めて巧妙でした。ベムは、実験の基盤として、心理学において既に確立され、広く受け入れられている実験パラダイム（プライミング効果や記憶テストなど）のみを用いました。また、統計分析もt検定のような「極めて単純で広く受け入れられている」手法に限定しました。これにより、実験手続きや複雑な統計手法の妥当性をめぐる無用な議論を避け、批評家たちが結果そのものと向き合わざるを得ない状況を作り出し、純粋に「未来の出来事が現在に影響を及ぼすか」という現象そのものに焦点を当てることに成功したのです。

この独創的な方法論に基づき、ベムは9つの具体的な実験を実施しました。次に、それらの実験デザインを概観します。

2. 9つの予知実験の概説

ベムが論文で報告した9つの実験は、単一の現象を多角的に検証するために、それぞれ異なる心理学の原理（誘引、回避、プライミング、習慣化、記憶）を応用している点で特筆に値します。これにより、特定の実験デザインに固有の欠陥による結果ではないことを示そうとしました。実験は、その目的と設計に基づき、以下の3つのカテゴリーに大別できます。

感情・行動に基づく実験（実験1, 2）
- 実験1（エロティックな刺激の予知的検出）: 参加者の前に2つのカーテンが提示され、片方の後ろには性的魅力のある画像、もう片方には何もない壁が表示されます。参加者はどちらのカーテンの後ろに画像があると思うかを選択しますが、実際には画像の位置は参加者が選択した後にランダムに決定されます。仮説は、参加者が未来に提示される魅惑的な画像を「予知」し、偶然（50%）を上回る確率で正しいカーテンを選択するというものです。
- 実験2（ネガティブな刺激の予知的回避）: 参加者は、左右反転した2つのほぼ同一の画像から、好ましい方を直感で選びます。参加者の選択が、後でランダムに決定される「ターゲット」と一致した場合、ポジティブな画像がサブリミナル（意識に上らないレベル）で提示され、一致しなかった場合は「襲いかかろうとする凶暴な犬の絵」のような不快な画像が提示されます。仮説は、参加者が未来の不快な刺激を無意識に回避し、ターゲットを選択する傾向があるというものです。
認知プロセスに基づく実験（実験3, 4, 5, 6, 7）
- 実験3, 4（逆行性プライミング）: 参加者は、表示された画像が「快い」か「不快」かを素早く判断します。判断を下した後、判断内容と一致するか不一致なプライミング単語がランダムに提示されます。例えば、果物のバスケットの絵に対し、「美味しそう（luscious）」という一致する単語か、「苦い（bitter）」という不一致な単語が提示されます。仮説は、未来に提示される単語が現在の判断速度に影響を与え、一致する場合の方が不一致な場合よりも反応が速くなるというものです。
- 実験5, 6（逆行性習慣化）: 人は見慣れたものに好意を抱きやすい（習慣化）という原理を応用。参加者は「リンゴの絵」のような2つの画像から好ましい方を選びます。選択後、選ばれた方の画像がターゲットとしてランダムに決定され、サブリミナルで繰り返し提示されます。仮説は、未来で繰り返し提示される画像を、現在においてより「親しみやすい」と感じ、選択しやすくなるというものです。
- 実験7（逆行性退屈誘導）: 実験5, 6の変形版。未来で提示されるターゲット画像が、サブリミナルではなく意識的に知覚できる長さで繰り返し提示されます。仮説は、セクシーな画像などは繰り返し見ると「飽き」が生じるため、未来で繰り返し提示される画像への好みが、現在において低下するというものです。
記憶プロセスに基づく実験（実験8, 9）
- 実験8, 9（想起の逆行性促進）: ベム自身が「宿題をやる前にテストを受ける」と比喩した実験。参加者は単語リストを学習した後、記憶している単語をできるだけ多く書き出す想起テストを受けます。テストが終了した後、リスト中の単語の半分がランダムに選ばれ、それを練習する機会が与えられます。仮説は、未来で練習することになる単語は、テストの時点ですでに、練習しない単語よりも想起成績が高くなっているというものです。

これらの多様な実験から得られた結果は、統計的に非常に強力なものであり、科学界に大きな波紋を広げることになりました。

3. 主要な実験結果と統計的意義

ベムの研究が与えた衝撃の大きさは、その実験対象に起因する部分もあります。被験者は特殊な能力を持つとされる人々ではなく、ごく普通のコーネル大学の学生でした。にもかかわらず、9つの実験は一貫して、彼らが無意識のうちに予知能力を行使していることを示唆する結果となったのです。

得られた統計的結果は、以下の通り要約できます。

統計的有意性: 9つの実験のうち、8つで統計的に有意な結果（p < 0.05）が得られました。これは、結果が偶然によるものである確率が5%未満であることを意味します。唯一有意でなかった実験7（逆行性退屈誘導）のp値も0.096であり、偶然とは言い切れないレベルでした。
総合的な確率: 9つの実験結果全体を統合して分析した際のp値は 2.6 x 10⁻¹¹ という驚異的なものでした。これが意味するのは、ベムが得た一連の結果がすべて偶然によって生じた確率が、およそ370億分の1に過ぎないということです。

この結果が持つ科学的インパクトは、他の分野の基準と比較することでより明確になります。素粒子物理学の世界では、新しい粒子を発見したと宣言するための「ゴールドスタンダード」として5シグマという基準が用いられます。これは、結果が偶然である確率が約170万分の1という非常に厳しい水準です。ベムの研究結果が示した統計的確度は6.5シグマに相当し、物理学の基準であれば「もしこれが物理学の実験であったなら、ベムが見つけたものに基づいて新粒子の発見を宣言できたであろう」レベルの強力な証拠だったのです。

これほど強力な統計的証拠にもかかわらず、研究結果は科学界、特に懐疑論者から激しい批判にさらされました。

4. 科学界の反応と主要な批判

ベムの論文が発表されると、科学界は騒然となりました。著名な懐疑論者であるレイ・ハイマンは「狂気の沙汰、純粋な狂気だ」と断じ、ある心理学者のヤン・ヴァーヘンマッカースは、論文を「読むと体調が悪くなった」と述べるほどの強い拒絶反応を示しました。この反応は、単なる方法論上の欠陥の指摘というレベルを超え、物理法則や因果律といった既存の科学的パラダイムとの根本的な衝突から生じるものでした。

懐疑論者たちから提起された主要な批判点は、以下の4つに整理できます。

「探索的研究」と「確証的研究」の混同ベムが、特に研究の初期段階において、仮説を生成するための自由度の高い「探索的研究」と、定められた手順で仮説を厳密に検証する「確証的研究」の区別を曖昧にしていたのではないかという批判です。もし確証的研究の途中で手順を変更していた場合、意図せずとも肯定的な結果が出やすいようにデータを操作してしまう危険性が指摘されました。
統計手法の問題（ベイズ統計の不使用）ベムが結果の分析に伝統的な頻度論的統計学（p値を用いる手法）を用いたことに対する批判です。懐疑論者たちは、予知のように事前確率が極めて低い（ありえそうにない）現象を検証する際には、研究者の「事前信念」を確率モデルに組み込むベイズ統計学を用いるべきだと主張しました。
「ファイルドロワー問題」（出版バイアス）科学研究の世界では、統計的に有意な結果が出た研究は論文として出版されやすい一方、有意でなかった（効果がなかった）研究は未発表のまま研究者のファイルキャビネットに眠ってしまう傾向があります。この「ファイルドロワー問題」により、公表されている研究だけを見ると、実際よりも効果が大きく見えてしまう可能性が指摘されました。
追試の必要性どのような画期的な発見であっても、それが科学的知見として確立されるためには、元の研究者とは独立した第三者による追試（replication）によって再現されることが不可欠である、という科学の最も基本的な要請です。

これらの批判に対し、ベムとその擁護者たちは、データに基づいた具体的な反論と、さらなる検証を行いました。

5. 反論と追試（メタ分析）の結果

ベムは批判の一部（特に初期の実験における探索的研究と確証的研究の区別の曖昧さ）を認めつつも、最終的な判断は追試の結果に委ねるという科学的な姿勢を貫きました。彼は他の研究者による追試を積極的に奨励し、実験に用いたソフトウェアやマニュアルを無償で公開するという、科学界では異例の対応を取りました。

前セクションで挙げられた主要な批判点に対し、その後の再分析や追試研究は以下のような結果を示しました。

ベイズ統計による再分析: ベムは統計学者のジェシカ・アッツ、ウェスリー・ジョンソンと共同で、自身のデータをベイズ統計学を用いて再分析しました。一部の批評家が予知現象に対する事前確率として「100京対1」という極端な値を提案し、いかなる証拠があっても受け入れない姿勢を示したのに対し、ベムらはより客観的な事前確率を設定しました。その結果、予知仮説を支持する度合いを示すベイズ因子は 13,669 と算出されました。ベイズ統計において、この値が100を超えると「決定的証拠（decisive evidence）」と見なされるため、この数値は予知仮説を圧倒的に支持するものでした。
「ファイルドロワー問題」への反論: 後述する2016年のメタ分析において、追試で確認された予知効果を統計的に無意味なレベルまで引き下げるには、一体何件の未発表の失敗研究がファイルドロワーに眠っている必要があるかが計算されました。その結果は 544件であり、超心理学分野の研究者の総数を考えると、これほど多くの未発表研究が存在するとは考えにくく、ファイルドロワー問題だけでは結果を説明できないことが示されました。
2016年のメタ分析の結果: ベムの論文発表後、世界中の研究者によって追試が行われました。2016年に発表されたメタ分析では、14カ国33の研究室で実施された90の実験の結果が統合されました。その結果は衝撃的なものでした。
- 頻度論的統計では、全実験を統合した結果は6シグマを超える統計的有意性を示しました（p = 1.2 x 10⁻¹⁰）。
- ベイズ統計による分析では、ベイズ因子が51億という、仮説を支持する上で考えうる限り最も強力な値を示しました。
- さらに、ベム自身による最初の9実験を除外し、独立した追試研究のみを分析した場合でも、p値は 1.1 x 10⁻⁵、ベイズ因子は 3,853 となり、依然として「決定的証拠」の基準をはるかに上回る結果が維持されました。

これらの結果は、独立した多数の研究者による厳密な追試を経てもなお、ベムが最初に報告した現象の統計的有意性は揺るがなかったという事実を強く示唆しています。この一連の論争は、単に予知の存在をめぐる議論に留まらず、心理学研究の方法論そのものに大きな影響を及ぼしました。

6. 考察：心理学における「ベム論争」の意義

この一連の論争は、単に予知の存在をめぐる議論に留まらず、心理学という学問分野の方法論そのものに大きな影響を及ぼしました。皮肉なことに、ベムの研究結果があまりにも多くの科学者にとって「受け入れがたい」ものであったがゆえに、かえって心理学研究全体が抱える方法論的な脆弱性が浮き彫りになったのです。ある専門家は、ベムの論文の発表を「心理学における方法論的・統計的危機の引き金となった転換点」と評しています。

「ベム論争」が心理学界にもたらした具体的な変化や提起した課題は、以下の点に集約されます。

「再現性の危機」への警鐘: ベムの研究の追試をめぐる議論は、心理学の他の分野でも多くの研究結果が追試で再現できないという「再現性の危機（replication crisis）」への関心を一気に高めるきっかけの一つとなりました。
研究手法の透明性向上: 「ファイルドロワー問題」や「探索的/確証的研究」の混同といった問題意識から、研究を実施する前にその計画（仮説、実験手順、分析方法）を公開リポジトリに登録する「事前登録（pre-registration）」制度の重要性が広く認識されるようになりました。これにより、研究の透明性が高まり、恣意的な分析を防ぐことができます。
超心理学の科学的厳密性: 常に懐疑的な視線にさらされている超心理学の研究者たちは、その批判に対抗する必要性から、方法論的に非の打ち所がない研究を行うことを余儀なくされてきました。その結果、出版バイアスを避けるために統計的に有意でなかった研究（帰無仮説を支持する結果）を積極的に公表したり、「事前登録」のような先進的な手法を他の分野に先駆けて導入したりするなど、他の科学分野よりも厳密な基準を自らに課している場合があるという逆説的な状況が生まれています。

最後に、これまでの分析を総括し、本レビューの結論を述べます。

7. 結論

社会心理学者ダリル・ベムの『Feeling the Future』は、確立された心理学の実験パラダイムを巧妙に時間反転させることで、予知という現象の存在を示唆する極めて強力な統計的証拠を提示しました。この研究は、その結論の過激さから科学界、特に懐疑論者からの激しい批判に直面しました。

しかし、その後の展開は、この研究が単なる偶然や方法論的欠陥の産物ではなかった可能性を強く示唆しています。世界中の独立した研究室による大規模な追試研究を統合したメタ分析によっても、ベムが最初に報告した効果は揺るぎない統計的有意性をもって支持されたのです。

最終的に、「ベム論争」の真の意義は、予知の存在証明という当初の目的以上に、心理学という学問分野そのものに大きな問いを突きつけた点にあります。この論争の核心は、優れた科学と粗悪な科学の対立ではなく、当時の基準では方法論的に健全な研究と、物理的に不可能と思われる結論との間の衝突でした。その最大の遺産は、心理学の標準的な研究手法が、極めて高い統計的信頼度で「ありえない」結果を生み出しうるという事実を学界に突きつけ、それによって「再現性の危機」への認識を深め、より頑健な研究実践への移行を加速させたことにあると言えるでしょう。

実験の概要と背景

ダリル・ベムの『未来を感じる（Feeling the Future）』予知実験のより大きな文脈において、提供されたソースは、この研究の概要と背景について以下のように述べています。

実験の背景と文脈

この実験は、社会心理学者であり、コーネル大学の元教授であるダリル・ベム氏（1938年生まれ、現在87歳）によって実施されました。彼は元々、超心理学とは無関係の‌‌主流な心理学‌‌を専門としていましたが、1990年代までには、ガンツフェルト実験のメタ分析などの超心理学的な問題についてコンサルティングを依頼されるようになりました。このコンサルティングがきっかけでベム氏は超心理学に興味を持ち、独自の超心理学実験を行うに至りました。

最も有名な研究は、2011年に「未来を感じる：認知と感情に対する異常な遡及的影響の実験的証拠（Feel in the Future, Experimental Evidence for Anomalous Retroactive Influences on Cognition and Affect）」という論文として発表されました。この論文は、‌‌一流で非常に尊敬されている主流な心理学の専門誌‌‌である『パーソナリティ・社会心理学ジャーナル（Journal of Personality and Social Psychology）』に掲載されたため、超心理学の分野を超えて大きな影響を与えました。

この論文は2011年に公表されるやいなや‌‌科学界に衝撃を与え‌‌ました。懐疑的な科学者たちは驚き、他の科学者に実験の追試を要求しました。ベム氏の結果が再現されたにもかかわらず、懐疑論者たちはその結果を受け入れることに非常に消極的で、‌‌「科学は根本的に壊れているに違いない」‌‌と主張する記事まで現れました。この研究は、ジンバルドーのスタンフォード監獄実験やミルグラムの電気ショック実験といった悪名高い研究と並んで、‌‌これまで発表された中で最も物議を醸した10の心理学研究‌‌の一つに数えられています。

実験の概要と方法論

ベム氏の実験は、人間が無意識のうちに将来に関する予知的な情報を得ることができるかどうかを調査するために実施されました。

‌‌中核となる方法論：‌‌ ベム氏らは、実験の‌‌確実性をできる限り高める‌‌ために、心理学において慣習的で広く受け入れられている原理に基づいた実験を設計しました。使用された統計手法も、一般的に受け入れられている‌‌非常に単純なもの‌‌でした。

実験の核心は、‌‌通常の心理学実験の手順の順番を入れ替える‌‌ことでした。

‌‌通常の手順（順行因果）：‌‌ 刺激（原因）を与え、その後、参加者の反応（結果）を測定します。
‌‌ベム氏の予知テスト（逆行因果）：‌‌ まず参加者の‌‌反応（結果）を測定し‌‌、‌‌その後で刺激（原因）を与える‌‌という順序に逆転させました。これにより、原因が結果に先行するのではなく、‌‌結果が原因に先行する‌‌「逆行因果（retro-causation）」が起こるかどうかを検証しました。

この論文では、ベム氏らが実施した‌‌9つの実験の結果‌‌が報告されています。

‌‌実験の被験者：‌‌ これらの実験の参加者は、‌‌超能力を持つ才能のある人物‌‌ではなく、コーネル大学の学生など、‌‌ごく普通の人々‌‌でした。彼らは超能力（ESP）の使い方について訓練を受けておらず、実験を受ける前は実験がどのように機能するかさえ知りませんでした。彼らは将来の出来事を意識的に知ろうとはしていませんでした。

‌‌主な実験の種類（例）：‌‌ ベム氏は、様々な確立された心理学的原理を用いて、予知が起こるかどうかをテストしました。

‌‌実験1（好ましい刺激の検出）：‌‌ 参加者は、後ろに画像が隠された2枚のカーテンのどちらかをクリックして画像を当てるよう求められました。画像がエロティックな場合、人々がそれを予知的に感知し、‌‌50%以上の確率で画像を意図的に探し出す‌‌かどうかを検証しました。しかし、画像の配置は参加者が推測を記録した‌‌後‌‌にランダムに決定されました。
‌‌実験8と9（想起の促進）：‌‌ これは「遡及的想起促進」に関する一対の実験でした。参加者に単語リストを見せ、その後、‌‌いくつの単語を思い出せるか尋ね‌‌、‌‌それから‌‌コンピューターが単語の半分をランダムに選び、‌‌練習させる‌‌という手順でした。将来練習するであろう単語の方が、練習しない単語よりもよく思い出せるかを検証しました。

‌‌結果の概要：‌‌ 結果として、9つの実験のうち‌‌8つで統計的に有意な結果‌‌が得られました。実験全体を結合したP値は$2.6 \times 10^-11$乗であり、結果が偶然によるものである確率は‌‌約370億分の1‌‌でした。これは、物理学において新粒子発見のために必要な‌‌5シグマの基準を大幅に超える‌‌ものでした（6.5シグマ）。

ベム氏の発見は、「ごく普通の人々が、環境をナビゲートするために、‌‌意識することなく常に予知能力を使っている‌‌」ことを示唆しています。

予知実験の具体例

ダリル・ベムの『未来を感じる（Feeling the Future）』予知実験のより大きな文脈において、提供されたソースは、彼が実施した9つの実験の具体例とその方法論について、詳細かつ包括的に説明しています。これらの実験は、‌‌標準的な心理学的手順のステップの順序を逆転させる‌‌ことで、予知（precognition）または逆行因果（retro-causation）を検証するように設計されました。

以下に、9つの実験の具体的な概要を、ソースに基づいて説明します。

実験の設計の核心

ベム氏の実験は、通常の手順（原因が結果に先行する）とは異なり、‌‌まず参加者の反応（結果）を測定し、その後に刺激（原因）を与える‌‌という順序に意図的に逆転させました。これにより、原因が結果に先行するのではなく、‌‌結果が原因に先行する‌‌「逆行因果」が起こるかどうかをテストしました。

参加者はコーネル大学の学生など‌‌ごく普通の人々‌‌であり、超能力者（サイキック）ではなく、予知能力を使う訓練も受けていませんでした。また、彼らは実験の仕組みを知らされておらず、将来の出来事を‌‌意識的に‌‌感知しようとはしていませんでした。

9つの予知実験の具体例

ベム氏によって実施された9つの実験の概要は以下の通りです。

1. 実験1：好ましい刺激の予知的な検出（Precognitive Detection of Erotic Stimuli）

‌‌目的:‌‌ 人々が、見たいと思う画像（特にセクシーな画像）を予知的に感知し、意図的にそれを探し出すかどうかを検証する。
‌‌手順:‌‌
1. 参加者には、2枚のカーテンのうちどちらの裏に画像が隠されているかをクリックして推測するよう求められました。
2. 画像は、性的に露骨なエロティックな画像を含むことが警告されました。
3. しかし実際には、‌‌参加者が推測を記録した後‌‌に、画像自体と、それが左か右のどちらのカーテンの後ろに配置されるか（50%の確率）がランダムに決定されました。
‌‌仮説:‌‌ エロティックな画像に惹かれる場合、人々がそれを予知的に感知できれば、ランダムな偶然（50%）を超えて、画像を‌‌より多く選択する‌‌だろう。

2. 実験2：否定的な刺激の予知的な回避（Precognitive Avoidance of Negative Stimuli）

‌‌目的:‌‌ 人々が、見たくない否定的な画像（ネガティブな刺激）を予知的に回避できるかどうかを検証する。
‌‌手順:‌‌
1. 参加者には、感情的に中立な2枚の画像（例：リンゴとその鏡像）が同時に提示され、どちらを好むかボタンを押して選択するよう求められました。
2. 参加者が知らされないうちに、‌‌意識的に気づくには速すぎる‌‌（サブリミナルな）追加画像が画面にフラッシュされました。
3. コンピュータは、参加者が好みを表明した‌‌後で‌‌、ターゲットとなる画像（例：左のリンゴ）をランダムに決定しました。
4. 参加者がターゲット（例：左のリンゴ）を選んだ場合、ポジティブな画像（例：夕焼け）がサブリミナルにフラッシュされ、ターゲットでなかった画像を選んだ場合、ネガティブな画像（例：獰猛な犬）が罰としてサブリミナルにフラッシュされました。
‌‌仮説:‌‌ 人間が潜在意識下で、ターゲットとなる画像を予測し、‌‌獰猛な犬のような否定的な刺激を回避できる‌‌かどうかを検証しました。

3. 実験3：遡及的プライミング（Retroactive Priming）

‌‌目的:‌‌ プライミング効果（先行する刺激がその後の反応速度に影響を与える現象）が、‌‌未来から逆行して‌‌起こるかどうかを検証する。
‌‌標準的なプライミング:‌‌ 通常、ポジティブな単語（プライム）を見た後でポジティブな画像（ターゲット）を見ると、反応が速くなります。
‌‌ベム氏の手順（逆行）:‌‌
1. まず、参加者に画像（ターゲット）が提示されます。
2. 次に、画像が‌‌ポジティブかネガティブか‌‌をできるだけ早く判断するよう求められます。
3. ‌‌その後で‌‌、コンピュータがランダムにポジティブまたはネガティブな単語（プライム）を参加者に見せました。
‌‌仮説:‌‌ 予知が存在する場合、参加者は将来提示されるプライム（単語）を考慮に入れて現在の画像を判断するため、‌‌将来のプライムと現在の画像が一致しない場合（不一致な場合）、判断に時間がかかる‌‌だろう。

4. 実験4：遡及的プライミング（複製と改良）

‌‌目的:‌‌ 実験3の複製であり、わずかな改良を加えることで、遡及的プライミング効果を再検証する。
‌‌改良点:‌‌ 実験3ではプライムがランダムに選ばれていましたが、実験4では、各画像（ターゲット）に対して、意味的に関連性の高い‌‌固定された‌‌ポジティブなプライムとネガティブなプライムが事前に割り当てられました（例：果物のバスケットには「おいしい（luscious）」と「苦い（bitter）」が対として割り当てられた）。
‌‌手順:‌‌ 参加者が画像の評価を応答した後で、コンピュータはランダムにその対のいずれか一方をプライムとして提示しました（遡及的プライミング）。

5. 実験5：遡及的馴化（Retroactive Habituation）

‌‌目的:‌‌ 馴化（Habituation）の概念（以前に触れたものに好意を感じる傾向）が、‌‌未来のサブリミナルな露出‌‌によって引き起こされるかどうかを検証する。
‌‌手順:‌‌
1. 参加者には、2枚の類似した画像が提示され、‌‌どちらの画像がより好きか‌‌を選択するよう求められました。
2. 次に、コンピュータがランダムにそのうちの1枚をターゲットとして選択し、‌‌その後‌‌、このターゲット画像をサブリミナル（意識の閾値以下）で複数回フラッシュしました。
3. この実験では、感情的に中立な画像（例：リンゴ）と感情的にネガティブな画像（例：獰猛な犬）の両方がテストされました。
‌‌仮説:‌‌ 参加者は、将来サブリミナルに露出される画像を予知的に感知することで、それが‌‌より馴染み深く感じられ‌‌、より好むと回答するだろう。ネガティブな画像をペアで使用した場合でも、人々は将来より多く露出される画像を50%以上の確率で好むだろうと仮定されました。

6. 実験6：遡及的馴化（セクシー画像の追加）

‌‌目的:‌‌ 実験5の複製であり、セクシーな画像ペアを新たなカテゴリーとして追加し、馴化がどのように起こるかを検証する。
‌‌手順:‌‌ 実験5と同じですが、画像カテゴリーに‌‌中立‌‌、‌‌ネガティブ‌‌、そして‌‌セクシーな‌‌画像の3種類が含まれました。
‌‌セクシー画像に関する新たな仮説:‌‌ セクシーな画像は、何度も見ると飽きられる（飽和する）傾向があるため、参加者は将来より多く露出される画像よりも、‌‌露出される回数が50%未満の画像を好む‌‌だろうと仮定されました。

7. 実験7：遡及的な飽きの誘導（Retroactive Induction of Boredom）

‌‌目的:‌‌ 実験6の過程で、セクシーな画像が将来10回以上サブリミナルに提示されると、参加者がその画像を嫌うようになるという現象が観察されたため、‌‌意識的に視認できる露出‌‌を用いて「遡及的な飽き」を検証するために設計されました。
‌‌手順:‌‌
1. 使用されたのは‌‌中立な画像のみ‌‌でした。
2. 参加者は2枚の中立な画像のうち、より好きな方を選択しました。
3. その後、コンピュータがランダムに選んだターゲット画像を、サブリミナルではなく、‌‌意識的に見ることができる十分な長さ‌‌で繰り返しフラッシュしました。
‌‌結果:‌‌ この実験は、9つのうち唯一、‌‌統計的に有意な結果が得られませんでした‌‌ (P値 0.096)。

8. 実験8：想起の遡及的促進（Retroactive Facilitation of Recall）

‌‌目的:‌‌ 記憶テストにおける練習効果が、‌‌時間を遡って‌‌記憶を促進するかどうかを検証する（遡及的想起促進）。
‌‌手順:‌‌
1. まず、参加者に単語の‌‌完全なリスト‌‌を見せます。
2. 次に、彼らが‌‌いくつの単語を思い出せるか‌‌を尋ね、それを記録します。
3. ‌‌その後で‌‌、コンピュータが単語リストの半分をランダムに選び、参加者にそれを練習させます（宿題をする前にテストを受けるようなもの）。
‌‌仮説:‌‌ 予知が存在する場合、参加者は将来練習する単語を、将来練習しない単語よりも‌‌よく思い出すことができる‌‌だろう。

9. 実験9：想起の遡及的促進（複製と追加練習）

‌‌目的:‌‌ 実験8の複製であり、遡及的想起促進を再検証する。
‌‌改良点:‌‌ 実験8と同じ手順でしたが、参加者が思い出せる単語をすべてリストした後、‌‌追加の種類の練習‌‌が導入され、半分以上の単語に対して少し多くの練習が行われました。

実験結果の全体像

これらの9つの実験の結果、‌‌8つの実験で統計的に有意な結果‌‌が得られました。すべての実験結果を統合したP値は$2.6 \times 10^-11$であり、この結果が偶然によるものである確率は約370億分の1でした。これは、物理学で新粒子発見に要求される5シグマの基準を大きく超える‌‌6.5シグマ‌‌に相当する結果でした。この結果は、「ごく普通の人々が、環境をナビゲートするために、‌‌意識することなく常に予知能力を使っている‌‌」可能性を示唆しています。

実験結果と統計的意義

ダリル・ベムの『未来を感じる（Feeling the Future）』予知実験のより大きな文脈において、提供されたソースは、‌‌実験結果の強固さ‌‌と‌‌統計的意義‌‌について非常に詳細かつ驚異的な主張を展開しています。

実験結果の概要

ベム氏の研究は、人間が無意識のうちに将来に関する予知的な情報を得ているかどうかを調査したものであり、テスト対象者において‌‌予知の確固たる証拠‌‌を発見しました。

結果の全体的な傾向として、予知の効果量（effect size）は小さかったものの、‌‌偶然の範囲を超えて‌‌おり、統計的に有意な結果を生み出しました。

統計的意義

この研究の最も衝撃的な点は、結果が偶然によるものではないことを示す‌‌統計的意義のレベル‌‌にあります。

1. 個々の実験の有意性

ベム氏が行った9つの実験のうち、‌‌8つの実験‌‌で統計的に有意な結果が得られました。統計的有意性の主な基準はP値が0.05（結果が偶然によるものである確率が20分の1、つまり5%）以下であることです。

‌‌有意な結果:‌‌ 8つの実験がP値0.05以下を達成しました。
- 例えば、実験1のP値は0.01（偶然の確率が1%）でした。
- 実験2のP値は0.009（偶然の確率が1%未満）でした。
‌‌有意ではなかった結果:‌‌ 唯一、実験7（遡及的な飽きの誘導）だけが統計的に有意ではありませんでした。このP値は0.096であり、結果が偶然によるものである確率は約10%でした。

2. 全実験の統合結果

9つの実験すべての結果を統合して分析したところ、その統計的な強度は極めて高くなりました。

‌‌P値（Frequentist Statistics）:‌‌ 統合されたP値は、$2.6 \times 10^-11$乗でした。
‌‌偶然による確率:‌‌ これは、結果がランダムな偶然によるものである確率が‌‌約370億分の1‌‌（1000億分の2.7）であることを意味します。
‌‌シグマ値:‌‌ このP値は、‌‌6.5シグマ‌‌に相当します。

物理学において新粒子の発見を発表するために必要な‌‌ゴールドスタンダード‌‌とされる閾値は‌‌5シグマ‌‌です。ベム氏の結果は、この5シグマの閾値を大幅に上回っており、もしこれが物理学の実験であれば、この発見に基づいて新しい粒子を発表できるほどの確証レベルに達しています。

3. ベイズ分析による評価

ベム氏らは、頻度主義統計（Frequentist statistics、P値を使用）だけでなく、ベイズ統計（Bayesian statistics）を用いて結果を再分析しました。ベイズ統計では、仮説を支持する証拠の強さを‌‌ベイズファクター‌‌で示し、通常、100を超える値が「決定的な証拠」と見なされます。

ベム氏が収集したデータに基づき、「予知が存在する」というモデルのベイズファクターは‌‌13,669‌‌と計算されました。
‌‌13,669‌‌という値は、決定的な証拠とされる閾値100をはるかに超えており、ベイズ統計の観点からも予知の存在を強く支持するものでした。
ベイズ分析によると、ベム氏の収集データが与えられた場合、予知が現実でない確率はおよそ‌‌1400分の1‌‌（$7.3 \times 10^-5$）であると結論づけられました。

科学界の反応と論争の背景

これらの結果が、主流の心理学専門誌である『パーソナリティ・社会心理学ジャーナル（Journal of Personality and Social Psychology）』に掲載されたことで、科学界に大きな衝撃を与えました。

‌‌懐疑論者の反応:‌‌ ハードコアな懐疑論者の科学者たちは、「信じられないほどおかしい、純粋に狂っている」と反応し、結果を受け入れることに非常に消極的でした。
‌‌示唆されること:‌‌ ベム氏の研究は、予知が存在するだけでなく、‌‌ごく普通の人々‌‌（コーネル大学の学生など）が、‌‌意識的に試みることなく‌‌、環境をナビゲートするために‌‌常に‌‌予知能力を使っている可能性を示唆しました。
‌‌論争の激化:‌‌ ベム氏の仮説（予知の存在、普通の人々による利用、無意識の使用）は、懐疑論者にとっては「狂気が狂気の上に積み重なっている」ように聞こえました。結果として、懐疑論者は実験の再現を要求し、ベム氏の研究は「これまで発表された中で最も物議を醸した10の心理学研究」の一つに数えられることになりました。

再現性の結果（メタ分析）

ベム氏の結果が、偶然によるものではないこと、そして統計的に信頼できることを証明するための最終的な手段は、‌‌再現‌‌でした。

ベム氏の研究材料と手順は、独立した研究者が再現しやすいように意図的にシンプルに設計されており、無料で公開されました。
2016年に行われた‌‌メタ分析‌‌では、14か国の33の研究室による90の再現実験の結果が統合されました。
‌‌統合された結果（再確認）:‌‌
- ベム氏の元の実験を含めた場合、統合されたP値は$1.2 \times 10^-10$乗であり、結果が偶然による確率は‌‌約100億分の1‌‌でした。これは‌‌6シグマ超‌‌の効果量を示しました。
- ベム氏の‌‌元の9つの実験を除外‌‌した場合でも、P値は$1.1 \times 10^-5$乗（偶然による確率が約10万分の1）であり、ベイズファクターは‌‌3853‌‌となり、‌‌決定的な証拠‌‌（閾値100）を大幅に超えていました。

再現の試み（成功したものと失敗したもの両方を含む）が行われた後も、ベム氏の結論は強く支持されました。さらに、出版バイアス（ファイルドロワー効果）を考慮した分析でも、この有意性を打ち消すには‌‌544件‌‌もの未発表の研究が必要であると計算されましたが、これは超心理学分野の研究者の総数や出版傾向を考慮すると非現実的であると結論づけられています。

科学界の論争と反応

ダリル・ベムの『未来を感じる（Feeling the Future）』予知実験のより大きな文脈において、提供されたソースは、‌‌科学界がこの研究に対して抱いた衝撃、懐疑的な反応、そしてそれに続く論争‌‌について詳細に述べています。

科学界への衝撃と論文掲載の重要性

ベム氏の論文は2011年に「未来を感じる：認知と感情に対する異常な遡及的影響の実験的証拠（Feel in the Future, Experimental Evidence for Anomalous Retroactive Influences on Cognition and Affect）」というタイトルで発表されました。

この論文が‌‌科学界に衝撃を与えた‌‌主な要因は、その掲載された場所でした。この研究は、超心理学専門のジャーナルではなく、‌‌一流で、非常に尊敬されている主流な心理学の専門誌‌‌である『パーソナリティ・社会心理学ジャーナル（Journal of Personality and Social Psychology）』に掲載されました。この掲載により、この研究は超心理学の分野を超えて、より広範な影響力を持つことになりました。そのため、懐疑的な科学者たちは「パニックに陥った」（freaked out）と表現されています。

懐疑論者の強烈な反応と拒絶

ベム氏の実験は、予知が存在することを示唆し、結果が偶然によるものである確率は‌‌数十億分の1‌‌であるという結果を示しました。

これに対し、懐疑的な科学者（ハードコアな懐疑論者）たちは、その結果を受け入れることに非常に消極的でした。

‌‌否定的な見解:‌‌ ニューヨーク・タイムズの記事は、この決定が「いわゆる超常現象の信奉者を喜ばせるかもしれないが、科学者たちを‌‌すでに当惑させている‌‌」と指摘しました。
‌‌「純粋な狂気」:‌‌ 高名な懐疑論者であるオレゴン大学のレイ・ハイマン名誉教授（心理学）は、「‌‌狂気だ、純粋な狂気だ‌‌。主要なジャーナルがこの研究を許可していることが信じられない」と述べました。
‌‌体調不良:‌‌ 心理学者のヤン・ヴァーヘマーカー（Jan Wagemaker）は、ベム氏の論文を読む経験について「何度も（読むのを）やめなければならなかった。読むと‌‌身体的に体調が悪くなった‌‌」と語っています。
‌‌不正行為の示唆:‌‌ トロント大学の心理学者であるユリ・シェマク（Yuli Shemak）は、「私はこれらの結果を信じる必要はない。なぜなら、それらは‌‌明らかに捏造されている‌‌からだ」と断言しました。

ベム氏が提案した仮説、特に「予知が存在する」ことに加えて、「‌‌ごく普通の一般人‌‌（コーネル大学の学生など）がそれを持っており、‌‌意識的に努力することなく常にそれを使っている‌‌」という3つの主張は、懐疑論者にとっては「狂気の上に狂気が積み重なっている」ように聞こえました。彼らにとって、このプロジェクト全体は「全くもってバット・グアノ（コウモリの糞）のように狂っている」と見なされました。

科学の危機と方法論的論争

ベム氏の結果があまりにも驚異的であったため、懐疑論者たちは、その結果を受け入れる代わりに、「‌‌科学が根本的に壊れているに違いない‌‌」と発表する記事さえ現れました。この論文の出版は、心理学における「方法論的および統計学的危機」を引き起こす‌‌決定的な転換点（the tipping point）‌‌として特定されました。

この研究は、ジンバルドーのスタンフォード監獄実験やミルグラムの電気ショック実験といった悪名高い研究と並び、‌‌これまで発表された中で最も物議を醸した10の心理学研究‌‌の一つに数えられています。

懐疑論者たちは、この結果を打ち消すために、科学的な観点からいくつかの主要な批判を展開しました。これらの批判の多くは、ベム氏が当時の‌‌標準的な研究慣行‌‌（standard research practices）に従っていたことを認めつつも、その‌‌標準慣行自体を改善する必要がある‌‌という主張に焦点を当てていました。

‌‌主な批判点と論争の焦点:‌‌

‌‌探索的実験と確証的実験の区別:‌‌ ベム氏が、仮説を生成するための探索的研究と、仮説をテストするための確証的研究とを適切に区別していないのではないかという指摘がありました。ベム氏自身も、初期の実験では緩い態度であったことを認めました。
‌‌統計手法:‌‌ ベム氏が使用した伝統的な頻度主義統計（P値に基づくもの）ではなく、‌‌ベイズ統計‌‌を使用すべきだったという批判がありました。ベム氏と共同研究者は、この批判に応えてベイズ分析を行い、予知が存在するモデルのベイズファクターは13,669であり、「決定的な証拠」の基準である100を大きく超える結果が得られたと発表しました。
‌‌ファイルドロワー効果（出版バイアス）:‌‌ 統計的に有意な結果が出なかった未発表の研究（ファイルドロワーにしまわれた研究）が多数存在し、それがベム氏の統計的有意性を歪めているのではないかという批判です。ベム氏側は、後に実施されたメタ分析において、有意性を無効にするには‌‌544件‌‌もの未発表の研究が必要であると計算し、これは非現実的であるとしてこの批判の妥当性を退けました。
‌‌実験の再現:‌‌ 最も重要な点として、懐疑論者たちは‌‌実験の追試（replication）‌‌を強く要求しました。

追試（再現）と結果の確証

ベム氏自身は、研究材料と手順をシンプルかつ透明性の高いものにすることで、独立した研究者による追試を奨励しました。

その結果、2016年に発表されたメタ分析では、14か国の33の研究室による90の再現実験の結果が統合されました。この統合された分析は、ベム氏の結論を強く支持しました。

ベム氏の元の実験を含めた場合、統合されたP値は$1.2 \times 10^-10$乗であり、‌‌6シグマ超‌‌の効果が示されました。
ベム氏の元の9つの実験を除外した場合でも、P値は$1.1 \times 10^-5$乗（約10万分の1の偶然の確率）であり、ベイズファクターは‌‌3853‌‌となり、「決定的な証拠」の基準を依然として上回りました。

結局のところ、ベム氏の結論は、再現の試み（成功したものと失敗したもの両方を含む）が行われた後も強く支持されることとなり、「提案された批判のどれもが妥当性を持っているようには見えず、ベム氏の結論は再現に耐え得た」と結論づけられました。

再現性

ダリル・ベムの『未来を感じる（Feeling the Future）』予知実験のより大きな文脈において、提供されたソースは、‌‌再現性（Replication）が論争の中心的な焦点であった‌‌こと、そして‌‌再現性の結果がベム氏の結論を強く支持した‌‌ことを強調しています。

1. 再現性の要求とベム氏の計画

ベム氏の論文が2011年に発表され、数十億分の1の確率で予知が存在する可能性を示唆する驚異的な統計結果（6.5シグマ）が示されると、‌‌懐疑的な科学者たちはショックを受け‌‌ました。

‌‌追試の要求:‌‌ 懐疑論者たちは、結果を受け入れることに非常に消極的であったため、‌‌他の科学者たちに実験の追試（replicate）を要求しました‌‌。
‌‌ベム氏の意図的な設計:‌‌ ベム氏は、当初から再現を強く望んでおり、そのために実験の設計を行いました。彼は、独立した研究者によって追試が可能になるよう、「心理学で慣習的で広く受け入れられている原理」に基づいて、‌‌可能な限りシンプルで透明性の高い‌‌方法論を採用しました。
‌‌資料の無料公開:‌‌ ベム氏は、追試を奨励するために、実験に使用したテストパッケージと手順書を‌‌無料で利用可能にしました‌‌。これにより、どの研究者でもプログラムを自分のコンピューターにインストールし、追試研究を開始することができました。

2. 再現の実行と結果の確証

ベム氏の公開戦略の結果、世界中の研究者が追試を行いました。

‌‌結果の分化:‌‌ 追試の中には、ベム氏の結果を‌‌確認したもの‌‌もあれば、‌‌再現に失敗したもの‌‌もありました。特に懐疑的な報道機関は、少数の失敗した再現をすぐに大きく報じました。中には、参加者が集中していない可能性があるオンラインでテストを実施するなど、‌‌方法論に根本的な変更を加えた‌‌不正確な追試もありました。
‌‌メタ分析の実施（2016年）:‌‌ 2016年、ベム氏と4人の同僚は、ベム氏の元の実験と、それまでに行われた追試研究を統合した‌‌メタ分析‌‌の結果を発表しました。
- このメタ分析には、‌‌14か国の33の研究室‌‌による‌‌90の実験‌‌の結果が含まれていました。

3. メタ分析による圧倒的な統計的確証

メタ分析の結果は、追試を含めてもなお、ベム氏の結論を強力に支持するものでした。

ベム氏のオリジナル実験を含めた場合

‌‌P値とシグマ:‌‌ 統合された全体効果は、‌‌6シグマ超‌‌であり、P値は$1.2 \times 10^-10$乗でした。これは、結果がランダムな偶然による確率が‌‌約100億分の1‌‌であることを意味します。
‌‌ベイズファクター:‌‌ ベイズ分析でも、ベイズファクターは$5.1 \times 10^9$（‌‌50億‌‌）に達しました。これは、仮説に対する決定的な証拠とされる基準値‌‌100‌‌をはるかに超えています。

ベム氏のオリジナル実験を除外した場合

ベム氏の元の9つの実験を分析から除外して、独立した追試の結果のみを統合した場合でも、結果は強力な有意性を示しました。

‌‌P値:‌‌ P値は$1.1 \times 10^-5$乗でした。これは、結果が偶然による確率が‌‌約10万分の1‌‌であることを意味します。
‌‌ベイズファクター:‌‌ ベイズファクターは‌‌3853‌‌となり、これも決定的な証拠の基準値100を大きく超える値でした。

この結果は、「‌‌ベム氏の結論が再現に耐え得た‌‌」ことを強く示しています。

4. ファイルドロワー効果への対処（Fail-Safe計算）

再現性の論争に関連して、懐疑論者から提起された主要な批判の一つに、‌‌ファイルドロワー効果‌‌（有意でない未発表の研究が結果を歪めている可能性）がありました。

‌‌計算の実施:‌‌ メタ分析の著者たちは、この出版バイアスを制御するための「フェイルセーフ計算」を実施しました。
‌‌結論:‌‌ この研究の統計的有意性を‌‌些細なレベルにまで引き下げる‌‌ためには、ファイルドロワーの中に‌‌544件‌‌もの未発表の研究が存在しなければならないと計算されました。
‌‌現実との比較:‌‌ 超心理学の分野で実際に研究を行っている研究者の数が非常に少ないことを考えると、この544件という数字は‌‌あり得ない‌‌と結論づけられました。また、超心理学のジャーナルは、他の分野のジャーナルとは異なり、‌‌有意でない結果の研究（null results）も出版する傾向にある‌‌ため、ファイルドロワー効果の可能性はさらに低いとされました。

総合的に見て、再現性の努力（成功・失敗の両方を含む）はベム氏の研究に対する科学的な‌‌最終的な検証‌‌として機能し、その結果は‌‌予知が存在する強力な証拠を依然として提供している‌‌と結論づけられています。

情報源

動画(1:18:35)

Feeling the Future (Precognition, Daryl Bem, Psi Research) - Jimmy Akin's Mysterious World

7,700 views 2025/08/08 Jimmy Akin's Mysterious World

文字起こし（話者識別）

展開

（以下は "Feeling the Future (Precognition,Daryl Bem,Psi Research) - Jimmy Akin's Mysterious World" と題された podcast の文字起こしです。話者識別ずみ。）

[Dom Bettinelli] : Jimmy Akin's Mysterious World is brought to you by the StarQuest Production Network, and is made possible by our many generous patrons. If you'd like to support the podcast, please visit sqpn.com slash give. You're listening to episode 377 of Jimmy Akin's Mysterious World, where we look at mysteries from the twin perspectives of faith and reason. In this episode, we're talking about a famous set of precognition experiments by psychologist Daryl Bem. I'm Dom Bettinelli, and joining me today is Jimmy Akin. Hey, Jimmy. (00:00:36)

[Jimmy Akin] : Howdy, Dom. (00:00:37)

[Dom Bettinelli] : Folks, be sure to stick around to the end of the episode. We'll have your feedback on our recent episode on Facilitated Communication and Spelling to Communicate. But first... In 2011, psychologist Daryl Bem rocked the scientific world by publishing a paper called Feeling the Future. The paper reported the results of a series of experiments Bem had done on whether humans have the ability to subconsciously get precognitive information about the future. The experiments strongly suggested that they did, and that the odds were billions to one against this being due to random chance. (00:01:15)

[Dom Bettinelli] : Skeptical scientists were shocked. They demanded that other scientists replicate the experiments. And when other scientists did just that, Bem's results held up. But skeptics were so unwilling to accept the results that articles began appearing that announced science must be fundamentally broken. So what were Daryl Bem's Feeling the Future experiments? What were the results they found? And what has happened with the controversy over them? That's what we'll be talking about on this episode of Jimmy Akin's Mysterious World. So, Jimmy, why did you want to cover the Feeling the Future experiments on Mysterious World? (00:01:51)

[Jimmy Akin] : Well, I try to strike a balance between episodes that are story-driven and episodes that are data-driven. Most people prefer stories because, well, stories are stories. They have plots, they have dramatic tension, and they have resolution, and humans are wired for stories. So I try to include stories whenever I can, but we also need to look at data. One of the subjects that we discuss on Mysterious World is parapsychology, which involves psychic functioning. We discussed the field of parapsychology itself in episode 360, but there I could only survey the concepts that are used in the field. (00:02:26)

[Jimmy Akin] : I couldn't dig into the data supporting them. In other episodes, like when I've introduced and interviewed former Defense Department psychic spies or told you stories about poltergeists, I've focused on stories involving psychic phenomena, and I could tell you story after story involving that. But if we never get around to talking about the data that has been uncovered in parapsychological experiments, the stories themselves won't be as convincing, especially to skeptics. I've found that the most effective way to present a case for something is to both provide stories about the phenomenon in action and to provide statistical data. (00:03:05)

[Jimmy Akin] : So today, we're going to be looking at a set of high-quality experiments and what they uncovered about psychic functioning, and we'll have a dose of story, too, because today's episode has the story of a scientific controversy in it. (00:03:18)

[Dom Bettinelli] : Then let's start with a simple question. To begin today's mystery, who is Darrell Bem? (00:03:23)

[Jimmy Akin] : He's a social psychologist and a former professor at Cornell University. He was born in 1938, and he's still alive today at age 87. He originally didn't have anything to do with parapsychology, just mainstream psychology. But by the 1990s, he was asked to start consulting on some parapsychological matters. For example, the parapsychologist Charles Onerton asked him to help with a meta-analysis of Gonsfeld experiments. We can discuss the Gonsfeld technique in future episodes, but basically it's a kind of telepathy test, and it's produced statistically significant results that have held up over time. Well, Bem got interested in parapsychology as a result of his consultations, and he ended up doing his own parapsychological experiments. (00:04:11)

[Jimmy Akin] : The most famous of these were published in 2011 in an article titled, Feel in the Future, Experimental Evidence for Anomalous Retroactive Influences on Cognition and Affect. And it wasn't just published in a hole-in-the-wall journal. It was published in the Journal of Personality and Social Psychology, which is an elite, highly respected, mainstream psychological journal. And that only made the study have an even broader impact outside of parapsychology, which is why the skeptics freaked out. (00:04:43)

[Dom Bettinelli] : Then let's look more closely at the Feel in the Future paper. What does it contain? (00:04:48)

[Jimmy Akin] : It basically reports the results of nine experiments that Bem and his associates did. To make these experiments as bulletproof as possible, they were based on conventional, well-accepted principles in psychology. And the tests also involved only super-simple statistics that were well-accepted, so the results shouldn't be controversial. Because all Bem did was switch the order of a couple of steps to make the experiments precognitive. The way a typical psychology experiment works, you conduct trials in which you have participants go through a sequence of steps. The first step is typically that you give the test participant a stimulus of some kind, and then, as a second step, you measure the participant's response to that stimulus. (00:05:38)

[Jimmy Akin] : These experiments followed normal causation, where the cause, the stimulus, precedes the effect or the response. But Bem wanted to force precognition into the experiment, so in his trials he reversed the order of the two steps. So in Bem's trials, first, you measured the participant's response, and second, you gave them the stimulus. Thus, he was testing whether causation could flow backwards in time, retro-causation, so that the effect preceded the cause instead of following it. (00:06:14)

[Dom Bettinelli] : We're going to want some examples of how these experiments work to make this concept clear. In fact, we're going to go through each of the nine experiments he did, but can you start by giving us an easy-to-understand example of an experiment? (00:06:27)

[Jimmy Akin] : Sure. In fact, I'll give you my favorite one, which is experiment number eight. Now, in a conventional psychological experiment, you could do a memory test. One way this could work is that, first, you show a person a list of words and make sure that they read each word. For example, you might give them the list aardvark, pizza, fireman, sweatshirt, iguana, spaghetti, nurse, and pants. Then, you have them practice half the words. For example, you might have them practice aardvark, pizza, fireman, and sweatshirt. (00:07:04)

[Jimmy Akin] : And they could write out those words. They could go over them several times. They get some practice. Then, after they've practiced half the list, you ask them to name as many of the words as they can remember. Now, I've simplified this test for purposes of clarity. In reality, you'd want to test a person with a lot more than eight words because some people can just remember eight words. So, you'd want to give the participants more words than a normal person could remember. (00:07:31)

[Jimmy Akin] : You'd also want to randomly select which words they get to practice instead of just picking the first four the way I did. But that gives you an idea of how this word recall experiment would work. What you'd expect in this experiment is that the participant would be able to recall more of the words that they practiced because they got to spend more time thinking about these words. They did those words as homework, so they should remember them better. (00:07:58)

[Jimmy Akin] : But Bim wanted to make this a test of precognition. So, what he did was switch the order of the second two steps. In his version, first, you showed the participants the complete list of words. Second, you asked the person how many words they can recall. And third, you let them practice half the words. It's like taking your test before you do your homework. And if the person can still remember the words that they haven't yet practiced but will practice better than the words that they will not practice, then that's evidence the using precognition. That the homework they will do in the future is helping them better remember the words that they will practice on the test in the present, which is pretty mind-blowing if that works. (00:08:51)

[Jimmy Akin] : But that's an example of how just switching the order of two of the steps can turn an ordinary memory test into a precognitive memory test. (00:09:00)

[Dom Bettinelli] : If that's how you can turn an ordinary test into a precognitive test by switching the order of the steps, then let's go through each of the nine experiments Bem did. How did the first one work? (00:09:11)

[Jimmy Akin] : As we go through the nine experiments, I'm going to leave out some details for purposes of simplicity so that I can make the concepts clearer and so we can get through more quickly. But we'll have a link to the original paper so you can read that for the full details. Well, one of the basic facts about human psychology is that men usually find women attractive and women usually find men attractive. There are exceptions, but that's how the human race keeps going. (00:09:38)

[Jimmy Akin] : The fact the two find each other attractive is what causes men and women to get together and get busy. And there's an evolutionary reason for that. Any species that reproduces sexually would die out if the two sexes didn't find each other attractive. So the two sexes are programmed on the biological level to seek each other out and mate. And from a Christian perspective, that's by God's design. So Bem wanted to design an experiment that relied on this kind of attractiveness. (00:10:07)

[Jimmy Akin] : Here's how Bem explained it in Feel in the Future. (00:10:10)

[narator] : 100 Cornell undergraduates, 50 women and 50 men, were recruited for this experiment through the psychology department's automated online sign-up system. They either receive one point of experimental credit in a psychology course offering that option or were paid $5 for their participation. Both the recruiting announcement and the introductory explanation given to participants upon entering the laboratory inform them that this is an experiment that tests for ESP. It takes about 20 minutes and is run completely by computer. (00:10:41)

[narator] : First you will answer a couple of brief questions. Then on each trial of the experiment, pictures of two curtains will appear on the screen side by side. One of them has a picture behind it. The other has a blank wall behind it. Your task is to click on the curtain that you feel has the picture behind it. The curtain will then open, permitting you to see if you selected the correct curtain. There will be 36 trials in all. (00:11:07)

[narator] : Several of the pictures contain explicit erotic images. For example, couples engaged in non-violent but explicit consensual sexual acts. If you object to seeing such images, you should not participate in this experiment. The participant then signed a consent form and was seated in front of the computer. After responding to two individual difference items discussed below, the participant had a three-minute relaxation period during which the screen displayed a slowly moving Hubble photograph of the starry sky while peaceful New Age music played through stereo speakers. The 36 trials began immediately after the relaxation period. (00:11:49)

[Jimmy Akin] : But there was something that the test participants didn't know. Bim explains. (00:11:54)

[narator] : From the participant's point of view, this procedure appears to test for clairvoyance. That is, participants were told that a picture was hidden behind one of the curtains, and their challenge was to guess correctly which curtain concealed the picture. In fact, however, neither the picture itself nor its left-right position was determined until after the participant recorded his or her guess, making the procedure a test of detecting a future event, i.e., a test of precognition. (00:12:25)

[Jimmy Akin] : So, despite the fact that the participants were asked to guess where the picture was, what they were actually guessing was where the picture would be, and the placement of the picture was determined randomly after they made their guess. Now, the picture would either be randomly placed behind the curtain on the left or the curtain on the right, with a 50% probability of it being in either place. So, you'd expect that by random chance alone, the participant would guess where the picture would be 50% of the time. And the other 50% of the time, they'd see a blank wall when the curtain opened. (00:13:01)

[Jimmy Akin] : The hypothesis was that if people are attracted to sexy images and can precognitively sense where they'll be, then they would end up seeing the picture more than 50% of the time. In other words, that the college students would seek out the sexy pictures. Personally, this is a test I would not have participated in, since I practice custody of the eyes, so I would have exercised the option not to participate. But, I'm not a typical Cornell College student, and the test would still show whether people can precognitively seek out images that they'd like to see. (00:13:35)

[Jimmy Akin] : So, the first test dealt with precognitive detection of erotic stimuli. (00:13:40)

[Dom Bettinelli] : We'll cover the results they got with the test later, but if that's how the first experiment worked, what was the second? (00:13:47)

[Jimmy Akin] : This was sort of the reverse of the first experiment. The first experiment was designed to see if people could precognitively seek out images that they wanted to see. So, the second experiment was designed to see if they could precognitively avoid images that they didn't want to see. So, it was a test that dealt with precognitive avoidance of negative stimuli. What they did in this one was show a person two pictures on a computer screen at the same time, and they asked the person to press one button if they preferred the image on the left, and another button if they preferred the image on the right. But the thing was, the two images were mirror images of each other that were otherwise identical. (00:14:30)

[Jimmy Akin] : The pictures also were emotionally neutral. So, you might see a picture of an apple on the right next to a mirror image of the same apple. So, there wasn't much of a basis for preferring one picture over its mirror image. What the participants didn't know is that they would also be shown additional images that would be flashed on the screen too quickly for them to consciously notice. So, these were being subliminally flashed, and the participants were being subliminally exposed to these additional pictures. (00:15:02)

[Jimmy Akin] : Some of these pictures were emotionally positive, like say the a picture of a pretty sunset, while other pictures were emotionally negative, like say a picture of a vicious dog about to attack. What controlled whether you subliminally saw the positive picture or the negative picture was determined by the choice you made. You see, the computer randomly selected either the left or the right image that you saw consciously as the target. So, if you were looking at an apple and its mirror image, the computer would select either the left apple or the right apple as the target. (00:15:38)

[Jimmy Akin] : Let's say that unknown to you, it selected the left apple. If you also selected the left apple, then it would reward you by subliminally flashing a positive image like the pretty sunset. On the other hand, if the target was the left hand apple and you selected the right hand apple, then the computer would punish you by showing you a negative image like a vicious dog. To make this a precognitive test, the computer wouldn't determine which image was the target until after you expressed your preference. So, first you expressed which picture you liked more, then the computer decides which image was the target, and then it subliminally rewards or punishes you based on which picture you chose. (00:16:24)

[Jimmy Akin] : The purpose of this test was thus to see whether humans could subconsciously predict which image would be the target and thus avoid negative stimuli like the picture of the vicious dog. It thus involved the precognitive avoidance of negative stimuli. (00:16:39)

[Dom Bettinelli] : If that's the second test involved, how did the third work? (00:16:43)

[Jimmy Akin] : The third test involves a concept that's known in psychology as priming. Bam explains. (00:16:49)

[narator] : In recent years, priming experiments have become a staple of cognitive and cognitive social psychology. In a typical affective priming experiment, participants are asked to judge as quickly as they can whether a pleasant or unpleasant, and their response time is measured. Just before the picture appears, a positive or negative word, for example, beautiful, ugly, is flashed briefly on the screen. This word is called the prime. Individuals typically respond more quickly when the valences of the prime and the picture are congruent, both are positive or both are negative, than when they are incongruent. (00:17:28)

[Jimmy Akin] : For example, let's suppose that the experimenter shows you a positive picture like a beautiful sunset or a negative picture like a vicious dog, and they ask you to assess as quickly as possible whether the picture is positive or negative. So you press one key if it's a positive picture, and you press another key if it's a negative picture. But in a standard psychology experiment, they first prime you by showing you a positive word like beautiful or a negative word like ugly. What priming experience has found is that people are able to answer the question about the picture more quickly if the prime and word and the picture are consistent, and they answer the question more slowly if the prime and the picture are in conflict. (00:18:17)

[Jimmy Akin] : So for example, if you're primed with the word beautiful and you see a beautiful sunset, then you'll answer more quickly than if you see the word ugly and a beautiful sunset. Similarly, if you're primed with the word beautiful and you see a picture of a vicious dog, you'll respond more slowly than if you see the word ugly before the vicious dog. In other words, you can answer the question, is this a positive or a negative image more quickly if the prime and word and the image agree than if they disagree? Because you have to think harder to ignore the prime and word and just focus on the picture. (00:18:58)

[Jimmy Akin] : So what BIM did to make this a precognitive experiment was to show the participants the prime and word after they'd already made their determination of whether the picture was pleasant or unpleasant. In other words, first the person is shown a picture, then they are asked to decide if it's positive or negative, and only then does the computer randomly show them a positive or negative word as a prime. The hypothesis was that if precognition does not exist, then it won't matter whether the prime and word that the person will be shown after they make their choice is positive or negative. (00:19:37)

[Jimmy Akin] : The future prime and word won't make any difference. People will determine whether the image is positive or negative at the same speed regardless of whether the computer later chooses a positive or a negative word. But if precognition does exist, then the person may see the current picture in light of the word that will eventually be chosen. So it will take them longer to determine if the image is positive or negative based on whether the prime and word that they will see in the future is also positive or negative. In other words, are you subconsciously peeking into the future for help in determining whether the picture is that you're seeing is positive or negative? (00:20:19)

[Jimmy Akin] : This was thus the first study in whether retroactive priming happens. If that was the third experiment, what about the fourth? It was essentially a replication of the third experiment, but with a minor change. Bim explains. (00:20:35)

[narator] : In the original study, the prime paired with each picture was randomly selected on each trial from the list of unused positive and negative primes. In the current replication, one fixed positive prime and one fixed negative prime were assigned to each picture prior to the experiment. These were selected to be semantically relevant to the picture. For example, a picture of a basket of fruit was paired with the positive prime of luscious and the negative prime of bitter. And a picture of a menacing pitbull was paired with the positive prime of friendly and the negative prime of threatening. (00:21:12)

[narator] : The computer then randomly selected either the positive or the negative prime before the picture was presented in the forward priming procedure or after the participant had responded to the picture in the retroactive priming procedure. (00:21:26)

[Jimmy Akin] : So it was essentially the same as the previous experiment, but with some refinements. This was thus a retroactive priming experiment. (00:21:35)

[Dom Bettinelli] : What happened with the fifth experiment? (00:21:37)

[Jimmy Akin] : This is one of two experiments that's based on the concept of habituation. Psychologists have found that we tend to like things that we're more familiar with. That is to say, things that we've been habituated to. That's something I noticed years ago. I often don't like an album, a book, a TV show, or a movie as much the first time I listen to or watch it as I like it when I re-listen to or re-watch it. And I've always said to myself, you know, the first time I consume a piece of media, I'm just learning it. (00:22:08)

[Jimmy Akin] : I don't know what to expect. But when I re-consume it, it's like now I see what the artist was going for and I like it better. Psychologists have done studies of how the fact that you've been previously exposed to something increases how positively you feel about it. And it turns out that this happens even when the exposure that you've had is subliminal or below the threshold of conscious awareness. So a typical psychological experiment works like this. (00:22:36)

[Jimmy Akin] : First, a picture is selected as the target. Let's say a picture of an apple. Then you're repeatedly exposed to this target as it's flashed on the screen multiple times subliminally. Then the target image is paired with a very similar picture. And then you're the two pictures you like better. Even though you've only seen the target picture subliminally, people still tend to select the picture that their subconscious has seen before. So that's how a normal experiment works. (00:23:10)

[Jimmy Akin] : But to make this a precognitive test, BIM changed the order of the steps. First, you're shown two similar pictures. Then you pick which one you like. Then the computer randomly selects one of the images as the target. And then it subliminally shows you this image several times. The idea is to see whether you'll say that you like the image that the computer will choose to subliminally expose you to in the future more than the image you will not be subliminally exposed to in the future. In other words, can you precognitively sense what the picture will be so that it feels more familiar to you and thus you like it more? (00:23:55)

[Jimmy Akin] : This was thus a retroactive habituation experiment. When they did this experiment, they tested it with both emotionally neutral pictures, like the apple I just mentioned, and with emotionally negative pictures, like the vicious dog. And the hypothesis was that when they used a pair of negative pictures, people would prefer the image that they would be exposed to more than 50% of the time. (00:24:20)

[Dom Bettinelli] : If that was experiment 5, what was experiment 6? (00:24:24)

[Jimmy Akin] : Basically, it was a replication of experiment 5. So it was also a retroactive habituation experiment, except this time they added a new class of pictures, namely sexy ones. So now there were three classes of pictures the participants might see. They might see a pair of neutral images, they might see a pair of negative images, or they might see a pair of sexy images. And the new class of pictures led to a second hypothesis that they were testing. Apparently, people who see a sexy picture over and over again get bored with it, and so they start preferring that picture less. (00:25:02)

[Jimmy Akin] : So the new hypothesis was that when people were shown a pair of sexy pictures, they would prefer the one that they would be exposed to less than 50% of the time compared to the picture that they would not be exposed to in the future, since exposure to the target would make that picture more boring for them. (00:25:23)

[Dom Bettinelli] : If that was experiment 6, what happened in experiment 7? (00:25:27)

[Jimmy Akin] : Well, when they were conducting experiment 6, they noticed that people did get bored with the sexy picture they would see in the future. Specifically, if the picture would be more than 10 times subliminally, they would like that picture less. So they decided to design an experiment about retroactive induction of boredom. What they did for this experiment was focus exclusively on neutral pictures. They didn't use any negative ones or any sexy ones. They also decided to try flashing the target picture on the screen for long enough that you could see it with your conscious mind. (00:26:03)

[Jimmy Akin] : But otherwise, it worked just like experiments 5 and 6. First, you get shown a pair of two neutral images. Then you select which one you like better. Then the computer randomly selects which of the two images will be the target. And then that image is flashed repeatedly, only the flashes are long enough that you can consciously see the image. (00:26:26)

[Dom Bettinelli] : You've already mentioned experiment 8, which was the one that involved remembering words. Can you quickly recap that one for us? (00:26:33)

[Jimmy Akin] : Sure, this one was part of a pair of experiments that involved retroactive facilitation of recall. Basically, they showed you a list of words. Then they asked you how many words you could remember. And then the computer randomly selected half of the words and let you practice those. The hypothesis was that if precognition exists, you might be able to recall the words that you would practice in the future better than the words that you would not practice in the future. (00:27:01)

[Dom Bettinelli] : Okay, we're now at the last experiment. If that was experiment 8, what was experiment 9? (00:27:08)

[Jimmy Akin] : It was basically a replication of experiment 8, so it also involved retroactive facilitation of recall. The only difference was that after you listed all the words you could remember, they introduced an extra kind of practice. So you got a little more practice with half the words than you would in experiment 8. (00:27:27)

[Dom Bettinelli] : But first, I'm sure you've all precognitively become aware of what I'm about to do. We'd like to take a moment to thank our patrons who make this show possible, including Tomo G, Sharon P, Joel I, Edward R, and Derek M. Their generous donations at sqpn.com slash give make it possible for us to continue Jimmy Yakin's Mysterious World and all the shows at StarQuest. More than 90% of our financial support comes from individual donors like you, and so you can join them by visiting sqpn.com slash give. (00:28:02)

[Dom Bettinelli] : Jimmy Yakin's Mysterious World is also brought to you by DeliverContacts.com, offering top brand contact lenses at always low prices. With free delivery, visit DeliverContacts.com. And by... The debut novel Holy Terror by John R. Daugherty. Holy Terror is a gripping supernatural thriller where horrific crimes reveal an invisible war between good and evil. As justice is delivered through mysterious otherworldly means, detectives and agents uncover a force far greater than themselves. Holy Terror is a bold blend of crime, faith, and the paranormal. (00:28:41)

[Dom Bettinelli] : You can find author John R. Daugherty at johnrdaugherty.com. And for those who enjoy audiobooks, Holy Terror is also available on audible.com. Subscribers to John's email list can receive a free copy of the Bible Study Guide Companion at authorjohnrdaugherty.com slash subscribe. You can find both links in the show notes for this episode on our website. Jimmy, before we reveal the results that Darrell Bem got with his experiments, let's take a moment to reflect on what he was proposing. Wouldn't many people just think that the hypotheses he was testing were just absolutely insane? (00:29:20)

[Jimmy Akin] : For scientists who are hardcore skeptics that don't believe that psychic functioning is possible, yes, Bem's hypotheses sound ludicrous and totally crazy. The idea that you could do your homework after you take a test so that you can better remember the words that you will practice better than the words you won't practice just sounds nuts. So does the idea that you'll like one image better than another because you will see it subliminally in the future more than the other image also sounds nuts, as do the other ideas that Bem was testing. So the whole project sounds absolutely crazy to someone who just thinks that psychic functioning is impossible. (00:30:03)

[Jimmy Akin] : Worse yet, the participants in these experiments weren't even gifted psychics. They were ordinary people, just students at Cornell University, and they hadn't been trained in how to use ESP. And they didn't even know how the experiments worked before they took them. For example, they didn't know that after they did the word recall test that they'd be given a chance to practice some of the words. They didn't know that there would be any future homework, so they weren't consciously trying to tap into that homework. (00:30:34)

[Jimmy Akin] : Similarly, with the habituation test, they didn't know that one of the images would be randomly chosen and flashed on the screen too fast for them to see. Except in experiment 7, they were never consciously aware of those flashes at all, so they weren't consciously trying to perceive anything in the future. To a hardcore skeptic, this would just sound like craziness piled on top of craziness. Bem was proposing that not only does precognition exist, which is crazy element number one, but that perfectly ordinary people are using it, which is crazy element number two, and that they're using it without even trying to, which would suggest that they're always using that in life, which is crazy element number three. (00:31:18)

[Jimmy Akin] : So to hardcore skeptics, this just sounded total bat guano crazy. (00:31:23)

[Dom Bettinelli] : You referred to materialist scientists who are hardcore skeptics and believe that psychic functioning is impossible. Do all scientists go into that category? (00:31:32)

[Jimmy Akin] : Well, skeptical organizations would certainly like us to believe that. That's why they sometimes make bald, unqualified statements like, there is no scientific evidence for ESP or ESP is impossible. But the actual answer is no, there is actually considerable openness to the possibility of ESP among scientists. And it varies depending on which field of science you're talking about. For example, physicists tend to be more open to ESP, perhaps because physicists actually study things like whether time can move forwards and backwards. While ironically, psychologists tend to be among the more skeptical, but the attitude towards ESP in science is quite a bit more open than the skeptics let on. (00:32:16)

[Dom Bettinelli] : Then let's look at the results of BEM's nine experiments. What results did he get? (00:32:21)

[Jimmy Akin] : In general terms, keeping statistics out of it for the moment, BEM found that there was robust evidence for precognition in the test subjects. This doesn't mean that the precognition itself was robust. Like in many ESP tests on ordinary people, like the Cornell University students he used, the effect size of the ESP was small, but it went beyond random chance and produced statistically significant results. (00:32:46)

[Dom Bettinelli] : Statistical significance is a concept not all listeners may be familiar with. What is it? (00:32:53)

[Jimmy Akin] : It's a concept that is used to measure how likely it is that the results you get could be due to random chance. In science, this is often expressed with what's known as a p-value. You can think of the p in p-value as standing for the probability that your results are just due to random chance and there's no real effect you're measuring. And p-values are expressed as a number that's between 1 and 0. If you had a p-value of 1, it would mean that your results have a 100% probability of being due to random chance. (00:33:29)

[Jimmy Akin] : And if you had a p-value of 0, it would mean that your results have a 0% probability of being due to random chance. But in science, you almost never have a 100% or a 0% chance of anything. So p-values are almost always decibel numbers between 1 and 0. The question is, where do you draw the line between what's considered a significant probability and a non-significant one? Different scientists use different thresholds, but the main threshold is a p-value of 0.05 or a 1 in 20 probability that the results were due to random chance. (00:34:11)

[Jimmy Akin] : This number goes back to the 20th century statistician Ronald Fisher. He picked the value more or less arbitrarily, but it's become the standard one for most purposes in science. So if you hear someone say that the results of the experiment were statistically significant, that means it had a p-value of 0.05 or less. In other words, that there is a better than 19 in 20 chances that the results were a real phenomenon and less than a 1 in 20 chance that the results were just due to random chance. (00:34:48)

[Dom Bettinelli] : Another term from statistics that we're going to encounter is sigma. What do scientists mean when they talk about sigmas? (00:34:55)

[Jimmy Akin] : Sigma is one of the letters of the Greek alphabet. It's the Greek equivalent of our letter S. And in statistics, it refers to a standard deviation. So if something has a value of one sigma, that means it's one standard deviation away from the mean or average value. If it has a value of two sigma, that means it's two standard deviations away from the mean. If it's three sigma, that's three standard deviations away from the mean and so forth. I won't walk us through all that in detail today, but the bottom line for our purposes is that you can use sigmas as a measure of probability. (00:35:30)

[Jimmy Akin] : And the higher the sigma value is, the lower the probability is. In particle physics, the gold standard probability threshold that you need to announce the discovery of a new particle is five sigma. Five sigma corresponds to a p-value of 0.00006, or a chance of only one in 1.7 million that the results you got are due to random chance. So if you have a result that's five standard deviations away from the average, the odds against your results being due to random chance are so big that you can usually announce the discovery of a new particle. You'll also want to keep the five sigma threshold in mind because it's going to come back into our discussion. (00:36:21)

[Dom Bettinelli] : Okay, then with that background, let's put some numbers on Daryl Bem's nine experiments. Let's start with statistical significance. How many of the experiments had that? (00:36:31)

[Jimmy Akin] : Eight of the nine did. All but one of the experiments had p-values less than 0.05. The one experiment that didn't was experiment seven, or the retroactive induction of boredom. It had a p-value of 0.096, which meant that there was only about a 10% chance that the result was due to random chance, but a 10% chance is still above the 5% threshold that's normally used for statistical significance. So this one experiment's result was not considered statistically significant. (00:37:08)

[Jimmy Akin] : All the other eight experiments, though, produced results that were statistically significant. For example, experiment one had a p-value of 0.01, or only a 1% probability that the results were due to random chance. Experiment two had a p-value of 0.009, which is just less than a 1% probability of it being due to random chance, and so forth. But we can go further, because we can combine the results of the nine experiments to see what the overall case they make for precognition is. When you do that, including the one experiment that didn't achieve statistical significance, it turns out that the combined p-value is 2.6 times 10 to the negative 11th power, which means that the odds against the results of the nine experiments being due to random chance are about 2.7 out of 100 billion, or about 1 in 37 billion. (00:38:09)

[Jimmy Akin] : That corresponds to a 6.5 result, so it well exceeds the 5 sigma threshold that is needed to announce a new particle in physics. In other words, if this had been a physics experiment, you could announce a new particle based on what BEM found. He's thus exceeded the gold standard of proof used in one of the most rigorous branches of science. So, based on these findings, it's looking not just like precognition really exists, but that it really most sincerely exists. (00:38:41)

[Dom Bettinelli] : How did the hardcore skeptics react when Daryl BEM's article appeared? (00:38:46)

[Jimmy Akin] : Well, they freaked out, as you'd imagine. Remember, BEM was proposing what for them sounded like three kinds of craziness stacked on top of each other. First, that precognition exists, and second, that ordinary people have it, and third, that they're apparently using it all the time without even trying to. So, they just couldn't accept these results. Writing in the SciEncyclopedia, Chris Rowe summarized the situation. (00:39:11)

[narator] : The scientific community's reaction to the article's publication was mainly negative. A New York Times article noted that the decision may delight believers in so-called paranormal events, but it is already mortifying scientists. And quotes the emeritus professor of psychology at the University of Oregon, Ray Hyman, a high-profile skeptic, It's craziness, pure craziness. I can't believe a major journal is allowing this work. Jarrett included BEM's study among the ten most controversial psychological studies ever published, alongside notorious research such as Zimbardo's Stanford Prison Experiment and Milgram's Shock Experiments. Chambers identified the publication of BEM's article as the tipping point that triggered a methodological and statistical crisis in psychology, rather than the dramatic high-profile revelations of fraud perpetrated by psychologists such as Diedrich Staple or Jem Smearsters, the dismal failure of the Open Science Replication Project, or the discovery that many psychologists admitted to questionable research practices, which are all considered in detail later on in his book. (00:40:20)

[narator] : Engberg described the research as both methodologically sound and logically insane. And quotes Wagemaker's experience of reading BEM's ESP paper, I had to put it away several times. Reading it made me physically unwell. In the same article, Yuli Shemak, a psychologist at the University of Toronto, asserted, I don't have to believe any of these results because they're clearly fudged. (00:40:46)

[Jimmy Akin] : So skeptics couldn't accept these results, and many just dismissed them out of hand. I particularly like Jan Wagemaker's statement that he repeatedly had to stop reading the paper because it was making him physically unwell. However, some critics of the paper sought to make more significant objections from a scientific point of view. (00:41:09)

[Dom Bettinelli] : Then let's look at those. Before we go further, can you give us a basic list of the objections that we'll be covering? (00:41:15)

[Jimmy Akin] : Well, one of the things that more serious critics agreed on was that BEM's paper followed standard research practices of the time. They generally weren't arguing that he did anything wrong under the research practices of the day in psychology. So instead, they argued that the standard psychological research practices needed to be upgraded. One argument that was made was that BEM needed to more sharply define between exploratory and confirmatory experiments. Another was that he used the wrong kind of statistics in analyzing his results. (00:41:51)

[Jimmy Akin] : He should have used what are known as Bayesian statistics rather than the frequentist statistics he did use. Also, that his results could be explained by what's known as the file drawer effect. And finally, that his experiments needed to be replicated. There were also other more technical objections, usually of a statistical nature, and BEM and his defenders had replies for these objections. But they get us way too far into the weeds of statistics, so we won't be covering them today. Though we will have links to the relevant articles so that if you're a statistics geek, you can read them and evaluate the arguments and the replies for yourself. (00:42:31)

[Dom Bettinelli] : Then let's stick with the four major criticisms you've identified. Let's look at the first. What is the issue involving exploratory and confirmatory experiments? (00:42:41)

[Jimmy Akin] : Basically, exploratory research is a kind of open-ended, free-form style of research that scientists use to generate hypotheses that they can then test. By contrast, confirmatory research is what you're supposed to do once you have a hypothesis to test. A difference between them is that you're allowed to change things up when you're doing exploratory research. Like, you can change up the protocols that you're using and create new hypotheses during the exploratory phase. But once you're doing confirmatory research, you're not allowed to do that. (00:43:13)

[Jimmy Akin] : You're supposed to follow the protocol you started the confirmatory study with. Because if you're allowed to change the procedures once your study to confirm is underway, then you could tweak it to make it say all kinds of things. It wasn't clear to Bim's critics that he always properly distinguished between these two different modes of doing research. And the way journal articles were written back then, they made it sound like they didn't change their methodology in mid-experiment, even though they sometimes did. Things are changing on that front, though, and today a solution for this is known as pre-registration. (00:43:50)

[Jimmy Akin] : Pre-registration involves publishing your research plan that you're going to use before you even do the experiment. You publish it in what's known as a registry. So, it's publicly on record before you even start. And this makes it clear whether you're doing exploratory or confirmatory research, which helps enhance transparency in research, reduce bias, and promote integrity. I'm personally a big fan of pre-registration, though it isn't universally used even today. And it certainly wasn't in general use back when Bim did his experiments. (00:44:27)

[Dom Bettinelli] : How did Bim respond to the charge that he didn't adequately distinguish between exploratory and confirmatory research? (00:44:34)

[Jimmy Akin] : Well, he published his paper in 2011, but he'd completed all his research by 2010, and the research itself took place over the decade leading up to 2010. He acknowledged that when he was doing his very first experiments like 10 years ago, he had been looser than he was later, which was not uncommon among people doing this type of research. So, he acknowledged that there was at least partial validity to this claim. Fortunately, there is a cure available, which is replication. (00:45:06)

[Jimmy Akin] : If other people did replication studies to confirm his findings, that would show whether his hypotheses held up or not. (00:45:14)

[Dom Bettinelli] : And we'll cover the replication issue in a bit, but for now, what about the second criticism that should have used Bayesian statistics rather than Frequentist statistics? What's the difference between the two? (00:45:27)

[Jimmy Akin] : Frequentist statistics are the kind that we've been discussing thus far in today's episode. They involve things like the p-value that we discussed earlier, and they're the classic way of doing statistics. So, they're the normal type of statistics that scientists have generally used. However, in recent years, another type of statistics has been gaining in popularity. These are based on the work of English clergyman Thomas Bayes, who lived back in the 1700s. A key aspect of Bayesian statistics is updating your beliefs based on new evidence you get. (00:46:00)

[Jimmy Akin] : So, before you run an experiment, you already have an idea of how likely you think a particular hypothesis is. This belongs to your prior beliefs or priors, as they're called. Then, you run an experiment and you get new evidence. And then, you modify your priors based on the new evidence you got, and that gives you a new estimate of what's likely the case. For example, suppose that you think psychic functioning is overwhelmingly unlikely to exist. But then, you do an experiment and you get a little bit of evidence suggesting it does exist. (00:46:37)

[Jimmy Akin] : Well, that little bit of evidence that's new isn't going to completely overcome your prior belief that it's overwhelmingly unlikely, but it will modify your prior belief a little bit. So, now you'll think that psychic functioning is still unlikely, but a bit less unlikely than you did before you ran the experiment. One of the bad things about Bayesian statistics is that it's got a big element of subjectivity in it. That's one of the reasons people often use frequentist statistics. (00:47:04)

[Jimmy Akin] : People can have radically different priors on the same subject. And, if you're determined that a conclusion must be true or must be false, then you can just set your priors so that you get the conclusion you want. But conversely, one of the good things about Bayesian statistics is that it forces you to state your prior beliefs about a matter, so other people can look at your priors and see whether they think they're reasonable or not. (00:47:33)

[Dom Bettinelli] : What do you make of the criticism that Bem should have used Bayesian statistics in his paper rather than the frequentist statistics he did use? (00:47:41)

[Jimmy Akin] : I think both approaches have their uses. And Bem acknowledges that, too. In fact, in his original 2011 paper, he acknowledged that there were other statistical techniques that he could have applied, but he said he was only using super simple frequentist statistics because most psychologists weren't familiar with the fancier techniques, and he wanted other psychologists to replicate what he did. Well, it's certainly true that they didn't know a lot about other fancier techniques. Even today, most scientific papers still use frequentist statistics, and many scientists don't really have a detailed grasp of how to use Bayesian statistics. (00:48:23)

[Jimmy Akin] : They're only gaining in popularity right now, and they haven't yet become dominant. And they may never become dominant for some tasks that are just better handled in a frequentist way. But Bem had no problem subjecting his results to a Bayesian analysis. In the same year that he originally published back in 2011, Bem and two colleagues, Jessica Utz and Wesley Johnson, both of whom are statisticians, published a reply to some of Bem's critics. The critics had expressed their priors by saying that the odds against psychic functioning were a number very, very close to zero, and they proposed what was basically the number 100 quintillion to one as the odds against psychic functioning. (00:49:09)

[Jimmy Akin] : More specifically, they said that it was 99 quintillion, 999 quadrillion, 999 trillion, 999 billion, 999 million, 999 thousand, and 999 to one. But if you round that off to 100 quintillion to one, it's a one followed by 20 zeros, which is just another way of saying it doesn't matter what evidence you produce. We're just never going to accept this. They just picked a number out of the air to get the result they wanted. In fact, they admitted that this was an arbitrary number that they were using for purposes of illustration because it was so close to zero. (00:49:52)

[Jimmy Akin] : So I don't consider that a serious argument from a statistical point of view, but it illustrates the advantage I mentioned for Bayesian statistics, which is that it forces you to state your priors so that others can look at them and see if they're reasonable or not. Bim, Utz, and Johnson also reanalyzed Bim's own results in terms of Bayesian statistics using priors based on other known psychological test results rather than just numbers plucked out of the air. And based on this number, they concluded that the chance of psychic functioning not being real, given the data that Bim had collected, was 7.3 times 10 to the negative 5th, which works out to about a 1 in 1400 chance on Bayesian analysis. (00:50:39)

[Jimmy Akin] : They also calculated what is known as a Bayes factor for the two models, that precognition exists and that it doesn't exist based on the data Bim got. A Bayes factor is a concept that's too technical for us to go into here, but we can cover the implications for Bayes factors of different strengths. Basically, if a Bayes factor is between 1 and 3, it's scarcely worth mentioning. If it's between 3 and 10, there is substantial evidence for the hypothesis. (00:51:09)

[Jimmy Akin] : If it's between 10 and 30, there is strong evidence for the hypothesis. If it's between 30 and 100, there is very strong evidence for the hypothesis. And if it's over 100, it's commonly regarded as decisive evidence for the hypothesis. Well, based on the results Bim got in the experiments, he and his two statistician colleagues calculated that they were looking at a Bayes factor of 13,669 in favor of precognition existent. Well, 13,669 is much greater than the 100 that would ordinarily be considered decisive evidence in favor of precognition by that reckoning. (00:51:53)

[Dom Bettinelli] : Then let's look at the third criticism you identified, which concerned the file drawer effect. What is the file drawer effect and how could that be playing a role here? (00:52:04)

[Jimmy Akin] : The file drawer effect is a form of what's known as publication bias in science. The basic idea is that scientific journals like to publish studies that have statistically significant results. They don't like publishing studies that have non-significant results. And so, if a researcher does a study and the results are not significant, he's not likely to get it published. Instead, it'll just go into his file drawer and no one will ever see it. The result is that when you scan the published literature, if you do a literature review, you'll get a distorted picture of what the research actually says. (00:52:41)

[Jimmy Akin] : For example, let's suppose someone has been testing a new drug. We'll call it drug X. And when they run a test, they get statistically significant results that drug X helps people and they publish their results. So, there's now one study in print that says drug X helps people. But let's say four other researchers test drug X and their studies don't show statistically significant evidence that it helps people. They show that it doesn't work any better than a sugar pill. (00:53:13)

[Jimmy Akin] : And since their results aren't statistically significant, they don't get published. Their studies just go into the researchers' file drawers. The truth is that we have one study suggesting that drug X helps people and four studies suggesting it doesn't. So, it looks like that one study is likely an outlier that should not be relied upon and drug X doesn't really help people. However, because of the way publication works, if you do a literature review, all you'll turn up is the one study saying that it does work. And so, you'll conclude that this is likely the case. (00:53:48)

[Jimmy Akin] : The publication bias or file drawer effect has given you an inaccurate picture of what the research says about drug X. Well, that doesn't just happen in medical testing. It happens all over science. Fortunately, parapsychologists are on top of this issue and have been leading the way. Just like parapsychologists have embraced pre-registration more than their colleagues in other fields have, they also hold themselves to higher standards in terms of publication bias. And so, parapsychological journals do publish studies that came back with null results, ones that weren't statistically significant. (00:54:26)

[Jimmy Akin] : For just one example, I'd point you to the July 2023 issue of the Journal for the Society of Psychical Research, which published the results of a study on whether people could tell whether someone was listening to them on the telephone. There has been previous work that has shown that people can tell better than random chance when someone is watching them or not watching them. All those old movies you've seen where someone says, I feel like we're being watched, that's a real phenomenon. And it's been studied, and it turns out people can better than random chance tell when someone's watching them. (00:55:02)

[Jimmy Akin] : So, the researchers wondered, would the same thing happen with sound as with sight? So, could you tell whether someone's listening to you talk on the phone, or could you tell when no one is listening to you talk on the phone? Well, they did their study, but the results of the study were not statistically significant. But the parapsychologist who did the research and the journal still published a research note on it so that the data wouldn't be hidden in file drawers and future researchers could access it. It would be out there as part of the public record. (00:55:38)

[Jimmy Akin] : And this is another example of how parapsychologists tend to hold themselves to higher standards than scientists in other fields. (00:55:45)

[Dom Bettinelli] : So, back in 2011, some of Daryl Bem's critics wondered how many studies on precognition might be sitting in people's file drawers that didn't show precognition existed. How did he respond to this? (00:55:58)

[Jimmy Akin] : Oh, he was straightforward about the subject. In fact, in the 2011 paper, he openly discussed and named several exploratory studies that he had done precisely so they wouldn't be in the file drawer, so to speak. But there are also ways you can control for the file drawer problem. For example, you can estimate well how many unpublished studies there would have to be to wipe out the statistical significance or the effect size found in the published ones. And a 2016 meta-analysis of replications that had been done of Bem's initial experiments did just that. (00:56:35)

[Dom Bettinelli] : And that brings us to the subject of replication, which is always important in science. Do the results of a study hold up when other people try the same experiment? What happened with replication and Daryl Bem's experiments? (00:56:49)

[Jimmy Akin] : Well, replication was something that he very much wanted. In fact, he'd planned for it from the beginning. And to encourage it, he even did something that almost nobody in science does. (00:56:59)

[narator] : As he explained in his 2011 paper, The experiments have been designed to be as simple and transparent as possible, drawing participants from the general population, requiring no instrumentation beyond a desktop computer, taking less than 30 minutes per session, and requiring statistical analyses no more complex than a t-test across sessions or participants. As noted in the introduction, the goal in this research program was to develop well-controlled demonstrations of psi that could be replicated by independent investigators. The research strategy was to design experiments and employ statistical procedures that were as simple, as transparent, and as familiar as possible. (00:57:44)

[Jimmy Akin] : Consequently, Bem made the test packages and the instructions on how to use them freely available. Any researcher who wanted could install the programs on his computer and start replication studies. In fact, because Bem had been sharing his results throughout the 10 years that he worked on the project, some parapsychologists were already doing this even before the 2011 paper came out. When the paper did come out, additional researchers also started replication studies, and some were skeptics who failed to replicate his results. In some cases, this was because they didn't do true replications but made fundamental changes to the methodology, like administering the tests online where you have no guarantee that the participant is giving their undivided attention to the test or doing something else at the same time. (00:58:33)

[Jimmy Akin] : Skeptical members of the press were quick to trumpet that a few of these replications failed, however others confirmed Bem's results. And in 2016, Bem and four other colleagues published a meta-analysis that combined the results of Bem's test with the replication studies. The abstract of that meta-analysis summarizes it this way. (00:58:55)

[narator] : In 2011, one of the authors, Daryl J. Bem, published a report of nine experiments in the Journal of Personality and Social Psychology purporting to demonstrate that an individual's cognitive and affective responses can be influenced by randomly selected stimulus events that do not occur until after his or her responses have already been made and recorded. A generalized variant of the phenomenon traditionally denoted by the term precognition. To encourage replications, all materials needed to conduct them were made available on request. We here report a meta-analysis of 90 experiments from 33 laboratories in 14 countries which yielded an overall effect greater than 6 sigma, p equals 1.2 times 10 to the power of negative 10. (00:59:49)

[Jimmy Akin] : So, using frequented statistics, the p-value of Bem's experiments plus the replications, including ones that failed, was still 1.2 times 10 to the power of negative 10, which means that there would be about a 1 in 10 billion chance that the results were just due to random chance and this is still an effect that is greater than 6 sigma. The authors also did a Bayesian analysis and the abstract continues. (01:00:22)

[narator] : A Bayesian analysis yielded a Bayes factor of 5.1 times 10 to the 9th, greatly exceeding the criterion value of 100 for decisive evidence in support of the experimental hypothesis. (01:00:36)

[Jimmy Akin] : 5.1 times 10 to the power of 9 is basically a Bayes factor of 5 billion, so that well exceeds the threshold of 100 for decisive evidence in support of the hypothesis that precognition exists. But, because these figures include Bem's original test results, you might wonder what would happen if we pull those 9 experiments out. And the abstract continues. (01:01:02)

[narator] : When Darrell J. Bem's original experiments are excluded, p equals 1.1 times 10 to the negative 5 and the Bayes factor value is 3853, again exceeding the criterion for decisive evidence. (01:01:20)

[Jimmy Akin] : So, even without Bem's original experiments and including replications that didn't work, the frequentist p-value is still 1.1 times 10 to the negative 5th, which means a 1 in 100,000 chance that the results were due to just random chance. And the Bayes factor was still almost 3900, well exceeding the value of 100 needed for decisive evidence. The results upon replication, taking into account both the attempts that didn't work and those that did, still strongly supported Bem's conclusion. And the authors of the meta analysis did what are known as fail-safe calculations to see whether other things like the file drawer effect could be responsible. (01:02:05)

[Jimmy Akin] : One of their conclusions was... So, there would have to be another 544 unpublished studies sitting in file drawers in order to bring down the significance of this study to a trivial level. And that's not plausible because there are very few people working in the field of parapsychology. 544 is multiple times the number of people actually doing parapsychological research. So, each parapsychological researcher in the world would have to have done multiple studies on precognition and then not publish them, which is not plausible, especially given the fact that parapsychological journals do publish studies with null results. The authors also did additional fail-safe tests and concluded... (01:03:08)

[Jimmy Akin] : So, basically, none of the proposed criticisms looked like they had merit, and Bem's conclusions stood up to replication. (01:03:32)

[Dom Bettinelli] : And now, what can we say about Bem's Feeling the Future article from the faith perspective? (01:03:38)

[Jimmy Akin] : Well, it hinges on whether precognition exists. And that's something that the Catholic understanding of the Christian faith doesn't have a problem with. There have been doctors of the church who believed in precognition as a natural human ability. For example, St. Gregory the Great held that... (01:03:56)

[narator] : Sometimes it is through a subtle power of their own that souls can foresee the future. At other times, the future is made known to them through revelation shortly before death. But there is evidence to show that sometimes the soul, by its own subtle powers, has knowledge of the future. (01:04:13)

[Jimmy Akin] : Similarly, another doctor of the church, St. Thomas Aquinas, held that precognition is a natural human ability. He referred to it as natural prophecy to distinguish it from the supernatural prophecy that God gives. (01:04:26)

[narator] : And he said... This natural prophecy differs in three ways from that supernatural prophecy about which we are now speaking. It differs first in this, that the prophecy of which we speak gets its foreknowledge of future things immediately from God, although an angel can be an intermediary inasmuch as he acts in virtue of the divine light. But natural prophecy is due to the proper activity of second causes. Second, it differs in this, that natural prophecy extends only to those future events which have determinate causes in nature, but the supernatural prophecy of which we speak relates indifferently to all things. Third, they differ in this, that natural prophecy does not foresee infallibly, but predicts those things which are true for the most part, whereas the prophecy which is a gift of the Holy Spirit foresees the future infallibly. (01:05:22)

[Jimmy Akin] : So, in Aquinas' view, natural prophecy or precognition won't let you know the future infallibly, only God can do that, but it will give you a probabilistic description of the future, which corresponds to the findings of modern parapsychology about precognition, according to which precognition is not infallible. In any event, I don't see any fundamental contradiction between Darrell Bim's findings in his Feel in the Future study and the teachings of the Christian faith. (01:05:50)

[Dom Bettinelli] : And so, Jimmy, that brings us to the bottom line. (01:05:53)

[Jimmy Akin] : What is the bottom line? Darrell Bim rocked the scientific world in 2011 when he published his Feel in the Future article. It reported the results of nine precognition experiments done on ordinary people. The experiments used established psychological principles from other experiments, they just time-reversed them to force precognition into the picture. And eight of the nine experiments showed statistically significant results. They suggested that ordinary people are using precognition all the time to navigate their environments without even being aware of it. And they reported a 6.5 sigma result with odds of billions to one against the results being due to random chance. (01:06:35)

[Jimmy Akin] : Hardcore skeptics naturally freaked out. They made a few points that Bim agreed had merit, like he could have more clearly distinguished between exploratory and confirmatory research in his very first experiments. But both parties agreed that the real proof or disproof would be found in replication efforts. Bim went out of his way to let other scientists make replication efforts. And by 2016, a meta-analysis of more than 90 experiments showed that Bim's results had held up. And we do indeed have strong evidence that precognition exists. (01:07:09)

[Jimmy Akin] : And, Jimmy, what further resources can we offer the listeners and viewers? We'll have links to information both about Darrell Bim and about the Feel in the Future experiments from the SPR's Psi Encyclopedia. We'll have a link to Darrell Bim's original 2011 Feel in the Future paper, a critique from 2011 of Feel in the Future, also the follow-up from 2011 where Bim and two statisticians responded to the criticism, and the 2016 replication article. We'll also have a link to Slate Magazine, which ran a piece on how this all showed what Bim had done had showed how science is broken. (01:07:51)

[Jimmy Akin] : We'll have a piece by medical doctor and parapsychologist Larry Dossi called Why Are Scientists Afraid of Darrell Bim? We'll have an article by Kate Neusenbaum on Bim and the broken science article. And we'll have a link to Rational Wiki and what they have to say about the Feel in the Future study. And I note that even though Rational Wiki tends to be really snarky, their page on this one is unusually subdued. Very good. (01:08:22)

[Dom Bettinelli] : And now we turn to our mysterious feedback brought to us by our feedback coordinator Rob Leonardi. And this feedback comes from our episode 370 on facilitated communications. Our first feedback comes from Alistair Rebello 9013 on YouTube, who writes, I'm a father of a severely nonverbal autistic preschooler boy. This was tough to watch. A real rollercoaster of emotions on the facilitated communication part. I had to stop multiple times. It was brutal to grieve along with the parents as their and my hopes were raised and then crushed. (01:08:55)

[Dom Bettinelli] : By the time I got to the end, I've given up any hopes of my own son being so fortunate as the young man and girl. We simply don't have the time, energy, monetary assistance or hope to try. I can't bring myself to share the video with my wife, even though there is at least the benefit of knowing to avoid facilitated communication. I'll keep watch over that myself. All I have left is my faith in God. Nothing else. (01:09:20)

[Jimmy Akin] : Alistair, I'm so sorry to hear about your situation. And I knew that this episode would be challenging for folks who have to deal with the more severe forms of autism. And I tried to be as sensitive as I could in it. I did think it was important to let people know about the problems with facilitated communication and similar techniques. I wouldn't give up hope, you know, and there's always new stuff being discovered. I'm glad that you have faith in God. And that's, you know, that's the most important thing from an eternal perspective, because our lives are finite and eternity is not. (01:09:55)

[Jimmy Akin] : But I do want to offer you what hope is available, that there may be things out there that could help your son, or there may be things that will be developed that could help your son. And so don't give up hope and keep looking. (01:10:10)

[Dom Bettinelli] : Then Ted Cibaran on YouTube writes, I'm autistic. Communication is key. For me, due to the fact that I'm high functioning, Sesame Street and Cat in the Hat worked. I've had a long love of literacy. Tanner's autism sufferers are the ones who are usually mute. Having said that, there are many autistics who are lower functioning, who still have savant abilities. Using a savant ability to learn to communicate can be helpful if you can discover a savant ability or obsession. My father has aphasia after a stroke. (01:10:44)

[Dom Bettinelli] : We got him a picture board computer. (01:10:46)

[Jimmy Akin] : Thank you, Ted. I'm sorry to hear about your father, but I'm glad you've been able to get him some help. And I'm glad that you also have been able to have a lifelong love of literacy. That's wonderful. And it's an illustration of how other folks on the autism spectrum do not have the same kind of problems as some of the children we discussed in the episode on facilitated communication. (01:11:08)

[Dom Bettinelli] : Kayla J741 on YouTube writes, I work with autistic teens teaching them math and they communicate with spelling boards. The intense help with communication in the beginning is to help them overcome their mobility issues and to learn how to use the speller. Once they overcome those issues, they will spell things and I've seen it without any encouragement or prompting. When I tutored a 20 year old who was nonverbal, I would ask him questions about fractions and ask his helper not to help him. I just wanted him to do the math in his head and give the final answer. (01:11:42)

[Dom Bettinelli] : He consistently got them right again without prompting, but with spelling. At times he did make mistakes, but they were the exact same answers I would assume neurotypical students to make, given a specific misunderstanding of the concept. (01:11:55)

[Jimmy Akin] : Yeah, I'd have to, you know, of course, see the exact procedure that you're describing to make an evaluation of it. But as I pointed out in the episode, there are people who use spelling to communicate who have graduated to a point where they can do it without someone holding the board and without anybody, you know, in physical contact with them. And that really narrows down the possibilities of how any influence might be happening. I mean, you still have to be careful that it's not happening with verbal prompts and so forth. (01:12:30)

[Jimmy Akin] : But there is a range of abilities here. And I'm not at all dismissive that some people who are otherwise nonverbal may be able to benefit from these techniques. (01:12:42)

[Dom Bettinelli] : Rob C sent in an email, says, I thought today's episode was excellent. It's a subject I haven't heard of, but I was fascinated throughout. Before you suggested idiomotor effect and presented the evidence that it likely influenced messages from facilitated spelling, I had thought that the clever Hans effect might be the cause, where the speller is trying to please the facilitator by reading cues given from the facilitator rather than generating new information. I would think that even when the facilitator is not in physical contact with the speller or the board, like in spelling to communicate, clever Hans effect could still be relevant. (01:13:18)

[Dom Bettinelli] : And that constant verbal prompting spelling to communicate adds another medium for cues to be given, detected and used to influence the content of typewritten messages. I think you're right that double blinding the test would detect if clever Hans effect were allowing information to pass from the facilitator to speller. Thanks for another great episode. (01:13:37)

[Jimmy Akin] : Thanks very much, Rob. And for folks who may not be aware, I'll explain what the clever Hans effect is. Hans was the name of a horse in the 19th century who was famed for apparently being able to do math problems. So you could go to clever Hans. He was called clever Hans because he could do the math problems. And so you'd go to the horse and say, you know, what's seven plus five? And he would then beat with his hoof 12 times. (01:14:05)

[Jimmy Akin] : And this was regarded as a remarkable illustration of animal intelligence. But there were skeptics and the skeptics observed carefully what clever Hans was doing with his handler, because he had an owner who was there who would be holding his reins or whatever when he was doing one of these math problems. And what they noticed was that the handler would be nodding his head along with the hoof beats. And so if you say what's five plus seven, the the handler would be nodding along and he'd get up to 12 and then he'd pause and clever Hans would then realize, oh, now is when I need to stop. (01:14:49)

[Jimmy Akin] : So instead of actually doing a math problem, what clever Hans was doing was just beating his hoof until he got the cue from his handler to stop. And the handler was totally innocent in this. It was just like in the facilitated communication procedures that we talked about. In fact, I could have cited clever Hans in that episode, but it was basically the same thing. It wasn't true authorship on the part of the horse. The horse was just responded to the cues of his handler. (01:15:18)

[Dom Bettinelli] : All right. And now it's time to hear from you. What are your theories about Daryl Bem's Feeling the Future experiments and what they suggested about precognition? You can let us know by visiting sqpn.com or the Jimmy Yakin's Mysterious World Facebook page, sending us an email to feedback at mysterious.fm, sending a tweet to at mys underscore world, visiting the Starquest Discord community at sqpn.com slash discord, or calling our mysterious feedback line at 619-738-4515. That's 619-738-4515. (01:15:58)

[Jimmy Akin] : And I want to say a special word of thanks to Oasis Studio 7 for the video and animation work in this episode. Check it out by going to my YouTube channel, youtube.com slash Jimmy Akin. I have loads of videos there for you, mysterious world and all the other stuff I do on YouTube. While you're there, be sure and like, comment, share the videos and subscribe and hit the bell notification so you always get notified whenever I have a new video. (01:16:23)

[Dom Bettinelli] : I also want to thank Rob Mady for his voiceover work on this episode. So, Jimmy, what are we going to be talking about next time? (01:16:30)

[Jimmy Akin] : Next week, we're going to southwest Oklahoma to a town called Honobi, and we're going to be discussing Bigfoot. Southwest Oklahoma is a hotspot for Bigfoot reports. And back in the year 2000, there was an event known as the Honobi Bigfoot Siege. (01:16:49)

[Dom Bettinelli] : Interesting. Folks, be sure to share the podcast with your friends, write a review with Apple Podcasts or wherever you get your podcasts from. That helps us grow our community and reach more listeners. Be sure to get your very own Mysterious World t-shirt, mug and more in our merchandise shop at sqpn.com slash merch. You'll find links to Jimmy's resources from our discussion on our show notes at mysterious.fm slash 377. And remember to help us continue to produce the podcast, please visit sqpn.com slash give. (01:17:22)

[Dom Bettinelli] : Jimmy Akin's Mysterious World is also brought to you in part by Rosary Army, featuring award-winning Catholic podcasts, rosary resources, videos, and the School of Mary online community, prayer and learning platform. Learn how to make them, pray them, and give them away while growing in your faith at rosaryarmy.com and schoolofmary.com. And by The Grady Group, a Catholic company bringing financial clarity to their clients across the United States, using safe money options to produce reasonable rates of return for their clients. Learn more at gradygroupinc.com. (01:18:01)

[Dom Bettinelli] : Until next time, Jimmy Akin, thank you for exploring with us our mysterious world. (01:18:08)

[Jimmy Akin] : Thanks, Dom. (01:18:08)

[Dom Bettinelli] : And once again, I'm Dom Bettinelli, thank you for listening to Jimmy Akin's Mysterious World on StarQuest. (01:18:16)

(2025-10-29)

前置き​

要旨​

目次​

ダリル・ベムの予知実験と科学的論争に関するブリーフィング​

エグゼクティブ・サマリー​

1. 序論：ダリル・ベムと『Feeling the Future』論文​

主な発見：​

2. 9つの予知実験の概要​

3. 科学界の反応と主要な批判​

4. 批判への反論と再現研究の結果​

ベム側の応答​

再現性の検証：2016年のメタ分析​

5. 信仰の観点からの考察​

6. 結論​

「未来を感じる」— 心理学者ダリル・ベムの驚くべき9つの予知実験を徹底解説​

導入：科学界を揺るがした「予知」の実験​

1. ベム実験の核心的アイデア：時間を逆転させる​

2. 9つの予知実験：その巧妙な手法と仮説​

グループA：本能的な「引き寄せ」と「回避」の実験​

実験1：魅力的な刺激の予知​

実験2：不快な刺激の予知的回避​

グループB：「未来からのヒント」で反応は変わるか？ (プライミング実験)​

実験3 & 4：逆行性プライミング​

実験5, 6, 7：逆行性ハビチュエーションと退屈​

3. 結果：偶然では説明できない驚異的なデータ​

4. 科学界の衝撃と論争、そして「再現」の重要性​

5. まとめ：私たちは皆、無意識に未来を感じているのかもしれない​

科学を揺るがした「予知実験」から学ぶ、科学的思考のレッスン​

導入：科学の世界を震撼させた論文​

1. 「未来を感じる」実験とは？​

2. 「偶然」と「本物」を見分ける科学の物差し​

2.1. p値：偶然である確率を示す指標​

2.2. シグマ（σ）：結果の確からしさのレベル​

3. 科学的探究の核心：反論から学ぶ重要概念​

3.1. 再現性：科学の再現ドラマ​

3.2. ファイルドロワー効果：公表されなかった研究たちの叫び​

3.3. ベイズ統計 vs. 頻度論的統計：信念を更新する考え方​

3.4. 探索的研究と検証的研究：仮説探しか、仮説検証か​

4. まとめ：ベム論争が科学に教えてくれたこと​

予知実験が暴いた科学の脆さ：ダリル・ベム論文が心理学に残した教訓​

導入：一流学術誌に掲載された「ありえない」論文​

1. 「未来を感じる」実験の巧妙な設計と衝撃的な結果​

2. 懐疑論の噴出：科学的方法論への全面的な問い直し​

2.1. P値への過信と統計的有意性の罠​

2.2. 探索的研究と検証的研究の混同​

2.3. ベイズ統計学の視点​

2.4. 「ファイルドロワー問題」という名の出版バイアス​

3. 再現性の検証：論争の最終的な審判​

4. 論争の遺産：科学の自己修正プロセスへの貢献​

結論：科学を進歩させる「健全な危機」​

研究論文レビュー：ダリル・ベムの『Feeling the Future』が提起した予知能力の科学的証拠とその論争​

序論​

1. 研究の背景と方法論的革新​

2. 9つの予知実験の概説​

3. 主要な実験結果と統計的意義​

4. 科学界の反応と主要な批判​

5. 反論と追試（メタ分析）の結果​

6. 考察：心理学における「ベム論争」の意義​

7. 結論​

実験の概要と背景​

実験の背景と文脈​

実験の概要と方法論​

予知実験の具体例​

実験の設計の核心​

9つの予知実験の具体例​

1. 実験1：好ましい刺激の予知的な検出（Precognitive Detection of Erotic Stimuli）​

2. 実験2：否定的な刺激の予知的な回避（Precognitive Avoidance of Negative Stimuli）​

3. 実験3：遡及的プライミング（Retroactive Priming）​

4. 実験4：遡及的プライミング（複製と改良）​

5. 実験5：遡及的馴化（Retroactive Habituation）​

6. 実験6：遡及的馴化（セクシー画像の追加）​

7. 実験7：遡及的な飽きの誘導（Retroactive Induction of Boredom）​

8. 実験8：想起の遡及的促進（Retroactive Facilitation of Recall）​

9. 実験9：想起の遡及的促進（複製と追加練習）​

実験結果の全体像​

実験結果と統計的意義​

実験結果の概要​

統計的意義​

1. 個々の実験の有意性​

2. 全実験の統合結果​

前置き

要旨

目次