383 94 1MB
English Pages xviii+101 [120] Year 2015
Automatic Detection of Verbal Deception
Synthesis Lectures on Human Language Technologies d⁸tor
Graeme Hirst, University of Toronto
Synthesis Lectures on Human Language Technologies ⁸s ⁴d⁸t⁴d by ra⁴m⁴ ⁸rst o⁵ t⁷⁴ Un⁸v⁴rs⁸ty o⁵ Toronto. ⁴ s⁴r⁸⁴s cons⁸sts o⁵ 0- to 0-pa⁶⁴ mono⁶rap⁷s on top⁸cs r⁴lat⁸n⁶ to natural lan⁶ua⁶⁴ proc⁴ss⁸n⁶, computat⁸onal l⁸n⁶u⁸st⁸cs, ⁸n⁵ormat⁸on r⁴tr⁸⁴val, and spok⁴n lan⁶ua⁶⁴ und⁴rstand⁸n⁶. mp⁷as⁸s ⁸s on ⁸mportant n⁴w t⁴c⁷n⁸qu⁴s, on n⁴w appl⁸cat⁸ons, and on top⁸cs t⁷at comb⁸n⁴ two or mor⁴ LT subfi⁴lds.
Automat⁸c ⁴t⁴ct⁸on o⁵ V⁴rbal ⁴c⁴pt⁸on
⁸l⁴⁴n ⁸tzpatr⁸ck, Joan ac⁷⁴nko, and Tommaso ornac⁸ar⁸ 0
S⁴mant⁸c S⁸m⁸lar⁸ty ⁵rom Natural Lan⁶ua⁶⁴ and Ontolo⁶y Analys⁸s Sébast⁸⁴n ar⁸sp⁴, Sylv⁸⁴ Ranw⁴z, St⁴⁵an Janaq⁸, and Jacky Montma⁸n 0
L⁴arn⁸n⁶ to Rank ⁵or n⁵ormat⁸on R⁴tr⁸⁴val and Natural Lan⁶ua⁶⁴ Proc⁴ss⁸n⁶, S⁴cond d⁸t⁸on an⁶ L⁸ 0
Ontolo⁶y-as⁴d nt⁴rpr⁴tat⁸on o⁵ Natural Lan⁶ua⁶⁴ P⁷⁸l⁸pp ⁸m⁸ano, ⁷r⁸st⁸na Un⁶⁴r, and Jo⁷n Mcra⁴ 0
Automat⁴d rammat⁸cal rror ⁴t⁴ct⁸on ⁵or Lan⁶ua⁶⁴ L⁴arn⁴rs, S⁴cond d⁸t⁸on laud⁸a L⁴acock, Mart⁸n ⁷odorow, M⁸c⁷a⁴l amon, and Jo⁴l T⁴tr⁴ault 0
W⁴b orpus onstruct⁸on
Roland Sc⁷ä⁵⁴r and ⁴l⁸x ⁸ld⁷au⁴r 0
R⁴co⁶n⁸z⁸n⁶ T⁴xtual nta⁸lm⁴nt Mod⁴ls and Appl⁸cat⁸ons
do a⁶an, an Rot⁷, Mark Sammons, and ab⁸o Mass⁸mo Zanzotto 0
iv
L⁸n⁶u⁸st⁸c undam⁴ntals ⁵or Natural Lan⁶ua⁶⁴ Proc⁴ss⁸n⁶ 00 ss⁴nt⁸als ⁵rom Morp⁷olo⁶y and Syntax m⁸ly M. ⁴nd⁴r 0
S⁴m⁸-Sup⁴rv⁸s⁴d L⁴arn⁸n⁶ and oma⁸n Adaptat⁸on ⁸n Natural Lan⁶ua⁶⁴ Proc⁴ss⁸n⁶ And⁴rs Sø⁶aard 0
S⁴mant⁸c R⁴lat⁸ons ⁴tw⁴⁴n Nom⁸nals
V⁸v⁸ Nastas⁴, Pr⁴slav Nakov, ⁸armu⁸d Ó Séa⁶⁷d⁷a, and Stan Szpakow⁸cz 0
omputat⁸onal Mod⁴l⁸n⁶ o⁵ Narrat⁸v⁴ nd⁴r⁹⁴⁴t Man⁸ 0
Natural Lan⁶ua⁶⁴ Proc⁴ss⁸n⁶ ⁵or ⁸stor⁸cal T⁴xts M⁸c⁷a⁴l P⁸otrowsk⁸ 0
S⁴nt⁸m⁴nt Analys⁸s and Op⁸n⁸on M⁸n⁸n⁶ ⁸n⁶ L⁸u 0
⁸scours⁴ Proc⁴ss⁸n⁶ Man⁵r⁴d St⁴d⁴ 0
⁸t⁴xt Al⁸⁶nm⁴nt Jör⁶ T⁸⁴d⁴mann 0
L⁸n⁶u⁸st⁸c Structur⁴ Pr⁴d⁸ct⁸on Noa⁷ A. Sm⁸t⁷ 0
L⁴arn⁸n⁶ to Rank ⁵or n⁵ormat⁸on R⁴tr⁸⁴val and Natural Lan⁶ua⁶⁴ Proc⁴ss⁸n⁶ an⁶ L⁸ 0
omputat⁸onal Mod⁴l⁸n⁶ o⁵ uman Lan⁶ua⁶⁴ Acqu⁸s⁸t⁸on A⁵ra Al⁸s⁷a⁷⁸ 00
ntroduct⁸on to Arab⁸c Natural Lan⁶ua⁶⁴ Proc⁴ss⁸n⁶ N⁸zar Y. abas⁷ 00
ross-Lan⁶ua⁶⁴ n⁵ormat⁸on R⁴tr⁸⁴val J⁸an-Yun N⁸⁴ 00
Automat⁴d rammat⁸cal rror ⁴t⁴ct⁸on ⁵or Lan⁶ua⁶⁴ L⁴arn⁴rs
laud⁸a L⁴acock, Mart⁸n ⁷odorow, M⁸c⁷a⁴l amon, and Jo⁴l T⁴tr⁴ault 00
ata-nt⁴ns⁸v⁴ T⁴xt Proc⁴ss⁸n⁶ w⁸t⁷ MapR⁴duc⁴ J⁸mmy L⁸n and ⁷r⁸s y⁴r 00
S⁴mant⁸c Rol⁴ Lab⁴l⁸n⁶
Mart⁷a Palm⁴r, an⁸⁴l ⁸ld⁴a, and N⁸anw⁴n Xu⁴ 00
Spok⁴n ⁸alo⁶u⁴ Syst⁴ms
Kr⁸st⁸⁸na Jok⁸n⁴n and M⁸c⁷a⁴l McT⁴ar 00
ntroduct⁸on to ⁷⁸n⁴s⁴ Natural Lan⁶ua⁶⁴ Proc⁴ss⁸n⁶
Kam-a⁸ Won⁶, W⁴n⁹⁸⁴ L⁸, Ru⁸⁵⁴n⁶ Xu, and Z⁷⁴n⁶-s⁷⁴n⁶ Z⁷an⁶ 00
ntroduct⁸on to L⁸n⁶u⁸st⁸c Annotat⁸on and T⁴xt Analyt⁸cs ra⁷am W⁸lcock 00
⁴p⁴nd⁴ncy Pars⁸n⁶
Sandra Kübl⁴r, Ryan Mconald, and Joak⁸m N⁸vr⁴ 00
Stat⁸st⁸cal Lan⁶ua⁶⁴ Mod⁴ls ⁵or n⁵ormat⁸on R⁴tr⁸⁴val ⁷⁴n⁶X⁸an⁶ Z⁷a⁸ 00
v
opyr⁸⁶⁷t © 0 by Mor⁶an & laypool All r⁸⁶⁷ts r⁴s⁴rv⁴d. No part o⁵ t⁷⁸s publ⁸cat⁸on may b⁴ r⁴produc⁴d, stor⁴d ⁸n a r⁴tr⁸⁴val syst⁴m, or transm⁸tt⁴d ⁸n any ⁵orm or by any m⁴ans—⁴l⁴ctron⁸c, m⁴c⁷an⁸cal, p⁷otocopy, r⁴cord⁸n⁶, or any ot⁷⁴r ⁴xc⁴pt ⁵or br⁸⁴⁵ quotat⁸ons ⁸n pr⁸nt⁴d r⁴v⁸⁴ws, w⁸t⁷out t⁷⁴ pr⁸or p⁴rm⁸ss⁸on o⁵ t⁷⁴ publ⁸s⁷⁴r. Automat⁸c ⁴t⁴ct⁸on o⁵ V⁴rbal ⁴c⁴pt⁸on ⁸l⁴⁴n ⁸tzpatr⁸ck, Joan ac⁷⁴nko, and Tommaso ornac⁸ar⁸ www.morganclaypool.com
SN 0 SN 0
pap⁴rback ⁴book
O 0.00/S00V0Y00LT0 A Publ⁸cat⁸on ⁸n t⁷⁴ Mor⁶an & laypool Publ⁸s⁷⁴rs s⁴r⁸⁴s SYNTHESIS LECTURES ON HUMAN LANGUAGE TECHNOLOGIES L⁴ctur⁴ # S⁴r⁸⁴s d⁸tor ra⁴m⁴ ⁸rst, University of Toronto S⁴r⁸⁴s SSN Pr⁸nt -00 l⁴ctron⁸c -0
Automatic Detection of Verbal Deception ⁸l⁴⁴n ⁸tzpatr⁸ck
Montcla⁸r Stat⁴ Un⁸v⁴rs⁸ty
Joan ac⁷⁴nko L⁸n⁶u⁸st⁴c⁷ LL
Tommaso ornac⁸ar⁸ tal⁸an Nat⁸onal Pol⁸c⁴
SYNTHESIS LECTURES ON HUMAN LANGUAGE TECHNOLOGIES #29
M &C
Morgan
& cLaypool publishers
ABSTRACT
⁴ att⁴mpt to spot d⁴c⁴pt⁸on t⁷rou⁶⁷ ⁸ts corr⁴lat⁴s ⁸n ⁷uman b⁴⁷av⁸or ⁷as a lon⁶ ⁷⁸story. Unt⁸l r⁴c⁴ntly, t⁷⁴s⁴ ⁴fforts ⁷av⁴ conc⁴ntrat⁴d on ⁸d⁴nt⁸⁵y⁸n⁶ ⁸nd⁸v⁸dual “cu⁴s” t⁷at m⁸⁶⁷t occur w⁸t⁷ d⁴c⁴pt⁸on. ow⁴v⁴r, w⁸t⁷ t⁷⁴ adv⁴nt o⁵ computat⁸onal m⁴ans to analyz⁴ lan⁶ua⁶⁴ and ot⁷⁴r ⁷uman b⁴⁷av⁸or, w⁴ now ⁷av⁴ t⁷⁴ ab⁸l⁸ty to d⁴t⁴rm⁸n⁴ w⁷⁴t⁷⁴r t⁷⁴r⁴ ar⁴ cons⁸st⁴nt clust⁴rs o⁵ d⁸ff⁴r⁴nc⁴s ⁸n b⁴⁷av⁸or t⁷at m⁸⁶⁷t b⁴ assoc⁸at⁴d w⁸t⁷ a ⁵als⁴ stat⁴m⁴nt as oppos⁴d to a tru⁴ on⁴. W⁷⁸l⁴ ⁸ts ⁵ocus ⁸s on v⁴rbal b⁴⁷av⁸or, t⁷⁸s book d⁴scr⁸b⁴s a ran⁶⁴ o⁵ b⁴⁷av⁸ors—p⁷ys⁸olo⁶⁸cal, ⁶⁴stural as w⁴ll as v⁴rbal—t⁷at ⁷av⁴ b⁴⁴n propos⁴d as ⁸nd⁸cators o⁵ d⁴c⁴pt⁸on. An ov⁴rv⁸⁴w o⁵ t⁷⁴ pr⁸mary psyc⁷olo⁶⁸cal and co⁶n⁸t⁸v⁴ t⁷⁴or⁸⁴s t⁷at ⁷av⁴ b⁴⁴n off⁴r⁴d as ⁴xplanat⁸ons o⁵ d⁴c⁴pt⁸v⁴ b⁴⁷av⁸ors ⁶⁸v⁴s cont⁴xt ⁵or t⁷⁴ d⁴scr⁸pt⁸on o⁵ sp⁴c⁸fic b⁴⁷av⁸ors. ⁴ book also addr⁴ss⁴s t⁷⁴ d⁸ff⁴r⁴nc⁴s b⁴tw⁴⁴n data coll⁴ct⁴d ⁸n a laboratory and “r⁴al-world” data w⁸t⁷ r⁴sp⁴ct to t⁷⁴ ⁴mot⁸onal and co⁶n⁸t⁸v⁴ stat⁴ o⁵ t⁷⁴ l⁸ar. t d⁸scuss⁴s sourc⁴s o⁵ r⁴al-world data and probl⁴mat⁸c ⁸ssu⁴s ⁸n ⁸ts coll⁴ct⁸on and ⁸d⁴nt⁸fi⁴s t⁷⁴ pr⁸mary ar⁴as ⁸n w⁷⁸c⁷ appl⁸⁴d stud⁸⁴s bas⁴d on r⁴al-world data ar⁴ cr⁸t⁸cal, ⁸nclud⁸n⁶ pol⁸c⁴, s⁴cur⁸ty, bord⁴r cross⁸n⁶, customs, and asylum ⁸nt⁴rv⁸⁴ws con⁶r⁴ss⁸onal ⁷⁴ar⁸n⁶s financ⁸al r⁴port⁸n⁶ l⁴⁶al d⁴pos⁸t⁸ons ⁷uman r⁴sourc⁴ ⁴valuat⁸on pr⁴datory commun⁸cat⁸ons t⁷at ⁸nclud⁴ nt⁴rn⁴t scams, ⁸d⁴nt⁸ty t⁷⁴⁵t, and ⁵raud and ⁵als⁴ product r⁴v⁸⁴ws. av⁸n⁶ ⁴stabl⁸s⁷⁴d t⁷⁴ back⁶round, t⁷⁸s book conc⁴ntrat⁴s on computat⁸onal analys⁴s o⁵ d⁴c⁴pt⁸v⁴ v⁴rbal b⁴⁷av⁸or t⁷at ⁷av⁴ ⁴nabl⁴d t⁷⁴ fi⁴ld o⁵ d⁴c⁴pt⁸on stud⁸⁴s to mov⁴ ⁵rom ⁸nd⁸v⁸dual cu⁴s to ov⁴rall d⁸ff⁴r⁴nc⁴s ⁸n b⁴⁷av⁸or. ⁴ computat⁸onal work ⁸s or⁶an⁸z⁴d around t⁷⁴ ⁵⁴atur⁴s us⁴d ⁵or class⁸ficat⁸on ⁵rom n-⁶ram t⁷rou⁶⁷ syntax to pr⁴d⁸cat⁴-ar⁶um⁴nt and r⁷⁴tor⁸cal structur⁴. ⁴ book conclud⁴s w⁸t⁷ a s⁴t o⁵ op⁴n qu⁴st⁸ons t⁷at t⁷⁴ computat⁸onal work ⁷as ⁶⁴n⁴rat⁴d.
KEYWORDS
cr⁴d⁸b⁸l⁸ty ass⁴ssm⁴nt, d⁴c⁴pt⁸on d⁴t⁴ct⁸on, ⁵actual lan⁶ua⁶⁴, ⁵or⁴ns⁸c l⁸n⁶u⁸st⁸cs, ⁶old-standard data, ⁶round trut⁷, ⁷⁸⁶⁷-stak⁴s sc⁴nar⁸os, ⁸ma⁶⁸nat⁸v⁴ lan⁶ua⁶⁴, r⁴alworld data, stylom⁴try, t⁴xt class⁸ficat⁸on
ix
Eileen Fitzpatrick: To my husband, Ralph Grishman Joan Bachenko: In memory of my mother, Claire Joan Baumgartner Tommaso Fornaciari: In memory of my father, Alberto Fornaciari
xi
Contents Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xv Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii
1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
. .
.
2
ntroduct⁸on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . V⁴rbal u⁴s to ⁴c⁴pt⁸on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. L⁸n⁶u⁸st⁸c ⁴atur⁴s Us⁴d ⁸n d⁴nt⁸⁵y⁸n⁶ ⁴c⁴pt⁸on . . . . . . . . . . . . . . . . . . .. ff⁴ct⁸v⁴n⁴ss o⁵ L⁸n⁶u⁸st⁸c u⁴s to ⁴c⁴pt⁸on . . . . . . . . . . . . . . . . . . . . . . .. V⁴rbal u⁴s to round Trut⁷ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . W⁷at’s A⁷⁴ad . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
e Background Literature on Behavioral Cues to Deception . . . . . . . . . . . . . . . 7
. .
.
.
.
ntroduct⁸on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Nonv⁴rbal u⁴s to ⁴c⁴pt⁸on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Poly⁶rap⁷y . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Vo⁸c⁴ Analys⁸s VSA and LVA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 .. ⁴rmo⁶rap⁷y . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. ra⁸n Scan and MR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Vocal u⁴s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. ody and ac⁸al Mov⁴m⁴nts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ⁴ Psyc⁷olo⁶y L⁸t⁴ratur⁴ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. ⁴Paulo ⁴t al.’s Study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Vr⁸⁹’s Stud⁸⁴s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ⁴ or⁴ns⁸c L⁸t⁴ratur⁴ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Stat⁴m⁴nt Analys⁸s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Stat⁴m⁴nt Val⁸d⁸ty Analys⁸s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. R⁴al⁸ty Mon⁸tor⁸n⁶ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . or⁴ns⁸c mpl⁴m⁴ntat⁸ons o⁵ t⁷⁴ L⁸t⁴ratur⁴ . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. SAN as an nv⁴st⁸⁶at⁸v⁴ Tool and Tra⁸n⁸n⁶ Pro⁶ram . . . . . . . . . . . . . . .. valuat⁸ons o⁵ SAN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
xii
3
Data Sources . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 . .
.
4
e Language of Deception: Computational Approaches . . . . . . . . . . . . . . . . . . 53 .
.
.
.
5
ntroduct⁸on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . stabl⁸s⁷⁸n⁶ round Trut⁷ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. or⁴ns⁸c ata Sourc⁴s Spok⁴n and Wr⁸tt⁴n . . . . . . . . . . . . . . . . . . . . . . .. ⁸nanc⁸al R⁴ports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Mass M⁴d⁸a ommun⁸cat⁸ons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . R⁸sks w⁸t⁷ round Trut⁷ Sourc⁴s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. L⁴⁶al and or⁴ns⁸c nt⁴rv⁸⁴ws and Stat⁴m⁴nts . . . . . . . . . . . . . . . . . . . . .. ⁸nanc⁸al R⁴ports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 .. Mass M⁴d⁸a ommun⁸cat⁸ons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . omputat⁸onal Approac⁷⁴s to V⁴rbal ⁴c⁴pt⁸on . . . . . . . . . . . . . . . . . . . . . . . . .. stabl⁸s⁷⁸n⁶ omparat⁸v⁴ M⁴asur⁴s o⁵ Syst⁴m P⁴r⁵ormanc⁴ . . . . . . . . . .. lass⁸ficat⁸on and Rank⁸n⁶ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Tra⁸n⁸n⁶ and T⁴st⁸n⁶ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Syst⁴m valuat⁸on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Pr⁴pp⁸n⁶ t⁷⁴ ata . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ons⁸d⁴rat⁸ons Sp⁴c⁸fic to ⁴c⁴pt⁸on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. ata Typ⁴s Am⁴nabl⁴ to ⁴c⁴pt⁸on R⁴s⁴arc⁷ . . . . . . . . . . . . . . . . . . . . . .. Un⁸t o⁵ Analys⁸s ⁴ L⁸ar or t⁷⁴ L⁸⁴ . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. L⁸⁴s o⁵ Om⁸ss⁸on and omm⁸ss⁸on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. L⁴v⁴l o⁵ ata Us⁴d ⁵or Mod⁴l⁸n⁶ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Tra⁸n⁸n⁶ ata and round Trut⁷ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ⁴ urr⁴nt Syst⁴ms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. ⁷aract⁴rs and n-⁶rams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 .. ⁴atur⁴s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Stud⁸⁴s Look⁸n⁶ abov⁴ t⁷⁴ L⁴x⁸cal L⁴v⁴l . . . . . . . . . . . . . . . . . . . . . . . . . onclus⁸on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0
Open Questions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 . . . . .
ntroduct⁸on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . mpact o⁵ ont⁴xtual actors on ⁴c⁴pt⁸v⁴ Narrat⁸v⁴ . . . . . . . . . . . . . . . . . . . . ⁴c⁴pt⁸v⁴ Lan⁶ua⁶⁴ and ma⁶⁸nat⁸v⁴ Lan⁶ua⁶⁴ . . . . . . . . . . . . . . . . . . . . . . . . . M⁴asur⁸n⁶ t⁷⁴ ⁸stanc⁴ b⁴tw⁴⁴n ⁸v⁴rs⁴ Narrat⁸v⁴s . . . . . . . . . . . . . . . . . . . . . round Trut⁷ Annotat⁸on ⁴ S⁴arc⁷ ⁵or old-Standard ata . . . . . . . . . . . .
. . . .
xiii
A ommon ata S⁴t . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . u⁴ lust⁴r⁸n⁶ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . orr⁴lat⁸on o⁵ V⁴rbal w⁸t⁷ Nonv⁴rbal u⁴s . . . . . . . . . . . . . . . . . . . . . . . . . . . . onclus⁸on . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89 Authors’ Biographies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
xv
Preface ⁴r⁴ ar⁴ many v⁴nu⁴s w⁷⁴r⁴ t⁷⁴ ab⁸l⁸ty to spot t⁷⁴ l⁸⁴ ⁸s an ⁸mportant, o⁵t⁴n cr⁸t⁸cal, sk⁸ll. t ⁸s n⁴c⁴ssary ⁸n pol⁸c⁴, s⁴cur⁸ty, bord⁴r cross⁸n⁶, customs, and asylum ⁸nt⁴rv⁸⁴ws ⁸n con⁶r⁴ss⁸onal ⁷⁴ar⁸n⁶s ⁸n financ⁸al r⁴port⁸n⁶ ⁸n l⁴⁶al d⁴pos⁸t⁸ons ⁸n ⁷uman r⁴sourc⁴ ⁴valuat⁸on ⁸n pr⁴datory commun⁸cat⁸ons, ⁸nclud⁸n⁶ nt⁴rn⁴t scams, ⁸d⁴nt⁸ty t⁷⁴⁵t, and ⁵raud and ⁸n ⁵als⁴ adv⁴rt⁸s⁸n⁶. ⁸scov⁴r⁸n⁶ l⁸⁴s can t⁷wart s⁴r⁸ous ⁸mm⁴d⁸at⁴ t⁷r⁴ats, prov⁸d⁴ product⁸v⁴ d⁸r⁴ct⁸ons ⁸n t⁷⁴ ⁸nv⁴st⁸⁶at⁸on o⁵ past ⁴v⁴nts, and ass⁸st ⁸n t⁷⁴ accurat⁴ pr⁴d⁸ct⁸on o⁵ l⁸k⁴ly ⁵utur⁴ ⁴v⁴nts. or muc⁷ o⁵ t⁷⁴ 0t⁷ c⁴ntury, t⁷⁴ fi⁴lds o⁵ psyc⁷olo⁶y and cr⁸m⁸nal ⁹ust⁸c⁴ ⁷av⁴ stud⁸⁴d t⁷⁴ b⁴⁷av⁸ors t⁷at m⁸⁶⁷t b⁴ assoc⁸at⁴d, d⁸r⁴ctly or ⁸nd⁸r⁴ctly, w⁸t⁷ d⁴c⁴pt⁸on. r⁴⁴ typ⁴s o⁵ b⁴⁷av⁸or ⁷av⁴ b⁴⁴n ⁴xam⁸n⁴d p⁷ys⁸olo⁶⁸cal b⁴⁷av⁸or vocal b⁴⁷av⁸or, ⁸nclud⁸n⁶ prosod⁸c ⁵⁴atur⁴s and v⁴rbal b⁴⁷av⁸or, ⁸nclud⁸n⁶ t⁷⁴ words and structur⁴s t⁷at m⁸⁶⁷t corr⁴lat⁴ w⁸t⁷ d⁴c⁴pt⁸on. ⁴ study o⁵ v⁴rbal b⁴⁷av⁸or ⁸n d⁴c⁴pt⁸on ⁸s r⁴lat⁸v⁴ly n⁴w and t⁷⁴ att⁴nt⁸on t⁷at natural lan⁶ua⁶⁴ proc⁴ss⁸n⁶ ⁷as pa⁸d to d⁸scr⁸m⁸nat⁸n⁶ tru⁴ ⁵rom ⁵als⁴ cla⁸ms ⁸s ⁴v⁴n n⁴w⁴r, w⁸t⁷ most o⁵ t⁷⁴ work don⁴ ⁸n t⁷⁴ last 0 y⁴ars as class⁸ficat⁸on t⁴c⁷n⁸qu⁴s ⁷av⁴ ⁸mprov⁴d. Now ⁸s a ⁶ood t⁸m⁴ to r⁴v⁸⁴w t⁷⁴ pr⁸or l⁸t⁴ratur⁴ on d⁴c⁴pt⁸on and cons⁸d⁴r t⁷⁴ NLP approac⁷⁴s t⁷at ⁷av⁴ b⁴⁴n tr⁸⁴d. Know⁸n⁶ t⁷⁴ ⁵oundat⁸ons and tr⁴nds ⁸n work on d⁴c⁴pt⁸on, bot⁷ t⁷⁴or⁴t⁸cal and appl⁸⁴d, w⁸ll ⁴nabl⁴ us to mov⁴ ⁵orward product⁸v⁴ly. S⁴v⁴ral ar⁴as o⁵ NLP ar⁴ r⁸p⁴ to addr⁴ss t⁷⁴ vocal and v⁴rbal ⁵⁴atur⁴s t⁷at m⁸⁶⁷t b⁴ assoc⁸at⁴d w⁸t⁷ d⁴c⁴pt⁸on and n⁴w approac⁷⁴s may w⁴ll comb⁸n⁴ ⁸n⁵ormat⁸on ⁵rom t⁷⁴s⁴ w⁸t⁷ t⁷⁴ ⁵ac⁸al and body mov⁴m⁴nts assoc⁸at⁴d w⁸t⁷ d⁴c⁴pt⁸on. A spat⁴ o⁵ r⁴c⁴nt NLP pap⁴rs on t⁷⁴ class⁸ficat⁸on o⁵ narrat⁸v⁴s as trut⁷⁵ul or d⁴c⁴pt⁸v⁴ su⁶⁶⁴sts t⁷at t⁷⁴ fi⁴ld ⁸s r⁴ady to op⁴n up to t⁷⁸s prom⁸s⁸n⁶ ar⁴a. ⁴ ⁶⁴n⁴s⁸s o⁵ t⁷⁸s t⁴xt was a works⁷op on d⁴c⁴pt⁸on d⁴t⁴ct⁸on t⁷at took plac⁴ as part o⁵ t⁷⁴ urop⁴an M⁴⁴t⁸n⁶ o⁵ t⁷⁴ Assoc⁸at⁸on ⁵or omputat⁸onal L⁸n⁶u⁸st⁸cs ⁸n Av⁸⁶non ⁸n t⁷⁴ spr⁸n⁶ o⁵ 0. ⁴ works⁷op brou⁶⁷t to⁶⁴t⁷⁴r coll⁴a⁶u⁴s and off⁴r⁴d pr⁴s⁴ntat⁸ons on work t⁷at ran⁶⁴d ⁵rom annotat⁸on tools and corpus bu⁸ld⁸n⁶ ⁵or d⁴c⁴pt⁸on to cross-l⁸n⁶u⁸st⁸c class⁸ficat⁸on o⁵ d⁴c⁴pt⁸v⁴ narrat⁸v⁴. t also ⁶av⁴ t⁷⁴ aut⁷ors o⁵ t⁷⁸s t⁴xt an opportun⁸ty to work to⁶⁴t⁷⁴r on our common ⁸nt⁴r⁴st ⁸n t⁷⁴ us⁴ o⁵ “r⁴al-world” data, pr⁸mar⁸ly l⁴⁶al data, ⁸n NLP d⁴c⁴pt⁸on stud⁸⁴s. ollow⁸n⁶ t⁷⁴ m⁸ss⁸on o⁵ t⁷⁴ Synt⁷⁴s⁸s s⁴r⁸⁴s, t⁷⁴ pr⁴s⁴nt t⁴xt ⁸s d⁴s⁸⁶n⁴d to ⁶⁸v⁴ t⁷⁴ stud⁴nt or r⁴s⁴arc⁷⁴r ⁸n natural lan⁶ua⁶⁴ proc⁴ss⁸n⁶ a back⁶round ⁸n t⁷⁴ ⁷⁸story o⁵ d⁴c⁴pt⁸on stud⁸⁴s conc⁴ntrat⁸n⁶ on t⁷⁴ b⁴⁷av⁸oral cu⁴s to d⁴c⁴pt⁸on t⁷at ⁷av⁴ b⁴⁴n support⁴d ⁸n t⁷⁴ psyc⁷olo⁶y, appl⁸⁴d psyc⁷olo⁶y, and cr⁸m⁸nal ⁹ust⁸c⁴ l⁸t⁴ratur⁴, a cons⁸d⁴rat⁸on o⁵ t⁷⁴ r⁴al-world data sourc⁴s ⁵or NLP work ⁸n d⁴c⁴pt⁸on, a r⁴v⁸⁴w o⁵ NLP work ⁸n d⁴c⁴pt⁸on or⁶an⁸z⁴d around t⁷⁴ ⁵⁴atur⁴s
xvi
PREFACE
us⁴d ⁵or class⁸ficat⁸on ⁵rom n⁶ram to r⁷⁴tor⁸cal structur⁴, and a look at ⁴xc⁸t⁸n⁶ qu⁴st⁸ons and ar⁴as t⁷at n⁴⁴d to b⁴ addr⁴ss⁴d ⁸n ord⁴r ⁵or t⁷⁴ fi⁴ld to pro⁶r⁴ss. ⁸l⁴⁴n ⁸tzpatr⁸ck, Joan ac⁷⁴nko, and Tommaso ornac⁸ar⁸ July 0
xvii
Acknowledgments W⁴ w⁸s⁷ to t⁷ank t⁷⁴ stud⁴nts w⁷o took part ⁸n a ⁶raduat⁴ s⁴m⁸nar on curr⁴nt ⁸ssu⁴s ⁸n NLP at Montcla⁸r Stat⁴ Un⁸v⁴rs⁸ty ⁸n w⁷⁸c⁷ a pr⁴v⁸ous dra⁵t o⁵ t⁷⁸s t⁴xt was us⁴d as t⁷⁴ pr⁸mary t⁴xt ⁵or t⁷⁴ cours⁴, ⁸nclud⁸n⁶ R⁸c⁷ard arr⁴tt, Jan⁴tt⁴ Mart⁸n⁴z, Matt⁷⁴w Mul⁷olland, and m⁸ly Ols⁷⁴⁵sk⁸. ⁴⁸r ab⁸l⁸ty to tak⁴ t⁷⁴ mat⁴r⁸al cov⁴r⁴d ⁸n t⁷⁴ t⁴xt to t⁷⁴ n⁴xt st⁴p ⁸n t⁷⁴⁸r own r⁴s⁴arc⁷ ⁶⁸v⁴s us confid⁴nc⁴ t⁷at w⁴ ar⁴ r⁴sp⁴ct⁸n⁶ t⁷⁴ m⁸ss⁸on o⁵ t⁷⁸s s⁴r⁸⁴s. W⁴ would also l⁸k⁴ to t⁷ank Ralp⁷ r⁸s⁷man and Mass⁸mo Po⁴s⁸o ⁵or ⁸nput, c⁷⁴ck⁸n⁶, and ⁵ru⁸t⁵ul d⁸scuss⁸ons r⁴lat⁴d to t⁷⁴ ⁸ssu⁴s ⁸n t⁷⁸s book. W⁴ ar⁴ ⁸nd⁴bt⁴d to Myl⁴ Ott and anot⁷⁴r, anonymous, r⁴v⁸⁴w⁴r w⁷o prov⁸d⁴d d⁴ta⁸l⁴d, t⁷ou⁶⁷t⁵ul ⁵⁴⁴dback ⁸n t⁷⁴⁸r r⁴v⁸⁴ws o⁵ a dra⁵t o⁵ t⁷⁴ book, and to t⁷⁴ s⁴r⁸⁴s ⁴d⁸tor ra⁴m⁴ ⁸rst w⁷o first su⁶⁶⁴st⁴d a t⁴xt on d⁴c⁴pt⁸on ⁸n Av⁸⁶non and ⁷as ⁶u⁸d⁴d ⁸t t⁷rou⁶⁷, w⁸t⁷ ⁶r⁴at pat⁸⁴nc⁴ and car⁴, to r⁴al⁸ty. W⁴ ar⁴ also ⁶rat⁴⁵ul to t⁷⁴ many aut⁷ors w⁷os⁴ work w⁴ ⁷av⁴ c⁸t⁴d ⁷⁴r⁴. ⁸l⁴⁴n ⁸tzpatr⁸ck, Joan ac⁷⁴nko, and Tommaso ornac⁸ar⁸ July 0
APTR
Introduction 1.1
INTRODUCTION
⁴c⁴pt⁸on occurs ⁵r⁴qu⁴ntly ⁸n ⁴v⁴ryday s⁸tuat⁸ons, ⁵rom ⁸ns⁸nc⁴r⁴ compl⁸m⁴nts — “You look ⁶r⁴at!” — to ⁵ac⁴ sav⁸n⁶ l⁸⁴s “an you l⁴nd m⁴ $ lost my ATM card.” Most o⁵ t⁷⁴s⁴ l⁸⁴s ar⁴ ⁸ncons⁴qu⁴nt⁸al som⁴ ⁴v⁴n ⁷av⁴ pos⁸t⁸v⁴ ⁴ff⁴cts. ⁸s book conc⁴ntrat⁴s on t⁷⁴ mor⁴ cons⁴qu⁴nt⁸al l⁸⁴s and on t⁷⁴ b⁴⁷av⁸ors t⁷at ar⁴ t⁷ou⁶⁷t to b⁴ assoc⁸at⁴d w⁸t⁷ ly⁸n⁶, part⁸cularly ly⁸n⁶ ⁸nvolv⁸n⁶ natural lan⁶ua⁶⁴. ⁸s book d⁸scuss⁴s ⁷ow t⁷⁴s⁴ b⁴⁷av⁸ors ar⁴ b⁴⁸n⁶ captur⁴d by appl⁸cat⁸ons ⁸n natural lan⁶ua⁶⁴ proc⁴ss⁸n⁶. ⁴ study o⁵ d⁴c⁴pt⁸on ⁷as d⁴⁴p ⁸mpl⁸cat⁸ons ⁵or ⁷uman, and som⁴ an⁸mal, b⁴⁷av⁸or s⁸nc⁴ ⁸t d⁴monstrat⁴s an awar⁴n⁴ss o⁵ s⁴l⁵, ⁸n part⁸cular an awar⁴n⁴ss t⁷at on⁴’s own t⁷ou⁶⁷ts can d⁸ff⁴r ⁵rom t⁷os⁴ o⁵ ot⁷⁴rs [K⁴⁴nan ⁴t al., 00]. n t⁷⁴ broad⁴st s⁴ns⁴, d⁴c⁴pt⁸on ⁸nclud⁴s s⁴l⁵-d⁴c⁴pt⁸on, act⁸n⁶, and con⁹ur⁸n⁶. t also som⁴t⁸m⁴s cov⁴rs ⁵als⁴ stat⁴m⁴nts b⁴l⁸⁴v⁴d by t⁷⁴ t⁴ll⁴r to b⁴ tru⁴. ⁸s book, ⁷ow⁴v⁴r, ⁸s d⁴vot⁴d to appl⁸cat⁸ons w⁷⁴r⁴ som⁴t⁷⁸n⁶ ⁸mportant ⁸s at stak⁴ ⁸n commun⁸cat⁸on ⁸n pol⁸c⁴, s⁴cur⁸ty, bord⁴r cross⁸n⁶, customs, and asylum ⁸nt⁴rv⁸⁴ws ⁸n con⁶r⁴ss⁸onal ⁷⁴ar⁸n⁶s ⁸n financ⁸al r⁴port⁸n⁶ ⁸n l⁴⁶al d⁴pos⁸t⁸ons ⁸n ⁷uman r⁴sourc⁴ ⁴valuat⁸on ⁸n pr⁴datory commun⁸cat⁸ons, ⁸nclud⁸n⁶ nt⁴rn⁴t scams, ⁸d⁴nt⁸ty t⁷⁴⁵t, and ⁵raud and ⁸n ⁵als⁴ adv⁴rt⁸s⁸n⁶. or t⁷⁴ purpos⁴s o⁵ t⁷⁸s book, t⁷⁴n, w⁴ w⁸ll ⁵ollow t⁷⁴ d⁴fin⁸t⁸on o⁵ d⁴c⁴pt⁸on ⁶⁸v⁴n by Vr⁸⁹ (00), w⁷⁸c⁷ ⁴xclud⁴s s⁴l⁵-d⁴c⁴pt⁸on, act⁸n⁶, and ⁵als⁴ly ⁷⁴ld b⁴l⁸⁴⁵s. Vr⁸⁹ d⁴fin⁴s d⁴c⁴pt⁸on as a succ⁴ss⁵ul or unsucc⁴ss⁵ul d⁴l⁸b⁴rat⁴ att⁴mpt, w⁸t⁷out ⁵or⁴warn⁸n⁶, to cr⁴at⁴ ⁸n anot⁷⁴r a b⁴l⁸⁴⁵ w⁷⁸c⁷ t⁷⁴ commun⁸cator cons⁸d⁴rs to b⁴ untru⁴. ⁴ l⁸ar can carry out t⁷⁴ d⁴c⁴pt⁸on ⁸n d⁸ff⁴r⁴nt ways. ⁴ way t⁷at ⁸mm⁴d⁸at⁴ly com⁴s to m⁸nd ⁸s t⁷⁴ outr⁸⁶⁷t l⁸⁴ (“ was not r⁴qu⁸r⁴d to approv⁴ t⁷os⁴ transact⁸ons”),¹ but l⁸ars can b⁴ ⁴vas⁸v⁴ (“W⁴ll, t⁷⁴r⁴’s an ⁸ssu⁴ as to w⁷⁴t⁷⁴r was actually at a—t⁷⁴ part⁸cular m⁴⁴t⁸n⁶ t⁷at you’r⁴ talk⁸n⁶ about”), ⁴xa⁶⁶⁴rat⁴ or m⁸n⁸m⁸z⁴ an ⁸ssu⁴ (“n t⁷at m⁴⁴t⁸n⁶, t⁷⁴ pow⁴r ⁷ad ⁶on⁴ out, and as ⁴v⁴rybody r⁴m⁴mb⁴rs,...t⁷⁴ room was dark, qu⁸t⁴ ⁵rankly, and p⁴opl⁴ w⁴r⁴ walk⁸n⁶ ⁸n and out o⁵ t⁷⁴ m⁴⁴t⁸n⁶”) or om⁸t s⁸⁶n⁸ficant ⁵acts ⁵rom a story, as d⁸d Scott P⁴t⁴rson, conv⁸ct⁴d ⁵or t⁷⁴ murd⁴r o⁵ ⁷⁸s pr⁴⁶nant w⁸⁵⁴, ⁸n ⁷⁸s d⁴ta⁸l⁴d account o⁵ ⁷⁸s act⁸ons on t⁷⁴ day s⁷⁴ was r⁴port⁴d m⁸ss⁸n⁶. ⁴ att⁴mpt to spot d⁴c⁴pt⁸on ⁷as a lon⁶ ⁷⁸story, dat⁸n⁶ at l⁴ast ⁵rom t⁷⁴ r⁴⁴k p⁷ys⁸c⁸an ras⁸stratus (00–0 ..), w⁷o ⁵⁴lt t⁷⁴ puls⁴ o⁵ a susp⁴ct to d⁸st⁸n⁶u⁸s⁷ t⁷⁴ l⁸⁴ ⁵rom t⁷⁴ trut⁷ ¹⁴ d⁴c⁴pt⁸v⁴ quot⁴s ⁸n t⁷⁸s para⁶rap⁷ and t⁷⁴ n⁴xt ar⁴ ⁵rom t⁷⁴ t⁴st⁸mony o⁵ J⁴ffr⁴y Sk⁸ll⁸n⁶, O o⁵ nron, t⁷⁴ Am⁴r⁸can ⁴n⁴r⁶y company, to on⁶r⁴ss on ⁴bruary , 00 conc⁴rn⁸n⁶ t⁷⁴ account⁸n⁶ ⁵raud t⁷at ⁷⁸s company ⁷ad p⁴rp⁴trat⁴d.
1
2
1. INTRODUCTION
[Trov⁸llo, ], st⁸ll a m⁴asur⁴ us⁴d ⁸n mod⁴rn poly⁶rap⁷y. W⁷⁸l⁴ a ⁶ood d⁴al o⁵ r⁴s⁴arc⁷ ⁷as b⁴⁴n d⁴vot⁴d to pot⁴nt⁸al p⁷ys⁸olo⁶⁸cal cu⁴s to d⁴c⁴pt⁸on s⁸nc⁴ t⁷⁴ poly⁶rap⁷ was ⁸nv⁴nt⁴d ⁸n , ⁸nclud⁸n⁶ ⁸ma⁶⁸n⁶ t⁴c⁷nolo⁶⁸⁴s suc⁷ as ⁵unct⁸onal ma⁶n⁴t⁸c r⁴sonanc⁴ ⁸ma⁶⁸n⁶ (⁵MR) and t⁷⁴rmal ⁸ma⁶⁸n⁶, v⁸sual m⁴asur⁴s t⁷at ⁸nclud⁴ body mov⁴m⁴nts, and vocal m⁴asur⁴s suc⁷ as p⁸tc⁷ and sp⁴⁴c⁷ rat⁴ c⁷an⁶⁴s, t⁷⁴r⁴ ⁸s an ⁴qually robust l⁸t⁴ratur⁴ on t⁷⁴ c⁷aract⁴r⁸st⁸cs o⁵ lan⁶ua⁶⁴ t⁷at ar⁴ t⁷ou⁶⁷t to b⁴ assoc⁸at⁴d w⁸t⁷ d⁴c⁴pt⁸on. n t⁷⁴ psyc⁷olo⁶y and cr⁸m⁸nal ⁹ust⁸c⁴ l⁸t⁴ratur⁴ on d⁴c⁴pt⁸on, t⁷⁴s⁴ ran⁶⁴ ⁵rom d⁸scr⁴t⁴ cu⁴s suc⁷ as ⁷⁸⁶⁷⁴r rat⁴s o⁵ n⁴⁶at⁸v⁴ stat⁴m⁴nts and ⁴xtr⁴m⁴ d⁴scr⁸pt⁸ons (“absolut⁴ly, pos⁸t⁸v⁴ly no conn⁴ct⁸on”) to mor⁴ ⁶lobal ⁵⁴atur⁴s l⁸k⁴ l⁴x⁸cal d⁸v⁴rs⁸ty and story cons⁸st⁴ncy. To ⁶⁸v⁴ t⁷⁴ scop⁴ o⁵ work ⁸n d⁴c⁴pt⁸on d⁴t⁴ct⁸on, w⁴ prov⁸d⁴ ⁸n ⁷apt⁴r an ov⁴rv⁸⁴w o⁵ bot⁷ t⁷⁴ non-v⁴rbal and v⁴rbal stud⁸⁴s on d⁴c⁴pt⁸v⁴ b⁴⁷av⁸or. ⁴ computat⁸onal l⁸t⁴ratur⁴, w⁷⁸c⁷ ⁷as, to a ⁶r⁴at ⁴xt⁴nt, r⁴pl⁸cat⁴d t⁷⁴ find⁸n⁶s o⁵ t⁷⁴ psyc⁷olo⁶⁸cal ⁴xp⁴r⁸m⁴ntat⁸on, runs t⁷⁴ ⁶amut o⁵ compar⁸sons o⁵ tru⁴ v⁴rsus ⁵als⁴ stat⁴m⁴nts ⁵rom d⁸ff⁴r⁴nc⁴s ⁸n n-⁶ram occurr⁴nc⁴s to d⁸ff⁴r⁴nc⁴s ⁸n local ⁵⁴atur⁴s o⁵ t⁷⁴ narrat⁸v⁴ to d⁸st⁸nct⁸ons ⁸n r⁷⁴tor⁸cal structur⁴. n ⁸ts ⁴stabl⁸s⁷m⁴nt o⁵ a bas⁴l⁸n⁴ a⁶a⁸nst w⁷⁸c⁷ to m⁴asur⁴ succ⁴ss, ⁸ts us⁴ o⁵ class⁸ficat⁸on al⁶or⁸t⁷ms to s⁴parat⁴ tru⁴ ⁵rom ⁵als⁴ narrat⁸v⁴s, ⁸ts tra⁸n⁸n⁶ and t⁴st⁸n⁶ proc⁴dur⁴s, and ⁸ts r⁴l⁸anc⁴ on standard ⁴valuat⁸on m⁴asur⁴s to ⁴st⁸mat⁴ succ⁴ss, t⁷⁴ computat⁸onal work ⁷as t⁷⁴ c⁷aract⁴r⁸st⁸cs o⁵ muc⁷ work ⁸n appl⁸⁴d natural lan⁶ua⁶⁴ proc⁴ss⁸n⁶. Unl⁸k⁴ many NLP v⁴ntur⁴s, t⁷ou⁶⁷, ⁸t ⁸s ⁷amp⁴r⁴d by data l⁸m⁸tat⁸ons ⁸t ⁸s ⁷ard to com⁴ by narrat⁸v⁴s t⁷⁴ propos⁸t⁸on(s) o⁵ w⁷⁸c⁷ ar⁴ known to b⁴ tru⁴ or ⁵als⁴. W⁴ d⁴vot⁴ ⁷apt⁴r to att⁴mpt⁴d solut⁸ons to t⁷⁸s probl⁴m. n add⁸t⁸on, computat⁸onal stud⁸⁴s o⁵ v⁴rbal d⁴c⁴pt⁸on d⁸ff⁴r ⁵rom work ⁸n ot⁷⁴r ar⁴as o⁵ natural lan⁶ua⁶⁴ proc⁴ss⁸n⁶ ⁸n two ⁸mportant, r⁴lat⁴d r⁴sp⁴cts most NLP stud⁸⁴s r⁴⁶ard ⁷uman p⁴r⁵ormanc⁴ as t⁷⁴ ⁶old standard, w⁷⁴r⁴as typ⁸cal ⁷uman p⁴r⁵ormanc⁴ ⁸n d⁴t⁴ct⁸n⁶ d⁴c⁴pt⁸on runs at c⁷anc⁴ l⁴v⁴ls. ⁸s first d⁸ff⁴r⁴nc⁴ l⁴ads to a s⁴cond w⁷⁸l⁴ most ⁴valuat⁸ons ⁸n NLP t⁴st a⁶a⁸nst ⁷uman p⁴r⁵ormanc⁴, work ⁸n d⁴c⁴pt⁸on d⁴t⁴ct⁸on must t⁴st a⁶a⁸nst ⁶round trut⁷, w⁷⁸c⁷ ⁸s ⁴xt⁴rnal to t⁷⁴ v⁴rbal data—Was, ⁵or ⁴xampl⁴, nron’s J⁴ffr⁴y Sk⁸ll⁸n⁶ pr⁴s⁴nt at t⁷at cr⁸t⁸cal m⁴⁴t⁸n⁶ or was ⁷⁴ not
1.2
VERBAL CUES TO DECEPTION
Propos⁸t⁸onal commun⁸cat⁸on tak⁴s plac⁴ t⁷rou⁶⁷ words, and so do⁴s t⁷⁴ opportun⁸ty to m⁸sr⁴pr⁴s⁴nt r⁴al⁸ty. ⁴ most obv⁸ous way to d⁴t⁴ct d⁴c⁴pt⁸on ⁸n commun⁸cat⁸on, t⁷⁴n, would b⁴ to compar⁴ t⁷⁴ propos⁸t⁸ons w⁸t⁷ t⁷⁴ r⁴al⁸ty ⁸⁵ som⁴ d⁸ss⁸m⁸lar⁸ty ⁸s ⁵ound, and t⁷⁴ narrator’s awar⁴n⁴ss o⁵ t⁷at d⁸ss⁸m⁸lar⁸ty ⁸s (r⁴asonably) d⁴monstrat⁴d, t⁷⁴ commun⁸cat⁸on can b⁴ r⁴⁶ard⁴d as d⁴c⁴pt⁸v⁴. W⁷⁴n doubts about t⁷⁴ trut⁷⁵uln⁴ss o⁵ a commun⁸cat⁸on ar⁸s⁴, typ⁸cally a ⁶r⁴at d⁴al o⁵ ⁴ffort ⁸s d⁸r⁴ct⁴d at ⁴stabl⁸s⁷⁸n⁶ t⁷⁴ ⁶round trut⁷ ⁶⁴tt⁸n⁶ at t⁷⁸s trut⁷ ⁸s on⁴ o⁵ t⁷⁴ ma⁸n ⁶oals not only ⁸n court tr⁸als and ⁸n pol⁸c⁴ ⁸nv⁴st⁸⁶at⁸ons, but also ⁸n pr⁸vat⁴ d⁸sput⁴s. Un⁵ortunat⁴ly, d⁴t⁴rm⁸n⁸n⁶ t⁷⁴ ⁶round trut⁷ ⁸n most cas⁴s ⁸s d⁸fficult ⁸⁵ not ⁸mposs⁸bl⁴. n suc⁷ cas⁴s, ⁷av⁸n⁶
1.2. VERBAL CUES TO DECEPTION
robust cu⁴s to d⁴c⁴pt⁸on would at l⁴ast l⁴ad t⁷⁴ ⁸nqu⁸r⁴r toward commun⁸cat⁸ons t⁷at ar⁴ mor⁴ l⁸k⁴ly to b⁴ l⁸⁴s. A notabl⁴ ⁴xampl⁴ o⁵ t⁷⁴ us⁴ o⁵ l⁸n⁶u⁸st⁸c analys⁸s to uncov⁴r ⁶round trut⁷ dat⁴s back to t⁷⁴ t⁷ c⁴ntury, w⁷⁴n t⁷⁴ tal⁸an r⁷⁴tor⁸c⁸an Lor⁴nzo Valla prov⁴d t⁷⁴ ⁵or⁶⁴ry o⁵ t⁷⁴ onat⁸on o⁵ onstant⁸n⁴, t⁷⁴ ⁵ak⁴ ⁸mp⁴r⁸al d⁴cr⁴⁴ w⁷⁸c⁷ suppos⁴dly con⁵⁴rr⁴d on t⁷⁴ Pop⁴ aut⁷or⁸ty ov⁴r t⁷⁴ w⁴st⁴rn Roman mp⁸r⁴. n t⁷at cas⁴, Valla d⁴monstrat⁴d t⁷⁴ ⁵als⁸ty o⁵ t⁷⁴ docum⁴nt not only by d⁸scuss⁸n⁶ t⁷⁴ ⁷⁸stor⁸cal ⁸mplaus⁸b⁸l⁸ty o⁵ t⁷⁴ onat⁸on and addr⁴ss⁸n⁶ a cl⁴ar m⁸stak⁴ w⁸t⁷ r⁴sp⁴ct to t⁷⁴ onat⁸on’s dat⁴, w⁷⁸c⁷ r⁴⁵⁴rr⁴d to a t⁸m⁴ not compat⁸bl⁴ w⁸t⁷ t⁷⁴ cont⁴nt o⁵ t⁷⁴ docum⁴nt, but also by mak⁸n⁶ us⁴ o⁵ l⁸n⁶u⁸st⁸c ar⁶um⁴nts. n part⁸cular, ⁷⁴ ⁴mp⁷as⁸z⁴d t⁷at t⁷⁴ Lat⁸n o⁵ t⁷⁴ onat⁸on could not b⁴lon⁶ to t⁷⁴ ⁸mp⁴r⁸al p⁴r⁸od, but was typ⁸cal o⁵ t⁷⁴ ⁵ollow⁸n⁶ c⁴ntur⁸⁴s [Valla, 00]. ⁴ study o⁵ v⁴rbal cu⁴s to d⁴c⁴pt⁸on assum⁴s t⁷at t⁷⁴ styl⁴ o⁵ t⁷⁴ commun⁸cat⁸on ⁸s aff⁴ct⁴d not only by t⁷⁴ d⁴mo⁶rap⁷⁸c, soc⁸al, and cultural c⁷aract⁴r⁸st⁸cs o⁵ t⁷⁴ narrator, but also by ⁷⁸s stat⁴ o⁵ m⁸nd at t⁷⁴ mom⁴nt o⁵ t⁷⁴ product⁸on o⁵ t⁷⁴ commun⁸cat⁸on. n part⁸cular, t⁷⁴ bas⁸c assumpt⁸ons o⁵ t⁷⁸s approac⁷ ar⁴ as ⁵ollows • t⁷⁴ psyc⁷olo⁶⁸cal cond⁸t⁸on o⁵ t⁷⁴ sub⁹⁴ct aff⁴cts commun⁸cat⁸on styl⁴ • t⁷⁴ ⁴laborat⁸on o⁵ a l⁸⁴ and t⁷⁴ r⁴call o⁵ a m⁴mory ar⁴ d⁸ff⁴r⁴nt co⁶n⁸t⁸v⁴ proc⁴ss⁴s and • at l⁴ast ⁸n ⁷⁸⁶⁷ stak⁴s sc⁴nar⁸os, t⁷⁴ ⁴mot⁸onal c⁷ar⁶⁴ o⁵ a l⁸⁴ d⁸ff⁴rs ⁵rom t⁷at o⁵ a trut⁷⁵ul stat⁴m⁴nt. t assum⁴s, t⁷⁴n, t⁷at t⁷⁴ commun⁸cat⁸on styl⁴ o⁵ a l⁸ar may b⁴ d⁸ff⁴r⁴nt ⁵rom t⁷at o⁵ a trut⁷ t⁴ll⁴r. as⁸cally, t⁷⁸s ⁸s t⁷⁴ sam⁴ ⁸d⁴a t⁷at c⁷aract⁴r⁸z⁴s t⁷⁴ stud⁸⁴s o⁵ non-v⁴rbal b⁴⁷av⁸or and p⁷ys⁸olo⁶⁸cal var⁸abl⁴s w⁸t⁷ r⁴sp⁴ct to d⁴c⁴pt⁸on. ow⁴v⁴r, w⁷⁸l⁴ t⁷⁴s⁴ stud⁸⁴s b⁴lon⁶ to t⁷⁴ r⁴s⁴arc⁷ fi⁴lds o⁵ psyc⁷olo⁶y and p⁷ys⁸olo⁶y, t⁷⁴ t⁷⁴or⁴t⁸cal parad⁸⁶m ⁷⁴r⁴ com⁴s ⁵rom l⁸n⁶u⁸st⁸cs, and ⁸n part⁸cular ⁵rom t⁷⁴ structural⁸st sc⁷ool o⁵ t⁷ou⁶⁷t. ⁴ structural⁸st approac⁷ stud⁸⁴s t⁴xts t⁷rou⁶⁷ t⁷⁴ r⁴lat⁸ons ⁴x⁸st⁸n⁶ amon⁶ t⁷⁴⁸r s⁸n⁶l⁴ ⁴l⁴m⁴nts t⁷⁸s mod⁴ o⁵ r⁴ason⁸n⁶ ⁶av⁴ r⁸s⁴ to t⁷⁴ poss⁸b⁸l⁸ty o⁵ quant⁸tat⁸v⁴ and computat⁸onal analys⁴s o⁵ t⁷⁴ t⁴xts. ⁴ sc⁸⁴nt⁸fic study o⁵ d⁴c⁴pt⁸on ⁸n lan⁶ua⁶⁴ dat⁴s at l⁴ast ⁵rom Und⁴utsc⁷, w⁷o ⁷ypot⁷⁴s⁸z⁴d t⁷at ⁸t ⁸s “not t⁷⁴ v⁴rac⁸ty o⁵ t⁷⁴ r⁴port⁸n⁶ p⁴rson but t⁷⁴ trut⁷⁵uln⁴ss o⁵ t⁷⁴ stat⁴m⁴nt t⁷at matt⁴rs and t⁷⁴r⁴ ar⁴ c⁴rta⁸n r⁴lat⁸v⁴ly ⁴xact, d⁴finabl⁴, d⁴scr⁸pt⁸v⁴ cr⁸t⁴r⁸a t⁷at ⁵orm a k⁴y tool ⁵or t⁷⁴ d⁴t⁴rm⁸nat⁸on o⁵ t⁷⁴ trut⁷⁵uln⁴ss o⁵ stat⁴m⁴nts” [Und⁴utsc⁷, ]. n t⁷⁴ last t⁴n y⁴ars, mod⁴rn natural lan⁶ua⁶⁴ proc⁴ss⁸n⁶ t⁴c⁷n⁸qu⁴s ⁷av⁴ b⁴⁴n appl⁸⁴d by many r⁴s⁴arc⁷⁴rs to t⁷⁴ d⁴t⁴ct⁸on o⁵ d⁴c⁴pt⁸on w⁸t⁷ prom⁸s⁸n⁶ r⁴sults. ⁸s r⁴s⁴arc⁷ ⁸s cov⁴r⁴d ⁸n ⁷apt⁴r . ⁴ ac⁷⁸⁴v⁴m⁴nt o⁵ t⁷⁴s⁴ stud⁸⁴s ⁸s not ⁸ns⁸⁶n⁸ficant, s⁸nc⁴ ⁸d⁴nt⁸⁵y⁸n⁶ d⁴c⁴pt⁸on by any m⁴ans ⁷as prov⁴n to b⁴ a v⁴ry d⁸fficult task, r⁴⁶ardl⁴ss o⁵ t⁷⁴ k⁸nd o⁵ cu⁴s ⁴mploy⁴d to ⁴l⁸c⁸t t⁷⁴ d⁴c⁴⁸t. As Ald⁴rt Vr⁸⁹, w⁷o ⁷as carr⁸⁴d out ⁴xt⁴ns⁸v⁴ stud⁸⁴s o⁵ d⁴c⁴pt⁸on w⁸t⁷⁸n soc⁸al psyc⁷olo⁶y, not⁴s ⁸n r⁴⁵⁴rr⁸n⁶ to t⁷⁴ ly⁸n⁶ prota⁶on⁸st o⁵ tal⁸an ⁵abl⁴ “a v⁴rbal cu⁴ un⁸qu⁴ly r⁴lat⁴d to d⁴c⁴pt⁸on, ak⁸n to P⁸nocc⁷⁸o’s ⁶row⁸n⁶ nos⁴, do⁴s not ⁴x⁸st” [Vr⁸⁹, 00, p. 0].
3
4
1. INTRODUCTION
⁸v⁴n t⁷⁸s s⁸tuat⁸on, t⁷⁴ ⁵a⁸r succ⁴ss o⁵ NLP analys⁴s carr⁸⁴d out t⁷rou⁶⁷ mac⁷⁸n⁴ l⁴arn⁸n⁶ t⁴c⁷n⁸qu⁴s ⁸s probably du⁴, at l⁴ast ⁸n part, to t⁷⁴ ⁵act t⁷at t⁷⁸s l⁸n⁴ o⁵ r⁴s⁴arc⁷ r⁴l⁸⁴s on clust⁴rs o⁵ cu⁴s, w⁷⁸c⁷ prov⁸d⁴ an ov⁴rall p⁸ctur⁴ o⁵ t⁷⁴ d⁴c⁴pt⁸v⁴ lan⁶ua⁶⁴. N⁴v⁴rt⁷⁴l⁴ss, t⁷⁴ ⁴ff⁴ct⁸v⁴n⁴ss o⁵ mac⁷⁸n⁴ l⁴arn⁸n⁶ t⁴c⁷n⁸qu⁴s d⁴p⁴nds ⁴qually on t⁷⁴ ⁴ff⁴ct⁸v⁴n⁴ss o⁵ t⁷⁴ ⁸nd⁸v⁸dual cu⁴s, or ⁵⁴atur⁴s, o⁵ d⁴c⁴pt⁸on t⁷at d⁸st⁸n⁶u⁸s⁷ t⁷⁴ narrat⁸v⁴s und⁴r analys⁸s. ⁴r⁴⁵or⁴ ⁸t ⁸s wort⁷ look⁸n⁶ clos⁴ly at t⁷⁴ l⁸n⁶u⁸st⁸c ⁵⁴atur⁴s ⁴mploy⁴d ⁸n d⁴t⁴ct⁸n⁶ d⁴c⁴pt⁸on ⁸n commun⁸cat⁸on.
1.2.1 LINGUISTIC FEATURES USED IN IDENTIFYING DECEPTION ⁴ ⁶oal ⁷⁴r⁴ ⁸s to d⁸st⁸n⁶u⁸s⁷ trut⁷⁵ul (T) ⁵rom ⁵als⁴ () narrat⁸v⁴s w⁸t⁷ a ⁷⁸⁶⁷ d⁴⁶r⁴⁴ o⁵ succ⁴ss. To t⁷at ⁴nd, NLP r⁴s⁴arc⁷ ⁷as borrow⁴d cu⁴s to ⁵als⁴ stat⁴m⁴nts ⁵rom t⁷⁴ psyc⁷olo⁶y and cr⁸m⁸nal ⁹ust⁸c⁴ l⁸t⁴ratur⁴ as w⁴ll as ⁵rom standard t⁴c⁷n⁸qu⁴s ⁸n natural lan⁶ua⁶⁴ proc⁴ss⁸n⁶. ⁴ standard NLP approac⁷ us⁴s sur⁵ac⁴ l⁴v⁴l ⁴l⁴m⁴nts o⁵ t⁷⁴ narrat⁸v⁴—c⁷aract⁴rs, n⁶rams, part-o⁵-sp⁴⁴c⁷ ta⁶s, narrat⁸v⁴ l⁴n⁶t⁷, l⁴x⁸cal d⁸v⁴rs⁸ty—⁸n a pur⁴ class⁸ficat⁸on task. ⁴ pauc⁸ty o⁵ data ⁸n t⁷⁸s fi⁴ld constra⁸ns t⁷⁴ us⁴ o⁵ ot⁷⁴r ⁴l⁴m⁴nts commonly us⁴d ⁸n NLP, ⁵or ⁴xampl⁴, collocat⁸onal prop⁴rt⁸⁴s. ⁴ data constra⁸nts ⁷av⁴ dr⁸v⁴n many ⁸nv⁴st⁸⁶ators to ⁶⁴n⁴ral⁸z⁴ ov⁴r s⁴mant⁸cally r⁴lat⁴d l⁸n⁶u⁸st⁸c ⁴l⁴m⁴nts, ⁵or ⁴xampl⁴, s⁴l⁵-r⁴⁵⁴r⁴nc⁸n⁶ ⁸t⁴ms (I, me, my) as oppos⁴d to ⁸t⁴ms t⁷at r⁴⁵⁴r⁴nc⁴ ot⁷⁴rs (you, they, them). ⁸s approac⁷ also ⁷as t⁷⁴ advanta⁶⁴ o⁵ tapp⁸n⁶ ⁸nto psyc⁷olo⁶⁸cal mot⁸vat⁸on ⁵or t⁷⁴s⁴ ⁵⁴atur⁴s s⁴l⁵-r⁴⁵⁴r⁴nc⁴s ⁷av⁴ b⁴⁴n ⁵ound to app⁴ar muc⁷ l⁴ss ⁵r⁴qu⁴ntly ⁸n d⁴c⁴pt⁸v⁴ narrat⁸v⁴s t⁷an ⁸n trut⁷⁵ul on⁴s, w⁷⁸c⁷ mak⁴s s⁴ns⁴ ⁸⁵ on⁴ ⁸s try⁸n⁶ to d⁸stanc⁴ on⁴s⁴l⁵ ⁵rom t⁷⁴ narrat⁸v⁴. ⁴ most commonly us⁴d ⁵⁴atur⁴s ⁷⁴r⁴ ar⁴ t⁷os⁴ o⁵ t⁷⁴ L⁸n⁶u⁸st⁸c nqu⁸ry and Word ount so⁵twar⁴ [P⁴nn⁴bak⁴r ⁴t al., 00]. n a s⁸m⁸lar v⁴⁸n, stud⁸⁴s ⁷av⁴ d⁴v⁸s⁴d ob⁹⁴ct⁸v⁴ ways o⁵ c⁷aract⁴r⁸z⁸n⁶ c⁴rta⁸n o⁵ t⁷⁴ find⁸n⁶s ⁵rom t⁷⁴ psyc⁷olo⁶y l⁸t⁴ratur⁴. V⁴rbal and vocal ⁸mm⁴d⁸acy, ⁵or ⁴xampl⁴, ar⁴ ⁸d⁴nt⁸fi⁴d by many stud⁸⁴s as ⁷⁸⁶⁷ly d⁸scr⁸m⁸nat⁸n⁶ b⁴tw⁴⁴n T and narrat⁸v⁴s as ⁸nd⁸cat⁴d by ⁴Paulo ⁴t al. [00] and ⁷av⁴ b⁴⁴n m⁴asur⁴d ⁸n t⁷⁴ NLP l⁸t⁴ratur⁴ by pr⁴s⁴nc⁴ o⁵ act⁸v⁴ vo⁸c⁴, pr⁴s⁴nt t⁴ns⁴, and s⁴l⁵-r⁴⁵⁴r⁴nc⁸n⁶. Mor⁴ r⁴c⁴ntly, d⁴⁴p⁴r and/or mor⁴ ⁶lobal c⁷aract⁴r⁸st⁸cs o⁵ t⁷⁴ narrat⁸v⁴ ar⁴ b⁴⁸n⁶ ⁸nv⁴st⁸⁶at⁴d to ⁴n⁷anc⁴ t⁷⁴ class⁸ficat⁸on p⁴r⁵ormanc⁴. Amon⁶ t⁷⁴s⁴ ar⁴ pars⁴ tr⁴⁴ d⁸ff⁴r⁴nc⁴s b⁴tw⁴⁴n T and narrat⁸v⁴s. ⁴ coll⁴ct⁸on o⁵ syntax-r⁴lat⁴d ⁵⁴atur⁴s ⁸s mor⁴ compl⁴x t⁷an t⁷at o⁵ sur⁵ac⁴ ⁵⁴atur⁴s, s⁸nc⁴ ⁸t r⁴qu⁸r⁴s pars⁸n⁶ t⁷⁴ narrat⁸v⁴ ⁸n ord⁴r to ⁸d⁴nt⁸⁵y ⁸ts syntact⁸c structur⁴s, ⁴ncod⁴d as tr⁴⁴s—or parts o⁵ tr⁴⁴s—o⁵ syntact⁸c ⁴l⁴m⁴nts w⁷⁸c⁷ ar⁴ us⁴d as ⁵⁴atur⁴s o⁵ t⁷⁴ t⁴xts. n t⁷⁴ fi⁴ld o⁵ d⁴c⁴pt⁸on d⁴t⁴ct⁸on, t⁷⁸s approac⁷ was ⁵ollow⁴d by ⁴n⁶ ⁴t al. [0a], w⁷o ⁵ound t⁷at p⁴r⁵ormanc⁴ ⁸n t⁷⁴ class⁸ficat⁸on o⁵ narrat⁸v⁴s as trut⁷⁵ul or d⁴c⁴pt⁸v⁴ was notably b⁴tt⁴r w⁷⁴n d⁴⁴p syntact⁸c ⁵⁴atur⁴s w⁴r⁴ ⁴mploy⁴d ⁸nst⁴ad o⁵ s⁷allow syntact⁸c ⁵⁴atur⁴s suc⁷ as part o⁵ sp⁴⁴c⁷. Narrat⁸v⁴ co⁷⁴r⁴nc⁴ and d⁸scours⁴ r⁴lat⁸ons w⁸t⁷⁸n t⁷⁴ narrat⁸v⁴ ar⁴ also und⁴r ⁸nv⁴st⁸⁶at⁸on by Rub⁸n and Vas⁷c⁷⁸lko [0]. ⁸s ⁸s cons⁸st⁴nt w⁸t⁷ work by Susan Adams w⁸t⁷⁸n t⁷⁴ cr⁸m⁸nal ⁹ust⁸c⁴ fi⁴ld on p⁴rson-o⁵-⁸nt⁴r⁴st narrat⁸v⁴s wr⁸tt⁴n pr⁸or to pol⁸c⁴ ⁸nt⁴rv⁸⁴ws, w⁷⁸c⁷ not⁴s an ⁸mbalanc⁴ ⁸n d⁴c⁴pt⁸v⁴ narrat⁸v⁴s amon⁶ t⁷⁴ ⁸ntroduct⁸on, body, and conclus⁸on [Adams, ,
1.2. VERBAL CUES TO DECEPTION
00]. Anot⁷⁴r prom⁸s⁸n⁶ approac⁷ cr⁴at⁴s a profil⁴ r⁴pr⁴s⁴ntat⁸v⁴ o⁵ T narrat⁸v⁴s a⁶a⁸nst w⁷⁸c⁷ a t⁴st narrat⁸v⁴ can b⁴ m⁴asur⁴d ⁵or compat⁸b⁸l⁸ty [⁴n⁶ and ⁸rst, 0]. ⁷apt⁴r ⁸s d⁴vot⁴d to ⁶r⁴at⁴r cov⁴ra⁶⁴ o⁵ t⁷⁴ NLP work on d⁴c⁴pt⁸on.
1.2.2 EFFECTIVENESS OF LINGUISTIC CUES TO DECEPTION W⁴ w⁸ll s⁴⁴ t⁷at t⁷⁴ var⁸at⁸on ⁸n t⁷⁴ succ⁴ss o⁵ T/ class⁸ficat⁸on w⁸t⁷⁸n NLP d⁴p⁴nds lar⁶⁴ly on t⁷⁴ cont⁴xtual ⁵actors ⁸nvolv⁴d ⁸n ⁴ac⁷ study, ⁸nclud⁸n⁶ t⁷⁴ top⁸cs, ⁶⁴nr⁴s, r⁴⁶⁸st⁴rs, and mod⁴s o⁵ d⁴l⁸v⁴ry (⁵ac⁴-to-⁵ac⁴, wr⁸tt⁴n, ⁴l⁴ctron⁸c, ⁴tc.). ⁴p⁴nd⁸n⁶ on t⁷⁴s⁴ ⁵actors, class⁸ficat⁸on accuracy rat⁴s vary ⁵rom t⁷⁴ low 0% ran⁶⁴ to t⁷⁴ low 0% ran⁶⁴, w⁷⁸c⁷ ⁸s st⁸ll b⁴tt⁴r t⁷an t⁷⁴ random accuracy o⁵ most ⁷uman ⁹ud⁶m⁴nts. ⁵ w⁴ want to ac⁷⁸⁴v⁴ class⁸ficat⁸on syst⁴ms t⁷at can ⁶⁴n⁴ral⁸z⁴ across t⁷⁴s⁴ ⁵actors, t⁷⁴ fi⁴ld n⁴⁴ds to a⁸m ⁵or a common data s⁴t and s⁷ar⁴d task a⁶a⁸nst w⁷⁸c⁷ to t⁴st t⁷⁴ syst⁴ms w⁷⁸l⁴ st⁸ll ⁸d⁴nt⁸⁵y⁸n⁶ t⁷⁴ tasks t⁷at ar⁴ mor⁴ am⁴nabl⁴ to class⁸ficat⁸on. S⁴v⁴ral ⁶roups ar⁴ conc⁴rn⁴d w⁸t⁷ t⁷⁴s⁴ ⁸ssu⁴s, ⁸nclud⁸n⁶ Myl⁴ Ott and ⁷⁸s coll⁴a⁶u⁴s, as w⁴ll as ornac⁸ar⁸ and Po⁴s⁸o, Rub⁸n and onroy, and ⁸tzpatr⁸ck and ac⁷⁴nko. W⁴ w⁸ll cons⁸d⁴r t⁷⁴ ⁸ssu⁴s t⁷⁴y ra⁸s⁴ ⁸n ⁷apt⁴rs and . As d⁸scuss⁴d ⁸n t⁷⁴ pr⁴v⁸ous subs⁴ct⁸on, a qu⁸t⁴ w⁸d⁴ var⁸⁴ty o⁵ l⁸n⁶u⁸st⁸c var⁸abl⁴s can b⁴ us⁴d as d⁴c⁴pt⁸on ⁸nd⁸cators. v⁴n t⁷ou⁶⁷ non⁴ o⁵ t⁷⁴m can b⁴ cons⁸d⁴r⁴d a ⁷⁸⁶⁷ probab⁸l⁸ty ⁸nd⁸cator o⁵ d⁴c⁴pt⁸on l⁸k⁴ P⁸nocc⁷⁸o’s nos⁴, t⁷⁴ r⁴sults o⁵ t⁷⁴ stud⁸⁴s m⁴nt⁸on⁴d abov⁴ su⁶⁶⁴st t⁷at t⁷⁴⁸r comb⁸nat⁸on can b⁴ us⁴⁵ul ⁸n ⁸d⁴nt⁸⁵y⁸n⁶ d⁴c⁴pt⁸v⁴ commun⁸cat⁸ons. ow⁴v⁴r, w⁷⁸l⁴ t⁷⁴r⁴ ⁷av⁴ b⁴⁴n a lar⁶⁴ numb⁴r o⁵ stud⁸⁴s conc⁴rn⁸n⁶ nonv⁴rbal cu⁴s to d⁴c⁴pt⁸on, w⁷⁸c⁷ w⁴r⁴ ⁴v⁴n t⁷⁴ ma⁸n ⁵ocus o⁵ a w⁴ll-known Am⁴r⁸can t⁴l⁴v⁸s⁸on s⁴r⁸⁴s Lie to Me, t⁷⁴r⁴ ⁷as b⁴⁴n only a r⁴lat⁸v⁴ly small sc⁸⁴nt⁸fic commun⁸ty o⁵ l⁸n⁶u⁸sts, psyc⁷olo⁶⁸sts, and comput⁴r sc⁸⁴nt⁸sts d⁴al⁸n⁶ w⁸t⁷ v⁴rbal cu⁴s to d⁴c⁴pt⁸on. ⁸v⁴n t⁷at many asp⁴cts o⁵ sp⁴⁴c⁷ ⁴lud⁴ t⁷⁴ consc⁸ous control o⁵ t⁷⁴ narrator, suc⁷ as t⁷⁴ a⁵or⁴m⁴nt⁸on⁴d vocabulary r⁸c⁷n⁴ss, t⁷⁴ study o⁵ v⁴rbal cu⁴s to t⁷⁴ l⁸⁴ prom⁸s⁴s to prov⁸d⁴ valuabl⁴ support ⁸n t⁷⁴ d⁸fficult task o⁵ ⁸d⁴nt⁸⁵y⁸n⁶ d⁴c⁴pt⁸on ⁸n commun⁸cat⁸on. ⁴ stat⁴ o⁵ t⁷⁴ art ⁸n t⁷⁸s fi⁴ld, part⁸cularly ⁸n ⁸ts automat⁸on, ⁸s t⁷⁴ ob⁹⁴ct o⁵ t⁷⁸s t⁴xt. 1.2.3 VERBAL CUES TO GROUND TRUTH or stud⁸⁴s t⁷at us⁴ r⁴al-world data, t⁷⁴ ⁴stabl⁸s⁷m⁴nt o⁵ w⁷at ⁸s r⁴⁵⁴rr⁴d to as ‘⁶round trut⁷’ usually ⁸nvolv⁴s t⁷⁴ compar⁸son o⁵ a propos⁸t⁸on w⁸t⁷ ⁴xt⁴rnal data. J⁴ffr⁴y Sk⁸ll⁸n⁶, ⁵or ⁴xampl⁴, ⁸nd⁸cat⁴d t⁷at ⁷⁴ may not ⁷av⁴ ⁴v⁴n att⁴nd⁴d a cr⁸t⁸cal m⁴⁴t⁸n⁶ unt⁸l t⁷⁴ m⁸nut⁴s o⁵ t⁷⁴ m⁴⁴t⁸n⁶ s⁷ow⁸n⁶ Sk⁸ll⁸n⁶ as a part⁸c⁸pant w⁴r⁴ produc⁴d. ow⁴v⁴r, ⁶round trut⁷ can also b⁴ v⁴r⁸fi⁴d by attr⁸but⁴s o⁵ t⁷⁴ v⁴rbal narrat⁸v⁴ ⁸ts⁴l⁵, ⁸nclud⁸n⁶ t⁷⁴ ⁵ollow⁸n⁶. Consistency ⁸nvolv⁴s t⁷⁴ r⁴p⁴t⁸t⁸on o⁵ t⁷⁴ sam⁴ cont⁴nt ⁸n d⁸ff⁴r⁴nt stat⁴m⁴nts (⁸ssu⁴d by t⁷⁴ sam⁴ or d⁸ff⁴r⁴nt sub⁹⁴cts). Contradiction ⁸nvolv⁴s two cla⁸ms t⁷⁴ ⁵acts o⁵ w⁷⁸c⁷ ar⁴ at odds w⁸t⁷ ⁴ac⁷ ot⁷⁴r.
5
6
1. INTRODUCTION
To mak⁴ a d⁴c⁸s⁸on about t⁷⁴ trut⁷⁵uln⁴ss o⁵ a propos⁸t⁸on ⁸nvolv⁴s t⁷⁴ us⁴ o⁵ t⁷⁴ rul⁴s o⁵ lo⁶⁸c, pra⁶mat⁸cs, and probab⁸l⁸ty calculus. v⁴n t⁷ou⁶⁷ t⁷⁴ mod⁴rn ⁵ormal⁸zat⁸on o⁵ t⁷⁴s⁴ conc⁴pts ⁸s qu⁸t⁴ r⁴c⁴nt, ⁷⁸stor⁸cally t⁷⁴ appl⁸cat⁸on o⁵ t⁷⁴s⁴ tools to t⁷⁴ d⁴t⁴ct⁸on o⁵ d⁴c⁴pt⁸on ⁸s anc⁸⁴nt, and t⁴st⁸mon⁸⁴s can b⁴ ⁵ound ⁴v⁴n ⁸n t⁷⁴ ⁸bl⁴, ⁵or ⁴xampl⁴, ⁸n t⁷⁴ ook o⁵ an⁸⁴l (nd c⁴ntury ), w⁷⁴r⁴ t⁷⁴ ⁴p⁸sod⁴ o⁵ Susana ⁸s d⁴scr⁸b⁴d. ⁴r⁴ t⁷⁴ prop⁷⁴t an⁸⁴l unmasks t⁷⁴ d⁴c⁴pt⁸v⁴ accusat⁸ons o⁵ two old ⁹ud⁶⁴s a⁶a⁸nst t⁷⁴ woman (an⁸⁴l - Nova Vul⁶ata) by ⁸d⁴nt⁸⁵y⁸n⁶ an ⁸ncons⁸st⁴ncy ⁸n t⁷⁴ d⁸ff⁴r⁴nt stat⁴m⁴nts t⁷at ⁷⁴ ask⁴d t⁷⁴ two ⁹ud⁶⁴s to ⁸ssu⁴ s⁴parat⁴ly, r⁴⁶ard⁸n⁶ t⁷⁴ sam⁴ d⁴ta⁸ls.
1.3
WHAT’S AHEAD
⁸s book ⁸s d⁴s⁸⁶n⁴d to ⁶⁸v⁴ som⁴on⁴ w⁸t⁷ an ⁸ntroductory back⁶round ⁸n natural lan⁶ua⁶⁴ proc⁴ss⁸n⁶ and/or mac⁷⁸n⁴ l⁴arn⁸n⁶ an und⁴rstand⁸n⁶ o⁵ t⁷⁴ curr⁴nt approac⁷⁴s to t⁷⁴ automat⁸c d⁴t⁴ct⁸on o⁵ v⁴rbal d⁴c⁴pt⁸on. ⁸s subfi⁴ld o⁵ NLP ⁸s ⁸n ⁸ts ⁸n⁵ancy and so pr⁴s⁴nts an ⁴xc⁸t⁸n⁶ ar⁴a ⁸n w⁷⁸c⁷ to do ⁶roundbr⁴ak⁸n⁶ work. ⁴ purpos⁴ o⁵ t⁷⁸s book ⁸s to ⁴qu⁸p t⁷⁴ r⁴ad⁴r w⁸t⁷ t⁷⁴ ⁸n⁵ormat⁸on to do t⁷at. Muc⁷ o⁵ t⁷⁴ curr⁴nt r⁴s⁴arc⁷ ⁸n t⁷⁴ broad⁴r fi⁴ld o⁵ d⁴c⁴pt⁸on d⁴t⁴ct⁸on ⁸s bas⁴d on ⁴xp⁴r⁸m⁴ntal work t⁷at att⁴mpts to conn⁴ct sp⁴c⁸fic b⁴⁷av⁸ors w⁸t⁷ ly⁸n⁶. Som⁴ work t⁴sts conn⁴ct⁸ons b⁴tw⁴⁴n p⁷ys⁸olo⁶⁸cal m⁴asur⁴s and ly⁸n⁶, w⁷⁸l⁴ t⁷⁴ r⁴st ⁴xam⁸n⁴s b⁴⁷av⁸oral cu⁴s t⁴st⁴d w⁸t⁷⁸n t⁷⁴ fi⁴lds o⁵ psyc⁷olo⁶y and cr⁸m⁸nal ⁹ust⁸c⁴. W⁴ b⁴⁶⁸n ⁸n ⁷apt⁴r w⁸t⁷ a br⁸⁴⁵ r⁴v⁸⁴w o⁵ t⁷⁴ l⁸t⁴ratur⁴ on p⁷ys⁸olo⁶⁸cal cu⁴s to d⁴c⁴pt⁸on ⁵ollow⁴d by a lon⁶⁴r r⁴v⁸⁴w o⁵ t⁷⁴ psyc⁷olo⁶y l⁸t⁴ratur⁴ bas⁴d on two ov⁴rarc⁷⁸n⁶ works by ⁴lla ⁴Paulo and ⁷⁴r coll⁴a⁶u⁴s, ⁴Paulo ⁴t al. [00] and Ald⁴rt Vr⁸⁹ [00], as w⁴ll as work ⁸n appl⁸⁴d psyc⁷olo⁶y and cr⁸m⁸nal ⁹ust⁸c⁴. ⁷apt⁴r d⁴als w⁸t⁷ sourc⁴s o⁵ d⁴c⁴pt⁸v⁴ v⁴rbal b⁴⁷av⁸or, pr⁸mar⁸ly ⁸n t⁷⁴ “r⁴al world.” ⁴ ⁷⁴art o⁵ t⁷⁴ book, ⁷apt⁴r , cons⁸d⁴rs ⁸ssu⁴s ⁸nvolv⁴d ⁸n d⁴s⁸⁶n⁸n⁶ an NLP ⁴xp⁴r⁸m⁴nt to t⁴st a d⁴c⁴pt⁸on syst⁴m and ⁴xam⁸n⁴s t⁷⁴ curr⁴nt syst⁴ms t⁷at ⁷av⁴ b⁴⁴n bu⁸lt to d⁴t⁴ct d⁴c⁴pt⁸on, compar⁸n⁶ t⁷⁴ m⁴t⁷ods and t⁷⁴ r⁴sults o⁵ t⁷⁴s⁴ syst⁴ms. ⁷apt⁴r cons⁸d⁴rs op⁴n r⁴s⁴arc⁷ qu⁴st⁸ons and ⁵utur⁴ d⁸r⁴ct⁸ons.
APTR
e Background Literature on Behavioral Cues to Deception 2.1
INTRODUCTION
As m⁴nt⁸on⁴d ⁸n ⁷apt⁴r , t⁷⁴r⁴ ⁸s a lon⁶ ⁷⁸story o⁵ att⁴mpts to l⁸nk ly⁸n⁶ to m⁴asurabl⁴ ⁴⁵⁵⁴cts on t⁷⁴ l⁸ar. ⁴ ⁴ff⁴cts cons⁸d⁴r⁴d ⁷av⁴ b⁴⁴n p⁷ys⁸olo⁶⁸cal ⁸n natur⁴—⁵or ⁴xampl⁴, c⁷an⁶⁴s ⁸n blood pr⁴ssur⁴ or vocal p⁸tc⁷. ⁴y ⁷av⁴ also b⁴⁴n co⁶n⁸t⁸v⁴, as ar⁴ sp⁴⁴c⁷ d⁸sflu⁴nc⁸⁴s and r⁴p⁴t⁸t⁸ons. mot⁸onal ⁴ff⁴cts suc⁷ as n⁴⁶at⁸v⁴ aff⁴ct and v⁴rbal unc⁴rta⁸nty ⁷av⁴ also b⁴⁴n ⁴xam⁸n⁴d. ot⁷ t⁷⁴ co⁶n⁸t⁸v⁴ and ⁴mot⁸onal ⁴ff⁴cts can b⁴ conn⁴ct⁴d d⁸r⁴ctly to lan⁶ua⁶⁴, as ar⁴ t⁷⁴ ⁴xampl⁴s ⁶⁸v⁴n ⁷⁴r⁴. ⁴ p⁷ys⁸olo⁶⁸cal ⁴ff⁴cts ar⁴ at som⁴ r⁴mov⁴ ⁵rom lan⁶ua⁶⁴, t⁷ou⁶⁷ vocal c⁷an⁶⁴s ar⁴ mor⁴ clos⁴ly conn⁴ct⁴d to ⁸t. W⁴ ⁸nclud⁴ t⁷⁴m ⁷⁴r⁴ to prov⁸d⁴ a back⁶round to t⁷⁴ psyc⁷olo⁶y and cr⁸m⁸nal ⁹ust⁸c⁴ l⁸t⁴ratur⁴, w⁷⁸c⁷ ⁴xam⁸n⁴s all t⁷r⁴⁴ typ⁴s o⁵ ⁴ff⁴cts. ⁴r⁴ ar⁴ also curr⁴nt att⁴mpts to l⁸nk t⁷⁴ v⁴rbal ⁴ff⁴cts w⁸t⁷ t⁷⁴ p⁷ys⁸olo⁶⁸cal, w⁷⁸c⁷ w⁴ d⁸scuss br⁸⁴fly ⁸n S⁴ct⁸on . o⁵ ⁷apt⁴r . ⁸nally, t⁷⁴ d⁸fficult⁸⁴s conn⁴ct⁸n⁶ p⁷ys⁸olo⁶⁸cal b⁴⁷av⁸ors to d⁴c⁴pt⁸on d⁴monstrat⁴ t⁷at t⁷⁴ probl⁴m o⁵ ⁸d⁴nt⁸⁵y⁸n⁶ t⁷⁴ l⁸⁴, by any m⁴ans, ⁸s ⁵ar ⁵rom solv⁴d. As ⁵or t⁷⁴ co⁶n⁸t⁸v⁴ and ⁴mot⁸onal ⁴ff⁴cts, t⁷⁴r⁴ ⁸s a r⁸c⁷ trad⁸t⁸on, dat⁸n⁶ ⁵rom Und⁴utsc⁷, o⁵ t⁷⁴ study o⁵ d⁴c⁴pt⁸v⁴ b⁴⁷av⁸ors ⁸n ⁴xp⁴r⁸m⁴ntal psyc⁷olo⁶y, w⁷⁴r⁴ data ⁸s obta⁸n⁴d by ⁴xp⁴r⁸m⁴ntat⁸on w⁸t⁷ sub⁹⁴cts ⁸n laboratory s⁴tt⁸n⁶s. Anot⁷⁴r, mor⁴ r⁴c⁴nt, t⁷r⁴ad o⁵ stud⁸⁴s o⁵ b⁴⁷av⁸ors l⁸nk⁴d to d⁴c⁴pt⁸on com⁴s ⁵rom t⁷⁴ appl⁸⁴d psyc⁷olo⁶y and cr⁸m⁸nal ⁹ust⁸c⁴ l⁸t⁴ratur⁴, w⁷⁴r⁴ data ⁸s coll⁴ct⁴d post ⁷oc ⁵rom pol⁸c⁴ ⁸nt⁴rv⁸⁴ws, court t⁴st⁸mony, ⁸nt⁴rv⁸⁴ws, and t⁷⁴ l⁸k⁴. ⁴ ram⁸ficat⁸ons o⁵ ⁴ac⁷ typ⁴ o⁵ data coll⁴ct⁸on ar⁴ d⁸scuss⁴d ⁸n ⁷apt⁴r ⁷⁴r⁴ w⁴ ⁴xam⁸n⁴ t⁷⁴ l⁸t⁴ratur⁴ ⁸n t⁷⁴ t⁷r⁴⁴ trad⁸t⁸ons, r⁴v⁸⁴w⁸n⁶ t⁷⁴ typ⁴s o⁵ cu⁴s, w⁸t⁷ an ⁴mp⁷as⁸s on v⁴rbal cu⁴s, t⁷at ⁷av⁴ b⁴⁴n stud⁸⁴d.
2.2
NONVERBAL CUES TO DECEPTION
Nonv⁴rbal cu⁴s occur ⁸nd⁴p⁴nd⁴nt o⁵ lan⁶ua⁶⁴. ⁴y ⁸nclud⁴ p⁷ys⁸olo⁶⁸cal act⁸v⁸ty, vocal⁸zat⁸ons, and mov⁴m⁴nts o⁵ t⁷⁴ ⁵ac⁴ and body. ⁴ l⁸st ⁸n Tabl⁴ ., w⁷⁸l⁴ not ⁴x⁷aust⁸v⁴, ⁸d⁴nt⁸fi⁴s t⁷⁴ pr⁸mary nonv⁴rbal cu⁴s c⁸t⁴d ⁸n t⁷⁴ l⁸⁴ d⁴t⁴ct⁸on l⁸t⁴ratur⁴ and t⁷⁴ pr⁸mary mann⁴r o⁵ d⁴t⁴ct⁸on ⁵or ⁴ac⁷.¹ ¹S⁴⁴ t⁴xt ⁵or r⁴⁵⁴r⁴nc⁴s on r⁴sults ⁶r⁴at⁴r t⁷an c⁷anc⁴.
7
8
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
Table 2.1: L⁸st o⁵ pr⁸mary nonv⁴rbal cu⁴s
u⁴ Typ⁴
r⁴sp⁸rat⁸on, ⁴l⁴ctrod⁴rmal act⁸v⁸ty, blood pr⁴ssur⁴ laryn⁶⁴al ⁵r⁴qu⁴nc⁸⁴s p⁷ys⁸olo⁶⁸cal
vocal
⁵ac⁴/body mov⁴m⁴nts
unknown (propr⁸⁴tary) ⁵ac⁸al blood flow ⁴l⁴ctr⁸cal bra⁸n wav⁴s bra⁸n blood flow vo⁸c⁴ ⁵0, fill⁴d paus⁴, s⁸l⁴nt paus⁴, d⁸sflu⁴nc⁸⁴s, ⁴tc. m⁸cro⁴xpr⁴ss⁸ons, pup⁸l d⁸lat⁸on, fin⁶⁴r tapp⁸n⁶, ⁴tc.
u⁴ ⁴- P⁷ys⁸cal t⁴ct⁸on ontact poly⁶rap⁷ y⁴s
R⁴sults c⁷anc⁴ y⁴s
omm⁴rc⁸al⁸z⁴d
vo⁸c⁴ str⁴ss analyz⁴r lay⁴r⁴d vo⁸c⁴ analys⁸s t⁷⁴rmo⁶rap⁷y
no
no
y⁴s
no
no
y⁴s
no
y⁴s
no
y⁴s
y⁴s
y⁴s
⁵MR
y⁴s
y⁴s
y⁴s
p⁸tc⁷ analyz⁴r, sp⁴⁴c⁷ ⁴d⁸tor, manual analys⁸s
no
y⁴s
no
v⁸d⁴o r⁴cord⁴r, manual analys⁸s
no
y⁴s
no
y⁴s
2.2. NONVERBAL CUES TO DECEPTION
O⁵ t⁷⁴ cu⁴s ⁸n t⁷⁸s l⁸st, p⁷ys⁸olo⁶⁸cal ⁸nd⁸cators ar⁴ p⁴r⁷aps t⁷⁴ most ⁵am⁸l⁸ar b⁴caus⁴ o⁵ t⁷⁴⁸r r⁴l⁸anc⁴ on sp⁴c⁸al⁸z⁴d t⁴c⁷nolo⁶⁸⁴s—poly⁶rap⁷, ⁵MR, ⁴l⁴ctro⁴nc⁴p⁷alo⁶rap⁷y (), ⁴tc.—t⁷at ⁷av⁴ allow⁴d t⁷⁴ d⁴v⁴lopm⁴nt o⁵ syst⁴ms curr⁴ntly us⁴d ⁸n t⁷r⁴at ass⁴ssm⁴nt, cr⁸m⁸nal ⁸nv⁴st⁸⁶at⁸on, and ⁵⁴d⁴ral ⁴mploy⁴⁴ scr⁴⁴n⁸n⁶. Our r⁴v⁸⁴w b⁴⁶⁸ns w⁸t⁷ t⁷⁸s class o⁵ nonv⁴rbal cu⁴s. P⁷ys⁸olo⁶⁸cal cu⁴s and t⁴c⁷nolo⁶⁸⁴s ⁵all ⁸nto ⁵our ma⁸n cat⁴⁶or⁸⁴s poly⁶rap⁷y, vo⁸c⁴ analys⁸s, ⁵ac⁸al t⁷⁴rmo⁶rap⁷y and bra⁸n scans. All assum⁴ t⁷at ly⁸n⁶ ⁸s a str⁴ss⁵ul act⁸v⁸ty t⁷at tr⁸⁶⁶⁴rs m⁴asurabl⁴ c⁷an⁶⁴s ⁸n t⁷⁴ act⁸v⁸ty o⁵ som⁴ p⁷ys⁸olo⁶⁸cal syst⁴m. W⁸t⁷ t⁷⁴ ⁴xc⁴pt⁸on o⁵ vo⁸c⁴ str⁴ss and lay⁴r⁴d vo⁸c⁴ analys⁸s, t⁷⁴ propos⁴d p⁷ys⁸olo⁶⁸cal cu⁴s and r⁴lat⁴d t⁴c⁷nolo⁶⁸⁴s prov⁸d⁴ at l⁴ast w⁴ak support ⁵or t⁷⁸s ⁸d⁴a.
2.2.1 POLYGRAPHY Poly⁶rap⁷y ⁸s t⁷⁴ old⁴st and b⁴st ⁴stabl⁸s⁷⁴d t⁴c⁷nolo⁶y ⁵or assoc⁸at⁸n⁶ p⁷ys⁸olo⁶⁸cal act⁸v⁸ty w⁸t⁷ d⁴c⁴pt⁸on-⁸nduc⁴d str⁴ss. xam⁸n⁴⁴s ar⁴ attac⁷⁴d to at l⁴ast t⁷r⁴⁴ k⁸nds o⁵ p⁷ys⁸olo⁶⁸cal data s⁴nsors blood pr⁴ssur⁴ cuff, ⁴l⁴ctrod⁴rmal s⁴nsor, and r⁴sp⁸rat⁸on s⁴nsors pos⁸t⁸on⁴d on t⁷⁴ c⁷⁴st and abdom⁴n [Am⁴r⁸can Poly⁶rap⁷ Assoc⁸at⁸on, 00]. ⁴ t⁴st ⁸ts⁴l⁵ ⁸s usually ⁴mb⁴dd⁴d ⁸n a lon⁶⁴r ⁸nt⁴rv⁸⁴w t⁷at may last ⁵or as lon⁶ as ⁵our ⁷ours. At c⁴rta⁸n po⁸nts ⁸n t⁷⁴ ⁸nt⁴rv⁸⁴w t⁷⁴ poly⁶rap⁷⁴r w⁸ll ask a s⁴r⁸⁴s o⁵ y⁴s/no qu⁴st⁸ons. Som⁴ o⁵ t⁷⁴s⁴ ar⁴ ⁸nt⁴nd⁴d to ⁴l⁸c⁸t p⁷ys⁸olo⁶⁸cal stat⁴s t⁷at prov⁸d⁴ bas⁴l⁸n⁴ m⁴asur⁴m⁴nts, ot⁷⁴rs ar⁴ ⁸nt⁴nd⁴d to ⁴l⁸c⁸t d⁴partur⁴s ⁵rom t⁷⁴ bas⁴l⁸n⁴ t⁷at ⁸nd⁸cat⁴ an ⁴mot⁸onally arous⁴d stat⁴. Arous⁴d stat⁴s pr⁴sumably ⁴ncod⁴ a fl⁸⁶⁷t ⁸nst⁸nct ⁸nd⁸cat⁸v⁴ o⁵ d⁴c⁴pt⁸on. valuat⁸n⁶ t⁷⁴ poly⁶rap⁷’s p⁴r⁵ormanc⁴ d⁴p⁴nds on ⁸ssu⁴s t⁷at ⁷av⁴ l⁸ttl⁴ to do w⁸t⁷ poly⁶rap⁷ t⁴c⁷nolo⁶y. A commonly not⁴d compl⁸cat⁸on ⁸s t⁷at t⁷⁴ p⁷ys⁸olo⁶⁸cal stat⁴s t⁷at may ⁸nd⁸cat⁴ d⁴c⁴pt⁸on o⁵t⁴n ar⁸s⁴ w⁷⁴n d⁴c⁴pt⁸on ⁸s abs⁴nt [Nat⁸onal R⁴s⁴arc⁷ ounc⁸l, 00, Sax⁴ ⁴t al., ]. n add⁸t⁸on, t⁴st outcom⁴s, m⁴asur⁴d by succ⁴ss ⁸n ⁸d⁴nt⁸⁵y⁸n⁶ d⁴c⁴pt⁸v⁴ and trut⁷⁵ul sub⁹⁴cts, d⁴p⁴nd lar⁶⁴ly on t⁷⁴ sk⁸ll o⁵ t⁷⁴ ⁸nt⁴rv⁸⁴w⁴r, w⁷o us⁴s t⁷⁴ poly⁶rap⁷ as an ⁸nt⁴rro⁶at⁸on tool, and on c⁷aract⁴r⁸st⁸cs o⁵ t⁷⁴ ⁸nt⁴rv⁸⁴w⁴⁴, w⁷o may b⁴ su⁶⁶⁴st⁸v⁴, anx⁸ous, and ⁸n⁴xp⁴r⁸⁴nc⁴d [Sax⁴ ⁴t al., ]. ⁸nally, poly⁶rap⁷s ar⁴ w⁴ll known to b⁴ vuln⁴rabl⁴ to count⁴rm⁴asur⁴s, t⁴c⁷n⁸qu⁴s t⁷⁴ ⁸nt⁴rv⁸⁴w⁴⁴ can us⁴ to d⁴l⁸b⁴rat⁴ly alt⁴r p⁷ys⁸olo⁶⁸cal stat⁴s, mak⁸n⁶ ⁸t poss⁸bl⁴ ⁵or a d⁴c⁴pt⁸v⁴ ⁸nt⁴rv⁸⁴w⁴⁴ to app⁴ar trut⁷⁵ul. Vr⁸⁹ [00] and t⁷⁴ Nat⁸onal R⁴s⁴arc⁷ ounc⁸l R⁴port [00] ra⁸s⁴ anot⁷⁴r conc⁴rn standard⁸z⁴d m⁴t⁷ods ⁵or r⁴pr⁴s⁴nt⁸n⁶ and scor⁸n⁶ poly⁶rap⁷ data ar⁴ d⁸fficult to ⁵ormulat⁴ and ⁷av⁴ y⁴t to b⁴ d⁴v⁴lop⁴d. ⁴nc⁴ t⁷⁴r⁴ ⁸s no cons⁸st⁴nt way to t⁴ll ⁸⁵ ⁵a⁸lur⁴ or succ⁴ss o⁵ a poly⁶rap⁷ t⁴st ⁸s du⁴ to p⁷ys⁸olo⁶⁸cal m⁴asur⁴m⁴nt or to t⁷⁴ ⁸mpr⁴ss⁸ons and ⁴xp⁴r⁸⁴nc⁴ o⁵ t⁷⁴ ⁸nt⁴rv⁸⁴w⁴r. Not surpr⁸s⁸n⁶ly, r⁴ports o⁵ poly⁶rap⁷ accuracy vary w⁸d⁴ly. ⁴ r⁴v⁸⁴w o⁵ poly⁶rap⁷ stud⁸⁴s by Sax⁴ ⁴t al. [] c⁸t⁴s r⁴sults o⁵ fi⁴ld stud⁸⁴s ⁸n w⁷⁸c⁷ corr⁴ct ⁶u⁸lty d⁴c⁸s⁸ons ran⁶⁴d ⁵rom 0.–.% and corr⁴ct ⁸nnoc⁴nt d⁴c⁸s⁸ons ran⁶⁴d ⁵rom .–%. ⁴ NR r⁴port conclud⁴s “⁸n populat⁸ons o⁵ ⁴xam⁸n⁴⁴s suc⁷ as t⁷os⁴ r⁴pr⁴s⁴nt⁴d ⁸n t⁷⁴ poly⁶rap⁷ r⁴s⁴arc⁷ l⁸t⁴ratur⁴, untra⁸n⁴d ⁸n count⁴rm⁴asur⁴s, sp⁴c⁸fic-⁸nc⁸d⁴nt poly⁶rap⁷ t⁴sts ⁵or ⁴v⁴nt-sp⁴c⁸fic ⁸nv⁴st⁸⁶at⁸ons can d⁸scr⁸m⁸-
9
10
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
nat⁴ ly⁸n⁶ ⁵rom trut⁷ t⁴ll⁸n⁶ at rat⁴s w⁴ll abov⁴ c⁷anc⁴, t⁷ou⁶⁷ w⁴ll b⁴low p⁴r⁵⁴ct⁸on” [Nat⁸onal R⁴s⁴arc⁷ ounc⁸l, 00, p. ]. ⁴sp⁸t⁴ t⁷⁴s⁴ cr⁸t⁸c⁸sms, t⁷⁴ abs⁴nc⁴ o⁵ ot⁷⁴r v⁸abl⁴ alt⁴rnat⁸v⁴s mak⁴s t⁷⁴ poly⁶rap⁷ a w⁸d⁴ly us⁴d t⁴c⁷n⁸qu⁴ ⁵or d⁴t⁴ct⁸n⁶ d⁴c⁴pt⁸on. R⁴c⁴nt c⁷an⁶⁴s ⁸n standards o⁵ ⁴v⁸d⁴nc⁴ ⁷av⁴ l⁴d s⁴v⁴ral stat⁴s to adm⁸t poly⁶rap⁷ r⁴sults ⁸nto ⁴v⁸d⁴nc⁴ [Am⁴r⁸can Poly⁶rap⁷ Assoc⁸at⁸on, 00].
2.2.2 VOICE ANALYSIS: VSA AND LVA Vo⁸c⁴ analys⁸s cu⁴s d⁴p⁴nd on d⁴t⁴ctabl⁴ ⁵r⁴qu⁴nc⁸⁴s produc⁴d by t⁷⁴ body dur⁸n⁶ sp⁴⁴c⁷. Two comp⁴t⁸n⁶ m⁴t⁷ods ⁷av⁴ b⁴⁴n ⁸mpl⁴m⁴nt⁴d ⁵or l⁸⁴ d⁴t⁴ct⁸on Vo⁸c⁴ Str⁴ss Analys⁸s (VSA) and Lay⁴r⁴d Vo⁸c⁴ Analys⁸s (LVA). ot⁷ ar⁴ ava⁸labl⁴ as comm⁴rc⁸al products t⁷at ar⁴ popular w⁸t⁷ law ⁴n⁵orc⁴m⁴nt pro⁵⁴ss⁸onals. VSA t⁴c⁷nolo⁶y ⁸s bas⁴d on t⁷⁴ t⁷⁴ory t⁷at all muscl⁴s ⁸n t⁷⁴ body, ⁸nclud⁸n⁶ t⁷os⁴ o⁵ t⁷⁴ larynx, v⁸brat⁴ at a rat⁴ o⁵ – z [L⁸ppold, ]. ⁴s⁴ ⁸naud⁸bl⁴ m⁸crotr⁴mors ar⁴ suppr⁴ss⁴d w⁷⁴n a sp⁴ak⁴r ⁴xp⁴r⁸⁴nc⁴s str⁴ss. VSA sp⁴c⁸al⁸sts and v⁴ndors cla⁸m t⁷at t⁷⁴⁸r t⁴c⁷nolo⁶y ⁸s capabl⁴ o⁵ d⁴t⁴ct⁸n⁶ and m⁴asur⁸n⁶ var⁸at⁸ons ⁸n laryn⁶⁴al m⁸crotr⁴mor ⁵r⁴qu⁴nc⁸⁴s. ⁴y ⁵urt⁷⁴r cla⁸m t⁷at t⁷⁴s⁴ var⁸at⁸ons ar⁴ assoc⁸at⁴d w⁸t⁷ arous⁴d stat⁴s t⁷at ⁸nd⁸cat⁴ d⁴c⁴pt⁸on. A VSA mac⁷⁸n⁴ ⁸s ⁴ss⁴nt⁸ally a comput⁴r w⁸t⁷ VSA so⁵twar⁴ t⁷at ost⁴ns⁸bly r⁴cords laryn⁶⁴al m⁸crotr⁴mor patt⁴rns. ⁴ VSA mac⁷⁸n⁴ may b⁴ us⁴d ⁸n a r⁴al-t⁸m⁴ ⁸nt⁴rv⁸⁴w or ⁸t may proc⁴ss pr⁴-r⁴cord⁴d sp⁴⁴c⁷. S⁴v⁴ral r⁴s⁴arc⁷⁴rs ⁷av⁴ ⁴valuat⁴d VSA d⁴v⁸c⁴s ⁸n laboratory ⁴xp⁴r⁸m⁴nts [addad ⁴t al., 00, orvat⁷, ] and fi⁴ld t⁴sts [amp⁷ouss⁴, 00]. ⁴s⁴ stud⁸⁴s ⁷av⁴ ⁵a⁸l⁴d to confirm t⁷at m⁸crotr⁴mors ⁴x⁸st or t⁷at VSA t⁴c⁷nolo⁶⁸⁴s can d⁴t⁴ct t⁷⁴m, alt⁷ou⁶⁷ t⁷⁴r⁴ ⁸s som⁴ a⁶r⁴⁴m⁴nt w⁸t⁷ addad’s [00, p. ] conclus⁸on t⁷at VSAs ar⁴ m⁴asur⁸n⁶ som⁴t⁷⁸n⁶, but not m⁸crotr⁴mors. Mor⁴ov⁴r, d⁴sp⁸t⁴ VSA’s popular⁸ty ⁸n law ⁴n⁵orc⁴m⁴nt or⁶an⁸zat⁸ons, t⁷⁴ t⁴sts o⁵ VSA syst⁴ms ⁷av⁴ ⁵a⁸l⁴d to s⁷ow t⁷at VSA d⁴v⁸c⁴s p⁴r⁵orm at a l⁴v⁴l abov⁴ c⁷anc⁴. Lay⁴r⁴d Vo⁸c⁴ Analys⁸s (LVA) ⁸s d⁴v⁴lop⁴d and mark⁴t⁴d by N⁴m⁴sysco. LVA do⁴s not us⁴ laryn⁶⁴al m⁸crotr⁴mors but r⁴l⁸⁴s ⁸nst⁴ad on an undocum⁴nt⁴d s⁸⁶nal proc⁴ss⁸n⁶ al⁶or⁸t⁷m t⁷at ⁴mploys a “propr⁸⁴tary s⁴t o⁵ vocal param⁴t⁴rs ... n⁴w to t⁷⁴ world o⁵ p⁷on⁴t⁸cs” (www.neme sysco.com). ⁴ d⁴scr⁸pt⁸on o⁵ LVA t⁴c⁷nolo⁶y ⁸s too ⁸nad⁴quat⁴ to support ⁴valuat⁸on o⁵ ⁸ts t⁷⁴or⁴t⁸cal bas⁸s. ⁸s l⁴av⁴s p⁴r⁵ormanc⁴ ⁴valuat⁸ons, w⁷⁸c⁷ ⁷av⁴ ⁵a⁸l⁴d to prov⁸d⁴ ⁴v⁸d⁴nc⁴ t⁷at LVA p⁴r⁵orms b⁴tt⁴r t⁷an c⁷anc⁴ ⁸n laboratory t⁴sts [arnsb⁴r⁶⁴r ⁴t al., 00] and fi⁴ld tr⁸als [orvat⁷ ⁴t al., 0]. ⁴sp⁸t⁴ t⁷⁴ poor p⁴r⁵ormanc⁴ r⁴sults, law ⁴n⁵orc⁴m⁴nt pro⁵⁴ss⁸onals ⁷av⁴ r⁴port⁴d ⁶r⁴at succ⁴ss ⁸n us⁸n⁶ LVA and VSA mac⁷⁸n⁴s to solv⁴ cr⁸m⁴s [addad ⁴t al., 00]. [orvat⁷ ⁴t al., 0, p. 0] sp⁴culat⁴ t⁷at t⁷⁴ r⁴port⁴d succ⁴ss by fi⁴ld pract⁸t⁸on⁴rs com⁴s not ⁵rom t⁷⁴ valu⁴ o⁵ t⁷⁴ LVA, but rat⁷⁴r ⁵rom op⁴rators’ ab⁸l⁸ty to “r⁴ad” t⁷⁴ cu⁴s ⁸n⁷⁴r⁴nt ⁸n an ⁸nt⁴rv⁸⁴w⁴⁴’s b⁴⁷av⁸or t⁷⁴⁸r ton⁴ o⁵ vo⁸c⁴, ass⁴rt⁸v⁴n⁴ss, d⁸r⁴ctn⁴ss, naturaln⁴ss, and so ⁵ort⁷. n ot⁷⁴r words, as w⁸t⁷ t⁷⁴ poly⁶rap⁷, t⁷⁴s⁴ d⁴v⁸c⁴s succ⁴⁴d not on t⁷⁴⁸r own but only w⁷⁴n us⁴d as support⁸n⁶ tools ⁸n t⁷⁴ ⁷ands o⁵ a sk⁸ll⁵ul and ⁴xp⁴r⁸⁴nc⁴d ⁸nt⁴rv⁸⁴w⁴r.
2.2. NONVERBAL CUES TO DECEPTION
11
2.2.3 THERMOGRAPHY ⁴rmal ⁸ma⁶⁸n⁶ works by us⁸n⁶ ⁷⁴at d⁴t⁴ct⁸n⁶ cam⁴ras to ⁸d⁴nt⁸⁵y warm⁸n⁶ patt⁴rns around a sub⁹⁴ct’s ⁴y⁴s. Warm⁸n⁶ patt⁴rns ar⁴ ⁵orm⁴d as a p⁷ys⁸olo⁶⁸cal r⁴spons⁴ to str⁴ss blood flow to t⁷⁴ ar⁴a around t⁷⁴ ⁴y⁴s ⁸s ⁸ncr⁴as⁴d, cr⁴at⁸n⁶ ⁸ncr⁴as⁴d warmt⁷. Pavl⁸d⁸s ⁴t al. [00] cla⁸m⁴d t⁷at ⁸t ⁸s poss⁸bl⁴ to ⁸d⁴nt⁸⁵y a “t⁷⁴rmal s⁸⁶natur⁴” cons⁸st⁸n⁶ o⁵ blood flow patt⁴rns ⁸nd⁸cat⁸v⁴ o⁵ d⁴c⁴pt⁸on and t⁷at t⁷⁴s⁴ patt⁴rns could b⁴ us⁴d to ⁸d⁴nt⁸⁵y d⁴c⁴pt⁸v⁴ sub⁹⁴cts w⁸t⁷ “an accuracy comparabl⁴ to t⁷at o⁵ poly⁶rap⁷ ⁴xam⁸nat⁸on.” ⁴ app⁴al o⁵ t⁷⁸s approac⁷ ⁸s t⁷at ⁸t off⁴rs t⁷⁴ poss⁸b⁸l⁸ty o⁵ ⁸d⁴nt⁸⁵y⁸n⁶ d⁴c⁴pt⁸on w⁸t⁷out t⁷⁴ n⁴⁴d ⁵or ⁸nt⁴rv⁸⁴ws or p⁷ys⁸cal contact. ⁴nc⁴, ⁸t would s⁴⁴m to ⁷old ⁶r⁴at prom⁸s⁴ ⁵or a⁸rport and bord⁴r cross⁸n⁶ appl⁸cat⁸ons as su⁶⁶⁴st⁴d by Warm⁴l⁸nk ⁴t al. [0] and Vr⁸⁹ ⁴t al. [00]. Laboratory stud⁸⁴s o⁵ t⁷⁴rmal ⁸ma⁶⁸n⁶ s⁷ow som⁴ support ⁵or ⁵ac⁸al t⁷⁴rmo⁶rap⁷y patt⁴rns as an ⁸nd⁸cator o⁵ d⁴c⁴pt⁸on. n t⁷⁴ 0 a⁸rport study by Warm⁴l⁸nk, t⁷⁴rmal ⁸ma⁶⁸n⁶ mana⁶⁴d to ⁸d⁴nt⁸⁵y l⁸ars % o⁵ t⁷⁴ t⁸m⁴ and trut⁷t⁴ll⁴rs % o⁵ t⁷⁴ t⁸m⁴, a rat⁴ t⁷at t⁷⁴y cla⁸m ⁸s too low ⁵or a⁸rport scr⁴⁴n⁸n⁶, ⁴sp⁴c⁸ally ⁶⁸v⁴n t⁷at ⁸nt⁴rv⁸⁴w⁴rs work⁸n⁶ w⁸t⁷out t⁷⁴rmo⁶rap⁷y p⁴r⁵orm⁴d s⁸⁶n⁸ficantly b⁴tt⁴r on t⁷⁴ d⁸scr⁸m⁸nat⁸on task. R⁴sults o⁵ t⁴sts by Poll⁸na ⁴t al. [00] su⁶⁶⁴st a stron⁶⁴r l⁸nk b⁴tw⁴⁴n ⁵ac⁸al ⁷⁴at d⁸splays and d⁴c⁴pt⁸on. ⁴y conclud⁴, ⁷ow⁴v⁴r, t⁷at t⁷⁴ status o⁵ t⁷⁴rmo⁶rap⁷y r⁴ma⁸ns uncl⁴ar “⁴ ⁴xt⁴nt to w⁷⁸c⁷ t⁷⁴rmo⁶rap⁷y w⁸ll ⁸ncr⁴as⁴ accuracy b⁴yond t⁷at w⁷⁸c⁷ ⁸s poss⁸bl⁴ us⁸n⁶ trad⁸t⁸onal poly⁶rap⁷ m⁴asur⁴s ⁸s not y⁴t known” (p. ). 2.2.4 BRAIN SCAN: EEG AND MRI l⁴ctro⁴nc⁴p⁷alo⁶rap⁷y () and ⁵unct⁸onal ma⁶n⁴t⁸c r⁴sonanc⁴ ⁸ma⁶⁸n⁶ (⁵MR) ar⁴ w⁴ll⁴stabl⁸s⁷⁴d t⁴c⁷nolo⁶⁸⁴s ⁵or t⁷⁴ m⁴asur⁴m⁴nt o⁵ bra⁸n act⁸v⁸ty. ⁴⁸r r⁴l⁴vanc⁴ to d⁴c⁴pt⁸on analys⁸s l⁸⁴s ⁸n t⁷⁴ d⁴monstrat⁴d ab⁸l⁸ty o⁵ and ⁵MR m⁴asur⁴m⁴nts to d⁸st⁸n⁶u⁸s⁷ t⁷⁴ bra⁸n’s r⁴spons⁴s to known ⁸n⁵ormat⁸on ⁵rom r⁴spons⁴s to nov⁴l ⁸n⁵ormat⁸on. ⁴nc⁴, t⁷⁴s⁴ t⁴c⁷nolo⁶⁸⁴s could b⁴ us⁴d to d⁴t⁴rm⁸n⁴, ⁵or ⁴xampl⁴, w⁷⁴t⁷⁴r a susp⁴ct ⁷as sp⁴c⁸al knowl⁴d⁶⁴ o⁵ a cr⁸m⁴ t⁷at only a ⁶u⁸lty p⁴rson would ⁷av⁴. ⁴r⁴ ⁴x⁸sts an ⁴xt⁴ns⁸v⁴ r⁴s⁴arc⁷ l⁸t⁴ratur⁴ on m⁴mory and /⁵MR as w⁴ll as s⁸⁶n⁸ficant comm⁴rc⁸al d⁴v⁴lopm⁴nt ⁸n t⁷⁴ Un⁸t⁴d Stat⁴s (www.brainwavescience.com) and nd⁸a (www.axxonet.com). s r⁴cord bra⁸n wav⁴s call⁴d ⁴v⁴nt r⁴lat⁴d pot⁴nt⁸als (RPs). ⁴ RP m⁴mory r⁴co⁶n⁸t⁸on r⁴spons⁴ ⁸s t⁷⁴ P00, so nam⁴d b⁴caus⁴ t⁷⁴ r⁴spons⁴ usually occurs 00–00 ms a⁵t⁴r ⁸n⁵ormat⁸on r⁴lat⁸n⁶ to t⁷⁴ m⁴mory ⁸s pr⁴s⁴nt⁴d. ⁴ P00 r⁴spons⁴ ⁵a⁸ls to occur ⁸⁵ t⁷⁴ ⁸n⁵ormat⁸on ⁸s un⁵am⁸l⁸ar and ⁷⁴nc⁴ may b⁴ v⁸⁴w⁴d as a d⁸a⁶nost⁸c to d⁴t⁴rm⁸n⁴ ⁸⁵ a sub⁹⁴ct ⁷as ⁴xp⁴r⁸⁴nt⁸al knowl⁴d⁶⁴ o⁵ som⁴ ⁴v⁴nt. ⁴ m⁴asur⁴m⁴nt o⁵ P00 r⁴spons⁴s ⁵orms t⁷⁴ ⁷⁴art o⁵ comm⁴rc⁸al⁸z⁴d /P00 syst⁴ms t⁷at cla⁸m to d⁴t⁴ct m⁴mor⁸⁴s ⁸nd⁸cat⁸n⁶ a susp⁴ct’s ⁶u⁸lt. Two o⁵ t⁷⁴ stron⁶⁴st c⁷all⁴n⁶⁴s to /P00 syst⁴ms com⁴ ⁵rom stud⁸⁴s o⁵ ⁵als⁴ m⁴mor⁸⁴s and count⁴rm⁴asur⁴s. All⁴n and M⁴rt⁴ns [00] ⁵ound t⁷at sub⁹⁴cts’ RP r⁴spons⁴s ⁵a⁸l⁴d to s⁷ow a d⁸st⁸nct⁸on b⁴tw⁴⁴n tru⁴ r⁴coll⁴ct⁸ons and ⁵als⁴ r⁴coll⁴ct⁸ons t⁷at w⁴r⁴ ⁸mplant⁴d by assoc⁸at⁸on w⁸t⁷ tru⁴ m⁴mor⁸⁴s, op⁴n⁸n⁶ up t⁷⁴ poss⁸b⁸l⁸ty o⁵ d⁴⁴m⁸n⁶ an ⁸nnoc⁴nt p⁴rson ⁶u⁸lty. n a study o⁵ count⁴rm⁴asur⁴s, ⁴r⁶ström
12
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
⁴t al. [0] w⁴r⁴ abl⁴ to d⁴monstrat⁴ t⁷at, contrary to pr⁴v⁸ous cla⁸ms, RP act⁸v⁸ty can b⁴ d⁴l⁸b⁴rat⁴ly suppr⁴ss⁴d. /P00 t⁴c⁷nolo⁶y, l⁸k⁴ t⁷⁴ poly⁶rap⁷, ⁸s t⁷us vuln⁴rabl⁴ to count⁴rm⁴asur⁴s t⁷at ⁶u⁸lty susp⁴cts may ⁴mploy ⁸n ord⁴r to app⁴ar ⁸nnoc⁴nt. ⁵MRs m⁴asur⁴ blood flow c⁷an⁶⁴s ⁸n t⁷⁴ bra⁸n. ⁴y ⁷av⁴ b⁴⁴n s⁷own to find act⁸v⁸ty ⁸n r⁴⁶⁸ons o⁵ t⁷⁴ bra⁸n t⁷at laboratory t⁴sts assoc⁸at⁴ w⁸t⁷ actual m⁴mor⁸⁴s. ⁴ tw⁴lv⁴ laboratory stud⁸⁴s r⁴v⁸⁴w⁴d by Vr⁸⁹ [00] all ⁵ound t⁷at “ar⁴as assoc⁸at⁴d w⁸t⁷ ⁸n⁷⁸b⁸t⁸on and confl⁸ct r⁴solut⁸on w⁴r⁴ act⁸vat⁴d w⁷⁴n ly⁸n⁶” (p. ). ow⁴v⁴r, Vr⁸⁹ ⁶o⁴s on to obs⁴rv⁴ t⁷at, ⁵or ⁴ac⁷ study, “a som⁴w⁷at d⁸ff⁴r⁴nt bra⁸n act⁸v⁸ty patt⁴rn ⁴m⁴r⁶⁴d as an ⁸nd⁸cator o⁵ d⁴c⁴⁸t” (p. ), su⁶⁶⁴st⁸n⁶ t⁷at a un⁸⁵orm ⁵MR-d⁴t⁴ctabl⁴ cu⁴ was non⁴x⁸st⁴nt ⁸n t⁷⁴ r⁴port⁴d data. abr⁸cat⁴d m⁴mor⁸⁴s cr⁴at⁴ add⁸t⁸onal c⁷all⁴n⁶⁴s. n t⁷⁴⁸r ⁴xp⁴r⁸m⁴nt on m⁴mory d⁴t⁴ct⁸on and ⁵ac⁴s, R⁸ssman ⁴t al. [00] ⁵ound t⁷at bra⁸n act⁸v⁸ty r⁴cord⁴d by ⁵MR ⁵a⁸l⁴d to s⁷ow d⁸ff⁴r⁴nc⁴s b⁴tw⁴⁴n tru⁴ and ⁵als⁴ ⁵ac⁸al m⁴mor⁸⁴s. ⁴sp⁸t⁴ t⁷⁴ doubts about ⁵MR cu⁴s to d⁴c⁴pt⁸on and t⁷⁴ lack o⁵ fi⁴ld ⁴v⁸d⁴nc⁴, ⁵MR ⁵or l⁸⁴ d⁴t⁴ct⁸on ⁷as b⁴⁴n comm⁴rc⁸al⁸z⁴d w⁸t⁷ cla⁸ms o⁵ accurat⁴ l⁸⁴ d⁴ct⁴ct⁸on ran⁶⁸n⁶ ⁵rom 0–% (www.noliemri.com).
2.2.5 VOCAL CUES ⁸s class o⁵ ⁸nd⁸cators compr⁸s⁴s sp⁴⁴c⁷ sounds t⁷at lack s⁴mant⁸c cont⁴nt. ⁴s⁴ sounds ⁸nclud⁴ fill⁴d paus⁴s, s⁸l⁴nt paus⁴s, d⁸sflu⁴nc⁸⁴s, and var⁸at⁸ons ⁸n rat⁴ and p⁸tc⁷. Stud⁸⁴s o⁵ vocal cu⁴s o⁵t⁴n r⁴port a w⁴ak corr⁴lat⁸on w⁸t⁷ trut⁷⁵ul vs. d⁴c⁴pt⁸v⁴ cond⁸t⁸ons but t⁷⁴y also t⁴nd to y⁸⁴ld confl⁸ct⁸n⁶ r⁴sults [⁴Paulo ⁴t al., 00, Vr⁸⁹, 00]. v⁸d⁴nc⁴ ⁵or a r⁴lat⁸on⁷⁸p b⁴tw⁴⁴n r⁸s⁸n⁶ p⁸tc⁷ and d⁴c⁴pt⁸on, ⁷ow⁴v⁴r, ⁸s cons⁸st⁴nt across r⁴ports. or ⁴xampl⁴, ⁸n t⁷⁴⁸r w⁸d⁴ly c⁸t⁴d nurs⁴s’ study, kman and r⁸⁴s⁴n ⁵ound a s⁸⁶n⁸ficant ⁸ncr⁴as⁴ ⁸n p⁸tc⁷ w⁷⁴n t⁷⁴ sub⁹⁴cts r⁴port⁴d ⁵⁴⁴l⁸n⁶ str⁴ss⁴d [kman and r⁸⁴s⁴n, ]. Str⁴⁴t⁴r ⁴t al. [] also docum⁴nts an ⁸ncr⁴as⁴ ⁸n p⁸tc⁷ and ampl⁸tud⁴ ⁵or laboratory sub⁹⁴cts ⁸n a d⁴c⁴pt⁸on task. rac⁸ar⁴na ⁴t al. [00] comb⁸n⁴d vocal cu⁴s, ⁸nclud⁸n⁶ acoust⁸c cu⁴s not usually ⁴xam⁸n⁴d ⁸n d⁴c⁴pt⁸on ⁴xp⁴r⁸m⁴nts, w⁸t⁷ m⁸x⁴d r⁴sults. T⁴sts w⁸t⁷ prosod⁸c ⁵⁴atur⁴s only—p⁸tc⁷, durat⁸on and ⁴n⁴r⁶y—and t⁴sts w⁸t⁷ sp⁴ctral ⁵⁴atur⁴s only y⁸⁴ld⁴d accurac⁸⁴s o⁵ .% and 0%, r⁴sp⁴ct⁸v⁴ly, ⁸n bot⁷ cas⁴s a ⁶a⁸n ov⁴r c⁷anc⁴. lk⁸ns ⁴t al. [0] off⁴r on⁴ o⁵ t⁷⁴ ⁵⁴w fi⁴ld stud⁸⁴s o⁵ vocal cu⁴ ⁴ff⁴ct⁸v⁴n⁴ss. P⁸tc⁷ var⁸at⁸ons and ⁴y⁴ ⁶az⁴ w⁴r⁴ r⁴cord⁴d as part o⁵ a bord⁴r cross⁸n⁶ v⁴nu⁴ ⁸n urop⁴. Alt⁷ou⁶⁷ d⁴ta⁸ls ar⁴ l⁴⁵t va⁶u⁴, lk⁸ns ⁴t al. ⁵ound a s⁸⁶n⁸ficant corr⁴lat⁸on b⁴tw⁴⁴n d⁴partur⁴s ⁵rom t⁷⁴ 0 bas⁴l⁸n⁴ and d⁴c⁴pt⁸on no ⁴ff⁴ct was not⁴d ⁵or ⁴y⁴ ⁶az⁴. Vr⁸⁹ [00, p. ] ar⁶u⁴s t⁷at d⁴c⁴pt⁸on r⁴s⁴arc⁷⁴rs ⁷av⁴ ⁵a⁸l⁴d to prov⁸d⁴ a s⁴t o⁵ “cons⁸st⁴nt and r⁴l⁸abl⁴ nonv⁴rbal cu⁴s,” ⁸n part b⁴caus⁴ t⁷⁴ r⁴s⁴arc⁷ commun⁸ty lacks suffic⁸⁴ntly d⁴ta⁸l⁴d m⁴t⁷ods ⁵or ⁸d⁴nt⁸⁵y⁸n⁶ and m⁴asur⁸n⁶ vocal and ot⁷⁴r nonv⁴rbal cu⁴s. Standard⁸zat⁸on o⁵ data d⁴scr⁸pt⁸ons and ⁴xp⁴r⁸m⁴ntal protocols ar⁴ n⁴⁴d⁴d to ⁴nsur⁴ pro⁶r⁴ss ⁸n t⁷⁸s ar⁴a. or ⁴xampl⁴, d⁸st⁸n⁶u⁸s⁷⁸n⁶ d⁸ff⁴r⁴nt class⁴s o⁵ fill⁴d paus⁴ (uh vs. um), d⁸sflu⁴ncy (word r⁴p⁴t⁸t⁸on, word ⁵ra⁶m⁴nt, ⁵als⁴ start and r⁴pa⁸r) and s⁸l⁴nt paus⁴ (durat⁸on and pos⁸t⁸on ⁸n s⁴nt⁴nc⁴) would undoubt⁴dly ⁷⁴lp to clar⁸⁵y t⁷⁴ contr⁸but⁸on o⁵ ⁴ac⁷ data typ⁴ ⁸n any ⁴xp⁴r⁸m⁴ntal parad⁸⁶m.
2.3. THE PSYCHOLOGY LITERATURE
13
2.2.6 BODY AND FACIAL MOVEMENTS n t⁷⁴⁸r r⁴v⁸⁴w o⁵ nonv⁴rbal d⁴c⁴pt⁸on cu⁴s, ⁴Paulo ⁴t al. ⁵ound pup⁸l d⁸lat⁸on to b⁴ t⁷⁴ most stron⁶ly support⁴d, w⁸t⁷ t⁷⁴ vocal cu⁴s o⁵ p⁸tc⁷ and d⁸sflu⁴ncy clos⁴ b⁴⁷⁸nd [⁴Paulo ⁴t al., 00]. kman [00] and kman and r⁸⁴s⁴n [] s⁷ow⁴d t⁷at bar⁴ly d⁴t⁴ctabl⁴ ⁵ac⁸al ⁴xpr⁴ss⁸ons— m⁸cro⁴xpr⁴s⁸sons—⁴ncod⁴ ⁸n⁵ormat⁸on about ⁴mot⁸onal stat⁴s t⁷at may ⁸nd⁸cat⁴ d⁴c⁴pt⁸on. ⁴s⁴ m⁸cro⁴xpr⁴ss⁸ons ⁷app⁴n qu⁸ckly and ar⁴ n⁴arly ⁸mposs⁸bl⁴ ⁵or p⁴opl⁴ to control, mak⁸n⁶ t⁷⁴m a l⁸k⁴ly sourc⁴ o⁵ val⁸d ⁸n⁵ormat⁸on about ⁴mot⁸onal stat⁴. Y⁴t, as w⁸t⁷ poly⁶rap⁷ and ot⁷⁴r ⁴mot⁸on d⁴t⁴ctors, ⁴v⁸d⁴nc⁴ o⁵ part⁸cular ⁴mot⁸ons may ⁷av⁴ many sourc⁴s ot⁷⁴r t⁷an d⁴c⁴pt⁸on. ata coll⁴ct⁸on ⁵or nonv⁴rbal cu⁴s ⁸s a d⁸fficult task. or ⁴xampl⁴, m⁸cro⁴xpr⁴ss⁸ons, w⁷⁸c⁷ ar⁴ fl⁴⁴t⁸n⁶ and subtl⁴, must b⁴ stud⁸⁴d w⁸t⁷ a ⁵ram⁴ by ⁵ram⁴ analys⁸s o⁵ v⁸d⁴o—a labor⁸ous and ⁴xp⁴ns⁸v⁴ pro⁹⁴ct. Unl⁸k⁴ ⁵MR and ot⁷⁴r approac⁷⁴s to d⁴c⁴pt⁸on stud⁸⁴s, t⁷⁴r⁴ ⁸s no ⁴x⁸st⁸n⁶ t⁴c⁷nolo⁶y t⁷at can p⁴r⁵orm t⁷⁴ analys⁸s o⁵ ⁵ac⁸al cu⁴s alt⁷ou⁶⁷ t⁷⁴r⁴ ar⁴ ⁴fforts to automat⁴ ⁵ac⁸al cod⁸n⁶ (⁴.⁶., ⁷u ⁴t al. [0]) t⁷at may ⁷av⁴ pro⁵ound ⁴ff⁴cts on t⁷⁸s ar⁴a o⁵ d⁴c⁴pt⁸on r⁴s⁴arc⁷. A s⁸m⁸lar obs⁴rvat⁸on m⁸⁶⁷t b⁴ mad⁴ ⁵or t⁷⁴ appl⁸cat⁸on o⁵ automat⁸c sp⁴⁴c⁷ r⁴co⁶n⁸t⁸on (ASR) t⁴c⁷nolo⁶y to r⁴s⁴arc⁷ on vocal and l⁸n⁶u⁸st⁸c cu⁴s.
2.3
THE PSYCHOLOGY LITERATURE
W⁸t⁷⁸n t⁷⁴ psyc⁷olo⁶y l⁸t⁴ratur⁴, w⁸d⁴ cov⁴ra⁶⁴ o⁵ t⁷⁴ b⁴⁷av⁸oral corr⁴lat⁴s o⁵ d⁴c⁴pt⁸on ⁷as b⁴⁴n ⁶⁸v⁴n by an ⁴xt⁴ns⁸v⁴ m⁴ta-analys⁸s o⁵ t⁷⁴ l⁸t⁴ratur⁴ don⁴ by ⁴Paulo ⁴t al. [00]. Ald⁴rt Vr⁸⁹ [00] prov⁸d⁴d anot⁷⁴r compr⁴⁷⁴ns⁸v⁴ r⁴v⁸⁴w o⁵ t⁷⁴ fi⁴ld. ⁴ ⁵ollow⁸n⁶ s⁴ct⁸on off⁴rs an ov⁴rv⁸⁴w o⁵ t⁷⁸s work. ot⁷ ⁴Paulo ⁴t al. and Vr⁸⁹, couc⁷⁴d w⁸t⁷⁸n t⁷⁴ ⁵ram⁴work o⁵ soc⁸al psyc⁷olo⁶y, ⁷av⁴ s⁴rv⁴d as r⁴⁵⁴r⁴nc⁴ po⁸nts ⁵or most o⁵ t⁷⁴ subs⁴qu⁴nt computat⁸onal work ⁸n d⁴c⁴pt⁸on t⁷⁴y not only contr⁸but⁴ to our knowl⁴d⁶⁴ o⁵ d⁴c⁴pt⁸on but can also ⁵unct⁸on as r⁴sourc⁴s ⁵or cr⁴at⁸v⁴ n⁴w approac⁷⁴s to ⁸ts d⁴t⁴ct⁸on. or t⁷⁸s r⁴ason, w⁴ ⁸nclud⁴ som⁴ d⁸scuss⁸on o⁵ t⁷⁴ non-v⁴rbal cu⁴s cons⁸d⁴r⁴d ⁸n t⁷⁴s⁴ works w⁷⁸l⁴ conc⁴ntrat⁸n⁶ on t⁷⁴ asp⁴cts t⁷at p⁴rta⁸n to v⁴rbal b⁴⁷av⁸or.
2.3.1 DEPAULO ET AL.’S STUDY ⁴Paulo ⁴t al. [00] d⁴fin⁴s d⁴c⁴pt⁸on as a “d⁴l⁸b⁴rat⁴ att⁴mpt to m⁸sl⁴ad ot⁷⁴rs,” ⁴xclud⁸n⁶ cas⁴s w⁷⁴r⁴ t⁷⁴ sub⁹⁴ct m⁸s⁸n⁵orms ⁸n ⁶ood ⁵a⁸t⁷. Mor⁴ ⁸nt⁴r⁴st⁸n⁶ly, t⁷⁴y only tak⁴ ⁸nto cons⁸d⁴rat⁸on t⁷⁴ cu⁴s w⁷⁸c⁷ “can b⁴ d⁸sc⁴rn⁴d by ⁷uman p⁴rc⁴⁸v⁴rs w⁸t⁷out t⁷⁴ a⁸d o⁵ any sp⁴c⁸al ⁴qu⁸pm⁴nt” [⁴Paulo ⁴t al., 00, p. ]. Suc⁷ a c⁷o⁸c⁴ ⁸mpl⁸⁴s t⁷at t⁷⁴ aut⁷ors ⁵ocus on cu⁴s o⁵ d⁴c⁴pt⁸on w⁷⁸c⁷ can b⁴ r⁴co⁶n⁸z⁴d ⁸n r⁴al t⁸m⁴. ⁴ final data s⁴t ⁵or t⁷⁴ m⁴ta-analys⁸s ⁸nclud⁴s 0 ⁸nd⁴p⁴nd⁴nt sampl⁴s ⁵rom stud⁸⁴s, and r⁴ports , ⁴st⁸mat⁴s o⁵ t⁷⁴ d⁸scr⁸m⁸natory ⁴ff⁴ct⁸v⁴n⁴ss o⁵ t⁷⁴ cu⁴s to d⁴c⁴pt⁸on cons⁸d⁴r⁴d. n do⁸n⁶ so, ⁸t cov⁴rs t⁷⁴ ma⁸n t⁷⁴or⁴t⁸cal approac⁷⁴s to d⁴c⁴pt⁸on d⁴t⁴ct⁸on o⁵ t⁷⁴ 0t⁷ c⁴ntury (t⁷⁴ l⁸t⁴ratur⁴ r⁴v⁸⁴w ⁴nd⁴d ⁸n 00). ⁴ aut⁷ors also ⁵ormulat⁴ and t⁴st t⁷⁴⁸r own ⁷ypot⁷⁴s⁴s, r⁴ly⁸n⁶ on t⁷⁴ vast amount o⁵ r⁴sults ava⁸labl⁴ t⁷rou⁶⁷ t⁷⁴⁸r r⁴v⁸⁴w.
14
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
n t⁷⁴ n⁴xt s⁴ct⁸on w⁴ ⁸ntroduc⁴ t⁷⁴ approac⁷⁴s to d⁴c⁴pt⁸on d⁴t⁴ct⁸on cons⁸d⁴r⁴d by ⁴Paulo ⁴t al. [00], ⁸nclud⁸n⁶ t⁷⁴⁸r own t⁷⁴or⁴t⁸cal ⁵ram⁴work, ⁵ollow⁴d by a pr⁴s⁴ntat⁸on o⁵ t⁷⁴ m⁴t⁷ods ⁴mploy⁴d ⁸n t⁷⁴ r⁴s⁴arc⁷, t⁷⁴ ma⁸n r⁴sults o⁵ t⁷⁴ l⁸t⁴ratur⁴ on cu⁴s o⁵ d⁴c⁴pt⁸on, and t⁷⁴ outcom⁴s o⁵ analys⁴s carr⁸⁴d out. e main approaches to deception detection ⁴Paulo ⁴t al. [00] d⁸scuss t⁷r⁴⁴ approac⁷⁴s to d⁴c⁴pt⁸on d⁴t⁴ct⁸on t⁷at ar⁴ consonant w⁸t⁷ t⁷⁴ approac⁷ o⁵ ⁴Paulo ⁴t al. and t⁷⁴⁸r m⁴t⁷odolo⁶⁸cal pr⁴m⁸s⁴s
• t⁷⁴ work o⁵ kman [kman, 00, kman and r⁸⁴s⁴n, , kman ⁴t al., , ] • t⁷⁴ work o⁵ Zuck⁴rman [Zuck⁴rman ⁴t al., , , Zuck⁴rman and r⁸v⁴r, ] and • t⁷⁴ work o⁵ ull⁴r [ull⁴r and ur⁶oon, , ull⁴r ⁴t al., ]. ⁴r⁴ w⁴ d⁸scuss t⁷⁴ work o⁵ ⁴ac⁷ ⁶roup. T
kman’s r⁴s⁴arc⁷ r⁴l⁸⁴s on t⁷⁴ ⁸d⁴a t⁷at stron⁶ ⁴mot⁸ons can act⁸vat⁴ ⁵ac⁸al muscl⁴s automat⁸cally, and t⁷at t⁷⁴ r⁴sult⁸n⁶ “m⁸cro-⁴xpr⁴ss⁸ons” could b⁴ cu⁴s to d⁴c⁴pt⁸on [kman, 00]. Mor⁴ pr⁴c⁸s⁴ly, ⁷⁴ d⁸st⁸n⁶u⁸s⁷⁴s two d⁸ff⁴r⁴nt k⁸nd o⁵ s⁸⁶nals t⁷at sub⁹⁴cts may d⁸splay. Leakage cues ar⁴ b⁴⁷av⁸oral ⁴xpr⁴ss⁸ons t⁷at t⁷⁴ l⁸ar ⁵a⁸ls to squ⁴lc⁷ and t⁷at ar⁴ stron⁶ and last⁸n⁶ ⁴nou⁶⁷ to r⁴v⁴al w⁷at t⁷⁴y unsucc⁴ss⁵ully try to conc⁴al t⁷⁴ trut⁷. Deception cues s⁷ar⁴ t⁷⁴ sam⁴ natur⁴ as l⁴aka⁶⁴ cu⁴s, su⁶⁶⁴st⁸n⁶ d⁴c⁴pt⁸on but w⁸t⁷out r⁴v⁴al⁸n⁶ t⁷⁴ trut⁷.
or ⁴xampl⁴, ⁸⁵ a sub⁹⁴ct tr⁸⁴s to d⁴ny an⁶⁴r, ⁷⁴ w⁸ll ⁷av⁴ to suppr⁴ss typ⁸cal s⁸⁶ns o⁵ an⁶⁴r, suc⁷ as narrow⁸n⁶ o⁵ t⁷⁴ l⁸ps, low⁴r⁸n⁶ o⁵ t⁷⁴ ⁴y⁴brows, and so on. ut t⁷⁸s task ⁸s d⁸fficult, s⁸nc⁴ ⁴mot⁸ons can ar⁸s⁴ sudd⁴nly. Accord⁸n⁶ to kman [00], sub⁹⁴cts can suppr⁴ss t⁷⁴⁸r ⁴xpr⁴ss⁸ons w⁸t⁷⁸n / o⁵ a s⁴cond, but t⁷⁸s laps⁴ o⁵ t⁸m⁴ ⁸s ⁴nou⁶⁷, ⁵or a tra⁸n⁴d obs⁴rv⁴r to d⁴t⁴ct suc⁷ ⁴xpr⁴ss⁸ons. S⁸m⁸larly, s⁴v⁴ral aut⁷ors [kman ⁴t al., , kman and O’Sull⁸van, 00, ⁴ss and Kl⁴ck, 0, ⁸ll and ra⁸⁶, 00] ⁵ound t⁷at spontan⁴ous and d⁴l⁸b⁴rat⁴ ⁴xpr⁴ss⁸ons ar⁴ d⁸ff⁴r⁴nt ⁸n lat⁴ncy t⁸m⁴, ov⁴rall durat⁸on, durat⁸on o⁵ p⁴ak ⁸nt⁴ns⁸ty, and ons⁴t and offs⁴t t⁸m⁴ (t⁷⁴ t⁸m⁴ ⁵rom t⁷⁴ start o⁵ t⁷⁴ ⁴xpr⁴ss⁸on to ⁸ts p⁴ak and ⁵rom t⁷⁴ p⁴ak to ⁸ts d⁸sapp⁴aranc⁴, r⁴sp⁴ct⁸v⁴ly). n ord⁴r to analyz⁴ t⁷os⁴ ⁴xpr⁴ss⁸ons, kman ⁴t al. [] and kman [00] d⁸st⁸n⁶u⁸s⁷ b⁴tw⁴⁴n d⁸ff⁴r⁴nt ⁴mot⁸ons assoc⁸at⁴d w⁸t⁷ d⁴c⁴pt⁸on t⁷at could d⁸ff⁴r⁴nt⁸at⁴ l⁸ars ⁵rom trut⁷ t⁴ll⁴rs. n part⁸cular, t⁷⁴y cons⁸d⁴r cu⁴s o⁵ t⁷⁴ ⁵ollow⁸n⁶. Fear. ⁸s ⁴mot⁸on ⁸s b⁴l⁸⁴v⁴d to d⁴p⁴nd on t⁷⁴ stak⁴s ⁸nvolv⁴d ⁸n t⁷⁴ t⁴ll⁸n⁶ o⁵ t⁷⁴ l⁸⁴ and may b⁴ ⁴xpr⁴ss⁴d by ⁷⁸⁶⁷⁴r p⁸tc⁷ o⁵ vo⁸c⁴, ⁵ast⁴r and loud⁴r sp⁴⁴c⁷, paus⁴s, and ⁴rrors [⁴Paulo ⁴t al., 00].
2.3. THE PSYCHOLOGY LITERATURE
15
Guilt. v⁴n t⁷ou⁶⁷ ⁸t s⁴⁴ms d⁸fficult to d⁴t⁴rm⁸n⁴ cu⁴s to ⁶u⁸lt, “t⁷⁴y could ⁸nclud⁴ cu⁴s to sadn⁴ss suc⁷ as low⁴r p⁸tc⁷, so⁵t⁴r and slow⁴r sp⁴⁴c⁷, and downward ⁶az⁸n⁶” [⁴Paulo ⁴t al., 00].
Delight. Lastly, ⁴v⁴n pos⁸t⁸v⁴ ⁴mot⁸ons may b⁴ assoc⁸at⁴d w⁸t⁷ t⁷⁴ act o⁵ t⁴ll⁸n⁶ l⁸⁴s, suc⁷ as ⁴xc⁸t⁴m⁴nt du⁴ to t⁷⁴ c⁷all⁴n⁶⁴ o⁵ t⁷⁴ task and ⁴up⁷or⁸a or pr⁸d⁴ w⁷⁴n t⁷⁴ l⁸⁴ ⁸s acc⁴pt⁴d as tru⁴, call⁴d by kman [00] “dup⁸n⁶ d⁴l⁸⁶⁷t.”
ow⁴v⁴r, kman ⁸s awar⁴ o⁵ t⁷⁴ amb⁸⁶uous natur⁴ o⁵ t⁷⁴ s⁸⁶ns o⁵ ⁴mot⁸on, and po⁸nts out t⁷at ⁸t ⁸s a m⁸stak⁴ “to pr⁴sum⁴ t⁷at conc⁴al⁴d ⁴mot⁸on ⁸s ⁴v⁸d⁴nc⁴ t⁷at a p⁴rson ⁸s ly⁸n⁶ about t⁷⁴ top⁸c o⁵ ⁸nt⁴r⁴st to t⁷⁴ ⁸nt⁴rv⁸⁴w⁴r” [kman, 00]. ⁸s ⁸s w⁷at kman calls t⁷⁴ “Ot⁷⁴llo ⁴rror,” ⁸n r⁴⁵⁴r⁴nc⁴ to t⁷⁴ S⁷ak⁴sp⁴ar⁴an c⁷aract⁴r’s m⁸sr⁴ad⁸n⁶ o⁵ ⁷⁸s w⁸⁵⁴’s d⁸splay o⁵ ⁴mot⁸on [kman, 00]. n ⁵act, kman warns t⁷at “⁴mot⁸ons do not t⁴ll you t⁷⁴⁸r caus⁴” and cu⁴s o⁵ ⁵⁴ar ar⁴ t⁷⁴ sam⁴, w⁷⁴t⁷⁴r t⁷⁴ sp⁴ak⁴r ⁸s a⁵ra⁸d o⁵ b⁴⁸n⁶ cau⁶⁷t ⁸n a l⁸⁴ or d⁸sb⁴l⁸⁴v⁴d w⁷⁴n t⁴ll⁸n⁶ t⁷⁴ trut⁷ [kman, 00]. T
Z . Zuck⁴rman ⁴t al. [] also assum⁴ t⁷at b⁴⁷av⁸ors sp⁴c⁸fically r⁴lat⁴d to d⁴c⁴pt⁸on cannot b⁴ ⁵ound t⁷⁴r⁴⁵or⁴ t⁷⁴y try to ⁸d⁴nt⁸⁵y a patt⁴rn o⁵ b⁴⁷av⁸oral ⁴xpr⁴ss⁸ons, non-sp⁴c⁸fic ⁸n t⁷⁴ms⁴lv⁴s—or, mor⁴ pr⁴c⁸s⁴ly, d⁴r⁸v⁸n⁶ ⁵rom ⁶⁴n⁴ral co⁶n⁸t⁸v⁴ and ⁴mot⁸onal proc⁴ss⁴s—t⁷at ⁷av⁴ a ⁶r⁴at⁴r probab⁸l⁸ty o⁵ app⁴ar⁸n⁶ w⁷⁴n sub⁹⁴cts l⁸⁴ rat⁷⁴r t⁷an w⁷⁴n t⁷⁴y ar⁴ t⁴ll⁸n⁶ t⁷⁴ trut⁷. n part⁸cular, t⁷⁴y ⁷ypot⁷⁴s⁸z⁴ t⁷at cu⁴s to d⁴c⁴pt⁸on m⁸⁶⁷t b⁴ ⁵ound by ⁴xplor⁸n⁶ t⁷⁴ ⁵ollow⁸n⁶ ⁵actors
Emotional reactions. Zuck⁴rman ⁴t al. assum⁴ t⁷at ⁵⁴⁴l⁸n⁶s o⁵ ⁶u⁸lt and ⁵⁴ar o⁵ b⁴⁸n⁶ unmask⁴d may b⁴ assoc⁸at⁴d w⁸t⁷ t⁷⁴ act o⁵ ly⁸n⁶. ⁴r⁴⁵or⁴, l⁸ars could ⁴x⁷⁸b⁸t s⁸⁶ns o⁵ anx⁸⁴ty as w⁴ll as v⁴rbal and non-v⁴rbal b⁴⁷av⁸ors ⁸nd⁸cat⁸n⁶ an unconsc⁸ous att⁴mpt to put a d⁸stanc⁴ b⁴tw⁴⁴n t⁷⁴ms⁴lv⁴s and t⁷⁴⁸r d⁴c⁴pt⁸v⁴ commun⁸cat⁸ons. ⁴ aut⁷ors call suc⁷ d⁸stanc⁸n⁶ “non⁸mm⁴d⁸acy” and suppos⁴ t⁷at l⁸ars m⁸⁶⁷t s⁴⁴m ⁴vas⁸v⁴ and ⁸nd⁸r⁴ct. Cognitive effort. L⁸ars ⁷av⁴ to accompl⁸s⁷ s⁴v⁴ral co⁶n⁸t⁸v⁴ly d⁴mand⁸n⁶ tasks. ⁸rst, t⁷⁴y ⁷av⁴ to ⁵ormulat⁴ narrat⁸v⁴s d⁸ff⁴r⁴nt ⁵rom t⁷⁴ trut⁷ t⁷⁴y know. ⁴y also ⁷av⁴ to mon⁸tor t⁷⁴ plaus⁸b⁸l⁸ty o⁵ t⁷⁴⁸r stat⁴m⁴nts and avo⁸d contrad⁸ct⁸ons ⁸n t⁷⁴⁸r stor⁸⁴s. Last, but not l⁴ast, t⁷⁴y ⁷av⁴ to c⁷⁴ck t⁷⁴ r⁴act⁸ons o⁵ t⁷⁴⁸r ⁸nt⁴rlocutors, tun⁸n⁶ t⁷⁴⁸r b⁴⁷av⁸or accord⁸n⁶ly. As a cons⁴qu⁴nc⁴, Zuck⁴rman ⁴t al. ⁴xp⁴ct l⁸ars to s⁷ow mor⁴ ⁷⁴s⁸tat⁸ons, mor⁴ sp⁴⁴c⁷ lat⁴nc⁸⁴s and ⁵⁴w⁴r ⁶⁴stur⁴s o⁵ ⁸llustrat⁸on t⁷an trut⁷ t⁴ll⁴rs. Attempted behavioral control. n ord⁴r to b⁴ conv⁸nc⁸n⁶, l⁸ars ⁷av⁴ to mon⁸tor not only t⁷⁴ cont⁴nt t⁷⁴y ar⁴ conv⁴y⁸n⁶, but also t⁷⁴⁸r v⁴rbal and non-v⁴rbal b⁴⁷av⁸or. ⁸s task could b⁴ d⁸fficult, s⁸nc⁴ som⁴ bod⁸ly r⁴act⁸ons, suc⁷ as t⁷⁴ ton⁴ o⁵ vo⁸c⁴ [kman, ], ar⁴ normally b⁴yond voluntary control [kman, 00]. ⁴ aut⁷ors t⁷⁴r⁴⁵or⁴ ar⁶u⁴ t⁷at ⁸n d⁴c⁴pt⁸v⁴ commun⁸cat⁸on d⁸scr⁴panc⁸⁴s b⁴tw⁴⁴n v⁴rbal and non-v⁴rbal ⁴xpr⁴ss⁸ons, or b⁴tw⁴⁴n d⁸ff⁴r⁴nt non-v⁴rbal b⁴⁷av⁸ors, s⁷ould occur to a lar⁶⁴r ⁴xt⁴nt t⁷an ⁸n a trut⁷⁵ul on⁴.
16
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
Arousal. v⁴n t⁷ou⁶⁷ typ⁸cal s⁸⁶ns o⁵ ⁶⁴n⁴ral arousal, suc⁷ as ⁷⁸⁶⁷⁴r p⁸tc⁷, pup⁸l d⁸lat⁸on or ⁵r⁴qu⁴nt bl⁸nk⁸n⁶ m⁸⁶⁷t b⁴ ⁵ound dur⁸n⁶ d⁴c⁴pt⁸v⁴ commun⁸cat⁸on, Zuck⁴rman ⁴t al. adm⁸t t⁷at t⁷⁴y can b⁴ ⁴xpla⁸n⁴d by t⁷⁴ ⁵actors c⁸t⁴d abov⁴. n ⁵act, arousal and ⁴mot⁸onal r⁴act⁸ons d⁸splay as t⁷⁴ sam⁴ coll⁴ct⁸on o⁵ p⁷ys⁸cal p⁷⁴nom⁴na, and so ar⁴ not d⁸st⁸n⁶u⁸s⁷⁴d ⁸n r⁴s⁴arc⁷ act⁸v⁸t⁸⁴s, as not⁴d by Vr⁸⁹ [00].
W⁷⁸l⁴ t⁷⁴ stud⁸⁴s o⁵ kman and o⁵ Zuck⁴rman ⁴t al. ⁵ocus ⁴sp⁴c⁸ally on t⁷⁴ l⁸ars, and cons⁴qu⁴ntly on t⁷⁴ cu⁴s to d⁴c⁴pt⁸on t⁷⁴y may d⁸sclos⁴, ull⁴r and ur⁶oon [] and ull⁴r ⁴t al. [] put sp⁴c⁸al ⁴mp⁷as⁸s on t⁷⁴ ⁸nt⁴ract⁸on b⁴tw⁴⁴n sub⁹⁴cts, as t⁷⁴ d⁴t⁴rm⁸nant ⁵actor ⁵or t⁷⁴ ⁴xpr⁴ss⁸on o⁵ s⁸⁶ns t⁷at may r⁴v⁴al d⁴c⁴pt⁸on. ull⁴r and ur⁶oon cla⁸m t⁷at t⁷⁴ d⁸fficult⁸⁴s ⁸n ly⁸n⁶ m⁴nt⁸on⁴d abov⁴ m⁸⁶⁷t b⁴ not constant dur⁸n⁶ t⁷⁴ commun⁸cat⁸on rat⁷⁴r t⁷⁴y t⁴nd to d⁸ssolv⁴, as t⁷⁴ l⁸ar, mor⁴ and mor⁴, tak⁴s control o⁵ t⁷⁴ ⁸nt⁴ract⁸on bas⁴d on t⁷⁴ ⁵⁴⁴dback ⁷⁴ r⁴c⁴⁸v⁴s. ⁴ ⁵⁴⁴dback, ⁸n turn, d⁴p⁴nds on t⁷⁴ ⁴xp⁴ctat⁸ons, mot⁸vat⁸ons, and a⁸ms o⁵ t⁷⁴ l⁸ar as w⁴ll as t⁷⁴ l⁸ar’s r⁴lat⁸ons⁷⁸p to t⁷⁴ tar⁶⁴t. ⁸v⁴n t⁷⁸s pr⁴m⁸s⁴, ull⁴r and ur⁶oon ⁵ormulat⁴ t⁷⁴⁸r nt⁴rp⁴rsonal ⁴c⁴pt⁸on ⁴ory ⁵ocus⁸n⁶ on t⁷⁴ mot⁸vat⁸onal dynam⁸cs o⁵ t⁷⁴ commun⁸cat⁸on, and ⁸n part⁸cular ⁸d⁴nt⁸⁵y⁸n⁶ t⁷r⁴⁴ k⁸nds o⁵ ⁶oals ⁸n d⁴c⁴pt⁸v⁴ commun⁸cat⁸on Instrumental, t⁷at ⁸s, r⁴lat⁴d to t⁷⁴ pract⁸cal r⁴sults t⁷⁴ l⁸⁴s ar⁴ a⁸m⁴d to obta⁸n, suc⁷ as “⁴stabl⁸s⁷⁸n⁶, max⁸m⁸z⁸n⁶, and ma⁸nta⁸n⁸n⁶ pow⁴r or ⁸nflu⁴nc⁴ ov⁴r t⁷⁴ r⁴c⁴⁸v⁴r, acqu⁸r⁸n⁶ and prot⁴ct⁸n⁶ r⁴sourc⁴s” [ull⁴r and ur⁶oon, , p. ] Relational, w⁷⁸c⁷ r⁴⁵⁴rs to t⁷⁴ ⁴ff⁴ct t⁷⁴ l⁸ar pursu⁴s ⁸n r⁴⁶ulat⁸n⁶ t⁷⁴ r⁴lat⁸on w⁸t⁷ t⁷⁴ ⁸nt⁴rlocutor, ⁵or ⁴xampl⁴ “avo⁸d⁸n⁶ ⁸nt⁴rp⁴rsonal t⁴ns⁸on or confl⁸ct, ma⁸nta⁸n⁸n⁶ and r⁴d⁸r⁴ct⁸n⁶ soc⁸al ⁸nt⁴ract⁸on” [ull⁴r and ur⁶oon, , p. ] Identity ⁶oals, w⁷⁸c⁷ conc⁴rn t⁷⁴ l⁸ar’s ⁴fforts w⁸t⁷ r⁴sp⁴ct to “avo⁸d⁸n⁶ s⁷am⁴ or ⁴mbarrassm⁴nt, pro⁹⁴ct⁸n⁶ a mor⁴ ⁵avorabl⁴ ⁸ma⁶⁴, ⁴n⁷anc⁸n⁶ or prot⁴ct⁸n⁶ s⁴l⁵ ⁴st⁴⁴m” [ull⁴r and ur⁶oon, , p. ].
⁴ mot⁸vat⁸on to d⁴c⁴⁸v⁴ l⁴ads t⁷⁴ l⁸ar to carry out strategic behaviors, a⁸m⁴d at mana⁶⁸n⁶ t⁷⁴ cont⁴nt and styl⁴ o⁵ t⁷⁴ commun⁸cat⁸on to ma⁸nta⁸n t⁷⁴ cr⁴d⁸b⁸l⁸ty o⁵ t⁷⁴ m⁴ssa⁶⁴. ⁴ po⁸nt ⁸s t⁷at w⁷⁴n t⁷⁴ mana⁶⁴m⁴nt o⁵ t⁷⁴ b⁴⁷av⁸or ⁸s “carr⁸⁴d to ⁴xtr⁴m⁴s, ⁸t may r⁴sult ⁸n an ov⁴rcontroll⁴d or r⁸⁶⁸d pr⁴s⁴ntat⁸on, ⁸n⁴xpr⁴ss⁸v⁴n⁴ss, and r⁴duc⁴d spontan⁴⁸ty, w⁷⁸c⁷ qual⁸⁵y as nonstrategic behaviors” [ull⁴r and ur⁶oon, , p. ]. n ot⁷⁴r words, t⁷⁴ ⁸ns⁸⁶⁷t o⁵ ull⁴r and ur⁶oon cons⁸sts ⁸n t⁷⁴ ⁵act t⁷at “strat⁴⁶⁸c ⁸nt⁴nt⁸ons may produc⁴ nonstrat⁴⁶⁸c byproducts ⁸n t⁷⁴ ⁵orm o⁵ non⁸nvolv⁴m⁴nt and p⁴r⁵ormanc⁴ d⁴cr⁴m⁴nts” [ull⁴r and ur⁶oon, , p. ]. n part⁸cular, ull⁴r and ur⁶oon cla⁸m to ⁷av⁴ ⁵ound ⁴v⁸d⁴nc⁴ t⁷at t⁷⁴ d⁴c⁴⁸v⁴r’s ⁴ffort to man⁸pulat⁴ t⁷⁴ commun⁸cat⁸on aff⁴cts bot⁷ ⁸ts cont⁴nt and t⁷⁴ concurr⁴nt b⁴⁷av⁸or “⁴c⁴⁸v⁴rs ⁷av⁴ b⁴⁴n ⁵ound to mana⁶⁴ ⁸n⁵ormat⁸on ⁸n t⁷⁴ ⁵orm o⁵ v⁴rbal cont⁴nt t⁷at ⁸s ⁸ncompl⁴t⁴, nonv⁴r⁸d⁸cal, ⁸nd⁸r⁴ct, va⁶u⁴, unc⁴rta⁸n, ⁷⁴s⁸tant, br⁸⁴⁵, and d⁸sassoc⁸at⁴d ⁵rom t⁷⁴ s⁴nd⁴r to mana⁶⁴ b⁴⁷av⁸or t⁷rou⁶⁷ a subm⁸ss⁸v⁴ and ⁵ormal d⁴m⁴anor and r⁴duc⁴d nonv⁴rbal ⁸mm⁴d⁸acy and to
2.3. THE PSYCHOLOGY LITERATURE
17
mana⁶⁴ ⁸ma⁶⁴ t⁷rou⁶⁷ ⁸ncr⁴as⁴d pl⁴asantn⁴ss and r⁴laxat⁸on ov⁴r t⁸m⁴. ⁴y ⁷av⁴ s⁸multan⁴ously also l⁴ak⁴d n⁴rvousn⁴ss, arousal, and n⁴⁶at⁸v⁴ aff⁴ct (at l⁴ast ⁸n⁸t⁸ally) and suff⁴r⁴d p⁴r⁵ormanc⁴ ⁸mpa⁸rm⁴nts suc⁷ as nonflu⁴nc⁸⁴s and poor ⁸mpr⁴ss⁸ons” [ull⁴r and ur⁶oon, , p. ]. T
S -P P P . ⁴ S⁴l⁵-Pr⁴s⁴ntat⁸onal P⁴rsp⁴ct⁸v⁴ o⁵ ⁴Paulo ⁴t al. d⁴r⁸v⁴s ⁵rom t⁷⁴ att⁴mpt o⁵ t⁷⁴ aut⁷ors to synt⁷⁴s⁸z⁴ t⁷⁴ b⁴st ⁸ntu⁸t⁸ons o⁵ t⁷⁴ pr⁴v⁸ous approac⁷⁴s, w⁷⁸l⁴ try⁸n⁶ also to ov⁴rcom⁴ t⁷⁴⁸r w⁴akn⁴ss⁴s. ⁴ first po⁸nt t⁷⁴y und⁴rl⁸n⁴ ⁸s t⁷at ly⁸n⁶ ⁸s a common pract⁸c⁴ ⁸n da⁸ly l⁸⁵⁴ stud⁸⁴s w⁷⁴r⁴ sub⁹⁴cts not⁴ t⁷⁴⁸r l⁸⁴s s⁷ow t⁷at t⁷⁴y t⁴ll an av⁴ra⁶⁴ o⁵ on⁴ or two l⁸⁴s ⁴v⁴ry day. u⁴ to t⁷⁴⁸r ⁷⁸⁶⁷ ⁵r⁴qu⁴ncy, ⁴Paulo ⁴t al. b⁴l⁸⁴v⁴ t⁷at not all l⁸⁴s r⁴ally r⁴qu⁸r⁴ many co⁶n⁸t⁸v⁴ r⁴sourc⁴s or ⁸mply stron⁶ ⁴mot⁸onal ⁸nvolv⁴m⁴nt “L⁸⁴s bas⁴d on scr⁸pts or ⁵am⁸l⁸ar stor⁸⁴s ar⁴ unl⁸k⁴ly to b⁴ mark⁴d by t⁷⁴ s⁸⁶ns o⁵ m⁴ntal ⁴ffort” [⁴Paulo ⁴t al., 00, p. ]. t ⁸s assum⁴d t⁷at p⁴opl⁴ do not plan nor do t⁷⁴y ⁵⁴⁴l muc⁷ ⁶u⁸lt about suc⁷ l⁸⁴s. N⁴v⁴rt⁷⁴l⁴ss, ⁴Paulo’s analys⁸s r⁴sts on t⁷⁴ assumpt⁸on t⁷at t⁷⁴s⁴ l⁸⁴s can b⁴ d⁸st⁸n⁶u⁸s⁷⁴d ⁵rom trut⁷⁵ul stat⁴m⁴nts “cu⁴s to d⁴c⁴pt⁸on ord⁸nar⁸ly ar⁴ qu⁸t⁴ w⁴ak. ⁴r⁴ ar⁴, ⁷ow⁴v⁴r, cond⁸t⁸ons und⁴r w⁷⁸c⁷ cu⁴s ar⁴ mor⁴ appar⁴nt.” [⁴Paulo ⁴t al., 00, p. ]. ⁴Paulo ⁴t al. s⁴⁴k to ⁴xpla⁸n t⁷⁴ cu⁴s as com⁸n⁶ ⁵rom “p⁴opl⁴’s att⁴mpts to control t⁷⁴ ⁸mpr⁴ss⁸ons t⁷at ar⁴ ⁵orm⁴d o⁵ t⁷⁴m”—w⁷at ⁴Paulo ⁴t al. r⁴⁵⁴r to as “s⁴l⁵-pr⁴s⁴ntat⁸on”—b⁴caus⁴ o⁵ t⁷⁴ “pr⁴val⁴nc⁴ o⁵ s⁴l⁵-pr⁴s⁴ntat⁸onal t⁷⁴m⁴s ⁸n t⁷⁴ k⁸nds o⁵ l⁸⁴s t⁷at p⁴opl⁴ most o⁵t⁴n t⁴ll.” ⁴Paulo ⁴t al. s⁴⁴ l⁸ars as l⁴ss capabl⁴ o⁵ ⁴mbrac⁸n⁶ t⁷⁴⁸r l⁸⁴s, w⁷⁸c⁷ may r⁴sult ⁸n narrat⁸v⁴s w⁸t⁷ ⁵⁴w⁴r d⁴ta⁸ls t⁷an trut⁷⁵ul on⁴s. ⁴y also v⁸⁴w l⁸ars as ⁵⁴⁴l⁸n⁶ a ⁶r⁴at⁴r s⁴ns⁴ o⁵ d⁴l⁸b⁴rat⁴n⁴ss t⁷an trut⁷ t⁴ll⁴rs and t⁷⁴ ⁸mpr⁴ss⁸on o⁵ low ⁸nvolv⁴m⁴nt o⁵ t⁷⁴ l⁸ar ⁸n t⁷⁴ ⁸nt⁴ract⁸on. ⁴y may ⁵a⁸l ⁸n ⁴valuat⁸n⁶ t⁷⁴ ⁵⁴⁴dback o⁵ t⁷⁴ r⁴c⁴⁸v⁴rs, and t⁷⁸s aff⁴cts t⁷⁴ ⁴ff⁴ct⁸v⁴n⁴ss o⁵ t⁷⁴ att⁴mpt⁴d s⁴l⁵-r⁴⁶ulat⁸ons. Lastly, ⁵rom an ⁴mot⁸onal po⁸nt o⁵ v⁸⁴w, and consonant w⁸t⁷ t⁷⁴ l⁸t⁴ratur⁴ ⁸n t⁷⁸s fi⁴ld, ⁴Paulo ⁴t al. ⁴xp⁴ct t⁷at d⁴c⁴⁸v⁴rs may ⁴xp⁴r⁸⁴nc⁴ mor⁴ n⁴⁶at⁸v⁴ ⁴mot⁸ons t⁷an trut⁷t⁴ll⁴rs. ⁴s⁴ ⁴mot⁸ons can vary ⁵rom s⁷am⁴ and ⁶u⁸lt to ⁵⁴ar o⁵ b⁴⁸n⁶ unmask⁴d. ons⁴qu⁴ntly, d⁴c⁴⁸v⁴rs may s⁴⁴m “l⁴ss ⁵ort⁷com⁸n⁶, l⁴ss conv⁸nc⁸n⁶, l⁴ss pl⁴asant, and mor⁴ t⁴ns⁴” [⁴Paulo ⁴t al., 00, p. ]. On⁴ o⁵ t⁷⁴ nov⁴lt⁸⁴s o⁵ t⁷⁴ t⁷⁴or⁴t⁸cal p⁴rsp⁴ct⁸v⁴ o⁵ ⁴Paulo ⁴t al. ⁸s t⁷at t⁷⁴y str⁴ss t⁷⁴ ⁸d⁴a t⁷at ⁴v⁴n ⁷on⁴st actors usually try to mana⁶⁴ t⁷⁴ ⁸mpr⁴ss⁸on t⁷⁴y ⁶⁸v⁴ to ⁸nt⁴rlocutors. Non⁴t⁷⁴l⁴ss, t⁷⁴y pr⁴d⁸ct t⁷at som⁴ b⁴⁷av⁸oral ⁴xpr⁴ss⁸ons may b⁴ assoc⁸at⁴d w⁸t⁷ d⁴c⁴⁸v⁸n⁶ commun⁸cat⁸on. ⁴Paulo ⁴t al. cla⁸m t⁷⁴⁸r approac⁷ “⁷as t⁷⁴ advanta⁶⁴ t⁷at t⁷⁴ b⁴⁷av⁸ors o⁵ ⁸nt⁴r⁴st ar⁴ cl⁴arly d⁴fin⁴d and ob⁹⁴ct⁸v⁴ly m⁴asur⁴d” [⁴Paulo ⁴t al., 00, p. ]. rom t⁷⁴ po⁸nt o⁵ v⁸⁴w o⁵ t⁷⁴ so-call⁴d “⁷ard sc⁸⁴nc⁴s,” t⁷⁴s⁴ “advanta⁶⁴s” ar⁴ rat⁷⁴r m⁸n⁸mal r⁴qu⁸r⁴m⁴nts ⁵or any r⁴s⁴arc⁷ act⁸v⁸ty t⁷⁴ ⁵act t⁷at t⁷⁴ aut⁷ors pr⁴s⁴nt t⁷⁴m as suc⁷ ⁸s a cl⁴ar symptom o⁵ t⁷⁴ d⁸fficult⁸⁴s t⁷⁴ r⁴s⁴arc⁷⁴r ⁷as to ⁵ac⁴ ⁸n study⁸n⁶ d⁴c⁴pt⁸on. n ⁵act, t⁷⁴ poss⁸b⁸l⁸ty o⁵ pr⁴c⁸s⁴ly m⁴asur⁸n⁶ t⁷⁴ var⁸abl⁴s ⁸s a c⁴ntral ⁸ssu⁴ ⁸n sc⁸⁴nt⁸fic r⁴s⁴arc⁷ ⁷ow⁴v⁴r, ⁴Paulo ⁴t al. do not ⁴xclud⁴ t⁷⁴ poss⁸b⁸l⁸ty t⁷at soc⁸al actors, w⁷o r⁴ly ⁹ust on sub⁹⁴ct⁸v⁴ ⁸mpr⁴ss⁸ons, may b⁴ abl⁴ to d⁴t⁴ct d⁴c⁴pt⁸on as w⁴ll as, and poss⁸bly b⁴tt⁴r t⁷an, ob⁹⁴ct⁸v⁴ m⁴asur⁴m⁴nts. or t⁷⁸s
18
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
r⁴ason, t⁷⁴y also ⁸nclud⁴ sub⁹⁴ct⁸v⁴ m⁴asur⁴m⁴nt ⁸n t⁷⁴⁸r syst⁴mat⁸c r⁴v⁸⁴w o⁵ t⁷⁴ l⁸t⁴ratur⁴. W⁴ d⁸scuss t⁷⁴ m⁴t⁷ods and r⁴sults o⁵ t⁷⁸s r⁴v⁸⁴w ⁸n t⁷⁴ n⁴xt s⁴ct⁸ons. Methods of literature analysis ⁴ stud⁸⁴s ⁸n t⁷⁴ ⁴Paulo ⁴t al. r⁴v⁸⁴w ⁸nclud⁴ only t⁷os⁴ ⁸n w⁷⁸c⁷ t⁷⁴ b⁴⁷av⁸ors assoc⁸at⁴d w⁸t⁷ d⁴c⁴pt⁸v⁴ commun⁸cat⁸ons w⁴r⁴ compar⁴d w⁸t⁷ t⁷os⁴ assoc⁸at⁴d w⁸t⁷ trut⁷⁵ul commun⁸cat⁸ons and only t⁷os⁴ ⁸n w⁷⁸c⁷ t⁷⁴ cu⁴s stud⁸⁴d w⁴r⁴ bot⁷ ob⁹⁴ct⁸v⁴ly and sub⁹⁴ct⁸v⁴ly m⁴asur⁴d. ⁴y t⁷us ⁴xclud⁴ r⁴s⁴arc⁷ on p⁷ys⁸olo⁶⁸cal var⁸abl⁴s t⁷at ar⁴ not d⁸r⁴ctly d⁴t⁴ctabl⁴ by ⁷uman obs⁴rv⁴rs. ⁴ analys⁸s was also l⁸m⁸t⁴d to stud⁸⁴s ⁸nvolv⁸n⁶ adult, n⁶l⁸s⁷-sp⁴ak⁸n⁶ sub⁹⁴cts. Stud⁸⁴s ⁸n w⁷⁸c⁷ sub⁹⁴cts ⁷ad to part⁸c⁸pat⁴ ⁸n rol⁴-play⁸n⁶ w⁴r⁴ also d⁸scard⁴d t⁷⁴ aut⁷ors cons⁸d⁴r only ⁴xp⁴r⁸m⁴ntal d⁴s⁸⁶ns w⁷⁴r⁴ t⁷⁴ part⁸c⁸pants ⁷ad to ⁴xpr⁴ss ⁶⁴nu⁸n⁴ trut⁷s or l⁸⁴s.
ollow⁸n⁶ t⁷⁴s⁴ cr⁸t⁴r⁸a, ⁴Paulo ⁴t al. cons⁸d⁴r⁴d 0 ⁸nd⁴p⁴nd⁴nt ⁴xp⁴r⁸m⁴ntal sub⁹⁴ct sampl⁴s ⁵rom publ⁸s⁷⁴d stud⁸⁴s. rom t⁷⁴s⁴ stud⁸⁴s, t⁷⁴y ⁸d⁴nt⁸fi⁴d d⁸ff⁴r⁴nt b⁴⁷av⁸ors t⁷at w⁴r⁴ cons⁸d⁴r⁴d poss⁸bl⁴ cu⁴s o⁵ d⁴c⁴pt⁸on. ⁴y w⁴r⁴, ⁸n turn, d⁸v⁸d⁴d ⁸nto fiv⁴ macrocat⁴⁶or⁸⁴s, d⁴s⁸⁶n⁴d to t⁴st t⁷⁴ ⁷ypot⁷⁴s⁴s st⁴mm⁸n⁶ ⁵rom t⁷⁴ s⁴l⁵-pr⁴s⁴ntat⁸on t⁷⁴ory [⁴Paulo ⁴t al., 00, p. ], accord⁸n⁶ to w⁷⁸c⁷ t⁷⁴ l⁸ars w⁴r⁴ pr⁴d⁸ct⁴d to b⁴ t⁷⁴ ⁵ollow⁸n⁶. Less forthcoming. ⁴ cu⁴s us⁴d to t⁴st t⁷⁸s assumpt⁸on conc⁴rn t⁷⁴ l⁴n⁶t⁷ o⁵ t⁷⁴ narrat⁸v⁴, ⁸ts compl⁴x⁸ty and t⁷⁴ amount o⁵ ⁸n⁵ormat⁸on prov⁸d⁴d by ⁸t. Less compelling. ⁴ cu⁴s ⁷⁴r⁴ t⁴st t⁷⁴ plaus⁸b⁸l⁸ty o⁵ t⁷⁴ narrat⁸v⁴ and t⁷⁴ sp⁴ak⁴r’s c⁴rta⁸nty and d⁴⁶r⁴⁴ o⁵ ⁸nvolv⁴m⁴nt ⁸n ⁸t. u⁴s ⁸nvolv⁸n⁶ sp⁴ak⁴r ⁸mm⁴d⁸acy, flu⁴ncy and an⁸mat⁸on w⁴r⁴ also ⁸ns⁴rt⁴d ⁸nto t⁷⁸s cat⁴⁶ory. Less pleasant. ⁴s⁴ cu⁴s t⁴st ⁵or t⁷⁴ d⁴⁶r⁴⁴ o⁵ pos⁸t⁸v⁴ ⁵⁴⁴l⁸n⁶s and ⁴mot⁸ons. More tense. ⁸s ⁶roup ⁸nvolv⁴s t⁷⁴ b⁴⁷av⁸oral s⁸⁶ns typ⁸cally assoc⁸at⁴d w⁸t⁷ t⁴ns⁸on and anx⁸⁴ty. Less spontaneous. Accord⁸n⁶ to t⁷⁴ last cat⁴⁶ory o⁵ ⁴Paulo ⁴t al., l⁸⁴s s⁷ould app⁴ar l⁴ss a⁵⁵⁴ct⁴d by t⁷⁴ ⁸mp⁴r⁵⁴ct⁸ons t⁷at normally c⁷aract⁴r⁸z⁴ trut⁷⁵ul narrat⁸v⁴s. ⁴s⁴ ⁸nclud⁴ spontan⁴ous corr⁴ct⁸ons, sup⁴rfluous d⁴ta⁸ls, cont⁴xtual ⁴mb⁴dd⁸n⁶ and, ⁸n ⁶⁴n⁴ral, a m⁸x⁴d ba⁶ o⁵ narrat⁸v⁴ ⁸mp⁴r⁵⁴ct⁸ons.
As m⁴nt⁸on⁴d abov⁴, t⁷⁴ ob⁹⁴ct⁸v⁴ m⁴asur⁴m⁴nt o⁵ t⁷⁴ cu⁴s ⁸s cr⁸t⁸cal to t⁷⁴ ⁴valuat⁸on o⁵ t⁷⁴ r⁴lat⁸on b⁴tw⁴⁴n cu⁴s and d⁴c⁴pt⁸on. Un⁵ortunat⁴ly, t⁷⁸s ⁸s not an ⁴asy task. n ⁵act, t⁷⁴ cu⁴s w⁴r⁴ cons⁸d⁴r⁴d objectively m⁴asur⁴d w⁷⁴n t⁷⁴⁸r s⁸z⁴ was pr⁴c⁸s⁴ly known, and subjectively m⁴asur⁴d w⁷⁴n t⁷⁴ ⁴valuat⁸on r⁴l⁸⁴d s⁸mply on t⁷⁴ sub⁹⁴ct⁸v⁴ ⁸mpr⁴ss⁸ons o⁵ t⁷⁴ obs⁴rv⁴rs. ⁴ ⁴valuat⁸on o⁵ t⁷⁴ ⁴ff⁴ct o⁵ t⁷⁴ cu⁴s ⁵ollows ⁵rom t⁷⁸s d⁸st⁸nct⁸on. n t⁷⁴ first cas⁴, ⁸n ⁵act, t⁷⁴ ⁴ff⁴ct was precisely calculated as w⁴ll ⁸n t⁷⁴ s⁴cond ⁸t was poss⁸bl⁴ to d⁴t⁴rm⁸n⁴ only t⁷⁴ direction o⁵
2.3. THE PSYCHOLOGY LITERATURE
19
t⁷⁴ ⁴ff⁴ct. Lastly, som⁴ cu⁴s w⁴r⁴ ⁸nd⁸cat⁴d ⁸n t⁷⁴ r⁴ports as not s⁸⁶n⁸ficant ⁸n suc⁷ cas⁴s, not ⁴v⁴n t⁷⁴ d⁸r⁴ct⁸on o⁵ t⁷⁴ ⁴ff⁴ct was knowabl⁴.
⁴Paulo ⁴t al. d⁴fin⁴ t⁷⁴ ⁴ff⁴ct s⁸z⁴ d ⁵or ⁴ac⁷ cu⁴ as “t⁷⁴ m⁴an ⁵or t⁷⁴ d⁴c⁴pt⁸v⁴ cond⁸t⁸on (⁸.⁴., t⁷⁴ l⁸⁴s) m⁸nus t⁷⁴ m⁴an ⁵or t⁷⁴ trut⁷⁵ul cond⁸t⁸on (⁸.⁴., t⁷⁴ trut⁷s), d⁸v⁸d⁴d by t⁷⁴ m⁴an o⁵ t⁷⁴ standard d⁴v⁸at⁸ons ⁵or t⁷⁴ trut⁷s and t⁷⁴ l⁸⁴s [o⁷⁴n, ]” [⁴Paulo ⁴t al., 00, p. ]. ⁴r⁴⁵or⁴, w⁷⁴n d ⁸s pos⁸t⁸v⁴, t⁷⁴ b⁴⁷av⁸or app⁴ars mor⁴ ⁵r⁴qu⁴ntly ⁸n assoc⁸at⁸on w⁸t⁷ l⁸⁴s t⁷an w⁸t⁷ trut⁷ and t⁷⁴ ot⁷⁴r way around w⁷⁴n d ⁸s n⁴⁶at⁸v⁴. n som⁴ r⁴ports, m⁴an and standard d⁴v⁸at⁸on w⁴r⁴ not prov⁸d⁴d, but ot⁷⁴r stat⁸st⁸cal param⁴t⁴rs w⁴r⁴ ava⁸labl⁴ ⁸n suc⁷ cas⁴s, ⁴Paulo ⁴t al. ⁴mploy⁴d ot⁷⁴r r⁴l⁴vant m⁴t⁷ods ⁵or ⁴valuat⁸n⁶ t⁷⁴ ⁴ff⁴ct s⁸z⁴. W⁷⁴n t⁷⁴ ⁴ff⁴ct o⁵ a cu⁴ was ⁸nd⁸cat⁴d as not s⁸⁶n⁸ficant and no mor⁴ ⁸n⁵ormat⁸on was ⁶⁸v⁴n, t⁷⁴ aut⁷ors s⁴t d D 0 ⁸⁵ t⁷⁴ d⁸r⁴ct⁸on o⁵ t⁷⁴ ⁴ff⁴ct was known but not t⁷⁴ s⁸z⁴, t⁷⁴y cons⁴rvat⁸v⁴ly s⁴t d D C0:01 or d D 0:01. As a r⁴sult o⁵ t⁷⁴ analys⁸s, ⁴Paulo ⁴t al. ⁵ound ⁴ff⁴ct s⁸z⁴s, o⁵ w⁷⁸c⁷ “ could b⁴ ⁴st⁸mat⁴d pr⁴c⁸s⁴ly, w⁴r⁴ s⁴t to z⁴ro, and w⁴r⁴ ass⁸⁶n⁴d t⁷⁴ valu⁴s o⁵ ˙0:01” [⁴Paulo ⁴t al., 00, p. -0]. ⁴r⁴ w⁴r⁴ cu⁴s s⁷ow⁸n⁶ an ⁴ff⁴ct s⁸z⁴ ⁶r⁴at⁴r t⁷an ˙1:50. n ord⁴r to obta⁸n t⁷⁴ ⁴st⁸mat⁴ o⁵ t⁷⁴ ⁴ff⁴ct s⁸z⁴ o⁵ ⁴ac⁷ cu⁴, t⁷⁴ m⁴an was calculat⁴d ov⁴r all occurr⁴nc⁴s o⁵ t⁷⁴ cu⁴ p⁴r sp⁴ak⁴r and ov⁴r all occurr⁴nc⁴s o⁵ t⁷⁴ cu⁴ w⁸t⁷⁸n t⁷⁴ sampl⁴. ac⁷ ⁸nd⁴p⁴nd⁴nt sampl⁴ was count⁴d as a s⁸n⁶l⁴ m⁴asur⁴ o⁵ t⁷⁴ cu⁴ r⁴⁶ardl⁴ss o⁵ t⁷⁴ s⁸z⁴ o⁵ t⁷⁴ sampl⁴. Stud⁸⁴s t⁷at ⁸nvolv⁴d a ⁶r⁴at⁴r numb⁴r o⁵ sub⁹⁴cts, t⁷at ⁸s to say t⁷⁴ stud⁸⁴s w⁷⁸c⁷ prov⁸d⁴d t⁷⁴ most r⁴l⁸abl⁴ r⁴sults, w⁴r⁴ mor⁴ ⁷⁴av⁸ly w⁴⁸⁶⁷t⁴d. Lastly, ⁸n ord⁴r to asc⁴rta⁸n ⁸⁵ t⁷⁴ ⁴ff⁴ct s⁸z⁴s w⁴r⁴ s⁸⁶n⁸ficant across ⁸nd⁴p⁴nd⁴nt sampl⁴s, ⁴Paulo ⁴t al. ⁴mploy⁴d t⁷⁴ ⁷omo⁶⁴n⁴⁸ty stat⁸st⁸c Q [⁴d⁶⁴s and Olk⁸n, ], w⁷os⁴ p t⁷r⁴s⁷olds d⁴p⁴nd on t⁷⁴ l⁸k⁴l⁸⁷ood t⁷at t⁷⁴ var⁸anc⁴ ⁸n ⁴ff⁴ct s⁸z⁴s was du⁴ to sampl⁸n⁶ ⁴rrors.
M ⁴Paulo ⁴t al. pr⁴d⁸ct⁴d t⁷at c⁴rta⁸n ⁵actors w⁸ll mod⁴rat⁴ t⁷⁴ ⁴ff⁴ct s⁸z⁴ o⁵ t⁷⁴ cu⁴s. ⁴s⁴ ⁸nclud⁴ t⁷⁴ ⁵ollow⁸n⁶. Motivation to lie Identity-relevant incentives w⁷⁴r⁴ t⁷⁴ task conc⁴rn⁴d t⁷⁴ sub⁹⁴cts’ pro⁵⁴ss⁸onal sk⁸lls or som⁴ ot⁷⁴r valu⁴d comp⁴t⁴nc⁴, suc⁷ as t⁷⁴⁸r ⁸nt⁴ll⁸⁶⁴nc⁴. Instrumental incentives ⁸n w⁷⁸c⁷ t⁷⁴ sub⁹⁴cts r⁴c⁴⁸v⁴d som⁴ k⁸nd o⁵ ⁴conom⁸c or mat⁴r⁸al r⁴ward. Both. Lastly, t⁷⁴ aut⁷ors class⁸fi⁴d s⁴parat⁴ly t⁷⁴ stud⁸⁴s w⁷⁴r⁴ bot⁷ ⁸nduc⁴m⁴nts w⁴r⁴ prov⁸d⁴d to t⁷⁴ sub⁹⁴cts.
20
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
Opportunity to plan the narrative
Planned ⁸n w⁷⁸c⁷ sub⁹⁴cts w⁴r⁴ ⁶⁸v⁴n pr⁴parat⁸on t⁸m⁴, r⁴mov⁸n⁶ som⁴ o⁵ t⁷⁴ m⁴ntal ⁴ffort ⁸nvolv⁴d ⁸n ly⁸n⁶. L⁸ars could s⁷ow lon⁶⁴r r⁴spons⁴s w⁸t⁷ s⁷ort⁴r r⁴spons⁴ lat⁴nc⁸⁴s.
Unplanned ⁸n w⁷⁸c⁷ sub⁹⁴cts w⁴r⁴ r⁴qu⁸r⁴d to r⁴spond spontan⁴ously. ⁴Paulo ⁴t al. ⁷ypot⁷⁴s⁸z⁴ t⁷at ⁷⁴r⁴ l⁸⁴s ar⁴ mor⁴ l⁸k⁴ly to b⁴ s⁷ort⁴r, l⁴ss cons⁸st⁴nt, and s⁷ow d⁸sflu⁴nc⁸⁴s and ot⁷⁴r s⁸⁶ns o⁵ m⁴ntal ⁴ffort. Duration of messages
Lon⁶⁴r m⁴ssa⁶⁴s may ⁸mpos⁴ “⁶r⁴at⁴r co⁶n⁸t⁸v⁴ burd⁴ns” on t⁷⁴ l⁸ar, mak⁸n⁶ cu⁴s to d⁴c⁴pt⁸on “cl⁴ar⁴r and mor⁴ num⁴rous.”
⁴ stud⁸⁴s w⁴r⁴ cons⁸d⁴r⁴d within subject ⁸⁵ t⁷⁴ sam⁴ actors ⁷ad to ⁶⁸v⁴ tru⁴ and ⁵als⁴ stat⁴m⁴nts between subjects ⁸⁵ trut⁷⁵ul and untrut⁷⁵ul narrat⁸v⁴s w⁴r⁴ ⁸ssu⁴d by d⁸ff⁴r⁴nt sub⁹⁴cts. ⁴ stud⁸⁴s, ⁸n turn, w⁴r⁴ d⁸v⁸d⁴d accord⁸n⁶ to t⁷⁴ parad⁸⁶m o⁵ t⁷⁴⁸r ⁴xp⁴r⁸m⁴ntal d⁴s⁸⁶ns and r⁴cod⁴d ⁸nto two cat⁴⁶or⁸⁴s l⁸⁴s ⁸nvolv⁸n⁶ trans⁶r⁴ss⁸ons, w⁷⁴r⁴ ⁴Paulo ⁴t al. ⁷ypot⁷⁴s⁸z⁴d t⁷at t⁷⁴ cu⁴s would b⁴ cl⁴ar⁴r, and t⁷os⁴ not about trans⁶r⁴ss⁸ons. ⁴ ⁴xp⁴r⁸m⁴ntal parad⁸⁶ms w⁴r⁴ cat⁴⁶or⁸z⁴d as ⁵ollows w⁴ ⁸nclud⁴ t⁷⁴ numb⁴r o⁵ stud⁸⁴s ⁸nvolv⁸n⁶ ⁴ac⁷ parad⁸⁶m. Transgressions Mock crime ( stud⁸⁴s) ⁸n w⁷⁸c⁷ sub⁹⁴cts ⁷av⁴ to “st⁴al” som⁴ mon⁴y, and t⁷⁴n l⁸⁴ about t⁷⁴ mock t⁷⁴⁵t.
Cheating () ⁸n w⁷⁸c⁷ sub⁹⁴cts w⁴r⁴ r⁴qu⁸r⁴d to c⁷⁴at and t⁷⁴n l⁸⁴ about ⁸t.
Naturalistic () ⁸n w⁷⁸c⁷ sub⁹⁴cts l⁸⁴d “o⁵ t⁷⁴⁸r own accord” ⁸n r⁴al-l⁸⁵⁴ c⁸rcumstanc⁴s w⁸t⁷ t⁷⁴⁸r stat⁴m⁴nts lat⁴r d⁴t⁴rm⁸n⁴d to b⁴ l⁸⁴s. Non-Transgressions Beliefs () ⁸nvolv⁴d sc⁴nar⁸os w⁷⁴r⁴ sub⁹⁴cts w⁴r⁴ ⁴xp⁴ct⁴d to l⁸⁴ or t⁴ll t⁷⁴ trut⁷ about p⁴rsonal b⁴l⁸⁴⁵s or op⁸n⁸ons.
Image description () ⁸n w⁷⁸c⁷ sub⁹⁴cts w⁴r⁴ ask⁴d to v⁸⁴w ⁸ma⁶⁴s and l⁸⁴ or t⁴ll t⁷⁴ trut⁷ about w⁷at t⁷⁴y w⁴r⁴ s⁴⁴⁸n⁶. Guilty knowledge/Card test () ⁸n w⁷⁸c⁷ sub⁹⁴cts l⁸⁴ about som⁴t⁷⁸n⁶ t⁷⁴y know or w⁷at card t⁷⁴y ⁷av⁴ s⁴l⁴ct⁴d ⁵rom a d⁴ck.
Person description () ⁸n w⁷⁸c⁷ sub⁹⁴cts ⁶⁸v⁴ trut⁷⁵ul and ⁵als⁴ d⁴scr⁸pt⁸ons o⁵ t⁷⁴⁸r ⁵⁴⁴l⁸n⁶s r⁴⁶ard⁸n⁶ p⁴opl⁴ t⁷⁴y know.
Job interview () ⁸n w⁷⁸c⁷ typ⁸cally t⁷⁴ sub⁹⁴ct ⁷as to conv⁸nc⁴ an ⁸nt⁴rv⁸⁴w⁴r o⁵ qual⁸ficat⁸ons t⁷⁴y do not actually poss⁴ss.
2.3. THE PSYCHOLOGY LITERATURE
21
Personality scale () ⁸n w⁷⁸c⁷ sub⁹⁴cts l⁸⁴ or t⁴ll t⁷⁴ trut⁷ about r⁴spons⁴s t⁷⁴y ⁷av⁴ ⁶⁸v⁴n on a p⁴rsonal⁸ty scal⁴. Pain experience () ⁸n w⁷⁸c⁷ t⁷⁴ d⁴c⁴pt⁸on conc⁴rn⁴d som⁴ pa⁸n r⁴ally ⁵⁴lt but d⁸ss⁸mulat⁴d or, by contrast, s⁸mulat⁴d but not r⁴ally ⁵⁴lt.
⁴ d⁴⁶r⁴⁴ o⁵ ⁸nt⁴ract⁸on ⁸n t⁷⁴ ⁴xp⁴r⁸m⁴ntal cond⁸t⁸ons was also ass⁴ss⁴d by t⁷⁴ aut⁷ors, w⁷o d⁸st⁸n⁶u⁸s⁷⁴d ⁵our s⁸tuat⁸ons. Full interaction ⁸n w⁷⁸c⁷ t⁷⁴ part⁸c⁸pants ⁸nt⁴ract⁴d ⁵r⁴⁴ly and ⁴xpr⁴ss⁴d t⁷⁴ms⁴lv⁴s w⁸t⁷out any constra⁸nt d⁴t⁴rm⁸n⁴d by t⁷⁴ ⁴xp⁴r⁸m⁴ntal d⁴s⁸⁶n. Partial interaction ⁸nvolv⁴d som⁴ k⁸nd o⁵ structur⁴ t⁷at was ⁸mpos⁴d on t⁷⁴ commun⁸cat⁸on, typ⁸cally a s⁴t o⁵ pr⁴-ord⁴r⁴d qu⁴st⁸ons. Non-direct interaction ⁸n w⁷⁸c⁷ t⁷⁴ part⁸c⁸pants w⁴r⁴ ⁸n t⁷⁴ sam⁴ ⁴nv⁸ronm⁴nt but d⁸d not ⁸nt⁴ract w⁸t⁷ ⁴ac⁷ ot⁷⁴r. n t⁷⁸s cas⁴ t⁷⁴r⁴ ⁸s no d⁸r⁴ct commun⁸cat⁸on, and t⁷⁴ r⁴c⁸procal ⁸nflu⁴nc⁴ o⁵ t⁷⁴ actors ⁸s l⁸m⁸t⁴d to t⁷at o⁵ b⁴⁸n⁶ pr⁴s⁴nt ⁸n t⁷⁴ sam⁴ ⁴nv⁸ronm⁴nt. Absence of interaction ⁸n w⁷⁸c⁷ t⁷⁴ sub⁹⁴cts w⁴r⁴ l⁴⁵t compl⁴t⁴ly alon⁴ w⁷⁴n carry⁸n⁶ out t⁷⁴ ⁴xp⁴r⁸m⁴ntal task.
Lastly, ⁴Paulo ⁴t al. cat⁴⁶or⁸z⁴d t⁷⁴ cu⁴s as b⁴⁸n⁶ ass⁴ss⁴d ob⁹⁴ct⁸v⁴ly or sub⁹⁴ct⁸v⁴ly ⁸n ⁴ac⁷ study s⁸nc⁴ t⁷⁴⁸r s⁴l⁵-pr⁴s⁴ntat⁸on mod⁴l pr⁴d⁸cts t⁷at sub⁹⁴ct⁸v⁴ ⁸mpr⁴ss⁸ons would b⁴ mor⁴ pow⁴r⁵ul d⁸scr⁸m⁸nators o⁵ tru⁴ vs. ⁵als⁴. nd⁴⁴d, around 0% o⁵ ⁴st⁸mat⁸ons o⁵ t⁷⁴ cu⁴s o⁵ d⁴c⁴pt⁸on cam⁴ ⁵rom sub⁹⁴ct⁸v⁴ ⁸mpr⁴ss⁸ons o⁵ untra⁸n⁴d rat⁴rs [⁴Paulo ⁴t al., 00] A
⁴ cod⁸n⁶ o⁵ t⁷⁴ study c⁷aract⁴r⁸st⁸cs was ⁸mpl⁴m⁴nt⁴d by t⁷r⁴⁴ annotators, w⁷o w⁴r⁴ tra⁸n⁴d b⁴⁵or⁴ t⁷⁴ task and w⁷o ⁴ac⁷ cod⁴d two-t⁷⁸rds o⁵ t⁷⁴ stud⁸⁴s. ⁴ d⁸scr⁴panc⁸⁴s b⁴tw⁴⁴n cod⁴rs w⁴r⁴ t⁷⁴n d⁸scuss⁴d and r⁴solv⁴d ⁸n con⁵⁴r⁴nc⁴. ⁴ w⁷ol⁴ corpus o⁵ stud⁸⁴s was also cod⁴d by ⁴lla M. ⁴Paulo, w⁷o d⁸scuss⁴d t⁷⁴ r⁴ma⁸n⁸n⁶ d⁸scr⁴panc⁸⁴s w⁸t⁷ a ⁵ourt⁷ cod⁴r. Un⁵ortunat⁴ly, ⁴Paulo ⁴t al. d⁸d not prov⁸d⁴ any m⁴asur⁴ o⁵ t⁷⁴ a⁶r⁴⁴m⁴nt b⁴tw⁴⁴n cod⁴rs, w⁷⁸c⁷ would ⁷av⁴ b⁴⁴n us⁴⁵ul ⁸n ⁴valuat⁸n⁶ t⁷⁴ r⁴l⁸ab⁸l⁸ty o⁵ t⁷⁴⁸r class⁸ficat⁸on. ow⁴v⁴r, t⁷⁴y cla⁸m t⁷at t⁷⁴ a⁶r⁴⁴m⁴nt was n⁴arly p⁴r⁵⁴ct ⁵or t⁷⁴ ⁴valuat⁸on o⁵ t⁷⁴ l⁴ast amb⁸⁶uous var⁸abl⁴s, as w⁴ll as ⁵or t⁷⁴ d⁸m⁴ns⁸on “trans⁶r⁴ss⁸on vs. non-trans⁶r⁴ss⁸on” [⁴Paulo ⁴t al., 00, p. ]. Report Statistics ⁴Paulo ⁴t al. prov⁸d⁴ stat⁸st⁸cs ⁵or t⁷⁴ r⁴ports t⁷⁴y analyz⁴d. n o⁵ t⁷⁴ 0 ⁸nd⁴p⁴nd⁴nt sampl⁴s cons⁸d⁴r⁴d, t⁷⁴ sub⁹⁴cts r⁴c⁴⁸v⁴d som⁴ ⁸nc⁴nt⁸v⁴ to l⁸⁴. An ⁸nt⁴r⁴st⁸n⁶ d⁴ta⁸l conc⁴rns t⁷⁴ durat⁸on o⁵ t⁷⁴ trut⁷⁵ul/d⁴c⁴pt⁸v⁴ m⁴ssa⁶⁴s only stud⁸⁴s r⁴port⁴d durat⁸on, and ⁸n o⁵ t⁷⁴m t⁷⁴ durat⁸on was 1 mi n: ⁴ ⁵act t⁷at s⁴v⁴ral cu⁴s o⁵ d⁴c⁴pt⁸on w⁴r⁴ produc⁴d ⁸n suc⁷ a s⁷ort span o⁵ t⁸m⁴ l⁴av⁴s op⁴n t⁷⁴ qu⁴st⁸on o⁵ ⁷ow t⁷⁴y m⁸⁶⁷t ⁴volv⁴ ov⁴r t⁸m⁴ ⁸n a lon⁶ narrat⁸v⁴.
22
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
Lastly, w⁸t⁷ r⁴sp⁴ct to t⁷⁴ ⁸nt⁴r-rat⁴r a⁶r⁴⁴m⁴nt on t⁷⁴ cu⁴s, “O⁵ t⁷⁴ ⁴st⁸mat⁴s o⁵ t⁷⁴ cu⁴s to d⁴c⁴pt⁸on, (0%) w⁴r⁴ bas⁴d on t⁷⁴ sub⁹⁴ct⁸v⁴ ⁸mpr⁴ss⁸ons o⁵ untra⁸n⁴d rat⁴rs” [⁴Paulo ⁴t al., 00, p. 0]. ⁴ ⁵act t⁷at t⁷⁴ rat⁴ o⁵ ⁴st⁸mat⁴s w⁷⁸c⁷ do not r⁴ly on ob⁹⁴ct⁸v⁴ m⁴asur⁴s do⁴s not ⁴xc⁴⁴d 0% ⁸s ⁶ood. ⁴ r⁴l⁸ab⁸l⁸ty o⁵ rat⁴r ⁹ud⁶m⁴nts and t⁷⁴ a⁶r⁴⁴m⁴nt b⁴tw⁴⁴n ⁷uman cod⁴rs, ⁸n ⁵act, ⁸s a t⁷orny ⁸ssu⁴ ⁸n sc⁸⁴nt⁸fic r⁴s⁴arc⁷. W⁸t⁷ r⁴sp⁴ct to t⁷⁸s, ⁸t ⁸s wort⁷ quot⁸n⁶ Mass⁸mo Po⁴s⁸o on ⁷⁸s work w⁸t⁷ anap⁷ora r⁴solut⁸on “Sub⁹⁴cts do not ⁴v⁴n a⁶r⁴⁴ on w⁷⁴t⁷⁴r a pronoun ⁸s t⁷⁴r⁴ or not.” No n⁴⁴d to say t⁷at t⁷⁴ pr⁴s⁴nc⁴ or abs⁴nc⁴ o⁵ a pronoun s⁷ould b⁴ unamb⁸⁶uous ⁴v⁴n so, rat⁴r op⁸n⁸ons can b⁴ qu⁸t⁴ ⁸ncons⁸st⁴nt. A fortiori, t⁷⁸s probl⁴m can b⁴ ⁴xp⁴ct⁴d w⁷⁴n ⁸t ⁸s a matt⁴r o⁵ ⁴valuat⁸n⁶ muc⁷ mor⁴ s⁷ad⁴d ⁸t⁴ms. ⁴r⁴⁵or⁴, t⁷⁴ ⁵act t⁷at most ⁴st⁸mat⁴s w⁴r⁴ ob⁹⁴ct⁸v⁴ly m⁴asur⁴d ⁸s a r⁴markabl⁴ po⁸nt w⁷⁸c⁷ supports t⁷⁴ r⁴l⁸ab⁸l⁸ty o⁵ t⁷⁴ r⁴sults.
Results n pr⁴s⁴nt⁸n⁶ t⁷⁴ r⁴sults, ⁴Paulo ⁴t al. ⁵ollow t⁷⁴ sc⁷⁴m⁴ s⁷own ⁸n S⁴ct⁸on .., d⁸scuss⁸n⁶ ⁴ac⁷ o⁵ t⁷⁴ ⁷ypot⁷⁴s⁴s t⁷at ⁵ollows ⁵rom t⁷⁴⁸r s⁴l⁵-pr⁴s⁴ntat⁸on p⁴rsp⁴ct⁸v⁴ on d⁴c⁴pt⁸on. ⁴ sam⁴ approac⁷ ⁸s adopt⁴d ⁷⁴r⁴. Unl⁴ss ot⁷⁴rw⁸s⁴ not⁴d, w⁴ r⁴port ⁷⁴r⁴ only t⁷⁴ s⁸⁶n⁸ficant r⁴sults. ⁴Paulo ⁴t al. r⁴port as pos⁸t⁸v⁴ d⁸ff⁴r⁴nc⁴s (Cd ) b⁴⁷av⁸ors assoc⁸at⁴d w⁸t⁷ ly⁸n⁶ and as n⁴⁶at⁸v⁴ d⁸ff⁴r⁴nc⁴s t⁷os⁴ assoc⁸at⁴d w⁸t⁷ trut⁷-t⁴ll⁸n⁶.
⁴ aut⁷ors ⁴valuat⁴d t⁷⁴ l⁴n⁶t⁷ o⁵ t⁷⁴ r⁴spons⁴ as t⁷⁴ first ⁸nd⁴x o⁵ t⁷⁴ approac⁷ab⁸l⁸ty o⁵ t⁷⁴ sub⁹⁴cts. ⁴y cla⁸m t⁷at t⁷⁸s ⁸s t⁷⁴ ⁵⁴atur⁴ o⁵ t⁷⁴ m⁴ssa⁶⁴s most o⁵t⁴n r⁴port⁴d ⁸n t⁷⁴ l⁸t⁴ratur⁴ non⁴t⁷⁴l⁴ss t⁷⁴y ⁵ound only a “t⁸ny and nons⁸⁶n⁸ficant ⁴ff⁴ct ⁸n t⁷⁴ pr⁴d⁸ct⁴d d⁸r⁴ct⁸on (d D 0:03)” [⁴Paulo ⁴t al., 00, p. ] d⁴c⁴pt⁸v⁴ m⁴ssa⁶⁴s would b⁴ s⁷ort⁴r t⁷an trut⁷⁵ul on⁴s. l⁴ar⁴r outcom⁴s, ⁷ow⁴v⁴r, w⁴r⁴ ⁵ound r⁴⁶ard⁸n⁶ t⁷⁴ durat⁸on o⁵ talk t⁸m⁴ and o⁵ t⁷⁴ ⁸nt⁴ract⁸on l⁸ars talk⁴d l⁴ss t⁷an trut⁷-t⁴ll⁴rs (d D 0:35) and ⁸nt⁴ract⁸ons w⁴r⁴ s⁷ort⁴r w⁷⁴n a l⁸ar was pr⁴s⁴nt (d D 0:20). L⁸ars also suppl⁸⁴d ⁵⁴w⁴r d⁴ta⁸ls t⁷an trut⁷-t⁴ll⁴rs (d D 0:30), a find⁸n⁶ w⁷⁸c⁷ ⁸s consonant w⁸t⁷ t⁷⁴ ⁷ypot⁷⁴s⁴s o⁵ t⁷⁴ aut⁷ors. ollow⁸n⁶ t⁷⁴ assump⁸ons o⁵ t⁷⁴ r⁴al⁸ty mon⁸tor⁸n⁶ t⁷⁴ory, d⁸scuss⁴d ⁸n S⁴ct⁸on . t⁷⁴y also ⁴xp⁴ct⁴d t⁷at d⁴c⁴pt⁸v⁴ m⁴ssa⁶⁴s would conta⁸n l⁴ss s⁴nsory-r⁴lat⁴d ⁸n⁵ormat⁸on, but t⁷⁴y ⁵ound only “a nons⁸⁶n⁸ficant tr⁴nd ⁸n t⁷at d⁸r⁴ct⁸on (d D 0:17)” [⁴Paulo ⁴t al., 00, p. ]. W⁸t⁷ r⁴sp⁴ct to non-v⁴rbal b⁴⁷av⁸or, t⁷⁴ only cu⁴ t⁷at was s⁸⁶n⁸ficant conc⁴rn⁴d t⁷⁴ t⁴nd⁴ncy o⁵ l⁸ars to pr⁴ss t⁷⁴ l⁸ps to⁶⁴t⁷⁴r mor⁴ t⁷an trut⁷-t⁴ll⁴rs (d D C0:16) ⁸n t⁷⁸s cas⁴ too t⁷⁴ pr⁴d⁸ct⁸ons o⁵ t⁷⁴ aut⁷ors, w⁷o ⁴xp⁴ct⁴d l⁴ss ⁵r⁸⁴ndly b⁴⁷av⁸ors ⁵rom l⁸ars, w⁴r⁴ support⁴d.
⁴ ⁴xp⁴r⁸m⁴ntal r⁴sults support⁴d t⁷⁴ s⁴cond ⁷ypot⁷⁴s⁸s o⁵ t⁷⁴ aut⁷ors as w⁴ll. n ⁵act, l⁸ars’ narrat⁸v⁴s w⁴r⁴ ⁹ud⁶⁴d to b⁴ s⁸⁶n⁸ficantly “l⁴ss plaus⁸bl⁴ (d D 0:23) l⁴ss l⁸k⁴ly to b⁴ structur⁴d ⁸n a lo⁶⁸cal, s⁴ns⁸bl⁴ way (d D 0:25) and mor⁴ l⁸k⁴ly to b⁴ ⁸nt⁴rnally d⁸scr⁴pant or to conv⁴y amb⁸val⁴nc⁴ (d D 0:34)” [⁴Paulo ⁴t al., 00, p. ]. L⁸ars w⁴r⁴ also s⁸⁶n⁸ficantly “l⁴ss
2.3. THE PSYCHOLOGY LITERATURE
23
⁸nvolv⁴d v⁴rbally and vocally ⁸n t⁷⁴⁸r s⁴l⁵-pr⁴s⁴ntat⁸ons” (d D 0:21) and, as ⁵ar as non-v⁴rbal b⁴⁷av⁸or ⁸s conc⁴rn⁴d, t⁷⁴y ⁴x⁷⁸b⁸t⁴d ⁵⁴w⁴r ⁸llustrat⁸v⁴ ⁶⁴stur⁴s dur⁸n⁶ t⁷⁴ narrat⁸v⁴ (d D 0:14). n ord⁴r to ⁴st⁸mat⁴ t⁷⁴ ⁸mm⁴d⁸acy o⁵ t⁷⁴ commun⁸cat⁸on, t⁷⁴ aut⁷ors mad⁴ us⁴ o⁵ a s⁴t o⁵ cu⁴s t⁷at ⁷ad b⁴⁴n t⁷⁴ ob⁹⁴ct o⁵ study o⁵ ot⁷⁴r aut⁷ors [l⁴m⁸n⁶, , M⁴⁷rab⁸an, ] and w⁷⁸c⁷ t⁷⁴y collaps⁴d ⁸nto t⁷r⁴⁴ compos⁸t⁴ m⁴asur⁴s.
Verbal immediacy. ⁴Paulo ⁴t al. ar⁴ caut⁸ous ⁸n t⁷⁴ us⁴ o⁵ t⁷⁸s cat⁴⁶ory, c⁸t⁸n⁶ pr⁴c⁸s⁴ ob⁹⁴ct⁸v⁴ m⁴asur⁴s o⁵ ⁸mm⁴d⁸acy suc⁷ as t⁷os⁴ d⁴scr⁸b⁴d by W⁸⁴n⁴r and M⁴⁷rab⁸an [], ⁸nclud⁸n⁶ “t⁷⁴ us⁴ o⁵ t⁷⁴ pass⁸v⁴ rat⁷⁴r t⁷an t⁷⁴ act⁸v⁴ vo⁸c⁴, t⁷⁴ us⁴ o⁵ n⁴⁶at⁸ons rat⁷⁴r t⁷an ass⁴rt⁸ons, and look⁸n⁶ away rat⁷⁴r t⁷an ma⁸nta⁸n⁸n⁶ ⁴y⁴ contact” [⁴Paulo ⁴t al., 00, p. -], but not⁸n⁶ t⁷at ot⁷⁴r cu⁴s ⁷av⁴ b⁴⁴n c⁸t⁴d ⁸n t⁷⁴ l⁸t⁴ratur⁴ and sk⁸ll⁴d obs⁴rv⁴rs o⁵ soc⁸al b⁴⁷av⁸or “can d⁸scr⁸m⁸nat⁴ trut⁷s ⁵rom l⁸⁴s by t⁷⁴⁸r sub⁹⁴ct⁸v⁴ ⁸mpr⁴ss⁸ons o⁵ t⁷⁴ constructs o⁵ ⁸nt⁴r⁴st (⁴.⁶., d⁸stanc⁸n⁶) ⁹ust as w⁴ll, ⁸⁵ not b⁴tt⁴r, t⁷an can ob⁹⁴ct⁸v⁴ cod⁸n⁶ syst⁴ms.” ⁴Paulo ⁴t al.’s analys⁸s s⁷ows t⁷at l⁸ars app⁴ar ⁶⁴n⁴rally l⁴ss ⁸mm⁴d⁸at⁴ t⁷an trut⁷-t⁴ll⁴rs by a s⁸⁶n⁸ficant mar⁶⁸n (d D 0:31). n part⁸cular, t⁷⁴y s⁴⁴m⁴d to ⁴mploy mor⁴ l⁸n⁶u⁸st⁸c ⁴xpr⁴ss⁸ons w⁷⁸c⁷ w⁴r⁴ ⁹ud⁶⁴d to put a d⁸stanc⁴ b⁴tw⁴⁴n t⁷⁴ms⁴lv⁴s and t⁷⁴ ⁸nt⁴rlocutors as w⁴ll as t⁷⁴ cont⁴nt o⁵ t⁷⁴⁸r stat⁴m⁴nts. ⁴y also s⁴⁴m⁴d to b⁴ “mor⁴ ⁴vas⁸v⁴, uncl⁴ar, and ⁸mp⁴rsonal” [⁴Paulo ⁴t al., 00, p. ]. Non-verbal immediacy was m⁴asur⁴d t⁷rou⁶⁷ t⁷⁴ ⁸mpr⁴ss⁸ons o⁵ t⁷⁴ obs⁴rv⁴rs and t⁷⁴ r⁴lat⁸on b⁴tw⁴⁴n cu⁴s and d⁴c⁴pt⁸on was non-s⁸⁶n⁸ficant, alt⁷ou⁶⁷ t⁷⁴ tr⁴nd was ⁸n t⁷⁴ ⁴xp⁴ct⁴d d⁸r⁴ct⁸on (d D 0:31). Verbal and vocal immediacy. As ⁸n t⁷⁴ pr⁴v⁸ous cas⁴, t⁷⁴s⁴ cu⁴s r⁴ly on sub⁹⁴ct⁸v⁴ ⁹ud⁶m⁴nts. ow⁴v⁴r, ⁸n assoc⁸at⁸on w⁸t⁷ t⁷⁴ v⁴rbal ⁸mm⁴d⁸acy compos⁸t⁴ m⁴asur⁴, t⁷⁴y s⁷ow⁴d a s⁸⁶n⁸ficant ⁴ff⁴ct (d D 0:55).
v⁴n t⁷ou⁶⁷ t⁷⁴ a⁶⁶r⁴⁶at⁸on o⁵ cu⁴s to ⁸mm⁴d⁸acy ⁶av⁴ appr⁴c⁸abl⁴ r⁴sults, t⁷⁴ ⁴ff⁴ct o⁵ most o⁵ t⁷⁴ ⁸nd⁸v⁸dual cu⁴s was ⁸ncons⁸st⁴nt. or ⁴xampl⁴, ⁸t ⁸s not⁴wort⁷y t⁷at ⁴y⁴ contact and ⁶az⁴ av⁴rs⁸on w⁴r⁴ not s⁸⁶n⁸ficant cu⁴s o⁵ d⁴c⁴pt⁸on. ⁸s ⁸s ⁸n contrast w⁸t⁷ w⁷at ⁸s commonly b⁴l⁸⁴v⁴d about d⁴c⁴pt⁸v⁴ b⁴⁷av⁸or. A poss⁸bl⁴ ⁴xplanat⁸on o⁵ t⁷⁸s r⁴sult may r⁴ly on t⁷⁴ ⁵act t⁷at soc⁸al actors ar⁴ ⁷⁸⁶⁷ly awar⁴ o⁵ t⁷⁴⁸r mana⁶⁴m⁴nt o⁵ ⁴y⁴ contact and ar⁴ abl⁴ to control ⁸t, ⁴x⁴rc⁸s⁸n⁶ count⁴rm⁴asur⁴s. ⁴ ⁴xpr⁴ss⁸on o⁵ t⁷⁸s k⁸nd o⁵ b⁴⁷av⁸or may also b⁴ r⁴⁶ulat⁴d ⁸n a mor⁴ compl⁴x way, as d⁸scuss⁴d ⁸n t⁷⁴ s⁴ct⁸on on Mod⁴rators b⁴low. A ⁵urt⁷⁴r r⁴l⁴vant param⁴t⁴r t⁷at aff⁴cts t⁷⁴ ⁴xt⁴nt to w⁷⁸c⁷ a sp⁴ak⁴r app⁴ars comp⁴ll⁸n⁶ ⁸s t⁷⁴ s⁴ns⁴ o⁵ unc⁴rta⁸nty w⁷⁸c⁷ accompan⁸⁴s t⁷⁴ narrat⁸v⁴. Accord⁸n⁶ to t⁷⁴ sub⁹⁴ct⁸v⁴ ⁴valuat⁸ons, l⁸ars s⁴⁴m⁴d to b⁴ mor⁴ unc⁴rta⁸n t⁷an trut⁷-t⁴ll⁴rs (d D C0:30). ow⁴v⁴r, t⁷⁸s ⁸mpr⁴ss⁸on do⁴s not carry ov⁴r to t⁷⁴ cat⁴⁶ory o⁵ sp⁴⁴c⁷ d⁸sflu⁴nc⁸⁴s. W⁸t⁷ r⁴sp⁴ct to t⁷⁸s, ⁴Paulo ⁴t al. ⁴mbrac⁴ t⁷⁴ d⁸st⁸nct⁸on propos⁴d by Kasl and Ma⁷l [] b⁴tw⁴⁴n t⁷⁴ so-call⁴d “ah” disturbances, w⁷⁸c⁷ usually fill t⁷⁴ paus⁴s w⁷⁴n t⁷⁴ cont⁴nt to b⁴ ⁴xpr⁴ss⁴d ⁸s part⁸cularly compl⁴x, and t⁷⁴ “non-ah” disturbances, w⁷⁸c⁷ cons⁸st ⁸n ⁸nt⁴rrupt⁸ons and c⁷an⁶⁴s ⁸n t⁷⁴ s⁴nt⁴nc⁴s, stutt⁴rs, om⁸ss⁸on o⁵ words, and sl⁸ps o⁵ ton⁶u⁴, w⁷⁸c⁷ ar⁴ cons⁸d⁴r⁴d a s⁸⁶n o⁵
24
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
anx⁸⁴ty. ⁴ most ⁵r⁴qu⁴ntly r⁴port⁴d cu⁴s o⁵ t⁷⁸s k⁸nd, comb⁸n⁴d w⁸t⁷ s⁸l⁴nt paus⁴s, w⁴r⁴ not s⁸⁶n⁸ficant, w⁷⁸l⁴ t⁷⁴ r⁴p⁴t⁸t⁸on o⁵ words and p⁷ras⁴s was a s⁸⁶n⁸ficant cu⁴ to d⁴c⁴pt⁸on (d D C0:21). Suc⁷ an outcom⁴ su⁶⁶⁴sts t⁷at t⁷⁴ compl⁴x⁸ty o⁵ t⁷⁴ task o⁵ ly⁸n⁶ ⁸mpl⁸⁴s an absorpt⁸on o⁵ co⁶n⁸t⁸v⁴ r⁴sourc⁴s, w⁷⁸c⁷ ⁷as t⁷⁴ p⁴cul⁸ar ⁴ff⁴ct o⁵ mak⁸n⁶ t⁷⁴ lan⁶ua⁶⁴ mor⁴ st⁴r⁴otyp⁴d, w⁷⁸c⁷ m⁸⁶⁷t b⁴ a product⁸v⁴ d⁸st⁸nct⁸on w⁸t⁷ r⁴sp⁴ct to l⁸n⁶u⁸st⁸c analys⁸s. As ⁵ar as t⁷⁴ non-v⁴rbal b⁴⁷av⁸or ⁸s conc⁴rn⁴d, t⁷⁴ aut⁷ors ⁵ound t⁷at l⁸ars t⁴nd to ra⁸s⁴ t⁷⁴ c⁷⁸n s⁸⁶n⁸ficantly mor⁴ o⁵t⁴n t⁷an trut⁷-t⁴ll⁴rs (d D C0:25). ⁸s ⁸s actually a qu⁸t⁴ surpr⁸s⁸n⁶ r⁴sult, as ra⁸s⁸n⁶ t⁷⁴ c⁷⁸n ⁸s usually cons⁸d⁴r⁴d a s⁸⁶n o⁵ dom⁸nanc⁴ ⁸n t⁷⁴ soc⁸al ⁸nt⁴ract⁸on and o⁵ c⁴rta⁸nty, w⁷⁸l⁴ t⁷⁴ l⁸ars s⁴⁴m⁴d to b⁴ mor⁴ unc⁴rta⁸n t⁷an s⁸nc⁴r⁴ actors. y contrast, ot⁷⁴r non-v⁴rbal b⁴⁷av⁸ors w⁷⁸c⁷ may conv⁴y som⁴ s⁴ns⁴ o⁵ unc⁴rta⁸nty, suc⁷ as postur⁴ s⁷⁸⁵ts, ⁷and mov⁴m⁴nts, and ⁵oot or l⁴⁶ mov⁴m⁴nts, d⁸d not s⁷ow any cl⁴ar r⁴lat⁸ons⁷⁸p w⁸t⁷ d⁴c⁴pt⁸on. ⁴r⁴⁵or⁴ t⁷⁴s⁴ outcom⁴s s⁴⁴m to su⁶⁶⁴st t⁷at t⁷⁴ non-⁸mm⁴d⁸acy, t⁷⁴ unc⁴rta⁸nty, and, ⁸n ⁶⁴n⁴ral, t⁷⁴ l⁴ss⁴r ⁸nvolv⁴m⁴nt w⁷⁸c⁷ c⁷aract⁴r⁸z⁴ t⁷⁴ v⁴rbal ⁴xpr⁴ss⁸ons o⁵ t⁷⁴ l⁸ars t⁴nd not to b⁴ corr⁴lat⁴d w⁸t⁷ analo⁶ous b⁴⁷av⁸ors on t⁷⁴ l⁴v⁴l o⁵ body lan⁶ua⁶⁴.
⁸s ⁷ypot⁷⁴s⁸s also r⁴c⁴⁸v⁴d som⁴ confirmat⁸on ⁵rom ⁴Paulo ⁴t al. L⁸ars turn⁴d out to ⁸ssu⁴ mor⁴ n⁴⁶at⁸v⁴ stat⁴m⁴nts and compla⁸nts t⁷an trut⁷-t⁴ll⁴rs (d D C0:21), and t⁷⁴y also w⁴r⁴ p⁴rc⁴⁸v⁴d as l⁴ss pl⁴asant (d D 0:12), and s⁸⁶n⁸ficantly l⁴ss coop⁴rat⁸v⁴ (d D 0:66). As ⁸n t⁷⁴ pr⁴v⁸ous cas⁴, l⁴ss cl⁴ar r⁴sults com⁴ ⁵rom t⁷⁴ non-v⁴rbal b⁴⁷av⁸or. or ⁴xampl⁴, “t⁷⁴ ⁴st⁸mat⁴s o⁵ sm⁸l⁸n⁶ produc⁴d a comb⁸n⁴d ⁴ff⁴ct s⁸z⁴ o⁵ ⁴xactly z⁴ro” [⁴Paulo ⁴t al., 00, p. ]. ow⁴v⁴r, t⁷⁴ aut⁷ors addr⁴ss t⁷⁴ ⁵act t⁷at, ⁸n t⁷⁴ stud⁸⁴s o⁵ sm⁸l⁸n⁶, only two d⁸st⁸n⁶u⁸s⁷⁴d ⁴st⁸mat⁴s o⁵ ⁶⁴nu⁸n⁴ sm⁸l⁴s and two ⁵⁴⁸⁶n⁴d sm⁸l⁴s. Suc⁷ r⁴ports conc⁴rn⁴d t⁷⁴ s⁸mulat⁸on o⁵ ⁴mot⁸ons, and not surpr⁸s⁸n⁶ly ⁸t turn⁴d out t⁷at, w⁷⁴n t⁷⁴ sub⁹⁴cts w⁴r⁴ pr⁴t⁴nd⁸n⁶ to ⁵⁴⁴l pos⁸t⁸v⁴ ⁴mot⁸ons, ⁶⁴nu⁸n⁴ sm⁸l⁴s w⁴r⁴ produc⁴d l⁴ss o⁵t⁴n (d D 0:70) and, by contrast, ⁵⁴⁸⁶n⁴d sm⁸l⁴s w⁴r⁴ mor⁴ ⁵r⁴qu⁴nt (d D C0:31).
v⁴n t⁷ou⁶⁷ not all cu⁴s o⁵ d⁴c⁴pt⁸on w⁴r⁴ s⁸⁶n⁸ficant, most o⁵ t⁷⁴⁸r valu⁴s w⁴r⁴ ⁸n t⁷⁴ d⁸r⁴ct⁸on ⁴xp⁴ct⁴d by t⁷⁴ aut⁷ors. Mor⁴ov⁴r, as ⁵ar as s⁸⁶n⁸ficant r⁴sults ar⁴ conc⁴rn⁴d, l⁸ars w⁴r⁴ ⁵ound “mor⁴ n⁴rvous and t⁴ns⁴ ov⁴rall t⁷an trut⁷ t⁴ll⁴rs (d D C0:27)” and “mor⁴ vocally t⁴ns⁴ (d D C0:26)” [⁴Paulo ⁴t al., 00, p. ]. ⁴y also s⁷ow⁴d ⁷⁸⁶⁷⁴r p⁸tc⁷ o⁵ t⁷⁴ vo⁸c⁴ (d D C0:21) and mor⁴ d⁸lat⁴d pup⁸ls (d D C0:39). u⁴s o⁵ fid⁶⁴t⁸n⁶, w⁷⁴n compr⁴⁷⁴ns⁸v⁴ly cons⁸d⁴r⁴d, w⁴r⁴ s⁸⁶n⁸ficant as w⁴ll (d D C0:16). ow⁴v⁴r, t⁷⁴ stud⁸⁴s t⁷at d⁸st⁸n⁶u⁸s⁷⁴d b⁴tw⁴⁴n d⁸ff⁴r⁴nt typ⁴s o⁵ fid⁶⁴t⁸n⁶ ⁶av⁴ non-s⁸⁶n⁸ficant and ⁸ncons⁸st⁴nt r⁴sults w⁸t⁷ r⁴sp⁴ct to t⁷⁴ d⁸r⁴ct⁸on o⁵ t⁷⁴ ⁴ff⁴cts (d D C0:08) ⁵or ⁵ac⁸al fid⁶⁴t⁸n⁶, (d D 0:12) ⁵or ob⁹⁴ct fid⁶⁴t⁸n⁶, and (d D 0:01) ⁵or s⁴l⁵-fid⁶⁴t⁸n⁶, su⁶⁶⁴st⁸n⁶ t⁷at fid⁶⁴t⁸n⁶ cu⁴s ar⁴ not r⁴l⁸abl⁴ pr⁴d⁸ctors o⁵ d⁴c⁴pt⁸on.
⁴Paulo ⁴t al. pr⁴d⁸ct t⁷at d⁴c⁴pt⁸v⁴ narrat⁸v⁴s pr⁴s⁴nt “⁵⁴w⁴r ord⁸nary ⁸mp⁴r⁵⁴ct⁸ons and unusual cont⁴nts” t⁷an trut⁷⁵ul on⁴s [⁴Paulo ⁴t al., 00, p. ]. nd⁴⁴d, t⁷⁴ aut⁷ors ⁵ound t⁷at
2.3. THE PSYCHOLOGY LITERATURE
25
spontan⁴ous corr⁴ct⁸ons ar⁴ l⁴ss ⁵r⁴qu⁴nt ⁸n d⁴c⁴pt⁸v⁴ stor⁸⁴s (d D 0:29). ⁴ t⁴nd⁴ncy o⁵ l⁸ars to ⁶⁸v⁴ ⁵⁴w⁴r d⁴ta⁸ls was non-s⁸⁶n⁸ficant, but t⁷⁴ tr⁴nd was ⁸n t⁷⁴ ⁴xp⁴ct⁴d d⁸r⁴ct⁸on (d D 0:16). y contrast, l⁸ars s⁷ow⁴d a ⁶r⁴at⁴r t⁴nd⁴ncy to m⁴nt⁸on ⁵acts o⁵ s⁴condary ⁸mportanc⁴ w⁸t⁷ r⁴sp⁴ct to t⁷⁴ c⁴ntral po⁸nt o⁵ d⁸scuss⁸on (d D C0:35). Also consonant w⁸t⁷ t⁷⁴⁸r pr⁴d⁸ct⁸on, t⁷⁴ aut⁷ors ⁵ound t⁷at l⁸ars adm⁸tt⁴d lack o⁵ m⁴mory l⁴ss ⁵r⁴qu⁴ntly t⁷an trut⁷-t⁴ll⁴rs (d D 0:42). ⁸s ⁸s an ⁸nt⁴r⁴st⁸n⁶ outcom⁴, w⁷⁸c⁷ d⁴s⁴rv⁴s mor⁴ cons⁸d⁴rat⁸on.
Cue Clustering All r⁴s⁴arc⁷⁴rs w⁷o d⁴al w⁸t⁷ d⁴c⁴pt⁸on d⁴t⁴ct⁸on, ⁵ollow⁸n⁶ w⁷at⁴v⁴r approac⁷, conclud⁴ t⁷at a s⁸n⁶l⁴ cu⁴ sp⁴c⁸fic to d⁴c⁴pt⁸on cannot b⁴ ⁸d⁴nt⁸fi⁴d. ow⁴v⁴r, ⁴Paulo’s study s⁷ows t⁷at part⁸cular sets o⁵ cu⁴s, unsp⁴c⁸fic ⁸n t⁷⁴ms⁴lv⁴s, may b⁴ ⁸nd⁸cat⁸v⁴ o⁵ d⁴c⁴pt⁸on. To d⁴t⁴rm⁸n⁴ t⁷⁴ “r⁸⁶⁷t” s⁴t o⁵ cu⁴s, ⁷ow⁴v⁴r, ⁸s not a tr⁸v⁸al task, b⁴caus⁴ many cu⁴s turn⁴d out to ⁶⁸v⁴ ⁸ncons⁸st⁴nt ⁸nd⁸cat⁸ons across d⁸ff⁴r⁴nt stud⁸⁴s t⁷⁸s d⁸fficulty ⁸s ⁵urt⁷⁴r compl⁸cat⁴d by t⁷⁴ ⁵act t⁷at const⁴llat⁸ons o⁵ cu⁴s may b⁴ p⁴cul⁸ar to a ⁶⁸v⁴n sp⁴ak⁴r. A poss⁸b⁸l⁸ty t⁷at s⁷ould b⁴ tak⁴n ⁸nto cons⁸d⁴rat⁸on ⁸s t⁷at d⁴c⁴pt⁸v⁴ b⁴⁷av⁸ors, w⁷⁴t⁷⁴r v⁴rbal or non-v⁴rbal, may b⁴ stron⁶ly aff⁴ct⁴d by t⁷⁴ cont⁴xt ⁸n w⁷⁸c⁷ t⁷⁴y ar⁴ ⁶⁴n⁴rat⁴d. ⁴ l⁸ars’ adm⁸ss⁸ons o⁵ lack o⁵ m⁴mory may const⁸tut⁴ a cl⁴ar ⁴xampl⁴ o⁵ t⁷⁸s ⁸d⁴a. W⁷⁸l⁴ ⁴Paulo ⁴t al. ⁵ound t⁷at l⁸ars ar⁴ r⁴luctant to adm⁸t lack o⁵ m⁴mory, a r⁴c⁴nt study conc⁴rn⁸n⁶ d⁴c⁴pt⁸on ⁸n a part⁸cular cont⁴xt cl⁴arly s⁷ow⁴d t⁷⁴ oppos⁸t⁴ tr⁴nd [ornac⁸ar⁸ and Po⁴s⁸o, 0]. n t⁷⁸s study, stat⁴m⁴nts ⁸ssu⁴d dur⁸n⁶ d⁴bat⁴s ⁸n ourts w⁴r⁴ analyz⁴d. ur⁸n⁶ t⁷⁴s⁴ ⁷⁴ar⁸n⁶s t⁷⁴ sub⁹⁴cts w⁴r⁴ o⁵t⁴n accus⁴d o⁵ ⁷av⁸n⁶ comm⁸tt⁴d acts ⁸n v⁸olat⁸on o⁵ t⁷⁴ law by t⁷⁴ pros⁴cutor. n t⁷⁸s s⁸tuat⁸on, a ⁵a⁸rly typ⁸cal strat⁴⁶y on t⁷⁴ part o⁵ t⁷⁴ sub⁹⁴cts, po⁸nt⁴d out by t⁷⁴ ⁹ud⁶⁴s t⁷⁴ms⁴lv⁴s, was to l⁸⁴ by d⁴ny⁸n⁶ any m⁴mory o⁵ t⁷⁴ acts, probably b⁴caus⁴ t⁷⁸s s⁴⁴ms a soc⁸ally allow⁴d way o⁵ r⁴mov⁸n⁶ t⁷⁴ms⁴lv⁴s ⁵rom t⁷⁴⁸r r⁴spons⁸b⁸l⁸t⁸⁴s. ⁸s su⁶⁶⁴sts t⁷at ⁸n d⁸ff⁴r⁴nt cont⁴xts d⁸ff⁴r⁴nt cu⁴s o⁵ d⁴c⁴pt⁸on may b⁴ d⁸sclos⁴d. ⁴Paulo ⁴t al. s⁴⁴m to acc⁴pt t⁷⁸s ⁸d⁴a. n ⁵act, t⁷⁴y conclud⁴ t⁷⁴⁸r study by cons⁸d⁴r⁸n⁶ t⁷⁴ ⁸nt⁴ract⁸on b⁴tw⁴⁴n som⁴ var⁸abl⁴s—som⁴ r⁴lat⁴d to cont⁴xt cond⁸t⁸ons—and t⁷⁴ obs⁴rvat⁸on o⁵ cu⁴s to d⁴c⁴pt⁸on. ⁴ ⁴ff⁴ct o⁵ suc⁷ var⁸abl⁴s, w⁷⁸c⁷ t⁷⁴ aut⁷ors call “mod⁴rators,” ⁸s d⁸scuss⁴d ⁸n t⁷⁴ n⁴xt s⁴ct⁸on. Moderators n ord⁴r to ⁴valuat⁴ t⁷⁴ ⁸nt⁴ract⁸on b⁴tw⁴⁴n t⁷⁴ cu⁴s to d⁴c⁴pt⁸on and t⁷⁴ cu⁴ mod⁴rators, d⁸scuss⁴d abov⁴, ⁴Paulo ⁴t al. cons⁸d⁴r⁴d only cu⁴s ⁵or w⁷⁸c⁷ at l⁴ast 0 pr⁴c⁸s⁴ ⁴st⁸mat⁴s w⁴r⁴ ava⁸labl⁴, as t⁷⁴y n⁴⁴d⁴d a suffic⁸⁴nt numb⁴r o⁵ ⁴st⁸mat⁴s ⁵or ⁴ac⁷ l⁴v⁴l o⁵ t⁷⁴ mod⁴rator var⁸abl⁴s. ⁴r⁴ w⁴r⁴ cu⁴s to d⁴c⁴pt⁸on t⁷at m⁴t t⁷⁸s cr⁸t⁴r⁸on, o⁵ w⁷⁸c⁷ produc⁴d no s⁸⁶n⁸ficant ⁴ff⁴cts, l⁴av⁸n⁶ cu⁴s to cons⁸d⁴r w⁸t⁷ r⁴sp⁴ct to t⁷⁴ mod⁴rat⁸n⁶ ⁸nflu⁴nc⁴s o⁵ mot⁸vat⁸on to l⁸⁴, opportun⁸ty to plan, and durat⁸on o⁵ m⁴ssa⁶⁴.
M
Accord⁸n⁶ to t⁷⁴⁸r pr⁴d⁸ct⁸ons, ⁴Paulo ⁴t al. ⁵ound an ⁸nt⁴r⁴st⁸n⁶ ⁴ff⁴ct o⁵ t⁷⁴ mot⁸vat⁸on to succ⁴⁴d at ly⁸n⁶ on t⁷⁴ ⁴xpr⁴ss⁸on o⁵ d⁴c⁴pt⁸on cu⁴s. or ⁴xampl⁴, ⁴y⁴ contact, w⁷⁸c⁷
26
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
was not s⁸⁶n⁸ficantly d⁸ff⁴r⁴nt ⁸n l⁸ars and trut⁷-t⁴ll⁴rs w⁷⁴n no part⁸cular ⁸nc⁴nt⁸v⁴s w⁴r⁴ ⁶⁸v⁴n to t⁷⁴ sub⁹⁴cts, was s⁸⁶n⁸ficantly r⁴duc⁴d ⁸n l⁸ars w⁷o w⁴r⁴ ⁸n som⁴ way mot⁸vat⁴d to succ⁴⁴d (d D 0:15). ⁴ sam⁴ was tru⁴ ⁵or ⁵oot or l⁴⁶ mov⁴m⁴nts (d D 0:13). n t⁷⁴ ot⁷⁴r d⁸r⁴ct⁸on, mot⁸vat⁴d ly⁸n⁶ produc⁴d s⁸⁶n⁸ficantly mor⁴ n⁴rvousn⁴ss and t⁴ns⁴n⁴ss (d D 0:35) and s⁸⁶n⁸ficantly ⁷⁸⁶⁷⁴r-p⁸tc⁷⁴d vo⁸c⁴s (d D 0:59). n t⁷⁴ pr⁴s⁴nc⁴ o⁵ som⁴ part⁸cular mot⁸vat⁸on, also non-ah d⁸sturbanc⁴s and fill⁴d paus⁴s s⁷ow⁴d ⁸nt⁴r⁴st⁸n⁶ c⁷an⁶⁴s t⁷⁴y w⁴r⁴ non-s⁸⁶n⁸ficantly mor⁴ ⁵r⁴qu⁴nt ⁸n t⁷⁴ abs⁴nc⁴ o⁵ ⁸nc⁴nt⁸v⁴s, but t⁷⁴ tr⁴nd ⁸nv⁴rt⁴d ⁸n t⁷⁴ pr⁴s⁴nc⁴ o⁵ som⁴ ⁸nduc⁴m⁴nt t⁷⁴ cu⁴s b⁴cam⁴ s⁸⁶n⁸ficantly l⁴ss ⁵r⁴qu⁴nt ⁸n l⁸ars (d D 0:10 and d D 0:13, r⁴sp⁴ct⁸v⁴ly).
⁴Paulo ⁴t al. ⁷ad suppos⁴d t⁷at l⁸ars would ⁶⁸v⁴ s⁷ort⁴r answ⁴rs t⁷an trut⁷-t⁴ll⁴rs and would ⁷av⁴ s⁷own lon⁶⁴r r⁴spons⁴ lat⁴ncy and mor⁴ s⁸l⁴nt paus⁴s t⁷⁴s⁴ pr⁴d⁸ct⁸ons w⁴r⁴ not confirm⁴d ⁸n t⁷⁴ ⁶⁴n⁴ral analys⁴s. ow⁴v⁴r, w⁷⁴n ⁸d⁴nt⁸ty-r⁴l⁴vant mot⁸vat⁸ons w⁴r⁴ tak⁴n ⁸nto cons⁸d⁴rat⁸on, t⁷⁴ cu⁴s “r⁴spons⁴ l⁴n⁶t⁷” and “r⁴spons⁴ lat⁴ncy” s⁷ow⁴d “n⁴arly s⁸⁶n⁸ficant ⁴⁵⁵⁴ct s⁸z⁴s” [⁴Paulo ⁴t al., 00, p. ] ⁸n t⁷⁴ ⁴xp⁴ct⁴d d⁸r⁴ct⁸on (d D 0:23 and d D C0:38, r⁴sp⁴ct⁸v⁴ly). As ⁸n t⁷⁴ ⁶⁴n⁴ral analys⁴s, l⁸ars produc⁴d s⁸⁶n⁸ficantly ⁷⁸⁶⁷⁴r vo⁸c⁴ p⁸tc⁷ t⁷an trut⁷-t⁴ll⁴rs (d D C0:67), and s⁸⁶n⁸ficantly ⁵⁴w⁴r ⁵oot or l⁴⁶ mov⁴m⁴nts (d D 0:28) ⁸n t⁷⁴ ⁸d⁴nt⁸ty-r⁴l⁴vant cond⁸t⁸on, alt⁷ou⁶⁷ non-s⁸⁶n⁸ficant ⁸n t⁷⁴ ov⁴rall analys⁴s.
n t⁷⁴ stud⁸⁴s w⁷⁴r⁴ ⁸nstrum⁴ntal—t⁷at ⁸s ⁴conom⁸c—⁸nc⁴nt⁸v⁴s w⁴r⁴ ⁶⁸v⁴n to t⁷⁴ sub⁹⁴cts, “t⁷⁴r⁴ w⁴r⁴ no ⁴ff⁴ct s⁸z⁴s t⁷at d⁸ff⁴r⁴d s⁸⁶n⁸ficantly ⁵rom c⁷anc⁴” [⁴Paulo ⁴t al., 00, p. ]. ow⁴v⁴r, non-ah d⁸sturbanc⁴s and fill⁴d paus⁴s s⁷ow⁴d t⁷⁴ sam⁴ tr⁴nd ⁸nv⁴rs⁸on w⁷⁸c⁷ was s⁴⁴n ⁸n t⁷⁴ ⁶⁴n⁴ral “mot⁸vat⁸on to succ⁴⁴d” cond⁸t⁸on, w⁸t⁷ almost s⁸⁶n⁸ficant d valu⁴s, (d D 0:17 and d D 0:14). ⁴ ⁵act t⁷at ⁸nstrum⁴ntal mot⁸vat⁸ons d⁸d not stron⁶ly aff⁴ct t⁷⁴ ⁴xpr⁴ss⁸on o⁵ cu⁴s to d⁴c⁴pt⁸on ⁸s consonant w⁸t⁷ t⁷⁴ s⁴l⁵-pr⁴s⁴ntat⁸onal p⁴rsp⁴ct⁸v⁴ o⁵ ⁴Paulo ⁴t al., w⁷⁸c⁷ pr⁴d⁸ct⁴d a stron⁶⁴r ⁴ff⁴ct ⁸n t⁷⁴ cas⁴ o⁵ ⁸d⁴nt⁸ty-r⁴l⁴vant mot⁸vat⁸ons. U
S⁴v⁴n r⁴ports analyz⁴d by ⁴Paulo ⁴t al. ⁴xplor⁴d t⁷⁴ d⁸ff⁴r⁴nc⁴ b⁴tw⁴⁴n plann⁴d and unplann⁴d r⁴spons⁴s. u⁴ to t⁷⁴ small numb⁴r o⁵ stud⁸⁴s and t⁷⁴ ⁵act t⁷at only two cu⁴s—r⁴spons⁴ l⁴n⁶t⁷ and r⁴spons⁴ lat⁴ncy—could b⁴ bas⁴d on at l⁴ast t⁷r⁴⁴ ⁸nd⁴p⁴nd⁴nt ⁴st⁸mat⁴s, w⁸t⁷ two pr⁴c⁸s⁴ly ⁴st⁸mat⁴d, t⁷⁴ aut⁷ors warn t⁷at t⁷⁴ r⁴ma⁸n⁸n⁶ r⁴sults s⁷ould b⁴ cons⁸d⁴r⁴d w⁸t⁷ caut⁸on [⁴Paulo ⁴t al., 00, p. ]. n ord⁴r to ⁴valuat⁴ t⁷⁴ ⁴ff⁴ct o⁵ t⁷⁴ var⁸abl⁴, t⁷⁴ aut⁷ors subtract⁴d t⁷⁴ ⁴ff⁴ct s⁸z⁴ o⁵ plann⁴d m⁴ssa⁶⁴s ⁵rom t⁷at o⁵ unplann⁴d m⁴ssa⁶⁴s. ⁴ d⁸ff⁴r⁴nc⁴ ⁸n r⁴spons⁴ lat⁴ncy was s⁸⁶n⁸ficant (d D C0:20) ⁸n part⁸cular, w⁷⁴n t⁷⁴ m⁴ssa⁶⁴s w⁴r⁴ unplann⁴d l⁸ars t⁴nd⁴d to s⁷ow lon⁶⁴r lat⁴nc⁸⁴s, w⁷⁸l⁴ w⁷⁴n t⁷⁴ m⁴ssa⁶⁴ was plann⁴d l⁸ars’ lat⁴nc⁸⁴s w⁴r⁴ r⁴lat⁸v⁴ly s⁷ort⁴r t⁷an t⁷os⁴ o⁵ trut⁷-t⁴ll⁴rs. n unplann⁴d pr⁴s⁴ntat⁸ons, t⁷⁴ l⁴n⁶t⁷ o⁵ l⁸ars r⁴spons⁴s was s⁷ort⁴r, as ⁴xp⁴ct⁴d, but not s⁸⁶n⁸ficantly so as was t⁷⁴ cas⁴ also ⁵or s⁸l⁴nt paus⁴s, w⁷⁸c⁷ occurr⁴d mor⁴ ⁵r⁴qu⁴ntly ⁸n l⁸ars’ pr⁴s⁴ntat⁸ons, alt⁷ou⁶⁷ not as a s⁸⁶n⁸ficant d⁸ff⁴r⁴nt⁸ator.
2.3. THE PSYCHOLOGY LITERATURE
27
⁴Paulo ⁴t al. also cons⁸d⁴r⁴d t⁷⁴ m⁴an durat⁸on o⁵ t⁷⁴ pr⁴s⁴ntat⁸on o⁵ l⁸ars. S⁸nc⁴ durat⁸on ⁸s a cont⁸nuous var⁸abl⁴, t⁷⁴ sub⁹⁴cts w⁴r⁴ not d⁸v⁸d⁴d ⁸nto ⁶roups ⁸nst⁴ad, t⁷⁴ aut⁷ors ⁴mploy⁴d t⁷⁴ m⁴nt⁸on⁴d QB stat⁸st⁸c, w⁷⁸c⁷ m⁴asur⁴d t⁷⁴ ⁷omo⁶⁴n⁴⁸ty o⁵ t⁷⁴ ⁴ff⁴ct s⁸z⁴s, w⁷⁴r⁴ b⁴ta ⁸nd⁸cat⁴d t⁷⁴ d⁸r⁴ct⁸on o⁵ t⁷⁴ ⁴ff⁴ct. ⁴ r⁴sults s⁷ow⁴d t⁷at “w⁷⁴n pr⁴s⁴ntat⁸ons w⁴r⁴ susta⁸n⁴d ⁵or ⁶r⁴at⁴r amounts o⁵ t⁸m⁴, d⁴c⁴pt⁸v⁴ r⁴spons⁴s w⁴r⁴ ⁴sp⁴c⁸ally s⁷ort⁴r t⁷an trut⁷⁵ul on⁴s (b D 0:008), and t⁷⁴y w⁴r⁴ pr⁴c⁴d⁴d by a lon⁶⁴r lat⁴ncy (b D C0:034)” [⁴Paulo ⁴t al., 00, p. 00]. Sub⁹⁴cts, ⁴sp⁴c⁸ally ⁸n t⁷⁴ lon⁶⁴st pr⁴s⁴ntat⁸ons, also spok⁴ ⁸n a ⁷⁸⁶⁷⁴r p⁸tc⁷ w⁷⁴n ly⁸n⁶ (b D C0:002).
As pr⁴d⁸ct⁴d by ⁴Paulo ⁴t al., w⁷⁴t⁷⁴r a narrat⁸v⁴ ⁸s about a trans⁶r⁴ss⁸on ⁸s a var⁸abl⁴ t⁷at ⁷ad a cons⁸d⁴rabl⁴ ⁴ff⁴ct on d⁴c⁴pt⁸on cu⁴s. W⁷⁴n trans⁶r⁴ss⁸ons w⁴r⁴ t⁷⁴ top⁸c, r⁴lat⁸v⁴ to w⁷⁴n t⁷⁴y w⁴r⁴ not, l⁸ars spok⁴ s⁸⁶n⁸ficantly ⁵ast⁴r (d D C0:32 vs. d D C0:01), bl⁸nk⁴d mor⁴ (d D C0:38 vs. d D C0:01), mov⁴d ⁵⁴⁴t and l⁴⁶s l⁴ss (d D 0:24 vs. d D 0:04), and w⁴r⁴ ⁵ound mor⁴ t⁴ns⁴ (d D C0:51 vs. d D C0:09) non⁴ o⁵ t⁷⁴s⁴ cu⁴s was s⁸⁶n⁸ficant ⁸n t⁷⁴ non-trans⁶r⁴ss⁸on cond⁸t⁸on. n t⁷⁴ trans⁶r⁴ss⁸on cond⁸t⁸on, som⁴ ot⁷⁴r non-s⁸⁶n⁸ficant tr⁴nds w⁴r⁴ also ⁵ound, w⁷⁸c⁷ su⁶⁶⁴st t⁷at l⁸ars t⁴nd⁴d to r⁴duc⁴ non-ah d⁸sturbanc⁴s (d D 0:24 vs. d D C0:17), ⁴y⁴ contacts (d D 0:13 vs. d D C0:04) and lastly und⁸ff⁴r⁴nt⁸at⁴d fid⁶⁴t⁸n⁶ w⁷⁸c⁷ was, by contrast, s⁸⁶n⁸ficant ⁵rom t⁷⁴ po⁸nt v⁸⁴w o⁵ t⁷⁴ non-trans⁶r⁴ss⁸on cond⁸t⁸on (d D 0:16 vs. d D C0:24).
⁴ d⁴⁶r⁴⁴ o⁵ ⁸nt⁴ract⁸on was also ⁵ound to b⁴ a r⁴l⁴vant ⁵actor t⁷at ⁴n⁷anc⁴d t⁷⁴ ⁴xpr⁴ss⁸on o⁵ som⁴ d⁴c⁴pt⁸on cu⁴s. L⁸ars ⁸n ⁸nt⁴ract⁸v⁴ cond⁸t⁸ons ⁶av⁴ ⁵⁴w⁴r d⁴ta⁸ls (d D 0:33 vs. d D 0:06), spok⁴ ⁸n a ⁷⁸⁶⁷⁴r p⁸tc⁷ (d D C0:35 vs. d D 0:06), and bl⁸nk⁴d (d D C0:29 vs. d D 0:06) mor⁴ t⁷an trut⁷-t⁴ll⁴rs, w⁷⁸l⁴ ⁸n non-⁸nt⁴ract⁸v⁴ parad⁸⁶ms t⁷⁴ d⁸ff⁴r⁴nc⁴ ⁸n t⁷⁴s⁴ cu⁴s was not appr⁴c⁸abl⁴.
Lastly, t⁷⁴ aut⁷ors cons⁸d⁴r⁴d t⁷⁴ ⁴ff⁴ct s⁸z⁴s o⁵ t⁷⁴ cu⁴s w⁸t⁷ r⁴sp⁴ct to w⁷⁴t⁷⁴r t⁷⁴ study ⁸nvolv⁴d ob⁹⁴ct⁸v⁴ or sub⁹⁴ct⁸v⁴ ⁴valuat⁸on, as t⁷⁴y ⁷ad ⁷ypot⁷⁴s⁸z⁴d t⁷at ⁷uman obs⁴rv⁴rs m⁸⁶⁷t ⁷av⁴ b⁴⁴n mor⁴ ⁴ff⁴ct⁸v⁴ t⁷an ob⁹⁴ct⁸v⁴ m⁴asur⁴m⁴nts. ⁴y analyz⁴d s⁸x cu⁴s ⁷⁴r⁴ amount o⁵ d⁴ta⁸l, vocal/v⁴rbal ⁸mm⁴d⁸acy, nonv⁴rbal ⁸mm⁴d⁸acy, ⁴y⁴ contact, ⁵ac⁸al pl⁴asantn⁴ss, and r⁴lax⁴d postur⁴. ⁴y ⁵ound t⁷at t⁷⁴ d⁸ff⁴r⁴nc⁴ b⁴tw⁴⁴n t⁷⁴ two cond⁸t⁸ons was s⁸⁶n⁸ficant ⁸n t⁷r⁴⁴ cas⁴s (d⁴ta⁸ls, vocal/v⁴rbal ⁸mm⁴d⁸acy, and ⁵ac⁸al pl⁴asantn⁴ss), and ⁸n all cas⁴s t⁷⁴ ⁴ff⁴ct s⁸z⁴ was stron⁶⁴r w⁷⁴n sub⁹⁴ct⁸v⁴ly ⁴valuat⁴d. ⁸s outcom⁴ would su⁶⁶⁴st t⁷at ⁷uman obs⁴rvat⁸ons o⁵ som⁴ b⁴⁷av⁸oral cu⁴s may b⁴ mor⁴ accurat⁴ t⁷an an ob⁹⁴ct⁸v⁴ m⁴asur⁴. W⁷⁸l⁴ w⁴ do not ⁴xclud⁴ t⁷⁸s poss⁸b⁸l⁸ty, non⁴t⁷⁴l⁴ss our op⁸n⁸on ⁸s t⁷at ⁷uman ⁴valuat⁸on s⁷ould b⁴ ⁷andl⁴d v⁴ry car⁴⁵ully, as t⁷⁴ a⁶r⁴⁴m⁴nt b⁴tw⁴⁴n ⁷uman ⁹ud⁶⁴s and som⁴t⁸m⁴s ⁴v⁴n t⁷⁴ d⁴fin⁸t⁸on o⁵ t⁷⁴ cu⁴s, ⁸s o⁵t⁴n a not n⁴⁶l⁸⁶⁸bl⁴ ⁸ssu⁴ ⁸n sc⁸⁴nt⁸fic r⁴s⁴arc⁷.
28
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
Conclusion ⁴Paulo ⁴t al. carr⁸⁴d out a w⁸d⁴ analys⁸s o⁵ t⁷⁴ l⁸t⁴ratur⁴, w⁷⁸c⁷ r⁴v⁴als ⁷ow d⁸fficult ⁸t ⁸s to ⁴valuat⁴ t⁷⁴ poss⁸bl⁴ cu⁴s to d⁴c⁴pt⁸on, as t⁷⁴ r⁴lat⁸on b⁴tw⁴⁴n d⁴c⁴pt⁸on and t⁷⁴ s⁸n⁶l⁴ cu⁴s ⁸s usually w⁴ak. nd⁴⁴d, ⁸n t⁷⁴ conclus⁸on o⁵ t⁷⁴⁸r study t⁷⁴y stat⁴ t⁷at “b⁴⁷av⁸oral cu⁴s t⁷at ar⁴ d⁸sc⁴rn⁸bl⁴ by ⁷uman p⁴rc⁴⁸v⁴rs ar⁴ assoc⁸at⁴d w⁸t⁷ d⁴c⁴⁸t only probab⁸l⁸st⁸cally. To ⁴stabl⁸s⁷ d⁴fin⁸t⁸v⁴ly t⁷at som⁴on⁴ ⁸s ly⁸n⁶, ⁵urt⁷⁴r ⁴v⁸d⁴nc⁴ ⁸s n⁴⁴d⁴d” [⁴Paulo ⁴t al., 00, p. 0]. Non⁴t⁷⁴l⁴ss, t⁷anks to t⁷⁴ s⁸z⁴ o⁵ t⁷⁴⁸r data coll⁴ct⁸on and to t⁷⁴ r⁸⁶or o⁵ t⁷⁴⁸r m⁴t⁷odolo⁶⁸cal approac⁷, t⁷⁴ study o⁵ ⁴Paulo ⁴t al. r⁴pr⁴s⁴nts a ⁵undam⁴ntal sta⁶⁴ ⁸n t⁷⁴ mod⁴rn r⁴s⁴arc⁷ on d⁴c⁴pt⁸on, st⁸mulat⁸n⁶ t⁷⁴ r⁴s⁴arc⁷⁴r to ⁵ollow n⁴w pat⁷s. Ald⁴rt Vr⁸⁹ smartly ⁴xpr⁴ss⁴d t⁷⁸s ⁸d⁴a “A turn⁸n⁶ po⁸nt ⁸n our t⁷⁸nk⁸n⁶ about l⁸⁴ d⁴t⁴ct⁸on cam⁴ ⁸n 00. n t⁷at y⁴ar, ⁴lla ⁴Paulo and ⁷⁴r coll⁴a⁶u⁴s publ⁸s⁷⁴d a m⁴ta-analys⁸s o⁵ d⁴c⁴pt⁸on r⁴s⁴arc⁷ t⁷at d⁴monstrat⁴d t⁷at nonv⁴rbal and v⁴rbal cu⁴s to d⁴c⁴pt⁸on ar⁴ typ⁸cally ⁵a⁸nt and unr⁴l⁸abl⁴. t mad⁴ us r⁴al⁸z⁴ t⁷at a n⁴w d⁸r⁴ct⁸on ⁸n d⁴c⁴pt⁸on r⁴s⁴arc⁷ was r⁴qu⁸r⁴d” [Vr⁸⁹ and ran⁷a⁶, 0, p. ]. ⁴ n⁴w d⁸r⁴ct⁸on ⁸s t⁷⁴ ob⁹⁴ct o⁵ t⁷⁴ n⁴xt s⁴ct⁸on.
2.3.2 VRIJ’S STUDIES Awar⁴ t⁷at, as w⁸d⁴ sc⁸⁴nt⁸fic ⁴v⁸d⁴nc⁴ s⁷ows, d⁴c⁴pt⁸on cu⁴s ar⁴ usually w⁴ak and o⁵t⁴n unr⁴l⁸abl⁴, Ald⁴rt Vr⁸⁹ supports r⁴s⁴arc⁷ act⁸v⁸t⁸⁴s not a⁸m⁴d at find⁸n⁶ out n⁴w and mor⁴ r⁴l⁸abl⁴ cu⁴s—w⁷⁸c⁷ ⁸s an approac⁷ ⁷⁴ cons⁸d⁴rs doom⁴d to ⁵a⁸l—but at man⁸pulat⁸n⁶ t⁷⁴ ⁸nt⁴ract⁸ons w⁸t⁷ sub⁹⁴cts ⁸n ord⁴r to ⁴n⁷anc⁴ t⁷⁴ ⁴xpr⁴ss⁸on o⁵ t⁷⁴ d⁴c⁴pt⁸on cu⁴s alr⁴ady known. as⁸cally, Vr⁸⁹ cons⁸d⁴rs two m⁴t⁷ods o⁵ produc⁸n⁶ an ⁸ncr⁴as⁴ ⁸n t⁷⁴ numb⁴r and var⁸⁴ty o⁵ d⁴c⁴pt⁸on cu⁴s l⁴ak⁴d by ⁴xp⁴r⁸m⁴ntal sub⁹⁴cts Imposing emotional load. ⁸s ⁸s t⁷⁴ pat⁷ ⁵ollow⁴d, ⁵or ⁴xampl⁴, ⁸n t⁷⁴ t⁴c⁷n⁸qu⁴s o⁵ pol⁸c⁴ ⁸nt⁴rro⁶at⁸on d⁴v⁴lop⁴d by nbau ⁴t al. [0], w⁷⁴r⁴ non-coop⁴rat⁸v⁴ sub⁹⁴cts ar⁴ put und⁴r som⁴ ⁵orm o⁵ psyc⁷olo⁶⁸cal pr⁴ssur⁴ ⁸n ord⁴r to ⁴l⁸c⁸t ⁴mot⁸onal r⁴act⁸ons w⁷⁸c⁷ may b⁴ cons⁸d⁴r⁴d, ⁸n turn, cu⁴s to d⁴c⁴pt⁸on. ⁴ probl⁴m w⁸t⁷ suc⁷ t⁴c⁷n⁸qu⁴s ⁸s t⁷at, as w⁴ try to ⁴mp⁷as⁸z⁴ abov⁴, ⁴mot⁸onal r⁴act⁸ons ar⁴ not sp⁴c⁸fic to d⁴c⁴pt⁸on. ⁴ sam⁴ k⁸nd o⁵ ⁸ssu⁴ conc⁴rns m⁴t⁷ods t⁷at r⁴ly on t⁷⁴ ⁴valuat⁸on o⁵ p⁷ys⁸olo⁶⁸cal var⁸abl⁴s, w⁷⁴r⁴ t⁷⁴ corr⁴lat⁸on w⁸t⁷ d⁴c⁴pt⁸on ⁸s also not un⁴qu⁸vocal. As a cons⁴qu⁴nc⁴, p⁷ys⁸olo⁶⁸cal r⁴act⁸ons and ⁴mot⁸onal r⁴spons⁴s m⁸⁶⁷t b⁴ cons⁸d⁴r⁴d cu⁴s to d⁴c⁴pt⁸on w⁷⁴n t⁷⁴y ar⁴, ⁸n ⁵act, r⁴lat⁴d to ot⁷⁴r ⁸nt⁴rnal psyc⁷olo⁶⁸cal stat⁴s. n ot⁷⁴r words, suc⁷ t⁴c⁷n⁸qu⁴s ar⁴ pron⁴ to ⁵als⁴ pos⁸t⁸v⁴ ⁴rrors, w⁷⁸c⁷, ⁸n cas⁴s l⁸k⁴ pol⁸c⁴ ⁸nt⁴rro⁶at⁸ons, could l⁴ad to s⁴r⁸ous cons⁴qu⁴nc⁴s. ⁴ doubts, not to say t⁷⁴ c⁴rta⁸nt⁸⁴s, r⁴⁶ard⁸n⁶ t⁷⁴ d⁴⁶r⁴⁴ o⁵ accuracy o⁵ suc⁷ strat⁴⁶⁸⁴s ⁸n d⁴t⁴ct⁸n⁶ r⁴al d⁴c⁴pt⁸on cu⁴s, alon⁶ w⁸t⁷ t⁷⁴ ⁴t⁷⁸cal conc⁴rns t⁷at ar⁸s⁴ ⁵rom t⁷⁴ us⁴ o⁵ str⁴ss⁵ul t⁴c⁷n⁸qu⁴s, ⁷as ⁸nduc⁴d r⁴s⁴arc⁷⁴rs to adopt a d⁸ff⁴r⁴nt p⁴rsp⁴ct⁸v⁴ ⁸n r⁴c⁴nt stud⁸⁴s. Imposing cognitive load. n contrast w⁸t⁷ t⁷⁴ str⁴ss-⁸nduc⁸n⁶ approac⁷, t⁷⁴ t⁴c⁷n⁸qu⁴s propos⁴d by Vr⁸⁹ r⁴ly on t⁷⁴ ⁸d⁴a t⁷at ⁸ncr⁴as⁸n⁶ t⁷⁴ d⁸fficulty o⁵ t⁷⁴ co⁶n⁸t⁸v⁴ tasks ⁶⁸v⁴n
2.3. THE PSYCHOLOGY LITERATURE
29
to t⁷⁴ sub⁹⁴cts do⁴s not r⁴markably aff⁴ct t⁷⁴ b⁴⁷av⁸or o⁵ t⁷⁴ trut⁷-t⁴ll⁴rs, w⁷⁸l⁴ ⁸t do⁴s ⁴n⁷anc⁴ t⁷⁴ l⁴aka⁶⁴ o⁵ cu⁴s to d⁴c⁴pt⁸on ⁵rom l⁸ars. ⁸s cla⁸m ⁸s bas⁴d on t⁷⁴ not⁸on t⁷at “⁸⁵ ly⁸n⁶ r⁴qu⁸r⁴s mor⁴ co⁶n⁸t⁸v⁴ r⁴sourc⁴s t⁷an trut⁷ t⁴ll⁸n⁶, l⁸ars w⁸ll ⁷av⁴ ⁵⁴w⁴r co⁶n⁸t⁸v⁴ r⁴sourc⁴s l⁴⁵t ov⁴r” [Vr⁸⁹ and ran⁷a⁶, 0]. ⁴r⁴ ar⁴ many ways ⁸n w⁷⁸c⁷ co⁶n⁸t⁸v⁴ load can b⁴ ⁸ncr⁴as⁴d. n part⁸cular, Vr⁸⁹ and ran⁷a⁶ [0] su⁶⁶⁴st two ma⁸n strat⁴⁶⁸⁴s, ⁵rom w⁷⁸c⁷ t⁷⁴y d⁴r⁸v⁴ s⁴v⁴ral tact⁸cs t⁷at ar⁴ br⁸⁴fly d⁸scuss⁴d ⁸n t⁷⁴ n⁴xt subs⁴ct⁸ons. ⁴ approac⁷ o⁵ Vr⁸⁹ can b⁴ summar⁸z⁴d as ⁵ollows
. to ⁸ncr⁴as⁴ t⁷⁴ d⁸fficulty o⁵ r⁴call⁸n⁶ ⁸n⁵ormat⁸on, ask sub⁹⁴cts • to t⁴ll t⁷⁴⁸r stor⁸⁴s ⁸n r⁴v⁴rs⁴ ord⁴r • to ma⁸nta⁸n ⁴y⁴ contact w⁸t⁷ t⁷⁴ ⁸nt⁴rv⁸⁴w⁴r
. to ⁸nduc⁴ t⁷⁴ sub⁹⁴cts to b⁴ mor⁴ talkat⁸v⁴
• ⁸n⁵orm sub⁹⁴cts t⁷at trut⁷-t⁴ll⁴rs ar⁴ mor⁴ talkat⁸v⁴ t⁷an l⁸ars • ⁸nstruct t⁷⁴ ⁸nt⁴rv⁸⁴w⁴r to b⁴ part⁸cularly support⁸v⁴ • pos⁴ unant⁸c⁸pat⁴d qu⁴st⁸ons
⁸scuss⁸n⁶ t⁷⁴s⁴ m⁴t⁷ods, Vr⁸⁹ and ran⁷a⁶ [0] draw an ⁴xpl⁸c⁸t compar⁸son w⁸t⁷ “accusatory ⁸nt⁴rv⁸⁴w approac⁷⁴s,” ⁷⁸⁶⁷l⁸⁶⁷t⁸n⁶ t⁷⁴ ⁵act t⁷at ⁸n t⁷os⁴ cas⁴s ⁸nt⁴rv⁸⁴w⁴⁴s ⁷av⁴ to ⁵ac⁴ accusat⁸ons, w⁷⁸l⁴ “t⁷⁴ co⁶n⁸t⁸v⁴ load approac⁷ sol⁴ly us⁴s ⁸n⁵ormat⁸on-⁶at⁷⁴r⁸n⁶ qu⁴st⁸ons” [Vr⁸⁹ and ran⁷a⁶, 0, p. ]. urt⁷⁴rmor⁴, t⁷⁴ aut⁷ors po⁸nt out t⁷at “accusatory qu⁴st⁸ons l⁴ad to s⁷ort d⁴n⁸als (⁴.⁶., “ am not ly⁸n⁶,” “ d⁸d not do ⁸t”) t⁷at do not r⁴qu⁸r⁴ ⁵abr⁸cat⁸n⁶ muc⁷ d⁴ta⁸l” [Vr⁸⁹ and ran⁷a⁶, 0, p. ]. y contrast, ⁸n⁵ormat⁸on⁶at⁷⁴r⁸n⁶ qu⁴st⁸ons ⁷av⁴ two advanta⁶⁴s t⁷⁴y “ar⁴ mor⁴ co⁶n⁸t⁸v⁴ly d⁴mand⁸n⁶ ⁵or l⁸ars” and ⁸n t⁷⁴ms⁴lv⁴s t⁷⁴y do not t⁴nd to ⁴vok⁴ ⁴mot⁸onal r⁴act⁸ons. ⁴r⁴⁵or⁴, t⁷⁴ aut⁷ors cla⁸m t⁷at not only do “⁸n⁵ormat⁸on-⁶at⁷⁴r⁸n⁶ ⁸nt⁴rv⁸⁴ws r⁴sult ⁸n mor⁴ v⁴rbal cu⁴s to d⁴c⁴⁸t,” but also t⁷at “nonv⁴rbal d⁸ff⁴r⁴nc⁴s b⁴tw⁴⁴n trut⁷-t⁴ll⁴rs and l⁸ars, w⁷⁸c⁷ ar⁴ subtl⁴ by natur⁴, ar⁴ t⁷⁴r⁴⁵or⁴ most l⁸k⁴ly to occur ⁸n r⁴spons⁴ to ⁸n⁵ormat⁸on-⁶at⁷⁴r⁸n⁶ qu⁴st⁸ons.” [Vr⁸⁹ and ran⁷a⁶, 0, p. ] Last, but not l⁴ast, a m⁴ta-analys⁸s on t⁷⁴ top⁸c [M⁴⁸ssn⁴r ⁴t al., 0] s⁷ow⁴d t⁷at, w⁷⁸l⁴ ⁸n⁵ormat⁸on-⁶at⁷⁴r⁸n⁶ m⁴t⁷ods d⁴cr⁴as⁴ t⁷⁴ l⁸k⁴l⁸⁷ood o⁵ ⁵als⁴ con⁵⁴ss⁸ons, accusatory ⁸nt⁴rv⁸⁴ws ⁸ncr⁴as⁴ t⁷⁴ l⁸k⁴l⁸⁷ood o⁵ bot⁷ tru⁴ and ⁵als⁴ con⁵⁴ss⁸ons. ⁴ n⁴xt subs⁴ct⁸ons d⁸scuss ⁴xp⁴r⁸m⁴nts t⁷at ⁴mploy co⁶n⁸t⁸v⁴-load m⁴t⁷ods. Increasing the difficulty of the recollection
R
Vr⁸⁹ ⁴t al. [00] carr⁸⁴d out an ⁴xp⁴r⁸m⁴nt w⁷⁴r⁴ trut⁷-t⁴ll⁴rs and l⁸ars w⁴r⁴ ask⁴d to r⁴call an ⁴v⁴nt ⁸n c⁷ronolo⁶⁸cal and r⁴v⁴rs⁴ ord⁴r. ⁴y appl⁸⁴d t⁷⁴ typ⁸cal “mock t⁷⁴⁵t” parad⁸⁶m w⁷⁸l⁴
30
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
t⁷⁴ trut⁷-t⁴ll⁴rs ⁷ad to ⁷on⁴stly r⁴call an ⁴v⁴nt w⁷⁴r⁴ t⁷⁴y ⁷ad not comm⁸tt⁴d a mock t⁷⁴⁵t, t⁷⁴ l⁸ars ⁷ad to d⁴ny t⁷⁴⁸r r⁴spons⁸b⁸l⁸ty ⁵or t⁷⁴ mock t⁷⁴⁵t t⁷⁴y ⁷ad comm⁸tt⁴d. ot⁷ trut⁷-t⁴ll⁴rs and l⁸ars w⁴r⁴ told t⁷⁴y would b⁴ r⁴ward⁴d ⁸⁵ t⁷⁴y w⁴r⁴ succ⁴ss⁵ul ⁸n conv⁸nc⁸n⁶ t⁷⁴ ⁸nt⁴rv⁸⁴w⁴r o⁵ t⁷⁴⁸r ⁸nnoc⁴nc⁴, w⁷⁸l⁴ t⁷⁴ ⁸nt⁴rv⁸⁴w⁴r was bl⁸nd w⁸t⁷ r⁴sp⁴ct to t⁷⁴ cond⁸t⁸on o⁵ t⁷⁴ ⁸nt⁴rv⁸⁴w⁴⁴. ow⁴v⁴r, ⁸n bot⁷ ⁶roups, ⁷al⁵ t⁷⁴ sub⁹⁴cts w⁴r⁴ ask⁴d to t⁴ll t⁷⁴ ⁴v⁴nt ⁸n c⁷ronolo⁶⁸cal ord⁴r, and ⁷al⁵ ⁷ad to r⁴call ⁸t ⁸n r⁴v⁴rs⁴ ord⁴r. ⁴ r⁴sults confirm⁴d t⁷⁴ ⁴xp⁴ctat⁸ons o⁵ t⁷⁴ aut⁷ors “part⁸c⁸pants ⁸n t⁷⁴ control cond⁸t⁸on s⁷ow⁴d only on⁴ cu⁴ o⁵ d⁴c⁴⁸t l⁸ars mov⁴d t⁷⁴⁸r ⁷ands and fin⁶⁴rs l⁴ss t⁷an trut⁷ t⁴ll⁴rs” [Vr⁸⁹ ⁴t al., 00, p. ]. y contrast, “n t⁷⁴ R⁴v⁴rs⁴ Ord⁴r cond⁸t⁸on l⁸ars ⁸nclud⁴d ⁵⁴w⁴r aud⁸tory d⁴ta⁸ls and cont⁴xtual ⁴mb⁴dd⁸n⁶ d⁴ta⁸ls and mor⁴ co⁶n⁸t⁸v⁴ op⁴rat⁸ons ⁸n t⁷⁴⁸r stor⁸⁴s t⁷an trut⁷ t⁴ll⁴rs, t⁷r⁴⁴ s⁸⁶ns o⁵ co⁶n⁸t⁸v⁴ load. urt⁷⁴rmor⁴, l⁸ars mad⁴ mor⁴ sp⁴⁴c⁷ ⁷⁴s⁸tat⁸ons, spok⁴ w⁸t⁷ a slow⁴r sp⁴⁴c⁷ rat⁴, and mad⁴ mor⁴ sp⁴⁴c⁷ ⁴rrors t⁷an trut⁷ t⁴ll⁴rs, w⁷⁸c⁷ ar⁴ t⁷r⁴⁴ mor⁴ s⁸⁶ns o⁵ co⁶n⁸t⁸v⁴ load. L⁸ars, ⁷ow⁴v⁴r, d⁸d not ⁹ust r⁴v⁴al mor⁴ s⁸⁶ns o⁵ co⁶n⁸t⁸v⁴ load t⁷an trut⁷ t⁴ll⁴rs. ⁴y also mad⁴ mor⁴ l⁴⁶ and ⁵oot mov⁴m⁴nts t⁷an trut⁷ t⁴ll⁴rs and bl⁸nk⁴d mor⁴. ⁴s⁴ ar⁴ s⁸⁶ns o⁵ n⁴rvousn⁴ss, rat⁷⁴r t⁷an s⁸⁶ns o⁵ co⁶n⁸t⁸v⁴ load” [Vr⁸⁹ ⁴t al., 00, p. ]. ⁴ purpos⁴ o⁵ t⁷⁴ r⁴s⁴arc⁷⁴rs, ⁷ow⁴v⁴r, was not s⁸mply to v⁴r⁸⁵y t⁷⁴ r⁴lat⁸on b⁴tw⁴⁴n co⁶n⁸t⁸v⁴ load and d⁴c⁴pt⁸on cu⁴s t⁷⁴y w⁴r⁴ also ⁸nt⁴r⁴st⁴d ⁸n ⁴st⁸mat⁸n⁶ t⁷⁴ ⁴xt⁴nt to w⁷⁸c⁷ t⁷⁴s⁴ cu⁴s to d⁴c⁴pt⁸on could b⁴ ⁴xplo⁸t⁴d by obs⁴rv⁴rs not sp⁴c⁸fically tra⁸n⁴d ⁵or t⁷⁴ task. ⁴y carr⁸⁴d out a s⁴cond ⁴xp⁴r⁸m⁴nt, w⁷⁴r⁴ pol⁸c⁴ offic⁴rs ⁷ad to ⁶⁸v⁴ a ⁹ud⁶m⁴nt r⁴⁶ard⁸n⁶ t⁷⁴ trut⁷⁵uln⁴ss o⁵ sub⁹⁴cts ⁸n t⁷⁴ first ⁴xp⁴r⁸m⁴nt, bot⁷ ⁸n control and ⁸n r⁴v⁴rs⁴ ord⁴r cond⁸t⁸ons. W⁷⁸l⁴ ⁸n t⁷⁴ first cond⁸t⁸on t⁷⁴ p⁴r⁵ormanc⁴ o⁵ t⁷⁴ pol⁸c⁴ ⁹ud⁶⁴s d⁸d not ⁴xc⁴⁴d t⁷⁴ c⁷anc⁴ l⁴v⁴l, ⁸n t⁷⁴ s⁴cond ⁸t d⁸d, as t⁷⁴ l⁸ars w⁴r⁴ ⁵ound to b⁴ mor⁴ n⁴rvous and d⁸splay⁴d a n⁴⁴d to t⁷⁸nk ⁷ard⁴r t⁷an trut⁷t⁴ll⁴rs. ⁸s s⁴cond r⁴sult ⁸s r⁴markabl⁴, as ⁸t su⁶⁶⁴sts t⁷at t⁷⁴ co⁶n⁸t⁸v⁴ load approac⁷ can b⁴ part⁸cularly us⁴⁵ul ⁸n r⁴al-l⁸⁵⁴ sc⁴nar⁸os. M
Anot⁷⁴r study by Vr⁸⁹ ⁴t al. [00] ⁵ollows t⁷⁴ sam⁴ sc⁷⁴m⁴ as t⁷⁴ pr⁴v⁸ous on⁴, w⁸t⁷ t⁷⁴ d⁸ff⁴r⁴nc⁴ t⁷at, ⁸n t⁷⁸s cas⁴, t⁷⁴ sub⁹⁴cts w⁴r⁴ ⁸nstruct⁴d to ma⁸nta⁸n ⁴y⁴ contact w⁸t⁷ t⁷⁴ ⁸nt⁴rv⁸⁴w⁴r w⁷⁸l⁴ r⁴call⁸n⁶ t⁷⁴ ⁴v⁴nt. n t⁷⁸s cas⁴ also, t⁷⁴ ⁴xp⁴r⁸m⁴ntal d⁴s⁸⁶n cons⁸st⁴d o⁵ t⁷⁴ mock t⁷⁴⁵t parad⁸⁶m, w⁷⁴r⁴ trut⁷-t⁴ll⁴rs and l⁸ars ⁷ad to r⁴call t⁷⁴ ⁴v⁴nt w⁸t⁷ and w⁸t⁷out constant ⁴y⁴ contact w⁸t⁷ t⁷⁴ ⁸nt⁴rv⁸⁴w⁴r. As w⁸t⁷ t⁷⁴ r⁴v⁴rs⁴ ord⁴r s⁸tuat⁸on, t⁷⁴ ⁷ypot⁷⁴s⁸s was t⁷at ma⁸nta⁸n⁸n⁶ ⁴y⁴ contact ⁸s co⁶n⁸t⁸v⁴ly d⁴mand⁸n⁶, t⁷⁴r⁴by ⁸nduc⁸n⁶ mor⁴ cu⁴s to d⁴c⁴pt⁸on. v⁴n t⁷ou⁶⁷ t⁷⁴ d⁸r⁴ct⁸on o⁵ t⁷⁴ outcom⁴ matc⁷⁴d t⁷⁴ aut⁷ors’ ⁴xp⁴ctat⁸ons, t⁷⁴ r⁴sults w⁴r⁴ w⁴ak⁴r t⁷an ⁸n t⁷⁴ pr⁴v⁸ous study. n ⁵act, ⁸n t⁷⁴ ⁴y⁴ contact cond⁸t⁸on, bot⁷ l⁸ars and trut⁷ t⁴ll⁴rs m⁴nt⁸on⁴d ⁵⁴w⁴r aud⁸tory and t⁴mporal d⁴ta⁸ls, spok⁴ slow⁴r, and ⁸ncr⁴as⁴d ⁴y⁴ bl⁸nks and ⁷and/fin⁶⁴r mov⁴m⁴nts. ⁴s⁴ ar⁴, as t⁷⁴ r⁴s⁴arc⁷⁴rs t⁷⁴ms⁴lv⁴s und⁴rl⁸n⁴d, s⁸⁶ns o⁵ n⁴rvousn⁴ss rat⁷⁴r t⁷an co⁶n⁸t⁸v⁴ load, ⁸nd⁸cat⁸n⁶ t⁷at t⁷⁴ ⁴y⁴ contact cond⁸t⁸on may aff⁴ct bot⁷ co⁶n⁸t⁸v⁴ proc⁴ss⁸n⁶ and ⁴mot⁸onal r⁴act⁸ons. Non⁴t⁷⁴l⁴ss, ⁸n t⁷⁴ ⁴y⁴ contact cond⁸t⁸on, two var⁸abl⁴s s⁸⁶n⁸ficantly d⁸ff⁴r⁴nt⁸at⁴d trut⁷-t⁴ll⁴rs ⁵rom l⁸ars “L⁸ars ⁸nclud⁴d ⁵⁴w⁴r spat⁸al d⁴ta⁸ls ⁸nto t⁷⁴⁸r account t⁷an trut⁷ t⁴ll⁴rs and told t⁷⁴⁸r story ⁸n a mor⁴ c⁷ronolo⁶⁸cal ord⁴r” [Vr⁸⁹ ⁴t al., 00].
2.3. THE PSYCHOLOGY LITERATURE
31
n sp⁸t⁴ o⁵ t⁷⁴s⁴ not part⁸cularly ⁸mpr⁴ss⁸v⁴ r⁴sults, t⁷⁴ ⁴y⁴ contact cond⁸t⁸on ⁷⁴lp⁴d untra⁸n⁴d stud⁴nt obs⁴rv⁴rs to d⁴t⁴ct d⁴c⁴pt⁸on. ⁴ s⁴cond ⁴xp⁴r⁸m⁴nt o⁵ t⁷⁸s study, ⁸n ⁵act, s⁷ow⁴d t⁷at only ⁸n t⁷⁸s cond⁸t⁸on could t⁷⁴y d⁸st⁸n⁶u⁸s⁷ b⁴tw⁴⁴n l⁸⁴s and trut⁷s, ⁴v⁴n t⁷ou⁶⁷ ⁸n t⁷⁸s cas⁴ too t⁷⁴ ⁴ff⁴ct was not part⁸cularly stron⁶. n conclus⁸on, t⁷⁴ aut⁷ors adm⁸t t⁷at t⁷⁴ co⁶n⁸t⁸v⁴ load ⁸mpos⁴d by t⁷⁴ ⁴y⁴ contact cond⁸t⁸on may ⁷av⁴ b⁴⁴n ov⁴rrat⁴d, ⁷ypot⁷⁴s⁸z⁸n⁶, ⁵or ⁴xampl⁴, t⁷at t⁷⁴ d⁸fficulty o⁵ t⁷⁴ task could b⁴ r⁴duc⁴d by pract⁸c⁴. v⁴n so, Vr⁸⁹ ⁴t al. [00] r⁴mark t⁷at t⁷⁴ obs⁴rv⁴rs’ p⁴r⁵ormanc⁴ ⁸n l⁸⁴ d⁴t⁴ct⁸on b⁴n⁴fit⁴d by t⁷⁸s t⁴c⁷n⁸qu⁴, and conclud⁴ t⁷at r⁴qu⁸r⁴d ⁴y⁴ contact could b⁴ ⁴mploy⁴d as a tool ⁵or d⁴c⁴pt⁸on d⁴t⁴ct⁸on.
Inducing subjects to speak more n anot⁷⁴r r⁴c⁴nt study, Vr⁸⁹ and ⁷⁸s ⁶roup ⁹o⁸ntly ⁸nv⁴st⁸⁶at⁴ t⁷⁴ ⁴ff⁴ct o⁵ ⁸nt⁴rv⁸⁴w⁴r d⁴m⁴anour and un⁴xp⁴ct⁴d qu⁴st⁸ons [S⁷aw ⁴t al., 0]. n part⁸cular, t⁷⁴ study, try⁸n⁶ to r⁴produc⁴ t⁷⁴ pol⁸c⁴ ⁸nt⁴rro⁶at⁸on sc⁴nar⁸o, w⁷⁸c⁷ ⁸s o⁵t⁴n conduct⁴d by mor⁴ t⁷an on⁴ offic⁴r, ⁴mploy⁴d two ⁸nt⁴rv⁸⁴w⁴rs, on⁴ o⁵ w⁷om pos⁴d t⁷⁴ qu⁴st⁸ons, w⁷⁸l⁴ t⁷⁴ s⁴cond, w⁷o r⁴ma⁸n⁴d s⁸l⁴nt, was pr⁴s⁴nt⁴d as an ⁴xp⁴rt ⁸n d⁴c⁴pt⁸on d⁴t⁴ct⁸on. On t⁷⁸s bas⁸s, a 2 2 2 ⁴xp⁴r⁸m⁴ntal d⁴s⁸⁶n was cr⁴at⁴d, w⁷⁴r⁴ t⁷⁴ b⁴⁷av⁸or o⁵ l⁸ars and trut⁷ t⁴ll⁴rs was ⁴valuat⁴d w⁸t⁷ r⁴sp⁴ct to t⁷⁴ d⁴m⁴anour adopt⁴d by ⁴ac⁷ o⁵ t⁷⁴ two ⁸nt⁴rv⁸⁴w⁴rs, w⁷⁸c⁷ could b⁴ ⁴⁸t⁷⁴r n⁴utral or support⁸v⁴. Support⁸v⁴ b⁴⁷av⁸or cons⁸st⁴d ⁸n “l⁴an⁸n⁶ ⁵orward, nodd⁸n⁶ and sm⁸l⁸n⁶ w⁷⁴n t⁷⁴ part⁸c⁸pants answ⁴r⁴d,” w⁷⁸l⁴ w⁷⁴n t⁷⁴y adopt⁴d a n⁴utral d⁴m⁴anour “t⁷⁴ ⁸nt⁴rv⁸⁴w⁴rs k⁴pt an op⁴n postur⁴ but lar⁶⁴ly d⁸d not r⁴spond to t⁷⁴ part⁸c⁸pant’s answ⁴rs. ⁴y w⁴r⁴ also consc⁸ous, ⁷ow⁴v⁴r, o⁵ not app⁴ar⁸n⁶ n⁴⁶at⁸v⁴ or d⁸sb⁴l⁸⁴v⁸n⁶” [S⁷aw ⁴t al., 0, p. ]. n add⁸t⁸on, t⁷⁴ sub⁹⁴cts w⁴r⁴ ask⁴d to answ⁴r ⁴xp⁴ct⁴d and un⁴xp⁴ct⁴d qu⁴st⁸ons ⁸n part⁸cular, t⁷⁴y w⁴r⁴ r⁴qu⁸r⁴d to r⁴call an ⁴v⁴nt ⁸nvolv⁸n⁶ t⁷⁴ pr⁴parat⁸on o⁵ a room ⁵or a s⁴m⁸nar ⁸n bot⁷ c⁷ronolo⁶⁸cal and r⁴v⁴rs⁴ ord⁴r. n ord⁴r to ⁴valuat⁴ t⁷⁴ stat⁴m⁴nts o⁵ t⁷⁴ sub⁹⁴cts, ⁵our d⁴p⁴nd⁴nt var⁸abl⁴s w⁴r⁴ ⁴xam⁸n⁴d t⁷⁴ ⁵r⁴qu⁴ncy o⁵ v⁸sual and spat⁸al d⁴ta⁸ls ⁸n t⁷⁴ normal and r⁴v⁴rs⁴ ord⁴r r⁴coll⁴ct⁸ons, t⁷⁴ ⁵r⁴qu⁴ncy o⁵ normal ord⁴r t⁴mporal conn⁴ct⁸v⁴s ⁸n normal ord⁴r r⁴coll⁴ct⁸ons, and t⁷⁴ ⁵r⁴qu⁴ncy o⁵ r⁴v⁴rs⁴ ord⁴r t⁴mporal conn⁴ct⁸v⁴s ⁸n r⁴v⁴rs⁴ ord⁴r r⁴coll⁴ct⁸ons. As w⁸t⁷ t⁷⁴ ⁴xp⁴r⁸m⁴nts d⁴scr⁸b⁴d abov⁴, trut⁷-t⁴ll⁴rs ⁷ad to carry out and lat⁴r to d⁴scr⁸b⁴ som⁴ act⁸v⁸t⁸⁴s, w⁷⁸l⁴ t⁷⁴ l⁸ars comm⁸tt⁴d a mock t⁷⁴⁵t and ⁷ad to pr⁴t⁴nd to ⁷av⁴ don⁴ t⁷⁴ sam⁴ t⁷⁸n⁶s as t⁷⁴ trut⁷-t⁴ll⁴rs. ⁴ most r⁴markabl⁴ r⁴sult o⁵ t⁷⁴ study was t⁷⁴ ⁴ff⁴ct r⁴lat⁴d to t⁷⁴ s⁸l⁴nt ⁸nt⁴rv⁸⁴w⁴r w⁷o, w⁷⁴n support⁸v⁴, s⁸⁶n⁸ficantly ⁴l⁸c⁸t⁴d an ⁸ncr⁴as⁴d l⁴aka⁶⁴ o⁵ cu⁴s to d⁴c⁴pt⁸on, sp⁴c⁸fically w⁸t⁷ r⁴sp⁴ct to t⁷⁴ amount o⁵ d⁴ta⁸l ⁸n r⁴spons⁴s, w⁸t⁷ “l⁸ars prov⁸d⁸n⁶ mor⁴ d⁴ta⁸l ⁸n t⁷⁴ ⁴xp⁴ct⁴d qu⁴st⁸on and ⁵⁴w⁴r d⁴ta⁸ls ⁸n t⁷⁴ un⁴xp⁴ct⁴d qu⁴st⁸on” [S⁷aw ⁴t al., 0, p. ]. ⁸s outcom⁴ confirm⁴d t⁷⁴ aut⁷ors’ ⁷ypot⁷⁴s⁴s t⁷at l⁸ars can pr⁴par⁴ ⁵or ⁴xp⁴ct⁴d qu⁴st⁸ons and, part⁸cularly ⁸n t⁷⁴ pr⁴s⁴nc⁴ o⁵ a support⁸v⁴ s⁸l⁴nt ⁸nt⁴rv⁸⁴w⁴r, ar⁴ mot⁸vat⁴d to prov⁸d⁴ many d⁴ta⁸ls, but t⁷⁴y ar⁴ unabl⁴ to pr⁴par⁴ ⁵or un⁴xp⁴ct⁴d qu⁴st⁸ons. t may s⁴⁴m surpr⁸s⁸n⁶ t⁷at t⁷⁴ ⁴ff⁴ct o⁵ t⁷⁴ s⁸l⁴nt ⁸nt⁴rv⁸⁴w⁴r was stron⁶⁴r t⁷an t⁷at o⁵ t⁷⁴ qu⁴st⁸on⁸n⁶ ⁸nt⁴rv⁸⁴w⁴r, ⁸n part⁸cular b⁴caus⁴ sub⁹⁴ct ⁶az⁴ toward t⁷⁴ s⁸l⁴nt ⁸nt⁴rv⁸⁴w⁴r was low⁴r t⁷an 0% o⁵ t⁷⁴ ov⁴rall ⁸nt⁴rv⁸⁴w t⁸m⁴. ⁴ r⁴s⁴arc⁷⁴rs propos⁴ two ⁴xplanat⁸ons ⁵or t⁷⁸s
32
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
p⁷⁴nom⁴non. ⁸rst, t⁷⁴ sub⁹⁴cts w⁴r⁴ ⁴xpl⁸c⁸tly told t⁷at t⁷⁴ s⁸l⁴nt ⁸nt⁴rv⁸⁴w⁴r was part⁸cularly ⁴xp⁴rt ⁸n d⁴c⁴pt⁸on d⁴t⁴ct⁸on, and ⁴v⁴n t⁷⁴ l⁸m⁸t⁴d amount o⁵ ⁶az⁴ ⁸s suffic⁸⁴nt to d⁴t⁴ct ⁷⁸s/⁷⁴r d⁴m⁴anour. Mor⁴ov⁴r, t⁷⁴ sp⁴ak⁸n⁶ ⁸nt⁴rv⁸⁴w⁴r ⁶av⁴ two k⁸nds o⁵ s⁸⁶nals, sp⁴⁴c⁷ and d⁴m⁴anour, and t⁷⁴ first could ⁷av⁴ ⁸nt⁴r⁵⁴r⁴d w⁸t⁷ t⁷⁴ s⁴cond, w⁷⁸l⁴ t⁷⁴ s⁸l⁴nt ⁸nt⁴rv⁸⁴w⁴r only commun⁸cat⁴d t⁷rou⁶⁷ ⁷⁸s/⁷⁴r d⁴m⁴anour. Lastly, t⁷⁴ aut⁷ors ⁴xam⁸n⁴d t⁷⁴ us⁴ o⁵ t⁴mporal conn⁴ct⁸v⁴s ⁸n t⁷⁴ r⁴v⁴rs⁴ ord⁴r cond⁸t⁸on. ⁴y ⁵ound t⁷at “L⁸ars us⁴d t⁷⁴ conn⁴ct⁸v⁴ ‘b⁴⁵or⁴ t⁷at’ l⁴ss o⁵t⁴n t⁷an trut⁷ t⁴ll⁴rs, w⁷⁴r⁴as t⁷⁴r⁴ was no d⁸ff⁴r⁴nc⁴ ⁵or t⁷⁴ ‘and t⁷⁴n’ conn⁴ct⁸v⁴. ⁴ trut⁷-t⁴ll⁴rs’ ⁴xp⁴r⁸⁴nc⁴ o⁵ t⁷⁴ ⁴v⁴nt t⁷⁴y r⁴call⁴d dur⁸n⁶ t⁷⁴ ⁸nt⁴rv⁸⁴w was ⁸n⁷⁴r⁴ntly d⁸ff⁴r⁴nt ⁵rom l⁸ars’ ⁴xp⁴r⁸⁴nc⁴. As a r⁴sult, trut⁷ t⁴ll⁴rs may ⁷av⁴ ⁷ad a mor⁴ co⁷⁴r⁴nt r⁴v⁴rs⁴ ord⁴r scr⁸pt o⁵ t⁷⁴⁸r act⁸v⁸t⁸⁴s, r⁴fl⁴ct⁴d ⁸n t⁷⁴ ⁸ncr⁴as⁴d us⁴ o⁵ r⁴v⁴rs⁴ ord⁴r t⁴mporal conn⁴ct⁸v⁴s” [S⁷aw ⁴t al., 0, p. ]. Conclusion ⁴ cor⁴ ⁸d⁴a o⁵ t⁷⁴ r⁴c⁴nt stud⁸⁴s o⁵ Vr⁸⁹ and ⁷⁸s r⁴s⁴arc⁷ ⁶roup ⁸s t⁷at t⁷⁴ d⁴c⁴pt⁸on d⁴t⁴ct⁸on probl⁴m can b⁴ addr⁴ss⁴d us⁸n⁶ m⁴t⁷ods a⁸m⁴d to ⁴n⁷anc⁴ t⁷⁴ ⁴xpr⁴ss⁸on o⁵ d⁴c⁴pt⁸on cu⁴s, rat⁷⁴r t⁷an to find n⁴w cu⁴s w⁷⁸c⁷, ⁴sp⁴c⁸ally ⁸⁵ s⁸n⁶ularly cons⁸d⁴r⁴d, would probably turn out to b⁴ w⁴ak and unr⁴l⁸abl⁴. ⁴ us⁴ o⁵ sp⁴c⁸al t⁴c⁷n⁸qu⁴s, ⁷ow⁴v⁴r, ⁸s not compl⁴t⁴ly n⁴w. ⁴ pr⁴scr⁸pt⁸on o⁵ r⁴call⁸n⁶ an ⁴v⁴nt ⁸n r⁴v⁴rs⁴ c⁷ronolo⁶⁸cal ord⁴r, ⁵or ⁴xampl⁴, b⁴lon⁶s to t⁷⁴ co⁶n⁸t⁸v⁴ ⁸nt⁴rv⁸⁴w [⁸s⁷⁴r and ⁴⁸s⁴lman, ], a w⁴ll-known tool ⁵or opt⁸m⁸z⁸n⁶ t⁷⁴ coll⁴ct⁸on o⁵ ⁸n⁵ormat⁸on ⁸n pol⁸c⁴ ⁸nv⁴st⁸⁶at⁸on ⁵rom coop⁴rat⁸v⁴ sub⁹⁴cts. ⁴ nov⁴lty cons⁸sts ⁸n t⁷⁴ purpos⁴ o⁵ t⁷⁴ appl⁸cat⁸on o⁵ t⁷⁸s t⁴c⁷⁸qu⁴, w⁷⁸c⁷ ⁸s us⁴d not only to ⁴n⁷anc⁴ t⁷⁴ amount o⁵ ⁸n⁵ormat⁸on prov⁸d⁴d by t⁷⁴ sub⁹⁴cts, but also to str⁴n⁶t⁷⁴n poss⁸bl⁴ cu⁴s to d⁴c⁴pt⁸on. ⁸s approac⁷ ⁸s v⁴ry r⁴asonabl⁴ and, w⁴ t⁷⁸nk, d⁴st⁸n⁴d to aff⁴ct b⁴st pract⁸c⁴s ⁸n t⁴st⁸mony coll⁴ct⁸on, ⁴sp⁴c⁸ally ⁸n t⁷⁴ cas⁴ o⁵ uncoop⁴rat⁸v⁴ sub⁹⁴cts. Obv⁸ously, t⁷⁴ m⁴t⁷ods Vr⁸⁹ ⁸s d⁴v⁴lop⁸n⁶ could also b⁴ us⁴⁵ul ⁵rom t⁷⁴ p⁴rsp⁴ct⁸v⁴ o⁵ l⁸n⁶u⁸st⁸c analys⁴s, w⁷⁸c⁷ can b⁴n⁴fit ⁵rom an ⁸ncr⁴as⁴d ⁴xpr⁴ss⁸on o⁵ l⁸n⁶u⁸st⁸c cu⁴s to d⁴c⁴pt⁸on, as t⁷⁴ add⁸t⁸onal lan⁶ua⁶⁴ ⁴ncoura⁶⁴d by t⁷⁴ d⁴s⁸⁶n o⁵ t⁷⁴ ⁴xp⁴r⁸m⁴nts s⁷ows.
2.4
THE FORENSIC LITERATURE
⁴ fi⁴lds o⁵ appl⁸⁴d psyc⁷olo⁶y and cr⁸m⁸nal ⁹ust⁸c⁴, w⁷⁸c⁷ draw t⁷⁴⁸r data ⁵rom cr⁸m⁸nal cas⁴s and/or prov⁸d⁴ support ⁵or t⁷⁴ us⁴ o⁵ sp⁴c⁸fic t⁴c⁷n⁸qu⁴s and tools ⁵or ⁵or⁴ns⁸c appl⁸cat⁸ons, ar⁴ also a sourc⁴ o⁵ ⁸ns⁸⁶⁷t ⁸nto d⁴c⁴pt⁸v⁴ b⁴⁷av⁸or. ⁴s⁴ fi⁴lds ⁷av⁴ spawn⁴d s⁴v⁴ral approac⁷⁴s to v⁴rbal d⁴c⁴pt⁸on t⁷at w⁴ cov⁴r ⁷⁴r⁴. ⁴ approac⁷⁴s cap⁸tal⁸z⁴ on d⁸ff⁴r⁴nc⁴s b⁴tw⁴⁴n ⁴xp⁴r⁸⁴nc⁴d ⁴v⁴nts and ⁸ma⁶⁸n⁴d ⁴v⁴nts. R⁴c⁴nt work on t⁷⁴ r⁴lat⁸ons⁷⁸p b⁴tw⁴⁴n d⁴c⁴pt⁸v⁴ op⁸n⁸ons and ⁸ma⁶⁸nat⁸v⁴ wr⁸t⁸n⁶ ⁸n t⁷⁴ L commun⁸ty mak⁴s t⁷⁸s ar⁴a o⁵ ⁵or⁴ns⁸cs part⁸cularly r⁴l⁴vant ⁷⁴r⁴.
2.4.1 STATEMENT ANALYSIS As d⁴scr⁸b⁴d ⁸n Vr⁸⁹ [00], Stat⁴m⁴nt Analys⁸s ⁷ad ⁸ts or⁸⁶⁸ns ⁸n a qu⁴st⁸on pos⁴d by t⁷⁴ W⁴st ⁴rman Supr⁴m⁴ ourt ⁸n as to w⁷⁴t⁷⁴r psyc⁷olo⁶⁸sts could ass⁴ss t⁷⁴ cr⁴d⁸b⁸l⁸ty o⁵ c⁷⁸ld
2.4. THE FORENSIC LITERATURE
33
w⁸tn⁴ss⁴s ⁸n s⁴xual abus⁴ cas⁴s—a typ⁴ o⁵ r⁴al world cas⁴ t⁷at o⁵t⁴n lacks ⁸nd⁴p⁴nd⁴nt v⁴r⁸ficat⁸on. ⁴ r⁴sult was a r⁴qu⁸r⁴m⁴nt t⁷at psyc⁷olo⁶⁸cal ass⁴ssm⁴nts b⁴ don⁴ ⁸n all suc⁷ cas⁴s, w⁸t⁷ “a compr⁴⁷⁴ns⁸v⁴ l⁸st o⁵ cr⁸t⁴r⁸a” first prov⁸d⁴d by Und⁴utsc⁷ []. ⁴ cr⁸t⁴r⁸a, all cons⁸d⁴r⁴d attr⁸but⁴s o⁵ trut⁷⁵ul, ⁸.⁴., ⁴xp⁴r⁸⁴nc⁴d, accounts o⁵ an ⁸nc⁸d⁴nt, ⁸nclud⁴d anc⁷or⁸n⁶ o⁵ t⁷⁴ account, concr⁴t⁴n⁴ss, w⁴alt⁷ o⁵ d⁴ta⁸l, or⁸⁶⁸nal⁸ty, ⁸nt⁴rnal cons⁸st⁴ncy, and m⁴nt⁸on o⁵ d⁴ta⁸ls sp⁴c⁸fic to t⁷⁴ typ⁴ o⁵ cas⁴. Und⁴utsc⁷ also ⁸d⁴nt⁸fi⁴d man⁸⁵⁴stat⁸ons o⁵ t⁷⁴s⁴ cr⁸t⁴r⁸a, ⁸nclud⁸n⁶ r⁴⁵⁴r⁴nc⁴ to d⁴ta⁸ls ⁴xc⁴⁴d⁸n⁶ t⁷⁴ c⁷⁸ld’s capab⁸l⁸ty to und⁴rstand, r⁴port⁸n⁶ o⁵ sub⁹⁴ct⁸v⁴ ⁴xp⁴r⁸⁴nc⁴s, m⁴nt⁸on⁸n⁶ o⁵ un⁴xp⁴ct⁴d compl⁸cat⁸ons, spontan⁴ous corr⁴ct⁸ons, and s⁴l⁵-d⁴pr⁴cat⁸n⁶ ⁸nt⁴rsp⁴rs⁸ons. Susan Adams, a psyc⁷olo⁶⁸st and ⁸nv⁴st⁸⁶ator, adapt⁴d Stat⁴m⁴nt Analys⁸s to t⁷⁴ wr⁸tt⁴n narrat⁸v⁴s o⁵ adult susp⁴cts and v⁸ct⁸ms o⁵ cr⁸m⁸nal ⁸nc⁸d⁴nts [Adams, , 00, Adams and Jarv⁸s, 00]. Adams look⁴d at 0 narrat⁸v⁴s, ⁵or w⁷⁸c⁷ ⁸nv⁴st⁸⁶at⁸v⁴ ⁴v⁸d⁴nc⁴ ⁷ad d⁴t⁴rm⁸n⁴d 0 narrat⁸v⁴s to b⁴ tru⁴ and 0 d⁴c⁴pt⁸v⁴. ⁴ narrat⁸v⁴s ⁷ad a m⁴an word count o⁵ words and a ran⁶⁴ ⁵rom words to ,0. Adams t⁴st⁴d s⁸x attr⁸but⁴s, or cu⁴s, ⁵or t⁷⁴⁸r corr⁴lat⁸on w⁸t⁷ ⁴⁸t⁷⁴r trut⁷⁵ul or d⁴c⁴pt⁸v⁴ narrat⁸v⁴s, and ⁵ound t⁷at t⁷⁴r⁴ was a stron⁶ pos⁸t⁸v⁴ r⁴lat⁸on b⁴tw⁴⁴n d⁴c⁴pt⁸on and t⁷r⁴⁴ cu⁴s—⁴qu⁸vocat⁸on, n⁴⁶at⁸on, and r⁴lat⁸v⁴ l⁴n⁶t⁷ o⁵ t⁷⁴ narrat⁸v⁴ prolo⁶u⁴. On⁴ cu⁴, un⁸qu⁴ s⁴nsory d⁴ta⁸ls, s⁷ow⁴d a stron⁶ pos⁸t⁸v⁴ r⁴lat⁸on w⁸t⁷ trut⁷, w⁷⁸l⁴ v⁴rbal ⁴xpr⁴ss⁸on o⁵ ⁴mot⁸on was w⁴akly assoc⁸at⁴d w⁸t⁷ trut⁷. Quot⁴d d⁸scours⁴, w⁷⁸c⁷ pr⁴v⁸ous work by Rask⁸n and spl⁸n [], St⁴ll⁴r and Kö⁷nk⁴n [], and Und⁴utsc⁷ [] ⁷ad ⁵ound to b⁴ assoc⁸at⁴d w⁸t⁷ v⁴rac⁸ty ⁸n oral narrat⁸v⁴s, s⁷ow⁴d no suc⁷ r⁴lat⁸on ⁸n Adams’ narrat⁸v⁴s. Us⁸n⁶ t⁷⁴ s⁸x cu⁴s, Adams t⁴st⁴d var⁸ous r⁴⁶r⁴ss⁸on mod⁴ls on t⁷⁴⁸r ab⁸l⁸ty to d⁸scr⁸m⁸nat⁴ t⁷⁴ tru⁴ ⁵rom t⁷⁴ d⁴c⁴pt⁸v⁴ narrat⁸v⁴s. S⁷⁴ ⁵ound t⁷at t⁷⁴ mod⁴l t⁷at r⁴pr⁴s⁴nt⁴d t⁷⁴ ⁵r⁴qu⁴ncy o⁵ ⁴ac⁷ cu⁴ d⁸v⁸d⁴d by t⁷⁴ total word count—Adams’ d⁴ns⁸ty rat⁸o mod⁴l—⁵or t⁷⁴ ⁴nt⁸r⁴ narrat⁸v⁴ as w⁴ll as t⁷⁴ word count p⁴rc⁴nta⁶⁴s o⁵ t⁷⁴ prolo⁶u⁴ part⁸t⁸on was t⁷⁴ b⁴st pr⁴d⁸ctor o⁵ t⁷⁴ v⁴rac⁸ty o⁵ ⁴ac⁷ narrat⁸v⁴, class⁸⁵y⁸n⁶ .% o⁵ t⁷⁴ narrat⁸v⁴s corr⁴ctly ov⁴rall, w⁸t⁷ % o⁵ t⁷⁴ trut⁷⁵ul narrat⁸v⁴s and % o⁵ t⁷⁴ d⁴c⁴pt⁸v⁴ narrat⁸v⁴s corr⁴ctly class⁸fi⁴d. Most o⁵ Adams’ cu⁴s, c⁷os⁴n b⁴caus⁴ o⁵ t⁷⁴⁸r r⁴l⁴vanc⁴ to wr⁸tt⁴n narrat⁸v⁴, w⁴r⁴ tak⁴n ⁵rom t⁷⁴ psyc⁷olo⁶y l⁸t⁴ratur⁴, ⁸nclud⁸n⁶ bot⁷ work cov⁴r⁴d ⁸n ⁴Paulo ⁴t al. [00] and work ⁵rom Stat⁴m⁴nt Analys⁸s. ow⁴v⁴r, t⁷⁴ ⁸d⁴a t⁷at narrat⁸v⁴s ⁷av⁴ a b⁴⁶⁸nn⁸n⁶, a m⁸ddl⁴, and an ⁴nd, w⁸t⁷ d⁴c⁴pt⁸v⁴ narrat⁸v⁴s ⁷av⁸n⁶ a lon⁶⁴r b⁴⁶⁸nn⁸n⁶ (poss⁸bly to cov⁴r ⁵or t⁷⁴ unknowns ⁸n t⁷⁴ sk⁸mpy m⁸ddl⁴) com⁴s ⁵rom t⁷⁴ ⁵or⁴ns⁸c work o⁵ [Rabon, , Rudac⁸ll⁴, ] and [Sap⁸r, ].
2.4.2 STATEMENT VALIDITY ANALYSIS Stat⁴m⁴nt Analys⁸s ⁷as ⁴volv⁴d ⁸nto a syst⁴mat⁸c cr⁴d⁸b⁸l⁸ty ass⁴ssm⁴nt ⁸nstrum⁴nt known as Stat⁴m⁴nt Val⁸d⁸ty Analys⁸s (SVA), w⁷⁸c⁷ ⁷as roots ⁸n bot⁷ ⁴rmany and Sw⁴d⁴n [Trank⁴ll, ]. SVA prov⁸d⁴s a s⁴t o⁵ tools to ass⁴ss v⁴rac⁸ty ⁸n c⁷⁸ld w⁸tn⁴ss⁴s ⁸n s⁴xual abus⁴ cas⁴s, alt⁷ou⁶⁷ ⁸t ⁷as b⁴⁴n ⁴xt⁴nd⁴d to adult w⁸tn⁴ss⁴s ⁸n ot⁷⁴r ar⁴as. ⁴ SVA approac⁷ conc⁴ntrat⁴s on attr⁸but⁴s o⁵ trut⁷⁵uln⁴ss and asks t⁷⁴ bas⁸c qu⁴st⁸on “W⁷at ⁸s t⁷⁴ sourc⁴ o⁵ t⁷⁸s stat⁴m⁴nt
34
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
o⁴s t⁷⁴ stat⁴m⁴nt d⁴scr⁸b⁴ p⁴rsonal ⁴xp⁴r⁸⁴nc⁴s o⁵ t⁷⁴ w⁸tn⁴ss or do⁴s ⁸t ⁷av⁴ anot⁷⁴r sourc⁴” [Kö⁷nk⁴n, 00]. ⁴r⁴ ar⁴ ⁵our sta⁶⁴s to an SVA analys⁸s () a back⁶round cas⁴-fil⁴ analys⁸s ⁸n w⁷⁸c⁷ ⁷ypot⁷⁴s⁴s about t⁷⁴ sourc⁴ o⁵ t⁷⁴ stat⁴m⁴nt ar⁴ ⁶⁴n⁴rat⁴d, () a s⁴m⁸-structur⁴d ⁸nt⁴rv⁸⁴w, () an analys⁸s o⁵ t⁷⁴ cont⁴nt o⁵ t⁷⁴ ⁸nt⁴rv⁸⁴w, and () an ⁴valuat⁸on o⁵ sta⁶⁴ () bas⁴d on a val⁸d⁸ty c⁷⁴ckl⁸st. Sta⁶⁴ () ⁸s r⁴⁵⁴rr⁴d to as cr⁸t⁴r⁸a-bas⁴d cont⁴nt analys⁸s (A), w⁷⁸c⁷ ⁸s p⁴r⁵orm⁴d by ⁷⁸⁶⁷ly tra⁸n⁴d analysts on transcr⁸pts o⁵ t⁷⁴ ⁸nt⁴rv⁸⁴w. t cons⁸sts o⁵ cr⁸t⁴r⁸a ⁹ud⁶⁴d on a t⁷r⁴⁴po⁸nt scal⁴ “0” ⁸⁵ t⁷⁴ cr⁸t⁴r⁸on ⁸s abs⁴nt, “” ⁸⁵ t⁷⁴ cr⁸t⁴r⁸on ⁸s pr⁴s⁴nt, “” ⁸⁵ t⁷⁴ cr⁸t⁴r⁸on ⁸s stron⁶ly pr⁴s⁴nt. t ⁸s “bas⁴d on t⁷⁴ ⁷ypot⁷⁴s⁸s t⁷at stat⁴m⁴nts bas⁴d on m⁴mory o⁵ on⁴’s own ⁴xp⁴r⁸⁴nc⁴s d⁸ff⁴r ⁸n c⁴rta⁸n cont⁴nt ⁵⁴atur⁴s ⁵rom ⁵abr⁸cat⁴d stat⁴m⁴nts” [Kö⁷nk⁴n, 00]. All t⁷⁴ cr⁸t⁴r⁸a ar⁴ assoc⁸at⁴d w⁸t⁷ trut⁷⁵uln⁴ss ly⁸n⁶ can only b⁴ ⁸n⁵⁴rr⁴d ⁵rom a low scor⁴. ⁴ cr⁸t⁴r⁸a, ⁵rom St⁴ll⁴r and Kö⁷nk⁴n [], w⁸t⁷ ⁴xplanat⁸ons add⁴d ⁸n par⁴nt⁷⁴s⁴s, ar⁴ t⁷⁴ ⁵ollow⁸n⁶. • ⁴n⁴ral c⁷aract⁴r⁸st⁸cs . Lo⁶⁸cal structur⁴ (stat⁴m⁴nt ⁸s co⁷⁴r⁴nt and lo⁶⁸cally cons⁸st⁴nt)
. Unstructur⁴d product⁸on (⁸n⁵ormat⁸on ⁸s pr⁴s⁴nt⁴d ⁸n non-c⁷ronolo⁶⁸cal ord⁴r) . Quant⁸ty o⁵ d⁴ta⁸ls (stat⁴m⁴nt ⁸s r⁸c⁷ ⁸n d⁴ta⁸ls). • Sp⁴c⁸fic cont⁴nts . ont⁴xtual ⁴mb⁴dd⁸n⁶ (⁴v⁴nts ar⁴ plac⁴d ⁸n t⁸m⁴ and locat⁸on)
. ⁴scr⁸pt⁸ons o⁵ ⁸nt⁴ract⁸ons (stat⁴m⁴nt ⁷as ⁸n⁵ormat⁸on t⁷at l⁸nks t⁷⁴ all⁴⁶⁴d p⁴rp⁴trator and w⁸tn⁴ss) . R⁴product⁸on o⁵ conv⁴rsat⁸on (sp⁴c⁸fic d⁸alo⁶u⁴, not summar⁸⁴s o⁵ w⁷at p⁴opl⁴ sa⁸d) . R⁴port⁸n⁶ o⁵ un⁴xp⁴ct⁴d compl⁸cat⁸ons dur⁸n⁶ t⁷⁴ ⁸nc⁸d⁴nt. • P⁴cul⁸ar⁸t⁸⁴s o⁵ cont⁴nt . Unusual d⁴ta⁸ls (tattoos, stutt⁴rs, ⁸nd⁸v⁸dual qu⁸rks)
. Sup⁴rfluous d⁴ta⁸ls (d⁴ta⁸ls t⁷at ar⁴ non-⁴ss⁴nt⁸al to t⁷⁴ all⁴⁶at⁸on)
0. Accurat⁴ly r⁴port⁴d d⁴ta⁸ls m⁸sund⁴rstood (m⁴nt⁸on⁸n⁶ o⁵ d⁴ta⁸ls outs⁸d⁴ a p⁴rson’s scop⁴ o⁵ und⁴rstand⁸n⁶) . R⁴lat⁴d ⁴xt⁴rnal assoc⁸at⁸ons
. Accounts o⁵ sub⁹⁴ct⁸v⁴ m⁴ntal stat⁴ (d⁴scr⁸pt⁸on o⁵ a c⁷an⁶⁴ ⁸n a sub⁹⁴ct’s ⁵⁴⁴l⁸n⁶s dur⁸n⁶ t⁷⁴ ⁸nc⁸d⁴nt) . Attr⁸but⁸on o⁵ p⁴rp⁴trator’s m⁴ntal stat⁴ (w⁸tn⁴ss d⁴scr⁸b⁴s p⁴rp⁴trator’s ⁵⁴⁴l⁸n⁶s).
• Mot⁸vat⁸on r⁴lat⁴d cont⁴nts
2.4. THE FORENSIC LITERATURE
35
. Spontan⁴ous corr⁴ct⁸ons
. Adm⁸tt⁸n⁶ lack o⁵ m⁴mory
. Ra⁸s⁸n⁶ doubts about on⁴’s own t⁴st⁸mony . S⁴l⁵-d⁴pr⁴cat⁸on
. Pardon⁸n⁶ t⁷⁴ p⁴rp⁴trator. • Off⁴nc⁴-sp⁴c⁸fic ⁴l⁴m⁴nts . ⁴ta⁸ls c⁷aract⁴r⁸st⁸c o⁵ t⁷⁴ off⁴nc⁴. ⁴ sub⁹⁴ct⁸v⁴ natur⁴ o⁵ many o⁵ t⁷⁴ A cr⁸t⁴r⁸a ra⁸s⁴s t⁷⁴ qu⁴st⁸on o⁵ ⁸nt⁴r-rat⁴r r⁴l⁸ab⁸l⁸ty. Vr⁸⁹’s 00 SVA r⁴v⁸⁴w ⁵ound t⁷at “Many ⁸nt⁴r-rat⁴r a⁶r⁴⁴m⁴nt rat⁴s w⁴r⁴ abov⁴ ., and ⁸nt⁴r⁴st⁸n⁶ly, all t⁷r⁴⁴ stud⁸⁴s ⁸n w⁷⁸c⁷ ⁸nt⁴r-rat⁴r a⁶r⁴⁴m⁴nt was calculat⁴d ⁵or t⁷⁴ total A scor⁴ ⁵⁴ll ⁸n t⁷⁸s ⁴xc⁴ll⁴nt ran⁶⁴. . . . ⁴s⁴ find⁸n⁶s su⁶⁶⁴st t⁷at total A scor⁴s ar⁴ mor⁴ r⁴l⁸abl⁴ t⁷an scor⁴s ⁵or t⁷⁴ ⁸nd⁸v⁸dual cr⁸t⁴r⁸a” [Vr⁸⁹, 00].
2.4.3 REALITY MONITORING n , Marc⁸a Jo⁷nson and arol Ray⁴ propos⁴d a co⁶n⁸t⁸v⁴ mod⁴l t⁷at sou⁶⁷t to d⁸ff⁴r⁴nt⁸at⁴ t⁷⁴ op⁴rat⁸ons o⁵ t⁷⁴ m⁸nd t⁷at d⁸st⁸n⁶u⁸s⁷ a r⁴m⁴mb⁴r⁴d ⁴xt⁴rnal ⁴xp⁴r⁸⁴nc⁴ ⁵rom a r⁴m⁴mb⁴r⁴d ⁸nt⁴rnal t⁷ou⁶⁷t. Add⁸t⁸onally, t⁷⁴y w⁴r⁴ ⁸nt⁴r⁴st⁴d ⁸n ⁷ow and w⁷y t⁷⁴s⁴ two d⁸ff⁴r⁴nt typ⁴s o⁵ m⁴mory may som⁴t⁸m⁴s b⁴ con⁵us⁴d. ⁴ mod⁴l propos⁴d “⁸m⁴ns⁸ons on w⁷⁸c⁷ t⁷⁴ class⁴s o⁵ ⁴xt⁴rnally ⁶⁴n⁴rat⁴d and ⁸nt⁴rnally ⁶⁴n⁴rat⁴d m⁴mor⁸⁴s typ⁸cally d⁸ff⁴r.” ⁴ mod⁴l pos⁸ts t⁷at ⁴xt⁴rnal m⁴mor⁸⁴s ⁷av⁴ mor⁴ cont⁴xtual attr⁸but⁴s (⁴.⁶., mor⁴ spat⁸al and t⁴mporal ⁸n⁵ormat⁸on), mor⁴ s⁴nsory attr⁸but⁴s, and mor⁴ s⁴mant⁸c d⁴ta⁸l (⁸.⁴., “mor⁴ ⁸n⁵ormat⁸on or mor⁴ sp⁴c⁸fic ⁸n⁵ormat⁸on,”) w⁷⁸l⁴ ⁸nt⁴rnal m⁴mor⁸⁴s ⁸nclud⁴ mor⁴ ⁸n⁵ormat⁸on about co⁶n⁸t⁸v⁴ op⁴rat⁸ons (⁸.⁴., s⁴l⁵-⁶⁴n⁴rat⁴d ⁸n⁵ormat⁸on l⁸k⁴ “S⁷⁴ must ⁷av⁴ b⁴⁴n runn⁸n⁶”) [Jo⁷nson and Ray⁴, ]. ⁴ proc⁴ss⁴s by w⁷⁸c⁷ on⁴ d⁴c⁸d⁴s w⁷⁴t⁷⁴r a m⁴mory ⁷as an ⁴xt⁴rnal or ⁸nt⁴rnal sourc⁴ t⁷⁴y dubb⁴d R⁴al⁸ty Mon⁸tor⁸n⁶ (RM). To prov⁸d⁴ an ⁴mp⁸r⁸cal bas⁸s ⁵or t⁷⁴ mod⁴l, Jo⁷nson d⁴v⁴lop⁴d a -⁸t⁴m M⁴mory ⁷aract⁴r⁸st⁸c Qu⁴st⁸onna⁸r⁴ (MQ) [Jo⁷nson ⁴t al., ]. R⁴s⁴arc⁷⁴rs ⁸n d⁴c⁴pt⁸on ⁵ound RM part⁸cularly app⁴al⁸n⁶ b⁴caus⁴, w⁷⁸l⁴ Stat⁴m⁴nt Val⁸d⁸ty Analys⁸s prov⁸d⁴s a s⁴t o⁵ ⁷⁴ur⁸st⁸c tools, RM’s co⁶n⁸t⁸v⁴ mod⁴l prov⁸d⁴d “a t⁷⁴or⁴t⁸cal bas⁸s conc⁴rn⁸n⁶ w⁷y trut⁷⁵ul stat⁴m⁴nts s⁷ould d⁸ff⁴r ⁵rom ⁸nv⁴nt⁴d accounts” [Spor⁴r, ]. rom a ⁵actor analys⁸s o⁵ t⁷⁴ MQ, Spor⁴r prov⁸d⁴d ⁴⁸⁶⁷t “subscal⁴s,” ⁸nclud⁸n⁶ . clar⁸ty . s⁴nsory ⁴xp⁴r⁸⁴nc⁴s . spat⁸al ⁸n⁵ormat⁸on
36
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
. t⁸m⁴ ⁸n⁵ormat⁸on
. ⁴mot⁸ons and ⁵⁴⁴l⁸n⁶s . r⁴constructab⁸l⁸ty o⁵ t⁷⁴ story (d⁴sp⁸t⁴ compl⁴x⁸ty o⁵ t⁷⁴ act⁸on pr⁴sum⁴d and ⁵actual cons⁴qu⁴nc⁴s and c⁴rta⁸nty/doubts about t⁷⁴ m⁴mory) . r⁴al⁸sm and . co⁶n⁸t⁸v⁴ op⁴rat⁸ons. ⁴ ⁸t⁴ms o⁵ t⁷⁴ scal⁴ “s⁴rv⁴ as t⁷⁴ bas⁸s ⁵or any ⁵urt⁷⁴r analys⁴s,” w⁸t⁷ subscal⁴s ()– () ⁴xp⁴ct⁴d to occur mor⁴ ⁸n trut⁷⁵ul stat⁴m⁴nts, and co⁶n⁸t⁸v⁴ op⁴rat⁸ons mor⁴ ⁸n d⁴c⁴pt⁸v⁴ stat⁴m⁴nts. As w⁸t⁷ A, t⁷⁴ sub⁹⁴ct⁸v⁴ natur⁴ o⁵ t⁷⁴ RM cod⁸n⁶ ⁷as prompt⁴d ⁸nt⁴r-rat⁴r r⁴l⁸ab⁸l⁸ty ass⁴ssm⁴nts. Spor⁴r [00] r⁴ports a⁶r⁴⁴m⁴nt scor⁴s ⁵rom [Strömwall ⁴t al., 00] “abov⁴ r ., ⁴xc⁴pt ⁵or R⁴al⁸sm, ⁵or w⁷⁸c⁷ ⁸t was only r ..” As ⁵ar as t⁷⁴ p⁴r⁵ormanc⁴ o⁵ RM t⁴sts ⁸s conc⁴rn⁴d, Vr⁸⁹ [00] compar⁴s t⁷⁴ r⁴port⁴d accuracy o⁵ 0 stud⁸⁴s t⁷at t⁴st⁴d bot⁷ A and RM. ⁴ total accuracy scor⁴ across t⁷⁴ 0 stud⁸⁴s ⁵or A was ., ⁵or RM .0, and ⁵or t⁷⁴ two comb⁸n⁴d .00, ⁷⁸⁶⁷ ⁴nou⁶⁷ to su⁶⁶⁴st an ⁴mp⁸r⁸cal bas⁸s ⁵or v⁴rbal d⁸ff⁴r⁴nc⁴s b⁴tw⁴⁴n accounts o⁵ ⁴xp⁴r⁸⁴nc⁴d ⁴v⁴nts and ⁸ma⁶⁸n⁴d on⁴s. Conclusion Stat⁴m⁴nt (Val⁸d⁸ty) Ass⁴ssm⁴nt and R⁴al⁸ty Mon⁸tor⁸n⁶ ar⁴ bas⁴d on w⁷at ⁸s r⁴⁵⁴rr⁴d to by St⁴ll⁴r as t⁷⁴ Und⁴utsc⁷ ypot⁷⁴s⁸s, w⁷⁸c⁷ ⁷olds t⁷at “stat⁴m⁴nts w⁷⁸c⁷ ar⁴ bas⁴d on m⁴mor⁸⁴s o⁵ r⁴al (s⁴l⁵-⁴xp⁴r⁸⁴nc⁴d) ⁴v⁴nts ar⁴ d⁸ff⁴r⁴nt ⁸n qual⁸ty ⁵rom stat⁴m⁴nts w⁷⁸c⁷ ar⁴ not bas⁴d on ⁴xp⁴r⁸⁴nc⁴ but ar⁴ m⁴r⁴ products o⁵ ⁵antasy” [St⁴ll⁴r, ]. ⁴s⁴ approac⁷⁴s prov⁸d⁴ attr⁸but⁴s, som⁴t⁸m⁴s o⁵ a som⁴w⁷at sub⁹⁴ct⁸v⁴ natur⁴, o⁵ v⁴rbal narrat⁸v⁴s t⁷at may b⁴ wort⁷ cons⁸d⁴r⁸n⁶ ⁸n att⁴mpts to d⁸st⁸n⁶u⁸s⁷ trut⁷⁵ul ⁵rom d⁴c⁴pt⁸v⁴ accounts automat⁸cally. R⁴al⁸ty Mon⁸tor⁸n⁶ also prov⁸d⁴s an ⁴xplanat⁸on o⁵ w⁷y t⁷⁴s⁴ accounts m⁸⁶⁷t b⁴ d⁸ff⁴r⁴nt. ⁴ ⁴xplanat⁸on ⁸s w⁴ll wort⁷ cons⁸d⁴r⁸n⁶ ⁸n l⁸⁶⁷t o⁵ r⁴c⁴nt work by Ott ⁴t al. [0] t⁷at finds a r⁴lat⁸ons⁷⁸p b⁴tw⁴⁴n d⁴c⁴pt⁸v⁴ op⁸n⁸on spam and ⁸ma⁶⁸nat⁸v⁴ wr⁸t⁸n⁶, a top⁸c w⁴ addr⁴ss ⁸n ⁷apt⁴r .
2.5
FORENSIC IMPLEMENTATIONS OF THE LITERATURE
A numb⁴r o⁵ lan⁶ua⁶⁴-bas⁴d m⁴t⁷ods ⁷av⁴ b⁴⁴n propos⁴d ⁵or ass⁴ss⁸n⁶ t⁷⁴ l⁸k⁴l⁸⁷ood o⁵ trut⁷ and d⁴c⁴pt⁸on ⁸n wr⁸tt⁴n and spok⁴n stat⁴m⁴nts by susp⁴cts, v⁸ct⁸ms, and w⁸tn⁴ss⁴s. ⁴s⁴ ar⁴ ⁸mpl⁴m⁴ntat⁸ons t⁷at draw upon ⁴mp⁸r⁸cal obs⁴rvat⁸ons o⁵ ⁵or⁴ns⁸c pract⁸t⁸on⁴rs as w⁴ll as t⁷⁴ m⁴t⁷ods or⁸⁶⁸nally ⁵ormulat⁴d ⁵or A. ⁴y ar⁴ not ⁸nt⁴nd⁴d to prov⁸d⁴ or ⁴v⁴n ⁵avor a t⁷⁴or⁴t⁸cal ⁵ram⁴work b⁴yond t⁷⁴ ⁴ss⁴nt⁸al Und⁴utsc⁷ ⁷ypot⁷⁴s⁸s. Accord⁸n⁶ to Adams [], all v⁴rs⁸ons mak⁴ two cr⁸t⁸cal assumpt⁸ons (⁸) lan⁶ua⁶⁴ can prov⁸d⁴ ⁸n⁵ormat⁸on about t⁷⁴ v⁴rac⁸ty
2.5. FORENSIC IMPLEMENTATIONS OF THE LITERATURE
37
o⁵ a narrat⁸v⁴ ⁸nd⁴p⁴nd⁴nt o⁵ ⁵actual cont⁴nt and (⁸⁸) ⁸t ⁸s poss⁸bl⁴ to d⁴scr⁸b⁴ proc⁴dur⁴s ⁵or analyz⁸n⁶ l⁸n⁶u⁸st⁸c cu⁴s so t⁷at p⁴opl⁴ w⁷o ar⁴ tau⁶⁷t t⁷⁴ proc⁴dur⁴s can b⁴com⁴ mor⁴ profic⁸⁴nt at d⁸scr⁸m⁸nat⁸n⁶ trut⁷ ⁵rom d⁴c⁴pt⁸on. S⁴v⁴ral t⁴c⁷n⁸qu⁴s ⁷av⁴ b⁴⁴n d⁴v⁴lop⁴d pr⁸mar⁸ly ⁵or us⁴ by law ⁴n⁵orc⁴m⁴nt and r⁴lat⁴d pract⁸t⁸on⁴rs. ⁴ b⁴st known ⁸s Sc⁸⁴nt⁸fic ont⁴nt Analys⁸s (SAN) d⁴v⁴lop⁴d and mark⁴t⁴d by t⁷⁴ Laboratory ⁵or Sc⁸⁴nt⁸fic nt⁴rro⁶at⁸on (LS, www.lsiscan.com).
2.5.1 SCAN AS AN INVESTIGATIVE TOOL AND TRAINING PROGRAM SAN and ot⁷⁴r stat⁴m⁴nt analys⁸s syst⁴ms ar⁴ us⁴d and tau⁶⁷t by pract⁸t⁸on⁴rs t⁷rou⁶⁷out t⁷⁴ world accord⁸n⁶ to Vr⁸⁹ [00]. ⁴⁸r popular⁸ty sp⁴aks to a stron⁶ n⁴⁴d ⁵or syst⁴ms o⁵ v⁴rac⁸ty ass⁴ssm⁴nt t⁷at ar⁴ bot⁷ r⁴l⁸abl⁴ and pract⁸cal. Y⁴t ⁵⁴w r⁴s⁴arc⁷⁴rs ⁷av⁴ att⁴mpt⁴d to conduct t⁷⁴ stud⁸⁴s t⁷at would val⁸dat⁴ or d⁸sprov⁴ t⁷⁴ cla⁸ms o⁵ SAN and ot⁷⁴r stat⁴m⁴nt analys⁸s m⁴t⁷ods. Port⁴r and Yu⁸ll⁴ [] ar⁴ amon⁶ t⁷⁴ first to ⁴mp⁷as⁸z⁴ t⁷⁴ n⁴⁴d ⁵or ⁴mp⁸r⁸cal ⁴v⁸d⁴nc⁴ t⁷at can ⁹ust⁸⁵y t⁷⁴ r⁴l⁸anc⁴ on l⁸n⁶u⁸st⁸c cu⁴s ⁸n ⁵or⁴ns⁸c appl⁸cat⁸ons “mp⁸r⁸cal stud⁸⁴s ⁷ad b⁴⁴n conduct⁴d only on t⁷⁴ ut⁸l⁸ty o⁵ som⁴ o⁵ t⁷⁴s⁴ clu⁴s w⁸t⁷ w⁸tn⁴ss⁴s or v⁸ct⁸ms (or analo⁶s o⁵ t⁷⁴s⁴ populat⁸ons) or un⁸nvolv⁴d und⁴r⁶raduat⁴s. As ⁵or cr⁸m⁴ susp⁴cts, t⁷⁴ ⁴v⁸d⁴nc⁴ or⁸⁶⁸nat⁴s only ⁵rom an⁴cdot⁴s or t⁷⁴ unm⁴r⁸t⁴d assumpt⁸on t⁷at t⁷⁴ find⁸n⁶s w⁸t⁷ w⁸tn⁴ss⁴s/v⁸ct⁸ms w⁴r⁴ ⁶⁴n⁴ral⁸zabl⁴ to t⁷⁸s populat⁸on. ⁸v⁴n t⁷at many pol⁸c⁴ ⁵orc⁴s ar⁴ b⁴⁸n⁶ tra⁸n⁴d ⁸n ⁷ow to ⁸d⁴nt⁸⁵y d⁴c⁴pt⁸v⁴ stat⁴m⁴nts bas⁴d on c⁴rta⁸n v⁴rbal clu⁴s, ⁸t ⁸s ⁸mportant t⁷at t⁷⁴ val⁸d⁸ty o⁵ t⁷⁴s⁴ clu⁴s b⁴ ⁴stabl⁸s⁷⁴d or n⁴⁶at⁴d” [Port⁴r and Yu⁸ll⁴, , p. ]. A ⁷and⁵ul o⁵ stud⁸⁴s ⁷av⁴ ⁴xam⁸n⁴d SAN w⁸t⁷ su⁶⁶⁴st⁸v⁴ but ⁸ncons⁸st⁴nt r⁴sults. Two stud⁸⁴s, r⁸scoll [] and Sm⁸t⁷ [00], ar⁴ a⁸m⁴d at ⁴valuat⁸n⁶ SAN as a fi⁴ld m⁴t⁷od ⁵or v⁴rac⁸ty ass⁴ssm⁴nt and as a tra⁸n⁸n⁶ m⁴t⁷od, r⁴sp⁴ct⁸v⁴ly. Adams and Jarv⁸s [00] ⁴xam⁸n⁴ t⁷⁴ p⁴r⁵ormanc⁴ o⁵ a subs⁴t o⁵ SAN cr⁸t⁴r⁸a on t⁷⁴⁸r ⁵or⁴ns⁸c data. Port⁴r and Yu⁸ll⁴ [] and Na⁷ar⁸ ⁴t al. [0] ⁴xam⁸n⁴ SAN’s p⁴r⁵ormanc⁴ on laboratory data ⁸n contrast w⁸t⁷ t⁷⁴ t⁷⁴or⁴t⁸cally ⁶round⁴d approac⁷⁴s o⁵ R⁴al⁸ty Mon⁸tor⁸n⁶ and A. W⁸t⁷ t⁷⁴ ⁴xc⁴pt⁸on o⁵ Port⁴r and Yu⁸ll⁴, w⁷os⁴ data sourc⁴ ⁸s spok⁴n ⁸nt⁴rv⁸⁴ws, t⁷⁴ SAN stud⁸⁴s r⁴ly on wr⁸tt⁴n stat⁴m⁴nts d⁴scr⁸b⁸n⁶ t⁷⁴ ⁸nc⁸d⁴nt und⁴r ⁸nv⁴st⁸⁶at⁸on. ⁸s ⁸s ⁸n k⁴⁴p⁸n⁶ w⁸t⁷ SAN’s pr⁴⁵⁴r⁴nc⁴ ⁵or wr⁸tt⁴n stat⁴m⁴nts pr⁴par⁴d w⁸t⁷out t⁷⁴ ⁸nflu⁴nc⁴ o⁵ a qu⁴st⁸on⁴r.² ⁴ stud⁸⁴s ⁷av⁴ l⁸ttl⁴ ⁸n common ot⁷⁴r t⁷an t⁷⁴⁸r ⁵ocus on SAN cr⁸t⁴r⁸a and prov⁸d⁴ only mod⁴rat⁴ support ⁵or SAN’s cla⁸ms at b⁴st. ow⁴v⁴r, t⁷⁴ cas⁴ ⁵or or a⁶a⁸nst SAN ⁸s ⁵ar ⁵rom clos⁴d. 2.5.2 EVALUATIONS OF SCAN r⁸scoll [] ⁷as two ⁶oals (⁸) to ⁴valuat⁴ LS’s cla⁸m t⁷at SAN can d⁸scr⁸m⁸nat⁴ b⁴tw⁴⁴n stat⁴m⁴nts t⁷at ar⁴ l⁸k⁴ly tru⁴ and t⁷os⁴ t⁷at ar⁴ l⁸k⁴ly ⁵als⁴ and (⁸⁸) to propos⁴ a quant⁸tat⁸v⁴ ⁴valuat⁸on m⁴asur⁴ t⁷at ⁸s s⁸m⁸lar to t⁷⁴ scor⁸n⁶ syst⁴m us⁴d ⁸n A and t⁷at would prov⁸d⁴ mor⁴ ⁸n⁵ormat⁸v⁴ and cons⁸st⁴nt r⁴sults t⁷an t⁷⁴ qual⁸tat⁸v⁴ ⁴valuat⁸on typ⁸cally us⁴d ⁸n SAN ass⁴ssm⁴nts. ⁴ study ⁴xam⁸n⁴s t⁴n SAN cr⁸t⁴r⁸a, l⁸st⁴d ⁸n r⁸scoll’s App⁴nd⁸x, t⁷at ar⁴ ⁴mp⁷as⁸z⁴d ⁸n SAN tra⁸n⁸n⁶ works⁷ops. ⁴ data s⁴t cons⁸sts o⁵ wr⁸tt⁴n stat⁴m⁴nts t⁷at w⁴r⁴ voluntar⁸ly ²Ot⁷⁴r syst⁴ms, ⁴.⁶., Rabon [] and lark [00], ar⁴ ⁸nt⁴nd⁴d ⁵or us⁴ ⁸n spok⁴n ⁸nt⁴rv⁸⁴ws as w⁴ll as wr⁸tt⁴n stat⁴m⁴nts.
38
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
Table 2.2: Outcom⁴ o⁵ SAN ass⁴ssm⁴nt adapt⁴d ⁵rom r⁸scoll [, p. ]
l⁸k⁴ly trut⁷⁵ul (AAS) l⁸k⁴ly ⁵als⁴ (S)
Pos⁸t⁸v⁴ Scor⁴s
N⁴⁶at⁸v⁴ Scor⁴s
Total
obta⁸n⁴d ⁵rom 0 susp⁴cts ⁹ust b⁴⁵or⁴ tak⁸n⁶ a poly⁶rap⁷ t⁴st. ⁴ stat⁴m⁴nts w⁴r⁴ scor⁴d us⁸n⁶ a -po⁸nt scal⁴ ( , , 0, +, +) to ⁸nd⁸cat⁴ t⁷⁴ pr⁴s⁴nc⁴ or abs⁴nc⁴ o⁵ a SAN cr⁸t⁴r⁸on. Pos⁸t⁸v⁴ valu⁴s ⁸nd⁸cat⁴d trut⁷⁵uln⁴ss, n⁴⁶at⁸v⁴ valu⁴s ⁸nd⁸cat⁴d d⁴c⁴pt⁸on, and z⁴ro ⁸nd⁸cat⁴d t⁷at a cr⁸t⁴r⁸on was not pr⁴s⁴nt ⁸n t⁷⁴ stat⁴m⁴nt. r⁸scoll ⁴xpla⁸ns “or ⁴xampl⁴, ⁸⁵ t⁷⁴ sub⁹⁴ct mad⁴ c⁷an⁶⁴s ⁸n t⁷⁴ lan⁶ua⁶⁴ ⁸n t⁷⁴ stat⁴m⁴nt, a could b⁴ ass⁸⁶n⁴d as t⁷⁴ scor⁴ ⁵or t⁷at cr⁸t⁴r⁸on. ut ⁸⁵ no c⁷an⁶⁴s ⁸n t⁷⁴ lan⁶ua⁶⁴ w⁴r⁴ obs⁴rv⁴d ⁸n a part⁸cular stat⁴m⁴nt, t⁷⁴ ass⁸⁶n⁴d scor⁴ would b⁴ a + or +, d⁴p⁴nd⁸n⁶ on t⁷⁴ l⁴n⁶t⁷ o⁵ t⁷⁴ stat⁴m⁴nt.” [r⁸scoll, , p. ]. ⁴ scor⁸n⁶ r⁴sults s⁷ow⁴d a s⁸⁶n⁸ficant d⁸ff⁴r⁴nc⁴ b⁴tw⁴⁴n l⁸k⁴ly tru⁴ (AAS) and l⁸k⁴ly ⁵als⁴ (S) stat⁴m⁴nts as Tabl⁴ . s⁷ows. Alt⁷ou⁶⁷ t⁷⁴s⁴ r⁴sults prov⁸d⁴ som⁴ support ⁵or r⁸scoll’s approac⁷, only on⁴ cr⁸t⁴r⁸on stands out w⁷⁴n t⁷⁴ r⁴sults ar⁴ brok⁴n down by ⁸nd⁸cator typ⁴ d⁴n⁸al o⁵ all⁴⁶at⁸ons and lack o⁵ d⁴n⁸al ar⁴ t⁷⁴ stron⁶⁴st ⁸nd⁸cators o⁵ trut⁷⁵uln⁴ss and d⁴c⁴pt⁸on, r⁴sp⁴ct⁸v⁴ly. As r⁸scoll obs⁴rv⁴s, t⁷⁴s⁴ r⁴sults s⁷ould only b⁴ cons⁸d⁴r⁴d su⁶⁶⁴st⁸v⁴. ⁴ stat⁴m⁴nts r⁴pr⁴s⁴nt a small sampl⁴ l⁸m⁸t⁴d to t⁷⁴ fil⁴s o⁵ a s⁸n⁶l⁴ poly⁶rap⁷⁴r and SAN ass⁴ssm⁴nts w⁴r⁴ p⁴r⁵orm⁴d by a s⁸n⁶l⁴ rat⁴r. t also app⁴ars ⁵rom a conclud⁸n⁶ comm⁴nt t⁷at t⁷⁴ ⁶round trut⁷ ass⁴ssm⁴nts w⁴r⁴ known by t⁷⁴ rat⁴r ass⁸⁶n⁸n⁶ SAN cr⁸t⁴r⁸a ass⁴ssm⁴nts “utur⁴ r⁴s⁴arc⁷ w⁸ll n⁴⁴d to ⁸nclud⁴ a bl⁸nd analys⁸s o⁵ t⁷⁴ stat⁴m⁴nts, w⁷⁴r⁴ t⁷⁴ ⁴valuator ⁷as no knowl⁴d⁶⁴ conc⁴rn⁸n⁶ t⁷⁴ cas⁴s” [r⁸scoll, , p. ]. Adams and Jarv⁸s [00] t⁴st s⁸x cr⁸t⁴r⁸a c⁸t⁴d ⁸n stat⁴m⁴nt analys⁸s appl⁸cat⁸ons and r⁴s⁴arc⁷. A subs⁴t o⁵ t⁷⁴s⁴ ar⁴ us⁴d ⁸n SAN ⁴qu⁸vocat⁸on, ⁴mot⁸ons, and narrat⁸v⁴ balanc⁴. n t⁷⁴ l⁸t⁴ratur⁴ and SAN docum⁴ntat⁸on (LS SAN works⁷op, 00) ⁴qu⁸vocat⁸on ⁸s usually r⁴⁵⁴rr⁴d to as unc⁴rta⁸nty or ⁷⁴d⁶⁸n⁶, ⁴.⁶., I think, kind of, to the best of my knowledge. ⁴ ⁴mot⁸ons cr⁸t⁴r⁸on r⁴⁵⁴rs to t⁷⁴ numb⁴r and locat⁸on o⁵ ⁴mot⁸on words. W⁷⁴n t⁷⁴s⁴ words occur ⁸n t⁷⁴ b⁴⁶⁸nn⁸n⁶ o⁵ a narrat⁸v⁴, t⁷⁴y ⁸nd⁸cat⁴ a ⁵abr⁸cat⁴d account t⁷⁴ pr⁴s⁴nc⁴ o⁵ ⁴mot⁸on words ⁸n t⁷⁴ d⁴scr⁸pt⁸on o⁵ t⁷⁴ ⁸nc⁸d⁴nt and ⁸ts ⁴p⁸lo⁶u⁴ ⁸s cons⁸st⁴nt w⁸t⁷ trut⁷⁵uln⁴ss. Narrat⁸v⁴ balanc⁴ ⁸s an ⁸mportant ⁴l⁴m⁴nt o⁵ SAN analys⁸s. t r⁴⁵⁴rs to t⁷⁴ structur⁸n⁶ o⁵ t⁷⁴ narrat⁸v⁴ ⁸nto t⁷r⁴⁴ parts—prolo⁶u⁴, ⁸nc⁸d⁴nt, and ⁴p⁸lo⁶u⁴—w⁷⁴r⁴ trut⁷⁵uln⁴ss ⁸s ⁸nd⁸cat⁴d w⁷⁴n t⁷⁴ ⁸nc⁸d⁴nt d⁴scr⁸pt⁸on mak⁴s up around ⁷al⁵ o⁵ t⁷⁴ narrat⁸v⁴. Accord⁸n⁶ to SAN, a l⁴n⁶t⁷y prolo⁶u⁴ can o⁵t⁴n b⁴ assoc⁸at⁴d w⁸t⁷ d⁴c⁴pt⁸on. Adams and Jarv⁸s ⁵ound a stron⁶ r⁴lat⁸on b⁴tw⁴⁴n stat⁴m⁴nt structur⁴ and d⁴c⁴pt⁸on— narrat⁸v⁴s ⁸n w⁷⁸c⁷ t⁷⁴ prolo⁶u⁴ ⁸s l⁴n⁶t⁷y by compar⁸son w⁸t⁷ t⁷⁴ ⁸nc⁸d⁴nt and ⁴p⁸lo⁶u⁴ s⁴ct⁸ons ⁸s mor⁴ l⁸k⁴ly to b⁴ a ⁵abr⁸cat⁸on. ⁴y also ⁸d⁴nt⁸fi⁴d a pos⁸t⁸v⁴ r⁴lat⁸on b⁴tw⁴⁴n ⁴qu⁸vocat⁸on and d⁴c⁴pt⁸on. W⁸t⁷ r⁴sp⁴ct to ⁴mot⁸ons, t⁷⁴ r⁴sults s⁷ow⁴d “a m⁸ld pos⁸t⁸v⁴ r⁴lat⁸ons⁷⁸p b⁴-
2.5. FORENSIC IMPLEMENTATIONS OF THE LITERATURE
39
tw⁴⁴n v⁴rac⁸ty and d⁴ns⁸ty o⁵ ⁴mot⁸ons ⁸n t⁷⁴ ⁴p⁸lo⁶u⁴ part⁸t⁸ons o⁵ t⁷⁴ ⁴xam⁸n⁴d stat⁴m⁴nts. No r⁴lat⁸ons⁷⁸ps w⁴r⁴ ⁵ound b⁴tw⁴⁴n v⁴rac⁸ty and ⁴mot⁸ons ⁸n t⁷⁴ r⁴ma⁸n⁸n⁶ s⁴ct⁸ons [Adams and Jarv⁸s, 00, p. ]. ⁴s⁴ r⁴sults would s⁴⁴m to prov⁸d⁴ ⁴mp⁸r⁸cal support ⁵or SAN’s cla⁸ms about t⁷⁴ s⁸⁶n⁸ficanc⁴ o⁵ narrat⁸v⁴ balanc⁴, ⁴qu⁸vocat⁸on, and t⁷⁴ locat⁸on o⁵ ⁴mot⁸ons. ow⁴v⁴r, som⁴ qu⁴st⁸ons r⁴ma⁸n as to ⁴xactly ⁷ow t⁷⁴ r⁴s⁴arc⁷⁴rs mana⁶⁴d t⁷⁴ ass⁸⁶nm⁴nt o⁵ ⁶round trut⁷ and t⁷⁴ ass⁴ssm⁴nt o⁵ d⁴c⁴pt⁸on/v⁴rac⁸ty cat⁴⁶or⁸⁴s and w⁷⁴t⁷⁴r t⁷⁴s⁴ tasks w⁴r⁴ p⁴r⁵orm⁴d by d⁸ff⁴r⁴nt p⁴opl⁴ [Adams and Jarv⁸s, 00, p. ]. ⁸nally, as a v⁸s⁸t to t⁷⁴ LS w⁴bs⁸t⁴ w⁸ll s⁷ow, SAN’s pr⁸mary act⁸v⁸ty ⁸s t⁷⁴ mark⁴t⁸n⁶ o⁵ SAN tra⁸n⁸n⁶. Works⁷ops on SAN’s approac⁷ and m⁴t⁷ods ar⁴ conduct⁴d by SAN ⁴xp⁴rts—o⁵t⁴n law ⁴n⁵orc⁴m⁴nt pro⁵⁴ss⁸onals—and off⁴r⁴d ⁸n a s⁴r⁸⁴s o⁵ pro⁶r⁴ss⁸v⁴ly mor⁴ compl⁴x tra⁸n⁸n⁶ s⁴ss⁸ons. ⁴ ⁸n⁸t⁸al works⁷op ⁸s a two- to t⁷r⁴⁴-day cours⁴ ⁵ocus⁸n⁶ on t⁷⁴ SAN cr⁸t⁴r⁸a and t⁷⁴⁸r appl⁸cat⁸on to wr⁸tt⁴n cr⁸m⁸nal stat⁴m⁴nts. Sm⁸t⁷ [00] conduct⁴d an ⁴valuat⁸on o⁵ SAN tra⁸n⁸n⁶ ⁸n ord⁴r to d⁴t⁴rm⁸n⁴ w⁷⁴t⁷⁴r SAN s⁷ould b⁴ us⁴d ⁸n ⁴ducat⁸n⁶ r⁸t⁸s⁷ pol⁸c⁴ d⁴t⁴ct⁸v⁴s. Sub⁹⁴cts ⁵or t⁷⁴ study w⁴r⁴ pol⁸c⁴ offic⁴rs t⁷at ⁵⁴ll ⁸nto t⁷r⁴⁴ ⁶⁴n⁴ral cat⁴⁶or⁸⁴s— (⁸) offic⁴rs w⁸t⁷ som⁴ knowl⁴d⁶⁴ o⁵ SAN (⁸⁸) ⁴xp⁴r⁸⁴nc⁴d d⁴t⁴ct⁸v⁴s w⁸t⁷ no knowl⁴d⁶⁴ o⁵ SAN and (⁸⁸⁸) n⁴wly r⁴cru⁸t⁴d offic⁴rs w⁸t⁷ no ⁴xposur⁴ to SAN. Sm⁸t⁷’s [00] study s⁷ow⁴d t⁷at ⁶roup (⁸) (offic⁴rs w⁸t⁷ ⁴xposur⁴ to SAN) and ⁶roup (⁸⁸) (⁴xp⁴r⁸⁴nc⁴d d⁴t⁴ct⁸v⁴s w⁷o r⁴l⁸⁴d on ⁸ntu⁸t⁸on) bot⁷ p⁴r⁵orm⁴d s⁸⁶n⁸ficantly b⁴tt⁴r ⁸n ⁸d⁴nt⁸⁵y⁸n⁶ d⁴c⁴pt⁸on t⁷an t⁷⁴ nov⁸c⁴ offic⁴rs w⁷o ⁷ad no SAN tra⁸n⁸n⁶ and r⁴l⁸⁴d on ⁸ntu⁸t⁸on (⁶roup ⁸⁸⁸). W⁷⁸l⁴ prov⁸d⁸n⁶ mod⁴st support ⁵or LSs cla⁸m t⁷at SAN ⁸mprov⁴s d⁴t⁴ct⁸v⁴s ab⁸l⁸ty to ass⁴ss v⁴rac⁸ty, t⁷⁴s⁴ r⁴sults w⁴r⁴ v⁸⁴w⁴d by Sm⁸t⁷ [00] as ⁸nsuffic⁸⁴nt to ⁹ust⁸⁵y t⁷⁴ ⁸nv⁴stm⁴nt ⁸n a nat⁸onal tra⁸n⁸n⁶ pro⁶ram.
Laboratory Tests of SCAN Criteria
Two laboratory stud⁸⁴s, Port⁴r and Yu⁸ll⁴ [] and Na⁷ar⁸ ⁴t al. [0], ⁴xam⁸n⁴ t⁷⁴ ⁴ff⁴ct⁸v⁴n⁴ss o⁵ SAN cr⁸t⁴r⁸a ⁸n compar⁸son to t⁷⁴or⁴t⁸cally mot⁸vat⁴d ⁸nd⁸cators. Port⁴r and Yu⁸ll⁴ d⁴scr⁸b⁴ a cr⁸m⁴ s⁸mulat⁸on study ⁸n w⁷⁸c⁷ sub⁹⁴cts cr⁴at⁴d trut⁷⁵ul accounts or al⁸b⁸s accord⁸n⁶ to t⁷⁴⁸r rol⁴ ⁸n t⁷⁴ ⁴xp⁴r⁸m⁴nt. or t⁷⁴⁸r ⁴valuat⁸on, t⁷⁴ aut⁷ors s⁴l⁴ct⁴d a subs⁴t o⁵ cr⁸t⁴r⁸a ⁵rom A, RM, and SAN. ⁴y ⁵ound t⁷at all m⁴t⁷ods d⁸d poorly. Only t⁷r⁴⁴ ⁸nd⁸cators, all ⁵rom A and all cons⁸st⁴nt w⁸t⁷ trut⁷⁵uln⁴ss, p⁴r⁵orm⁴d w⁴ll amount o⁵ d⁴ta⁸l, co⁷⁴r⁴nc⁴, and lack o⁵ m⁴mory. N⁴⁸t⁷⁴r SAN nor RM s⁷ow⁴d any us⁴⁵uln⁴ss ⁵or ass⁴ss⁸n⁶ v⁴rac⁸ty ⁸n t⁷⁸s study. ow⁴v⁴r, t⁷⁴r⁴ may b⁴ r⁴asons ot⁷⁴r t⁷an doubt⁵ul cr⁸t⁴r⁸a to ⁴xpla⁸n t⁷⁴ poor p⁴r⁵ormanc⁴ o⁵ RM and SAN. Port⁴r and Yu⁸ll⁴ [] po⁸nt out t⁷at t⁷⁴ two A cod⁴rs ⁸n t⁷⁴⁸r study w⁴r⁴ car⁴⁵ully tra⁸n⁴d, r⁴c⁴⁸v⁸n⁶ “⁴xt⁴ns⁸v⁴ -day tra⁸n⁸n⁶” (p. ) ⁸n A ta⁶⁶⁸n⁶. ⁴r⁴ ⁸s no r⁴⁵⁴r⁴nc⁴ to tra⁸n⁸n⁶ or ot⁷⁴r pr⁴parat⁸on ⁵or RM and SAN ta⁶⁶⁸n⁶. RM cr⁸t⁴r⁸a w⁴r⁴ scor⁴d bas⁴d on automat⁴d ⁵r⁴qu⁴ncy counts o⁵ ⁷⁴d⁶⁴s (I believe, It seems to me, ⁴tc.), counts o⁵ s⁴l⁵r⁴⁵⁴r⁴nc⁴s (I; me; mi ne; my; myself ), and t⁷⁴ word count o⁵ t⁷⁴ ⁴nt⁸r⁴ narrat⁸v⁴. SAN cr⁸t⁴r⁸a w⁴r⁴ scor⁴d bas⁴d on automat⁴d ⁵r⁴qu⁴ncy counts o⁵ “unn⁴c⁴ssary conn⁴ctors” (afterwards, the next thing I remember, ⁴tc.), a count o⁵ d⁴v⁸at⁸ons ⁵rom us⁴ o⁵ t⁷⁴ first p⁴rson s⁸n⁶ular pronoun
40
2. THE BACKGROUND LITERATURE ON BEHAVIORAL CUES TO DECEPTION
I (⁸t ⁸s uncl⁴ar ⁸⁵ t⁷⁸s was conduct⁴d manually or automat⁸cally) and stat⁴m⁴nt balanc⁴ (r⁴lat⁸v⁴ l⁴n⁶t⁷s o⁵ prolo⁶u⁴ and ⁴p⁸lo⁶u⁴). n t⁷⁴ abs⁴nc⁴ o⁵ an ⁴xpl⁸c⁸t d⁴scr⁸pt⁸on, ⁸t s⁴⁴ms l⁸k⁴ly t⁷at all o⁵ t⁷⁴ automat⁴d ass⁸⁶nm⁴nts w⁴r⁴ don⁴ w⁸t⁷ k⁴y word and k⁴y p⁷ras⁴ s⁴arc⁷⁴s. Pronoun d⁴v⁸at⁸ons may also ⁷av⁴ b⁴⁴n computat⁸onally ass⁸⁶n⁴d but d⁴ta⁸ls about t⁷⁴ pro⁶ram or ⁷uman ⁸nvolv⁴m⁴nt ar⁴ not prov⁸d⁴d. ⁸s l⁴av⁴s op⁴n t⁷⁴ qu⁴st⁸on as to w⁷⁴t⁷⁴r t⁷⁴ pro⁶ram or a tra⁸n⁴d ⁷uman rat⁴r ⁸d⁴nt⁸fi⁴d firstp⁴rson pronoun d⁴v⁸at⁸ons. A s⁷⁸⁵t ⁵rom “” to a d⁸ff⁴r⁴nt pronoun may ⁸nd⁸cat⁴ d⁴c⁴pt⁸on, as t⁷⁴ aut⁷ors obs⁴rv⁴, or ⁸t may s⁸mply s⁴rv⁴ to advanc⁴ t⁷⁴ narrat⁸v⁴. or ⁴xampl⁴, ⁸⁵ t⁷⁴ sp⁴ak⁴r says “ ask⁴d ⁷⁸m w⁷⁴r⁴ ⁷⁴ was y⁴st⁴rday morn⁸n⁶” and cont⁸nu⁴s w⁸t⁷ “⁴ r⁴pl⁸⁴d t⁷at ⁷⁴ was at ⁷om⁴,” t⁷⁴ pronoun c⁷an⁶⁴ ⁵rom “” to “⁷⁴” fits t⁷⁴ narrat⁸v⁴ flow. Tra⁸n⁴d ⁷uman rat⁴rs may r⁴co⁶n⁸z⁴ t⁷⁴s⁴ d⁸ff⁴r⁴nt ⁵unct⁸ons o⁵ pronoun c⁷an⁶⁴ but ⁸t ⁸s unl⁸k⁴ly t⁷at a k⁴yword-bas⁴d syst⁴m could do so. n summary, t⁷⁴ lack o⁵ suffic⁸⁴nt tra⁸n⁸n⁶ and t⁷⁴ poss⁸bl⁴ om⁸ss⁸on o⁵ cont⁴xt or cr⁸t⁸cal words ⁵rom automat⁴d k⁴y word/k⁴y p⁷ras⁴ s⁴arc⁷⁴s could ⁷av⁴ aff⁴ct⁴d t⁷⁴ RM and SAN scor⁴s. ⁴ aut⁷ors su⁶⁶⁴st as muc⁷ “t s⁷ould b⁴ not⁴d t⁷at non⁴ o⁵ t⁷⁴ comput⁴r-⁶⁴n⁴rat⁴d m⁴asur⁴s d⁸scr⁸m⁸nat⁴d amon⁶ t⁷⁴ ⁶roups. t ⁸s, o⁵ cours⁴, poss⁸bl⁴ t⁷at a computat⁸onal approac⁷ (comput⁴r or manually ⁶⁴n⁴rat⁴d) was not a val⁸d on⁴ ⁸n tapp⁸n⁶ t⁷⁴ var⁸ous var⁸abl⁴s b⁴⁸n⁶ ⁴xam⁸n⁴d” [Port⁴r and Yu⁸ll⁴, , p. 0]. t may b⁴ ⁵a⁸r to say t⁷at t⁷⁴ r⁴sults r⁴port⁴d ⁸n Port⁴r and Yu⁸ll⁴ prov⁸d⁴ support ⁵or t⁷⁴ w⁸nn⁸n⁶ A cr⁸t⁴r⁸a but ar⁴ ⁸nconclus⁸v⁴ w⁸t⁷ r⁴sp⁴ct to t⁷⁴ SAN and RM cr⁸t⁴r⁸a. Na⁷ar⁸ ⁴t al. [0] compar⁴ t⁷⁴ p⁴r⁵ormanc⁴ o⁵ a lar⁶⁴r s⁴t o⁵ SAN and RM cr⁸t⁴r⁸a ⁸n a mock t⁷⁴⁵t sc⁴nar⁸o. Sub⁹⁴cts wrot⁴ narrat⁸v⁴ accounts t⁷at w⁴r⁴ lat⁴r cod⁴d us⁸n⁶ SAN cr⁸t⁴r⁸a ⁵rom Vr⁸⁹ [00] and RM cr⁸t⁴r⁸a ⁵rom Spor⁴r [00]. u⁴ ta⁶⁶⁸n⁶ was p⁴r⁵orm⁴d by cod⁴rs, ⁴ac⁷ o⁵ w⁷om r⁴c⁴⁸v⁴d 0 m⁸n o⁵ tra⁸n⁸n⁶ ⁸n SAN and RM cr⁸t⁴r⁸a. ac⁷ narrat⁸v⁴ account was ta⁶⁶⁴d by two cod⁴rs and d⁸sa⁶r⁴⁴m⁴nts as to w⁷⁴t⁷⁴r an ⁸nd⁸cator was pr⁴s⁴nt or not w⁴r⁴ r⁴⁵⁴rr⁴d to a t⁷⁸rd cod⁴r w⁷o mad⁴ t⁷⁴ final d⁴c⁸s⁸on. Low ⁸nt⁴r-rat⁴r r⁴l⁸ab⁸l⁸ty mot⁸vat⁴d t⁷⁴ ⁴xp⁴r⁸m⁴nt⁴rs to adopt a s⁸mpl⁸fi⁴d ta⁶⁶⁸n⁶ sc⁷⁴m⁴ ⁸n w⁷⁸c⁷ cu⁴s w⁴r⁴ d⁴⁴m⁴d pr⁴s⁴nt or abs⁴nt but ⁸n⁵ormat⁸on about ta⁶ ⁵r⁴qu⁴ncy, w⁷⁸c⁷ was part o⁵ t⁷⁴ ⁸n⁸t⁸al ta⁶⁶⁸n⁶ sc⁷⁴m⁴, was ⁴xclud⁴d. ⁸s ⁸mprov⁴d rat⁴r p⁴r⁵ormanc⁴ but may ⁷av⁴ aff⁴ct⁴d t⁷⁴ scor⁸n⁶ o⁵ SAN cr⁸t⁴r⁸a t⁷at mak⁴ us⁴ o⁵ ⁵r⁴qu⁴ncy, ⁴.⁶., first-p⁴rson pronouns.³ v⁴n so, ⁸nt⁴r-rat⁴r r⁴l⁸ab⁸l⁸ty ⁸s low to mod⁴rat⁴ (< .0) on som⁴ o⁵ t⁷⁴ most ⁸mportant SAN cr⁸t⁴r⁸a—c⁷an⁶⁴s ⁸n pronoun us⁴, structur⁴ o⁵ t⁷⁴ stat⁴m⁴nt and m⁸ss⁸n⁶ ⁸n⁵ormat⁸on. R⁴sults o⁵ t⁷⁴ scor⁸n⁶ s⁷ow⁴d RM succ⁴⁴d⁴d ⁸n d⁸st⁸n⁶u⁸s⁷⁸n⁶ l⁸ars ⁵rom trut⁷-t⁴ll⁴rs w⁷⁴r⁴ SAN ⁵a⁸l⁴d to do so. ow⁴v⁴r, t⁷⁴ poss⁸b⁸l⁸ty ⁴x⁸sts t⁷at ⁸nsuffic⁸⁴nt rat⁴r tra⁸n⁸n⁶, a s⁸mpl⁸fi⁴d scor⁸n⁶ syst⁴m and rat⁴r d⁸sa⁶r⁴⁴m⁴nts may ⁷av⁴ play⁴d a rol⁴ ⁸n SAN’s poor p⁴r⁵ormanc⁴. ⁸nally, Na⁷ar⁸ ⁴t al. [0] mak⁴ a stron⁶ ar⁶um⁴nt ⁵or t⁷⁴ valu⁴ o⁵ RM’s “t⁷⁴or⁴t⁸cal und⁴rp⁸nn⁸n⁶.” As a psyc⁷olo⁶⁸cal mod⁴l, RM ⁷olds t⁷⁴ ⁷⁸⁶⁷ ⁶round compar⁴d to SAN. A drawback o⁵ RM as a ⁵ormal mod⁴l, ⁷ow⁴v⁴r, ⁸s t⁷at many o⁵ t⁷⁴ cr⁸t⁴r⁸a ar⁴ ⁸mpr⁴ss⁸on⁸st⁸c and ³rom t⁷⁴ SAN works⁷op ⁶u⁸d⁴l⁸n⁴s “⁵ w⁴ find only on⁴ (or two or t⁷r⁴⁴) first-p⁴rson pronoun(s) (I; we ) ⁸n t⁷⁴ stat⁴m⁴nt, t⁷⁴ s⁴nt⁴nc⁴(s) w⁷⁴r⁴ t⁷⁴y app⁴ar s⁷ould b⁴ cons⁸d⁴r⁴d as v⁴ry un⁸qu⁴” (LS SAN works⁷op, 00.)
2.5. FORENSIC IMPLEMENTATIONS OF THE LITERATURE
41
r⁴⁵⁴r to qual⁸t⁸⁴s suc⁷ as “v⁸brancy,” “r⁸c⁷n⁴ss,” “clar⁸ty,” and “r⁴al⁸sm” t⁷at p⁴opl⁴ r⁴co⁶n⁸z⁴ but t⁷at ar⁴ d⁸fficult to r⁴pr⁴s⁴nt ⁸n an automat⁴d syst⁴m. Many o⁵ t⁷⁴ SAN cr⁸t⁴r⁸a, w⁷⁸l⁴ lack⁸n⁶ t⁷⁴or⁴t⁸cal mot⁸vat⁸on, l⁴nd t⁷⁴ms⁴lv⁴s to pract⁸cal d⁴scr⁸pt⁸ons t⁷at work ⁵or ⁸nstruct⁸on and ⁸mpl⁴m⁴ntat⁸on.
Conclusion ⁴ r⁴sults obta⁸n⁴d ⁵rom laboratory t⁴sts o⁵ SAN ar⁴ at odds w⁸t⁷ t⁷os⁴ obta⁸n⁴d ⁵rom fi⁴ld stud⁸⁴s. ot⁷ r⁸scoll [] and Adams and Jarv⁸s [00] ⁵ound t⁷at stat⁴m⁴nt balanc⁴, on⁴ o⁵ t⁷⁴ most ⁸mportant SAN cr⁸t⁴r⁸a, fi⁶ur⁴s s⁸⁶n⁸ficantly ⁸n d⁸scr⁸m⁸nat⁸n⁶ trut⁷ and d⁴c⁴pt⁸on. ⁴ laboratory data stud⁸⁴s ⁵a⁸l⁴d to find any s⁸⁶n⁸ficant ⁴ff⁴ct o⁵ stat⁴m⁴nt balanc⁴. ⁸s s⁷ould not b⁴ surpr⁸s⁸n⁶ s⁸nc⁴ ⁸ncons⁸st⁴ncy across stud⁸⁴s ⁸s pr⁴val⁴nt amon⁶ bot⁷ laboratory r⁴s⁴arc⁷ stud⁸⁴s and fi⁴ld stud⁸⁴s, w⁷⁴t⁷⁴r t⁷⁴y s⁷ar⁴ a common ⁵ram⁴work and ⁶oals [Vr⁸⁹, 00] or ar⁴ ⁸nv⁴st⁸⁶at⁸n⁶ comp⁴t⁸n⁶ ⁵ram⁴works [Na⁷ar⁸ ⁴t al., 0, Port⁴r and Yu⁸ll⁴, ] t would c⁴rta⁸nly prov⁴ product⁸v⁴ to ⁸nt⁴rpr⁴t ⁸ncons⁸st⁴nc⁸⁴s as ⁴v⁸d⁴nc⁴ o⁵ t⁷⁴ n⁴⁴d ⁵or mor⁴ d⁴v⁴lopm⁴nt o⁵ s⁷ar⁴d r⁴sourc⁴s—data, so⁵twar⁴, tra⁸n⁸n⁶ tools—and b⁴st pract⁸c⁴s. Suc⁷ an ⁴⁵⁵ort would off⁴r ⁴normous b⁴n⁴fits to fi⁴ld stud⁸⁴s w⁷⁴r⁴ ass⁴ssm⁴nts o⁵ ⁶round trut⁷ ar⁴ cr⁸t⁸cal to produc⁸n⁶ r⁴l⁸abl⁴ r⁴sults.
[ac⁷⁴nko ⁴t al., 00] us⁴ s⁴v⁴ral cr⁸t⁴r⁸a borrow⁴d ⁵rom t⁷⁴ stat⁴m⁴nt analys⁸s commun⁸ty, ⁸nclud⁸n⁶ SAN.
APTR
43
Data Sources 3.1
INTRODUCTION
As d⁸scuss⁴d ⁸n ⁷apt⁴r , part⁸cularly ⁸n r⁴⁵⁴r⁴nc⁴ to R⁴al⁸ty Mon⁸tor⁸n⁶, t⁷⁴ s⁴arc⁷ ⁵or b⁴⁷av⁸oral cu⁴s to d⁴c⁴pt⁸on ⁸s ⁶u⁸d⁴d by t⁷⁴ assumpt⁸on t⁷at narrat⁸v⁴s bas⁴d on ⁴xp⁴r⁸⁴nc⁴d ⁴v⁴nts d⁸ff⁴r ⁸n m⁴asurabl⁴ ways ⁵rom ⁵abr⁸cat⁴d ⁴xp⁴r⁸⁴nc⁴s. Laboratory ⁴xp⁴r⁸m⁴nts suc⁷ as t⁷os⁴ cons⁸d⁴r⁴d ⁸n t⁷⁴ pr⁴v⁸ous c⁷apt⁴r ⁷av⁴ produc⁴d a body o⁵ ⁴v⁸d⁴nc⁴ ⁸nd⁸cat⁸n⁶ t⁷at s⁴v⁴ral v⁴rbal and nonv⁴rbal b⁴⁷av⁸ors corr⁴lat⁴ w⁸t⁷ ⁵abr⁸cat⁴d accounts. ⁴ sc⁴nar⁸os us⁴d ⁵or laboratory ⁴xp⁴r⁸m⁴nts ar⁴ o⁵t⁴n c⁷aract⁴r⁸z⁴d as “low-stak⁴s d⁴c⁴pt⁸on” b⁴caus⁴ t⁷⁴ sub⁹⁴cts ⁸n t⁷⁴s⁴ stud⁸⁴s ⁷av⁴ l⁸ttl⁴ to los⁴ ⁸⁵ t⁷⁴⁸r l⁸⁴s ar⁴ d⁸scov⁴r⁴d. Most ⁹ur⁸sd⁸ct⁸ons ⁷av⁴ r⁴⁶ulat⁸ons on t⁷⁴ prot⁴ct⁸on o⁵ ⁷uman sub⁹⁴cts t⁷at would pro⁷⁸b⁸t ⁴xp⁴r⁸m⁴ntal d⁴s⁸⁶ns ⁸nvolv⁸n⁶ ⁷⁸⁶⁷-stak⁴s s⁸tuat⁸ons. n a ⁵abr⁸cat⁴d sc⁴nar⁸o, sub⁹⁴cts ar⁴ prot⁴ct⁴d ⁵rom any ⁷arm t⁷at m⁸⁶⁷t com⁴ ⁵rom d⁸scov⁴ry o⁵ d⁴c⁴pt⁸on. urt⁷⁴r, b⁴caus⁴ sub⁹⁴cts ar⁴ ⁸nstruct⁴d to l⁸⁴, t⁷⁴y n⁴⁴d not tak⁴ r⁴spons⁸b⁸l⁸ty ⁵or t⁷⁴ d⁸s⁷on⁴st b⁴⁷av⁸or “n v⁸rtually ⁴v⁴ry study o⁵ d⁴c⁴pt⁸v⁴ b⁴⁷av⁸or, t⁷⁴ r⁴s⁴arc⁷⁴rs ⁷av⁴ sanct⁸on⁴d, ⁴v⁴n ⁴ncoura⁶⁴d, t⁷⁴ sub⁹⁴cts’ d⁸s⁷on⁴st b⁴⁷av⁸or, t⁷us r⁴mov⁸n⁶ t⁷⁴ onus o⁵ r⁴spons⁸b⁸l⁸ty ⁵or t⁷⁴ d⁴c⁴pt⁸on act ⁵rom t⁷⁴ d⁴c⁴⁸v⁴r and plac⁸n⁶ ⁸t on t⁷⁴ r⁴s⁴arc⁷⁴r” [Kop⁴r and Sa⁷lman, , p. ]. ⁸⁶⁷-stak⁴s d⁴c⁴pt⁸on ⁸n, ⁵or ⁴xampl⁴, cr⁸m⁸nal cas⁴s, carr⁸⁴s s⁸⁶n⁸ficant r⁸sk s⁸nc⁴ d⁸scov⁴ry may ⁷av⁴ s⁴r⁸ous cons⁴qu⁴nc⁴s ⁵or t⁷⁴ d⁴c⁴⁸v⁴r and t⁷⁴ d⁴c⁴⁸v⁴r’s ⁵am⁸ly, ⁵r⁸⁴nds and coll⁴a⁶u⁴s. or many r⁴s⁴arc⁷⁴rs and pract⁸t⁸on⁴rs, t⁷⁴ lack o⁵ ⁴colo⁶⁸cal val⁸d⁸ty ⁸n low-stak⁴s d⁴c⁴pt⁸on ra⁸s⁴s doubts about t⁷⁴ ab⁸l⁸ty o⁵ laboratory b⁴⁷av⁸ors to stand ⁵or r⁴al-world d⁴c⁴pt⁸v⁴ b⁴⁷av⁸or. t ⁸s poss⁸bl⁴ t⁷at laboratory r⁴s⁴arc⁷ ⁷as ov⁴rlook⁴d l⁴aka⁶⁴ b⁴⁷av⁸ors ⁸n ⁷⁸⁶⁷-stak⁴s d⁴c⁴⁸v⁴rs s⁸nc⁴ actual r⁸sk may promot⁴ b⁴⁷av⁸ors t⁷at do not occur ⁸n low-stak⁴s sc⁴nar⁸os. t ⁸s also poss⁸bl⁴ t⁷at laboratory t⁴sts ov⁴r⁴st⁸mat⁴ t⁷⁴ ⁸mportanc⁴ o⁵ c⁴rta⁸n b⁴⁷av⁸ors t⁷at, ⁸n ⁵act, ⁷av⁴ b⁴⁴n l⁸nk⁴d to sanct⁸on⁴d d⁴c⁴⁸t, as not⁴d by orvat⁷ ⁴t al. [], Vr⁸⁹ and Mann [00a], and ornac⁸ar⁸ and Po⁴s⁸o [0]. Qu⁴st⁸ons about t⁷⁴ val⁸d⁸ty o⁵ laboratory stud⁸⁴s b⁴com⁴ a cr⁸t⁸cal ⁸ssu⁴ w⁷⁴n w⁴ att⁴mpt to compar⁴ and ⁴valuat⁴ t⁷⁴or⁴t⁸cal mod⁴ls o⁵ d⁴c⁴pt⁸on and, ⁴qually ⁸mportant, w⁷⁴n w⁴ att⁴mpt to bu⁸ld syst⁴ms ⁵or t⁷⁴ r⁴co⁶n⁸t⁸on o⁵ d⁴c⁴pt⁸v⁴ lan⁶ua⁶⁴ by ⁷umans and mac⁷⁸n⁴s. Y⁴t alt⁴rnat⁸v⁴s to laboratory-⁶⁴n⁴rat⁴d data ⁷av⁴ t⁷⁴⁸r own probl⁴ms. ⁴ most ⁸mportant o⁵ t⁷⁴s⁴ ⁸s t⁷⁴ r⁴pr⁴s⁴ntat⁸on o⁵ ⁶round trut⁷. n a typ⁸cal laboratory ⁴xp⁴r⁸m⁴nt, sub⁹⁴cts ar⁴ ass⁸⁶n⁴d to trut⁷⁵ul or d⁴c⁴pt⁸v⁴ cond⁸t⁸ons and plac⁴d ⁸n sc⁴nar⁸os w⁷⁴r⁴ t⁷⁴y d⁴scr⁸b⁴ t⁷⁴⁸r act⁸ons trut⁷⁵ully or d⁴c⁴pt⁸v⁴ly d⁴p⁴nd⁸n⁶ on t⁷⁴ rol⁴ t⁷⁴y ⁷av⁴ b⁴⁴n ass⁸⁶n⁴d. ⁴ ⁴xp⁴r⁸m⁴nt⁴rs know t⁷⁴ ⁶round trut⁷, ⁸.⁴., w⁷⁸c⁷ o⁵ t⁷⁴ accounts ⁸s a l⁸⁴ and w⁷⁸c⁷ ⁸s tru⁴, and can look ⁵or t⁴lltal⁴
44
3. DATA SOURCES
b⁴⁷av⁸ors und⁴r ⁴ac⁷ cond⁸t⁸on. round trut⁷ ⁵or ⁷⁸⁶⁷-stak⁴s s⁸tuat⁸ons ⁸s rar⁴ly as r⁴l⁸abl⁴ or as w⁴ll r⁴pr⁴s⁴nt⁴d. ⁸s c⁷apt⁴r ⁴xam⁸n⁴s s⁴v⁴ral “r⁴al-world” stud⁸⁴s o⁵ lan⁶ua⁶⁴ and d⁴c⁴pt⁸on. W⁴ b⁴⁶⁸n w⁸t⁷ a br⁸⁴⁵ d⁴scr⁸pt⁸on o⁵ data sourc⁴s and ⁶o on to compar⁴ t⁷⁴ d⁸ff⁴r⁴nt m⁴t⁷ods by w⁷⁸c⁷ ⁶round trut⁷ ⁸s ⁴stabl⁸s⁷⁴d. urr⁴nt t⁴c⁷n⁸qu⁴s ⁵or r⁴al-world d⁴c⁴pt⁸on stud⁸⁴s ar⁴ lar⁶⁴ly d⁴r⁸v⁴d ⁵rom m⁴t⁷ods d⁴v⁴lop⁴d ⁵or r⁸t⁴r⁸a-as⁴d ont⁴nt Analys⁸s (A), d⁸scuss⁴d ⁸n ⁷apt⁴r , S⁴ct⁸on . as t⁷⁴ lan⁶ua⁶⁴ analys⁸s compon⁴nt o⁵ Stat⁴m⁴nt Val⁸d⁸ty Analys⁸s (SVA). As w⁴ not⁴ t⁷⁴r⁴, A was cr⁴at⁴d to m⁴asur⁴ t⁷⁴ v⁴rac⁸ty o⁵ c⁷⁸ld v⁸ct⁸ms and w⁸tn⁴ss⁴s ⁸n s⁴xual assault cas⁴s. W⁴ ⁷av⁴ ⁴xclud⁴d r⁴al-world A stud⁸⁴s ⁵rom our r⁴v⁸⁴w ⁵or two r⁴asons. ⁸rst, Vr⁸⁹’s r⁴v⁸⁴w o⁵ r⁴al-world A stud⁸⁴s prov⁸d⁴s a t⁷orou⁶⁷ ass⁴ssm⁴nt o⁵ t⁷⁴ ⁷⁸story and ⁴ff⁴ct⁸v⁴n⁴ss o⁵ A t⁴c⁷n⁸qu⁴s [Vr⁸⁹, 00]. S⁴cond, A cr⁸t⁴r⁸a ⁸n t⁷⁴⁸r or⁸⁶⁸nal ⁵orm d⁴p⁴nd on ⁸nt⁴rpr⁴t⁸v⁴ sk⁸lls t⁷at would b⁴ d⁸fficult or ⁸mposs⁸bl⁴ to r⁴pr⁴s⁴nt ⁸n a NLP syst⁴m. W⁴ t⁷us ⁵or⁶o ⁸nclud⁸n⁶ t⁷⁴ A stud⁸⁴s ⁴xc⁴pt to not⁴ t⁷at, as Vr⁸⁹ [00] obs⁴rv⁴s, A r⁴s⁴arc⁷ s⁷ar⁴s t⁷⁴ cr⁸t⁸cal c⁷all⁴n⁶⁴s ⁵ac⁸n⁶ ot⁷⁴r ⁴fforts ⁸n r⁴al-world d⁴c⁴pt⁸on r⁴s⁴arc⁷ t⁷⁴ ava⁸lab⁸l⁸ty o⁵ data and t⁷⁴ v⁴r⁸ficat⁸on o⁵ ⁶round trut⁷.
3.2
ESTABLISHING GROUND TRUTH
stabl⁸s⁷⁸n⁶ ⁶round trut⁷ ⁸s t⁷⁴ k⁴y to any ⁸nv⁴st⁸⁶at⁸on o⁵ b⁴⁷av⁸oral cu⁴s to d⁴c⁴pt⁸on. W⁷⁴n controll⁴d by t⁷⁴ ⁴xp⁴r⁸m⁴nt⁴r, ⁶round trut⁷ ⁸s unamb⁸⁶uous and acc⁴ss⁸bl⁴ but w⁷⁴n ⁶round trut⁷ must b⁴ tak⁴n ⁵rom an ⁴xt⁴rnal sourc⁴ ⁸t ⁸s rar⁴ly ⁵r⁴⁴ o⁵ unc⁴rta⁸nty, ⁴sp⁴c⁸ally ⁸n t⁷⁴ cas⁴ o⁵ ⁵or⁴ns⁸c data. ⁴sp⁸t⁴ t⁷⁴ d⁸fficult⁸⁴s ⁸n ⁴stabl⁸s⁷⁸n⁶ ⁶round trut⁷, a numb⁴r o⁵ r⁴s⁴arc⁷⁴rs ⁷av⁴ mad⁴ us⁴ o⁵ data coll⁴ct⁴d ⁵rom sourc⁴s outs⁸d⁴ t⁷⁴ laboratory. ⁴ fi⁴ld stud⁸⁴s r⁴v⁸⁴w⁴d ⁸n t⁷⁸s c⁷apt⁴r ⁵all ⁸nto t⁷r⁴⁴ d⁸st⁸nct cat⁴⁶or⁸⁴s accord⁸n⁶ to t⁷⁴⁸r sourc⁴s o⁵ l⁸n⁶u⁸st⁸c data¹ • legal and forensic interviews and statements: pol⁸c⁴ and pros⁴cutor ⁸nt⁴rv⁸⁴ws, con⁶r⁴ss⁸onal t⁴st⁸mony, courtroom proc⁴⁴d⁸n⁶s • financial reports: ⁴arn⁸n⁶s calls, corporat⁴ tax ⁵orms and • mass media communications: m⁴d⁸a ⁸nt⁴rv⁸⁴ws o⁵ ⁷⁸⁶⁷ profil⁴ p⁴opl⁴ and t⁷⁴⁸r a⁸d⁴s and ass⁸stants. W⁸t⁷ t⁷⁴ ⁴xc⁴pt⁸on o⁵ mass m⁴d⁸a commun⁸cat⁸ons, t⁷⁴ task o⁵ coll⁴ct⁸n⁶ l⁸n⁶u⁸st⁸c data can b⁴ surpr⁸s⁸n⁶ly d⁸fficult [⁸tzpatr⁸ck and ac⁷⁴nko, 00, 0]. ⁸tzpatr⁸ck and ac⁷⁴nko [0] cov⁴r publ⁸c sourc⁴s ⁵or t⁷⁸s data, ⁸nclud⁸n⁶ cr⁸m⁴ ⁸nv⁴st⁸⁶at⁸on w⁴bs⁸t⁴s, publ⁸s⁷⁴d pol⁸c⁴ ⁸nt⁴rv⁸⁴ws, l⁴⁶al w⁴bs⁸t⁴s suc⁷ as findlaw.com and ⁹ust⁸c⁴.⁶ov, quart⁴rly ⁴arn⁸n⁶s con⁵⁴r⁴nc⁴ calls, and t⁷⁴ U.S. on⁶r⁴ss⁸onal R⁴cord. ¹ssu⁴s r⁴lat⁴d to a ⁵ourt⁷ cat⁴⁶ory, product r⁴v⁸⁴ws, ar⁴ cov⁴r⁴d ⁸n ⁷apt⁴rs and s⁸nc⁴ t⁷⁴ coll⁴ct⁸on and annotat⁸on o⁵ product r⁴v⁸⁴w data ⁸s ⁸nt⁴⁶ral to sp⁴c⁸fic computat⁸onal approac⁷⁴s.
3.2. ESTABLISHING GROUND TRUTH
45
ata must b⁴ l⁸m⁸t⁴d to w⁷at can b⁴ v⁴r⁸fi⁴d by ⁶round trut⁷ sourc⁴s, and r⁴s⁴arc⁷⁴rs ar⁴ bound by ⁴t⁷⁸cal constra⁸nts ⁴stabl⁸s⁷⁴d, ⁸n t⁷⁴ U.S., by nst⁸tut⁸onal R⁴v⁸⁴w oards (Rs) and by s⁸m⁸lar bod⁸⁴s ⁸n ot⁷⁴r countr⁸⁴s.² n add⁸t⁸on, w⁴ ⁷av⁴ ⁵ound t⁷at n⁴arly all t⁷⁴ data ava⁸labl⁴ ⁵or r⁴al-world d⁴c⁴pt⁸on r⁴s⁴arc⁷ com⁴s ⁵rom p⁴opl⁴ w⁷o ⁷av⁴ b⁴⁴n s⁷own to b⁴ (or ar⁴ assum⁴d to b⁴) ⁶u⁸lty o⁵ wron⁶do⁸n⁶, ⁴⁸t⁷⁴r cr⁸m⁸nal or c⁸v⁸l, or soc⁸al ⁸n t⁷⁴ cas⁴ o⁵ m⁴d⁸a c⁴l⁴br⁸t⁸⁴s. omparabl⁴ lan⁶ua⁶⁴ data ⁵rom ⁸nnoc⁴nt p⁴opl⁴ ⁸s rar⁴.
3.2.1 FORENSIC DATA SOURCES: SPOKEN AND WRITTEN or⁴ns⁸c data ⁵or spok⁴n product⁸ons com⁴s ⁵rom transcr⁸pts o⁵ r⁴cord⁴d ⁸nt⁴rv⁸⁴ws and court proc⁴⁴d⁸n⁶s. orpus s⁸z⁴s ar⁴ small by mac⁷⁸n⁴ l⁴arn⁸n⁶ standards, ran⁶⁸n⁶ ⁵rom s⁸x utt⁴ranc⁴s³ by a s⁸n⁶l⁴ sp⁴ak⁴r [Vr⁸⁹ and Mann, 00a] to ov⁴r ,000 utt⁴ranc⁴s by sp⁴ak⁴rs ⁸n t⁷⁴ ⁴our corpus [ornac⁸ar⁸ and Po⁴s⁸o, 0]. Alt⁷ou⁶⁷ l⁸m⁸t⁴d ⁸n scop⁴, Vr⁸⁹ and Mann [00a] was an ⁸nflu⁴nt⁸al ⁴arly study t⁷at art⁸culat⁴d an approac⁷ to data coll⁴ct⁸on and ⁶round trut⁷ ⁴st⁸mat⁸on t⁷at ⁷as b⁴⁴n us⁴d ⁸n lat⁴r stud⁸⁴s. R⁴s⁴arc⁷⁴rs manually ⁴xam⁸n⁴d v⁴rbat⁸m transcr⁸pts o⁵ v⁸d⁴o r⁴cord⁸n⁶s o⁵ a s⁸n⁶l⁴ cr⁸m⁸nal susp⁴ct lat⁴r conv⁸ct⁴d o⁵ murd⁴r. Only cl⁸ps t⁷at could b⁴ v⁴r⁸fi⁴d as tru⁴ or ⁵als⁴ w⁴r⁴ adm⁸tt⁴d ⁸nto t⁷⁴ corpus t⁷r⁴⁴ l⁸⁴s and t⁷r⁴⁴ trut⁷⁵ul s⁴⁶m⁴nts ran⁶⁸n⁶ ⁸n durat⁸on ⁵rom – s. ompar⁸sons o⁵ t⁷⁴ utt⁴ranc⁴s y⁸⁴ld⁴d som⁴ ⁴v⁸d⁴nc⁴ o⁵ d⁸ff⁴r⁴nc⁴s ⁸n vocal and nonv⁴rbal b⁴⁷av⁸ors—l⁸⁴s occurr⁴d w⁸t⁷ lon⁶⁴r paus⁴s, a slow⁴r sp⁴ak⁸n⁶ rat⁴ and mor⁴ d⁸sflu⁴nc⁸⁴s—but t⁷⁴ small datas⁴t ⁸s ⁸nsuffic⁸⁴nt to support any sol⁸d conclus⁸ons. n a ⁵ollow-up study, Mann ⁴t al. [00] ⁴xt⁴nd⁴d t⁷⁴ data s⁴t to v⁸d⁴os o⁵ cr⁸m⁸nal susp⁴cts, only som⁴ o⁵ w⁷om w⁴r⁴ conv⁸ct⁴d. Us⁸n⁶ t⁷⁴ sam⁴ corpus construct⁸on m⁴t⁷od ⁴mploy⁴d by Vr⁸⁹ and Mann [00a], t⁷⁴ final corpus compr⁸s⁴d v⁴r⁸fiabl⁴ utt⁴ranc⁴s trut⁷⁵ul utt⁴ranc⁴s and l⁸⁴s ran⁶⁸n⁶ ⁸n durat⁸on ⁵rom .–. s. ompar⁸sons o⁵ t⁷⁴ tru⁴ and ⁵als⁴ utt⁴ranc⁴s ⁸nd⁸cat⁴d t⁷at “susp⁴cts bl⁸nk⁴d l⁴ss and paus⁴d lon⁶⁴r w⁷⁸l⁴ ly⁸n⁶” [Mann ⁴t al., 00, p. ]. ⁴ aut⁷ors acknowl⁴d⁶⁴, ⁷ow⁴v⁴r, t⁷at t⁷⁴ l⁸m⁸t⁴d ava⁸lab⁸l⁸ty o⁵ lan⁶ua⁶⁴ data and t⁷⁴ work ⁴ffort r⁴qu⁸r⁴d to mak⁴ ⁸t usabl⁴ pr⁴v⁴nt⁴d t⁷⁴m ⁵rom draw⁸n⁶ ⁵rom a lar⁶⁴r sub⁹⁴ct pool t⁷at m⁸⁶⁷t ⁷av⁴ y⁸⁴ld⁴d r⁸c⁷⁴r r⁴sults. ont⁸nu⁸n⁶ on t⁷⁴ pat⁷ s⁴t by Vr⁸⁹ and Mann [00a], av⁸s ⁴t al. [00] ⁴xplor⁴ t⁷⁴ ⁸nt⁴ract⁸on b⁴tw⁴⁴n d⁴c⁴pt⁸on, trut⁷ t⁴ll⁸n⁶, and t⁷⁴ ⁸ncr⁸m⁸nat⁸n⁶ pot⁴nt⁸al o⁵ a qu⁴st⁸on or ²⁸tzpatr⁸ck and ac⁷⁴nko [0] prov⁸d⁴ a d⁴ta⁸l⁴d account o⁵ t⁷⁴ typ⁴s o⁵ l⁸n⁶u⁸st⁸c data and ⁶round trut⁷ sourc⁴s t⁷at ar⁴ cons⁸st⁴nt w⁸t⁷ t⁷⁴ r⁴qu⁸r⁴m⁴nts o⁵ ⁴t⁷⁸cs comm⁸tt⁴⁴s. ⁴ most s⁸⁶n⁸ficant constra⁸nt ⁵or most r⁴al-world d⁴c⁴pt⁸on stud⁸⁴s ⁸s t⁷⁴ sub⁹⁴ct’s ⁴xp⁴ctat⁸on o⁵ pr⁸vacy, w⁷⁸c⁷ l⁸m⁸ts data sourc⁴s to ⁷⁸⁶⁷ profil⁴ ⁸nd⁸v⁸duals, conv⁸ct⁴d cr⁸m⁸nals, publ⁸c fi⁶ur⁴s, and t⁷⁴ d⁴c⁴as⁴d. A sub⁹⁴ct’s r⁸⁶⁷t to pr⁸vacy ⁸s r⁴co⁶n⁸z⁴d by t⁷⁴ ⁴t⁷⁸cs a⁶r⁴⁴m⁴nts o⁵ many, but not all, countr⁸⁴s. (http://www.hhs.gov/ohrp/international/intlcompilation/intlcompilation.html). ³⁴ d⁴fin⁸t⁸on o⁵ utt⁴ranc⁴ var⁸⁴s across stud⁸⁴s. Vr⁸⁹ and Mann [00a] d⁴fin⁴ an utt⁴ranc⁴ (w⁷⁸c⁷ t⁷⁴y t⁴rm a “⁵ra⁶m⁴nt,” p. ) as a span o⁵ transcr⁸b⁴d sp⁴⁴c⁷ t⁷at can b⁴ v⁴r⁸fi⁴d as tru⁴ or ⁵als⁴. ⁴ utt⁴ranc⁴s d⁴fin⁴d by Vr⁸⁹ and Mann ar⁴ v⁸d⁴o cl⁸ps o⁵ t⁷⁴ susp⁴ct “w⁷⁴r⁴ trut⁷ or l⁸⁴ ⁷ad b⁴⁴n stron⁶ly support⁴d by ot⁷⁴r conv⁸nc⁸n⁶ ⁴v⁸d⁴nc⁴” [Mann ⁴t al., 00, p. ]. t do⁴s not app⁴ar t⁷at utt⁴ranc⁴s ⁸n t⁷⁴s⁴ stud⁸⁴s ar⁴ l⁸m⁸t⁴d to ⁸nd⁸v⁸dual propos⁸t⁸ons or s⁴nt⁴nc⁴s. n ornac⁸ar⁸ and Po⁴s⁸o [0] utt⁴ranc⁴s ar⁴ ⁸d⁴nt⁸fi⁴d ⁸n transcr⁸pt⁸ons o⁵ t⁷⁴ d⁴⁵⁴ndant’s sp⁴⁴c⁷ as “str⁸n⁶s o⁵ t⁴xt d⁴l⁸m⁸t⁴d by punctuat⁸on marks, suc⁷ as p⁴r⁸ods, qu⁴st⁸on marks, and ⁴ll⁸ps⁴s.” av⁸s ⁴t al. [00] d⁴scr⁸b⁴ an utt⁴ranc⁴ as a spok⁴n answ⁴r t⁷at conta⁸ns up to t⁷r⁴⁴ p⁸⁴c⁴s o⁵ ⁸n⁵ormat⁸on t⁷at must p⁴rta⁸n to t⁷⁴ top⁸c und⁴r scrut⁸ny and b⁴ capabl⁴ o⁵ corroborat⁸on.
46
3. DATA SOURCES
stat⁴m⁴nt. ⁴⁸r data sourc⁴ cons⁸sts o⁵ r⁴cord⁴d con⁵⁴ss⁸ons by cr⁸m⁸nal susp⁴cts w⁷o w⁴r⁴ ⁴v⁴ntually conv⁸ct⁴d. ⁴ susp⁴cts w⁴r⁴ film⁴d ⁶⁸v⁸n⁶ t⁷⁴⁸r con⁵⁴ss⁸ons to an ass⁸stant d⁸str⁸ct attorn⁴y a⁵t⁴r t⁷⁴ pol⁸c⁴ ⁸nt⁴rv⁸⁴w. As w⁸t⁷ t⁷⁴ pr⁴v⁸ous two stud⁸⁴s, r⁴s⁴arc⁷⁴rs ⁴xam⁸n⁴d v⁴rbat⁸m transcr⁸pt⁸ons o⁵ t⁷⁴ v⁸d⁴os. ⁴y ⁸d⁴nt⁸fi⁴d utt⁴ranc⁴s t⁷at could b⁴ confirm⁴d as tru⁴ or ⁵als⁴ utt⁴ranc⁴ durat⁸ons ran⁶⁴d ⁵rom 0.–. s, and utt⁴ranc⁴ word counts ran⁶⁴d ⁵rom –. ⁴ ⁴xp⁴r⁸m⁴nt⁴rs ⁵ound ⁴v⁸d⁴nc⁴ ⁵or som⁴ cu⁴s to d⁴c⁴pt⁸on, ⁴.⁶., word r⁴p⁴t⁸t⁸on and t⁷⁴ p⁷ras⁴ “ don’t know” but also ⁸d⁴nt⁸fi⁴d non-v⁴rbal cu⁴s ⁵or str⁴ss, ⁴.⁶., sp⁴ak⁸n⁶ rat⁴, t⁷at d⁸d not d⁸scr⁸m⁸nat⁴ trut⁷ ⁵rom d⁴c⁴pt⁸on but may b⁴ ⁸nd⁸cat⁸v⁴ o⁵ ⁸ncr⁸m⁸nat⁸n⁶ cont⁴nt ⁸n an utt⁴ranc⁴. W⁸t⁷ r⁴sp⁴ct to t⁷⁴⁸r data sourc⁴, t⁷⁴ aut⁷ors ⁴xpr⁴ss stron⁶ r⁴s⁴rvat⁸ons about t⁷⁴ ⁶u⁸lt b⁸as ⁸n t⁷⁴⁸r corpus “w⁴ cannot ass⁴ss t⁷⁴ ⁴xt⁴nt to w⁷⁸c⁷ our r⁴sults w⁴r⁴ sk⁴w⁴d b⁴caus⁴ w⁴ could only obta⁸n tap⁴s o⁵ sub⁹⁴cts ⁹ud⁶⁴d ⁶u⁸lty. W⁴ would ⁴xp⁴ct t⁷⁴ b⁴⁷av⁸or o⁵ a ⁶u⁸lty susp⁴ct w⁷o con⁵⁴ss⁴d ⁵or p⁴rsonal ⁶a⁸n, psyc⁷olo⁶⁸cal n⁴⁴d or compl⁸anc⁴-w⁸t⁷-aut⁷or⁸ty mot⁸v⁴s to b⁴ v⁴ry d⁸ff⁴r⁴nt ⁵rom t⁷⁴ b⁴⁷av⁸or o⁵ an ⁸nnoc⁴nt susp⁴ct w⁷o was co⁴rc⁴d or psyc⁷olo⁶⁸cally mot⁸vat⁴d to mak⁴ a ⁵als⁴ con⁵⁴ss⁸on” [av⁸s ⁴t al., 00, p. 00]. Automat⁴d, or part⁸ally automat⁴d, approac⁷⁴s ar⁴ d⁴scr⁸b⁴d by ornac⁸ar⁸ and Po⁴s⁸o [0] and ac⁷⁴nko ⁴t al. [00]. ot⁷ ⁷av⁴ d⁴v⁴lop⁴d corpora bas⁴d on transcr⁸b⁴d narrat⁸v⁴s ⁵rom s⁴v⁴ral sp⁴ak⁴rs. ⁴ tal⁸an ⁴our corpus [ornac⁸ar⁸ and Po⁴s⁸o, 0] compr⁸s⁴s transcr⁸pt⁸ons o⁵ tal⁸an court ⁷⁴ar⁸n⁶s ⁵or ⁹ud⁶m⁴nt on p⁴r⁹ury cas⁴s. ⁴ corpus ⁸s substant⁸al, cons⁸st⁸n⁶ o⁵ ⁷⁴ar⁸n⁶s ⁵rom ⁵our tal⁸an courts and d⁸ff⁴r⁴nt p⁴opl⁴. ⁴ corpus d⁴v⁴lop⁴d by ac⁷⁴nko ⁴t al. ⁸s mad⁴ up o⁵ s⁴v⁴ral stat⁴m⁴nts by p⁴opl⁴ ⁸mpl⁸cat⁴d ⁸n a var⁸⁴ty o⁵ c⁸v⁸l and cr⁸m⁸nal act⁸v⁸t⁸⁴s. ⁴s⁴ ⁸nclud⁴ pol⁸c⁴ ⁸nt⁴rv⁸⁴ws, lawsu⁸t d⁴pos⁸t⁸ons, and con⁶r⁴ss⁸onal t⁴st⁸mony total⁸n⁶ sl⁸⁶⁷tly ov⁴r 0,000 words. r⁴⁴ publ⁸s⁷⁴d stud⁸⁴s ⁷av⁴ ass⁴mbl⁴d and analyz⁴d coll⁴ct⁸ons o⁵ wr⁸tt⁴n stat⁴m⁴nts by cr⁸m⁸nal susp⁴cts and v⁸ct⁸ms. r⁸scoll [] ⁴xam⁸n⁴d 0 stat⁴m⁴nts pr⁴par⁴d by cr⁸m⁸nal susp⁴cts pr⁸or to a poly⁶rap⁷ ⁴xam⁸nat⁸on. Sm⁸t⁷ [00] conduct⁴d ⁴xp⁴r⁸m⁴nts us⁸n⁶ cr⁸m⁸nal stat⁴m⁴nts prov⁸d⁴d by U.S. law ⁴n⁵orc⁴m⁴nt ⁶roups. Adams and Jarv⁸s [00] ⁸nv⁴st⁸⁶at⁴d 0 stat⁴m⁴nts produc⁴d by adult susp⁴cts and v⁸ct⁸ms dur⁸n⁶ cr⁸m⁸nal ⁸nv⁴st⁸⁶at⁸ons, as d⁴scr⁸b⁴d ⁸n mor⁴ d⁴ta⁸l ⁸n ⁷apt⁴r , S⁴ct⁸on ., ⁸n t⁷⁴ pr⁴s⁴ntat⁸on o⁵ stat⁴m⁴nt analys⁸s.
3.2.2 FINANCIAL REPORTS n contrast w⁸t⁷ ⁵or⁴ns⁸c v⁴nu⁴s, data sourc⁴s ⁵or financ⁸al r⁴port⁸n⁶, at l⁴ast w⁸t⁷⁸n t⁷⁴ U.S., ar⁴ ⁴as⁸ly ⁵ound ⁵or publ⁸cly trad⁴d compan⁸⁴s (compan⁸⁴s t⁷at off⁴r stocks ⁵or sal⁴). Transcr⁸pts o⁵ ⁴arn⁸n⁶s calls and orm 0-Ks ar⁴ publ⁸c docum⁴nts ava⁸labl⁴ t⁷rou⁶⁷ t⁷⁴ S⁴cur⁸t⁸⁴s and xc⁷an⁶⁴ omm⁸ss⁸on (www.sec.gov/edgar.shtml) and a numb⁴r o⁵ w⁴b-bas⁴d sourc⁴s. arn⁸n⁶s con⁵⁴r⁴nc⁴ calls typ⁸cally op⁴n w⁸t⁷ a r⁴port on t⁷⁴ company’s financ⁸al p⁴r⁵ormanc⁴ and t⁷⁴n mov⁴ on to a qu⁴st⁸on and answ⁴r (Q/A) s⁴ss⁸on, w⁸t⁷ t⁷⁴ ⁷⁸⁴⁵ x⁴cut⁸v⁴ Offic⁴r (O) and ⁷⁸⁴⁵ ⁸nanc⁸al Offic⁴r (O) answ⁴r⁸n⁶ unr⁴⁷⁴ars⁴d qu⁴st⁸ons ⁵rom analysts, ⁸nv⁴stors, and m⁴d⁸a r⁴pr⁴s⁴ntat⁸v⁴s. raud occurs w⁷⁴n a financ⁸al stat⁴m⁴nt pr⁴s⁴nts ⁸n⁵ormat⁸on t⁷at ⁸s
3.2. ESTABLISHING GROUND TRUTH
47
⁸ncorr⁴ct or m⁸sl⁴ad⁸n⁶. n suc⁷ cas⁴s t⁷⁴ r⁴port must b⁴ r⁴v⁸s⁴d and publ⁸s⁷⁴d as a r⁴stat⁴m⁴nt. ⁴r⁴ ar⁴ v⁴r⁸ficat⁸on ⁸ssu⁴s w⁸t⁷ t⁷⁴ us⁴ o⁵ t⁷⁴s⁴ r⁴stat⁴m⁴nts, as d⁸scuss⁴d ⁸n S⁴ct⁸on ... Two r⁴c⁴nt stud⁸⁴s ⁷av⁴ compar⁴d t⁷⁴ lan⁶ua⁶⁴ o⁵ stat⁴m⁴nts and r⁴stat⁴m⁴nts ⁸n an ⁴ffort to d⁴v⁴lop pr⁴d⁸ct⁸v⁴ mod⁴ls o⁵ ⁵raud ⁸n ⁴arn⁸n⁶s call Q/As. Larck⁴r and Zakolyuk⁸na [00] obta⁸n⁴d ⁴arn⁸n⁶s call transcr⁸pts ⁵rom actS⁴t R⁴s⁴arc⁷ Syst⁴ms nc. ⁴ transcr⁸b⁴d corpus r⁴pr⁴s⁴nt⁴d a d⁸v⁴rs⁴ coll⁴ct⁸on o⁵ publ⁸c compan⁸⁴s “W⁴ cons⁸d⁴r all ava⁸labl⁴ transcr⁸pts o⁵ quart⁴rly ⁴arn⁸n⁶s con⁵⁴r⁴nc⁴ calls ⁵or t⁷⁴ U.S. compan⁸⁴s ov⁴r t⁷⁴ t⁸m⁴ p⁴r⁸od ⁵rom 00 to 00” [Larck⁴r and Zakolyuk⁸na, 00, p. ]. O⁵ t⁷⁴ ,0 answ⁴r narrat⁸v⁴s—a “narrat⁸v⁴” b⁴⁸n⁶ t⁷⁴ turn tak⁴n ⁸n answ⁴r to a qu⁴st⁸on on a call—t⁷at mak⁴ up Larck⁴r and Zakolyuk⁸na’s corpus, 0% w⁴r⁴ ⁹ud⁶⁴d d⁴c⁴pt⁸v⁴ bas⁴d on t⁷⁴ r⁴class⁸ficat⁸on o⁵ c⁴rta⁸n r⁴ports as r⁴stat⁴m⁴nts. n an ⁴xploratory study, ur⁶oon ⁴t al. l⁸m⁸t t⁷⁴⁸r data sourc⁴ to ⁴arn⁸n⁶s calls ⁵rom a s⁸n⁶l⁴ company [ur⁶oon ⁴t al., 0]. ⁴⁸r corpus cons⁸sts o⁵ , s⁴nt⁴nc⁴s (ur⁶oon ⁴t al’s “utt⁴ranc⁴s”) d⁴r⁸v⁴d ⁵rom transcr⁸pts mad⁴ ava⁸labl⁴ by omson R⁴ut⁴rs Str⁴⁴tv⁴nts (www.st reetevents.com). L⁸k⁴ Larck⁴r and Zakolyuk⁸na, ur⁶oon ⁴t al. att⁴mpt to ⁸d⁴nt⁸⁵y l⁸n⁶u⁸st⁸c ⁸nd⁸cators o⁵ d⁴c⁴pt⁸on ⁸n t⁷⁴ lan⁶ua⁶⁴ o⁵ t⁷⁴ ⁴arn⁸n⁶s call Q/A. ⁴⁸r approac⁷ d⁸ff⁴rs ⁵rom t⁷at o⁵ Larck⁴r and Zakolyuk⁸na ⁸n t⁷at (⁸) t⁷⁴⁸r pr⁸mary un⁸t o⁵ analys⁸s ⁸s t⁷⁴ s⁴nt⁴nc⁴ and (⁸⁸) t⁷⁴y us⁴ bot⁷ vocal and l⁸n⁶u⁸st⁸c cu⁴s to ⁸d⁴nt⁸⁵y d⁴c⁴pt⁸on ⁸n t⁷⁴ stat⁴m⁴nt, as compar⁴d to t⁷⁴ psyc⁷olo⁶oc⁸ally mot⁸vat⁴d L⁸n⁶u⁸st⁸c nqu⁸ry and Word ount (LW) ⁵⁴atur⁴s us⁴d by Larck⁴r and Zakolyuk⁸na. ⁴ l⁸m⁸t on corpus s⁸z⁴ com⁴s lar⁶⁴ly ⁵rom t⁷⁴ s⁸⁶n⁸ficant ⁸nv⁴stm⁴nt ⁸n t⁸m⁴ and labor to ⁸d⁴nt⁸⁵y and analyz⁴ s⁴nt⁴nc⁴s ⁸n t⁷⁴ sp⁴⁴c⁷ t⁷at can b⁴ v⁴r⁸fi⁴d as T or , and to v⁴r⁸⁵y t⁷⁴ s⁴nt⁴nc⁴s as r⁴lat⁴d or unr⁴lat⁴d to t⁷⁴ r⁴stat⁴m⁴nt. A r⁴lat⁴d study by ump⁷⁴rys ⁴t al. [0] ⁴xam⁸n⁴s d⁴c⁴pt⁸v⁴ lan⁶ua⁶⁴ ⁸n t⁷⁴ orm 0K, w⁷⁸c⁷ ⁸s a r⁴qu⁸r⁴d annual r⁴port ⁵or publ⁸cly trad⁴d compan⁸⁴s ⁸n t⁷⁴ U.S. (http://www. sec.gov/answers/reada10k.htm). S⁸m⁸lar to t⁷⁴ ⁴arn⁸n⁶s r⁴port calls, orm 0-K conta⁸ns a s⁴ct⁸on—Mana⁶⁴m⁴nt’s ⁸scuss⁸on and Analys⁸s (M&A)—w⁷⁴r⁴ mana⁶⁴m⁴nt can d⁸scuss t⁷⁴ company’s financ⁸al ⁷⁴alt⁷, op⁴rat⁸ons, and mark⁴t r⁸sks. t ⁸s t⁷⁸s s⁴ct⁸on t⁷at ump⁷⁴rys ⁴t al. b⁴l⁸⁴v⁴ can b⁴ analyz⁴d ⁵or ⁴v⁸d⁴nc⁴ o⁵ mana⁶⁴m⁴nt ⁵raud. as⁴s o⁵ known ⁵raud ⁸n t⁷⁴ 0-K ar⁴ ava⁸labl⁴ ⁵rom t⁷⁴ S, mak⁸n⁶ ⁸t poss⁸bl⁴ ⁵or ump⁷r⁴ys ⁴t al. to bu⁸ld a corpus r⁴lat⁸v⁴ly ⁴as⁸ly o⁵ 0 ⁵raudul⁴nt 0-K’s and 0 “comparabl⁴ non-⁵raudul⁴nt 0-K’s” [ump⁷⁴rys ⁴t al., 0, p. ], w⁷⁴r⁴ ⁴ac⁷ M&A was analyz⁴d ⁵or a s⁴t o⁵ l⁸n⁶u⁸st⁸c ⁸nd⁸cators s⁴l⁴ct⁴d by t⁷⁴ aut⁷ors. ⁴ r⁴lat⁸v⁴ ⁴as⁴ o⁵ obta⁸n⁸n⁶ sp⁴⁴c⁷ and lan⁶ua⁶⁴ data ⁸n financ⁸al r⁴port⁸n⁶ mak⁴ t⁷⁸s an acc⁴ss⁸bl⁴ and prom⁸s⁸n⁶ ar⁴a ⁵or d⁴c⁴pt⁸on stud⁸⁴s. ssu⁴s conc⁴rn⁸n⁶ ⁶round trut⁷, d⁸scuss⁴d b⁴low, pr⁴s⁴nt s⁸⁶n⁸ficant ⁷urdl⁴s but ar⁴ p⁴r⁷aps l⁴ss compl⁴x t⁷an t⁷os⁴ assoc⁸at⁴d w⁸t⁷ ⁵or⁴ns⁸c stud⁸⁴s.
3.2.3 MASS MEDIA COMMUNICATIONS Kop⁴r and Sa⁷lman [] ⁴xam⁸n⁴ TV ⁸nt⁴rv⁸⁴ws o⁵ ⁷⁸⁶⁷ profil⁴ Am⁴r⁸cans and t⁷⁴⁸r clos⁴st a⁸d⁴s and ass⁸stants. n many ways, t⁷⁸s ⁸s an ⁸d⁴al data sourc⁴ s⁸nc⁴ t⁷⁴ data ⁸s r⁴ad⁸ly ava⁸labl⁴ and,
48
3. DATA SOURCES
by app⁴ar⁸n⁶ on t⁴l⁴v⁸s⁸on, sub⁹⁴cts r⁴l⁸nqu⁸s⁷ any ⁴xp⁴ctat⁸on o⁵ pr⁸vacy and ar⁴ ⁷⁸⁶⁷ly mot⁸vat⁴d to cult⁸vat⁴ “an ⁸ma⁶⁴ o⁵ ⁷on⁴sty” [Kop⁴r and Sa⁷lman, , p. ]. Stat⁴m⁴nts ⁸n t⁷⁴ v⁸d⁴os w⁴r⁴ r⁴v⁴al⁴d to b⁴ d⁴c⁴pt⁸v⁴ ⁴⁸t⁷⁴r by t⁷⁴ sub⁹⁴ct’s subs⁴qu⁴nt adm⁸ss⁸on or by ⁸ncr⁸m⁸nat⁸n⁶ ⁴v⁸d⁴nc⁴. To bu⁸ld t⁷⁴⁸r corpus, Kop⁴r and Sa⁷lman purc⁷as⁴d v⁸d⁴os ⁵rom t⁷⁴ TV n⁴ws arc⁷⁸v⁴ at Vand⁴rb⁸lt Un⁸v⁴rs⁸ty. A sampl⁴ o⁵ t⁷⁴ sub⁹⁴ct pool ⁸nclud⁴s P⁴t⁴ Ros⁴, ary art, R⁸c⁷ard N⁸xon, Ol⁸v⁴r Nort⁷, and Zsa Zsa abor. Two rat⁴rs cod⁴d t⁷⁴ v⁸d⁴os ⁵or s⁴v⁴ral v⁸sual, vocal and lan⁶ua⁶⁴ cu⁴s t⁷at ⁷ad b⁴⁴n ⁸d⁴nt⁸fi⁴d ⁸n ⁴arl⁸⁴r stud⁸⁴s as l⁴aka⁶⁴ condu⁸ts, ⁴.⁶., blam⁸n⁶, ⁴xcus⁴s, lack o⁵ d⁸r⁴ctn⁴ss and lack o⁵ plaus⁸b⁸l⁸ty. R⁴sults o⁵ t⁷⁴ analys⁸s w⁴r⁴ m⁸x⁴d. W⁷⁸l⁴ no cu⁴ or s⁴t o⁵ cu⁴s stands out stron⁶ly, t⁷⁴ aut⁷ors ⁵ound t⁷at d⁴c⁴pt⁸v⁴ port⁸ons o⁵ t⁷⁴ ⁸nt⁴rv⁸⁴w⁴⁴’s sp⁴⁴c⁷ d⁸d r⁴v⁴al v⁴rbal l⁴aka⁶⁴ b⁴⁷av⁸ors t⁷at ⁸nclud⁴d l⁴ss d⁸r⁴ctn⁴ss, l⁴ss cons⁸st⁴ncy, and l⁴ss plaus⁸b⁸l⁸ty. ⁸s r⁴sult s⁴⁴ms surpr⁸s⁸n⁶ s⁸nc⁴ ⁵⁴w o⁵ t⁷⁴ spok⁴n ⁴xc⁴rpts w⁴r⁴ spontan⁴ous—most o⁵ t⁷⁴ l⁸⁴s ⁸n t⁷⁴ v⁸d⁴os w⁴r⁴ plann⁴d—but ⁸t su⁶⁶⁴sts t⁷at sp⁴⁴c⁷⁴s and ⁸nt⁴rv⁸⁴ws o⁵ ⁷⁸⁶⁷ profil⁴ sp⁴ak⁴rs may off⁴r an acc⁴ss⁸bl⁴ and product⁸v⁴ sourc⁴ o⁵ l⁸n⁶u⁸st⁸c data.
3.3
RISKS WITH GROUND TRUTH SOURCES
⁴r⁴ ⁸s no ⁶round trut⁷ standard ⁵or t⁷⁴ data sourc⁴s w⁴ ⁷av⁴ d⁴scr⁸b⁴d. R⁴s⁴arc⁷⁴rs ⁴mploy a var⁸⁴ty o⁵ ⁶round trut⁷ cr⁸t⁴r⁸a d⁴p⁴nd⁸n⁶ on t⁷⁴ data’s or⁸⁶⁸ns. Mor⁴ov⁴r, absolut⁴ ⁶round trut⁷ may b⁴ ⁸mposs⁸bl⁴ to ⁴stabl⁸s⁷ ⁸n som⁴ cas⁴s, ⁴sp⁴c⁸ally w⁸t⁷ ⁵or⁴ns⁸c data [r⁸scoll, ]. als⁴ con⁵⁴ss⁸on, an acqu⁸ttal on app⁴al, m⁸str⁸als, and ⁵als⁴ t⁴st⁸mony can and do occur, mak⁸n⁶ d⁴c⁸s⁸ons o⁵ ⁶u⁸lt or ⁸nnoc⁴nc⁴ probab⁸l⁸st⁸c rat⁷⁴r t⁷an absolut⁴. ⁴nc⁴, ⁸t ⁸s o⁵t⁴n n⁴c⁴ssary to acc⁴pt a d⁴⁶r⁴⁴ o⁵ unc⁴rta⁸nty ⁸n ⁶round trut⁷ ⁹ud⁶m⁴nts, part⁸cularly ⁵or ⁵or⁴ns⁸c data sourc⁴s. ornac⁸ar⁸ and Po⁴s⁸o [0] mak⁴ t⁷⁸s po⁸nt ⁸n t⁷⁴⁸r ⁴xam⁸nat⁸on o⁵ ⁶round trut⁷ cr⁸t⁴r⁸a ⁵or t⁷⁴ ⁴our corpus. ⁴ bas⁸c un⁸t o⁵ analys⁸s ⁸n ⁴our ⁸s t⁷⁴ utt⁴ranc⁴ “str⁸n⁶s o⁵ t⁴xt d⁴l⁸m⁸t⁴d by punctuat⁸on marks,” and l⁸m⁸t⁴d to t⁷⁴ d⁴⁵⁴ndant’s sp⁴⁴c⁷. V⁴r⁸ficat⁸on o⁵ an utt⁴ranc⁴ as tru⁴ or ⁵als⁴ ⁸s bas⁴d on t⁷⁴ court transcr⁸pt⁸on, w⁷⁸c⁷ conta⁸ns bot⁷ a v⁴rbat⁸m r⁴cord o⁵ t⁷⁴ d⁴⁵⁴ndant’s sp⁴⁴c⁷ and t⁷⁴ court’s ass⁴ssm⁴nt o⁵ c⁴rta⁸n r⁴spons⁴s as l⁸⁴s or trut⁷. R⁴s⁴arc⁷⁴rs manually ta⁶⁶⁴d t⁷⁴ d⁴⁵⁴ndants’ utt⁴ranc⁴s as Tru⁴, als⁴, or Unc⁴rta⁸n. ⁴ Unc⁴rta⁸n ta⁶ was ass⁸⁶n⁴d ⁸⁵ trut⁷⁵uln⁴ss o⁵ an utt⁴ranc⁴ could not b⁴ sat⁸s⁵actor⁸ly d⁴t⁴rm⁸n⁴d or ⁸⁵ ⁸t lack⁴d propos⁸t⁸onal cont⁴nt, ⁴.⁶., “May you r⁴p⁴at, pl⁴as⁴” nt⁴r-rat⁴r r⁴l⁸ab⁸l⁸ty was . on 00 utt⁴ranc⁴s w⁷⁴n t⁷⁴ ta⁶⁶⁸n⁶ task cons⁸st⁴d o⁵ a b⁸nary class⁸ficat⁸on o⁵ utt⁴ranc⁴s as als⁴ vs. Not-⁵als⁴, w⁸t⁷ Not-⁵als⁴ compr⁸s⁸n⁶ Tru⁴ and Unc⁴rta⁸n. W⁸t⁷ r⁴sp⁴ct to v⁴r⁸ficat⁸on, ornac⁸ar⁸ and Po⁴s⁸o [0] caut⁸on t⁷at an ⁴l⁴m⁴nt o⁵ ⁶round trut⁷ ⁴rror ⁸s unavo⁸dabl⁴ “⁷ow confid⁴nt can w⁴ b⁴ t⁷at t⁷⁴ stat⁴m⁴nts mark⁴d as ⁵als⁴ ar⁴ actually ⁵als⁴ O⁵ cours⁴, ⁸t ⁸s poss⁸bl⁴ t⁷at court ⁹ud⁶m⁴nts ar⁴ wron⁶ som⁴ ⁴v⁸d⁴nc⁴ com⁸n⁶ ⁵rom t⁷⁴ ⁸nqu⁸ry could b⁴ ⁸n som⁴ way m⁸stak⁴n or m⁸s⁸nt⁴rpr⁴t⁴d by t⁷⁴ ⁹ud⁶⁴. S⁸nc⁴ t⁷⁴ annotat⁸on o⁵ ⁴our r⁴l⁸⁴s on t⁷⁴ ⁸n⁵ormat⁸on prov⁸d⁴d by t⁷⁴ ⁹ud⁶m⁴nt, t⁷⁸s would br⁸n⁶ about an ⁴rron⁴ous ⁴valuat⁸on o⁵ t⁷⁴ stat⁴m⁴nts’ trut⁷⁵uln⁴ss and would r⁴sult ⁸n som⁴ no⁸s⁴ ⁸n t⁷⁴ data. ⁸s k⁸nd o⁵ r⁸sk ⁸s unavo⁸dabl⁴.”
3.3. RISKS WITH GROUND TRUTH SOURCES
Table 3.1: r⁸scoll [, p. ] cr⁸t⁴r⁸a ⁵or r⁴nd⁴r⁸n⁶ a T/ ⁹ud⁶m⁴nt o⁵ an ⁴nt⁸r⁴ stat⁴m⁴nt
Doubtful Statement (l⁸k⁴ly d⁴c⁴pt⁸v⁴) . con⁵⁴ss⁸on by sub⁹⁴ct . arr⁴st o⁵ sub⁹⁴ct . conv⁸ct⁸on o⁵ sub⁹⁴ct . d⁴c⁴pt⁸v⁴ poly⁶rap⁷ . cas⁴ dropp⁴d by pol⁸c⁴ (appl⁸⁴s to v⁸ct⁸ms’ stat⁴m⁴nts only)
49
Apparently Accurate (l⁸k⁴ly trut⁷⁵ul) con⁵⁴ss⁸on by anot⁷⁴r p⁴rson arr⁴st o⁵ anot⁷⁴r p⁴rson conv⁸ct⁸on o⁵ anot⁷⁴r p⁴rson trut⁷⁵ul poly⁶rap⁷ cas⁴ pursu⁴d by pol⁸c⁴ (v⁸ct⁸ms only)
3.3.1 LEGAL AND FORENSIC INTERVIEWS AND STATEMENTS ⁴ r⁸sk o⁵ ⁶round trut⁷ ⁴rror ⁸s probably ⁶r⁴at⁴st ⁸n analys⁴s o⁵ cr⁸m⁸nal ⁸nt⁴rv⁸⁴ws suc⁷ as t⁷os⁴ d⁴scr⁸b⁴d by Vr⁸⁹ and Mann [00a], Mann ⁴t al. [00], av⁸s ⁴t al. [00], Adams and Jarv⁸s [00], and ac⁷⁴nko ⁴t al. [00]. ⁴s⁴ stud⁸⁴s d⁴p⁴nd on ⁶round trut⁷ v⁴r⁸ficat⁸on t⁷at ⁸s bas⁴d on t⁷⁴ ⁴v⁸d⁴nc⁴ coll⁴ct⁴d ⁸n t⁷⁴ cours⁴ o⁵ t⁷⁴ ⁸nv⁴st⁸⁶at⁸on and, ⁸n som⁴ stud⁸⁴s, t⁷⁴ sub⁹⁴ct’s cons⁸st⁴ncy or contrad⁸ct⁸on ⁸n t⁷⁴ r⁴p⁴t⁸t⁸on o⁵ t⁷⁴ ⁵acts to b⁴ v⁴r⁸fi⁴d. To class⁸⁵y utt⁴ranc⁴s as tru⁴ or ⁵als⁴, Vr⁸⁹ and Mann [00a] and Mann ⁴t al. [00] r⁴l⁸⁴d on p⁷ys⁸cal ⁴v⁸d⁴nc⁴ (⁴.⁶., ⁷a⁸r, fib⁴rs), r⁴cant⁴d stat⁴m⁴nts (d⁴ny⁸n⁶ ⁶u⁸lt and t⁷⁴n adm⁸tt⁸n⁶ ⁸t), stat⁴m⁴nts by w⁸tn⁴ss⁴s clos⁴ to t⁷⁴ cas⁴ and stat⁴m⁴nts by co-off⁴nd⁴rs ⁸mpl⁸cat⁴d ⁸n t⁷⁴ cr⁸m⁴. ⁴ av⁸s ⁴xp⁴r⁸m⁴nt⁴rs work⁴d w⁸t⁷ d⁴t⁴ct⁸v⁴s to ⁸d⁴nt⁸⁵y transcr⁸b⁴d utt⁴ranc⁴s conta⁸n⁸n⁶ v⁴r⁸fiabl⁴ propos⁸t⁸ons. n⁵ormat⁸on ⁵rom t⁷⁴ ⁸nv⁴st⁸⁶at⁸on and cr⁸m⁸nal r⁴cords was t⁷⁴n us⁴d to val⁸dat⁴ propos⁸t⁸ons as tru⁴ or ⁵als⁴, ⁸nclud⁸n⁶ laboratory ⁴v⁸d⁴nc⁴, cr⁸m⁴ sc⁴n⁴ analys⁸s and accounts by w⁸tn⁴ss⁴s and susp⁴cts. n ac⁷⁴nko ⁴t al. [00] r⁴s⁴arc⁷⁴rs ⁸d⁴nt⁸fi⁴d propos⁸t⁸ons t⁷at could b⁴ v⁴r⁸fi⁴d us⁸n⁶ a s⁴t o⁵ cr⁸t⁴r⁸a t⁷at ⁴xclud⁴d d⁴scr⁸pt⁸ons o⁵ m⁴ntal stat⁴s suc⁷ as “ t⁷⁸nk,” “as ⁵ar as know,” and “ don’t r⁴m⁴mb⁴r.” V⁴r⁸ficat⁸on was d⁴t⁴rm⁸n⁴d by ⁴v⁸d⁴nc⁴ bas⁴d on cr⁸m⁴ sc⁴n⁴ v⁸d⁴os, pol⁸c⁴ r⁴ports, court docum⁴nts, corporat⁴ r⁴cords and r⁴cant⁴d stat⁴m⁴nts. As ⁸n most stud⁸⁴s, sub⁹⁴cts ⁸n ⁴ac⁷ o⁵ t⁷⁴s⁴ w⁴r⁴ ⁹ud⁶⁴d to b⁴ ⁶u⁸lty o⁵ wron⁶do⁸n⁶. Adams and Jarv⁸s [00] us⁴ ⁶round trut⁷ cr⁸t⁴r⁸a alon⁶ t⁷⁴ sam⁴ l⁸n⁴s as ot⁷⁴r stud⁸⁴s “conv⁸ct⁸on o⁵ t⁷⁴ sub⁹⁴ct by a ⁹ud⁶⁴ or ⁹ury, ov⁴rw⁷⁴lm⁸n⁶ p⁷ys⁸cal cas⁴ ⁴v⁸d⁴nc⁴, or corroborat⁴d con⁵⁴ss⁸on by t⁷⁴ off⁴nd⁴r” [Adams and Jarv⁸s, 00, p. ]. Un⁵ortunat⁴ly, suc⁷ d⁴scr⁸pt⁸ons o⁵ ⁶round trut⁷ r⁴sourc⁴s ar⁴ qu⁸t⁴ va⁶u⁴. t ⁸s d⁸fficult to r⁴pl⁸cat⁴ pr⁴c⁸s⁴ly a s⁴t o⁵ ⁶round trut⁷ cr⁸t⁴r⁸a t⁷at m⁴⁴t standards ⁴xpr⁴ss⁴d as “substant⁸al r⁴l⁸abl⁴ ⁸nd⁴p⁴nd⁴nt w⁸tn⁴ss” [Mann ⁴t al., 00, p. ], or w⁸tn⁴ss⁴s w⁸t⁷ “clos⁴ up and ⁵a⁸rly protract⁴d ⁴xposur⁴ to t⁷⁴ cr⁸m⁸nal act⁸v⁸ty” [av⁸s ⁴t al., 00, p. ]. r⁸scoll [] and Sm⁸t⁷ [00] l⁸st ⁴xpl⁸c⁸t cr⁸t⁴r⁸a ⁵or ⁹ud⁶⁸n⁶ an ⁴nt⁸r⁴ stat⁴m⁴nt as tru⁴ or ⁵als⁴ bas⁴d on t⁷⁴ outcom⁴ o⁵ t⁷⁴ cr⁸m⁸nal ⁸nv⁴st⁸⁶at⁸on. n bot⁷ stud⁸⁴s, ass⁸⁶nm⁴nt to t⁷⁴ ”l⁸k⁴ly d⁴c⁴pt⁸v⁴” or ”l⁸k⁴ly trut⁷⁵ul” cat⁴⁶ory occurr⁴d w⁷⁴n at l⁴ast two cr⁸t⁴r⁸a o⁵ a cat⁴⁶ory w⁴r⁴ m⁴t. r⁸scoll’s cr⁸t⁴r⁸a ar⁴ summar⁸z⁴d ⁸n Tabl⁴ ..
50
3. DATA SOURCES
Sm⁸t⁷ [00, p. ] r⁴plac⁴s r⁸scoll’s cr⁸t⁴r⁸a and w⁸t⁷ t⁷⁴ ⁵ollow⁸n⁶
• Trut⁷⁵ul cr⁸t⁴r⁸on Un⁴qu⁸vocal ⁴v⁸d⁴nc⁴ to support t⁷⁴ trut⁷ o⁵ t⁷⁴ stat⁴m⁴nt • ⁴c⁴pt⁸v⁴ cr⁸t⁴r⁸on Un⁴qu⁸vocal ⁴v⁸d⁴nc⁴ to support t⁷⁴ d⁴c⁴⁸t w⁸t⁷⁸n t⁷⁴ stat⁴m⁴nt Sm⁸t⁷ ⁸s uncl⁴ar about t⁷⁴ sourc⁴ o⁵ ⁶round trut⁷ corroborat⁸on, say⁸n⁶ only t⁷at “⁴ stat⁴m⁴nts w⁴r⁴ or⁸⁶⁸nally class⁸fi⁴d by t⁷⁴ U.S. pol⁸c⁴ as ’tru⁴’, ’d⁴c⁴⁸t⁵ul’ or ’⁸nconclus⁸v⁴’ .... nconclus⁸v⁴ stat⁴m⁴nts w⁴r⁴ cat⁴⁶or⁸z⁴d as t⁷os⁴ t⁷at poss⁴ss⁴d only on⁴, ⁸⁵ any, o⁵ t⁷⁴s⁴ cr⁸t⁴r⁸a” [Sm⁸t⁷, 00, p. ]. l⁴arly t⁷⁴r⁴ ⁸s a n⁴⁴d ⁸n ⁵or⁴ns⁸c r⁴s⁴arc⁷ ⁵or ⁶round trut⁷ standards t⁷at ⁸d⁴nt⁸⁵y pr⁴⁵⁴rr⁴d sourc⁴s o⁵ val⁸dat⁸on alon⁶ w⁸t⁷ ⁷⁴ur⁸st⁸cs ⁵or w⁴⁸⁶⁷t⁸n⁶ t⁷⁴ probab⁸l⁸ty o⁵ ⁴rror ⁸n ⁶round trut⁷ d⁴t⁴rm⁸nat⁸on. v⁴n so ⁸t ⁸s poss⁸bl⁴ t⁷at alt⁷ou⁶⁷ ⁶round trut⁷ corpora t⁴nd to b⁴ small t⁷⁴y ar⁴ suffic⁸⁴nt to support b⁴⁷av⁸oral stud⁸⁴s o⁵ trut⁷ and d⁴c⁴pt⁸on. Tabl⁴ . summar⁸z⁴s t⁷⁴ ⁶round trut⁷ corpus s⁸z⁴ ⁵or t⁷⁴ ⁵or⁴ns⁸c stud⁸⁴s d⁸scuss⁴d abov⁴. Table 3.2: ounts o⁵ ⁶round trut⁷ un⁸ts
Type of GT unit Wr⁸tt⁴n Stat⁴m⁴nts
Utt⁴ranc⁴s
Propos⁸t⁸ons
Count of ground truth units Source Total True False r⁸scoll () 0 Sm⁸t⁷ (00) 0 Adams and Jarv⁸s (00) 0 0 0 Vr⁸⁹ and Mann (00) Mann ⁴t al. (00) ornac⁸ar⁸ and Po⁴s⁸o (0) 0 0 av⁸s ⁴t al. (00) 0 ac⁷⁴nko ⁴t al. (00)
Uncertain
—
3.3.2 FINANCIAL REPORTS As w⁸t⁷ data coll⁴ct⁸on, ⁶round trut⁷ ⁵or financ⁸al r⁴ports ⁸s cons⁸d⁴rably mor⁴ acc⁴ss⁸bl⁴ t⁷an data ⁵or ⁵or⁴ns⁸c and r⁴lat⁴d doma⁸ns. Publ⁸c r⁴sourc⁴s and pr⁸vat⁴ compan⁸⁴s can b⁴ us⁴d to ⁸d⁴nt⁸⁵y ⁵raudul⁴nt r⁴port⁸n⁶ and obta⁸n stat⁴m⁴nts and ot⁷⁴r docum⁴nts. or ⁴xampl⁴, t⁷⁴ U.S. S⁴cur⁸t⁸⁴s and xc⁷an⁶⁴ omm⁸ss⁸on produc⁴s Account⁸n⁶ and Aud⁸t⁸n⁶ n⁵orc⁴m⁴nt R⁴l⁴as⁴s (AARs) t⁷at prov⁸d⁴ ⁸n⁵ormat⁸on uncov⁴r⁴d dur⁸n⁶ t⁷⁴ ⁸nv⁴st⁸⁶at⁸on o⁵ a company ⁵or financ⁸al m⁸sconduct. ump⁷⁴rys ⁴t al. [0] r⁴l⁸⁴d on AARs to ⁸d⁴nt⁸⁵y a s⁴t o⁵ ⁵raud cas⁴s t⁷at ⁸nvolv⁴d orm 0-Ks. ⁴ or⁸⁶⁸nal orm 0-Ks t⁷at AARs s⁷ow⁴d to b⁴ ⁵raudul⁴nt w⁴r⁴ us⁴d to bu⁸ld t⁷⁴ d⁴c⁴pt⁸v⁴ corpus. ⁴ trut⁷⁵ul corpus was bu⁸lt ⁵rom cas⁴s w⁷⁴r⁴ t⁷⁴r⁴ was no ⁸nd⁸cat⁸on o⁵ ⁵raud. n t⁷⁴ cas⁴ o⁵ r⁴stat⁴m⁴nts and ⁵raud, r⁴stat⁴m⁴nt databas⁴s ar⁴ ma⁸nta⁸n⁴d by t⁷⁴ ov⁴rnm⁴nt Accountab⁸l⁸ty Offic⁴ and a numb⁴r o⁵ pr⁸vat⁴ compan⁸⁴s. Larck⁴r and Zakolyuk-
3.3. RISKS WITH GROUND TRUTH SOURCES
51
⁸na [00] us⁴d lass, L⁴w⁸s & o. to ⁸d⁴nt⁸⁵y r⁴stat⁴m⁴nts t⁷at t⁷⁴y us⁴d ⁸n construct⁸n⁶ t⁷⁴⁸r corpus o⁵ d⁴c⁴pt⁸v⁴ r⁴ports. S⁴v⁴ral p⁸⁴c⁴s o⁵ ⁴v⁸d⁴nc⁴ may ⁶o ⁸nto a ⁵raud ⁸nv⁴st⁸⁶at⁸on and b⁴ us⁴d to ⁴stabl⁸s⁷ ⁶round trut⁷. ⁴ company s⁴l⁴ct⁴d ⁵or ur⁶oon ⁴t al. [0] analys⁸s subm⁸tt⁴d a r⁴stat⁴m⁴nt to corr⁴ct t⁷⁴ ov⁴rstat⁴d ⁴arn⁸n⁶s ⁶⁸v⁴n ⁸n t⁷⁴ or⁸⁶⁸nal stat⁴m⁴nt. n add⁸t⁸on, s⁴v⁴ral class act⁸on su⁸ts w⁴r⁴ fil⁴d a⁶a⁸nst t⁷⁴ O and O and s⁴ttl⁴d ⁸n ⁵avor o⁵ t⁷⁴ pla⁸nt⁸ffs, t⁷⁴ all⁴⁶at⁸ons a⁶a⁸nst t⁷⁴ O and O l⁴d to t⁷⁴⁸r l⁴av⁸n⁶ t⁷⁴ company and t⁷⁴ S start⁴d an ⁸nv⁴st⁸⁶at⁸on. Larck⁴r and Zakolyuk⁸na [00] lab⁴l a r⁴stat⁴m⁴nt narrat⁸v⁴ d⁴c⁴pt⁸v⁴ only ⁸⁵ “t⁷⁴y ⁸nvolv⁴ substant⁸al subs⁴qu⁴nt r⁴stat⁴m⁴nt or n⁴t ⁸ncom⁴ and ar⁴ assoc⁸at⁴d w⁸t⁷ mor⁴ s⁴v⁴r⁴ typ⁴s o⁵ r⁴stat⁴m⁴nts suc⁷ as t⁷⁴ d⁸sclosur⁴ o⁵ a mat⁴r⁸al w⁴akn⁴ss, t⁷⁴ c⁷an⁶⁴ o⁵ an aud⁸tor, a lat⁴ fil⁸n⁶, or a orm -K fil⁸n⁶” (pp. -). ⁴s⁴ cr⁸t⁴r⁸a ar⁴ conv⁸nc⁸n⁶ but t⁷⁴y do not ⁶uarant⁴⁴ t⁷⁴ val⁸d⁸ty o⁵ ⁶round trut⁷. ⁴ta⁸l⁴d ⁸n⁵ormat⁸on about t⁷⁴ ⁵raud—⁴xactly ⁷ow ⁸t was p⁴rp⁴trat⁴d—may not b⁴ publ⁸cly ava⁸labl⁴. t ⁸s poss⁸bl⁴ t⁷at lawsu⁸ts, lat⁴r r⁴stat⁴m⁴nts and n⁴w d⁸scov⁴r⁸⁴s w⁸ll alt⁴r ⁶round trut⁷. n add⁸t⁸on, w⁴ cannot always know ⁸⁵ ⁴x⁴cut⁸v⁴s w⁴r⁴ awar⁴ o⁵ t⁷⁴ d⁴c⁴pt⁸on “⁴x⁴cut⁸v⁴s may not know about t⁷⁴ man⁸pulat⁸on at t⁷⁴ t⁸m⁴ o⁵ t⁷⁴ con⁵⁴r⁴nc⁴ call” [Larck⁴r and Zakolyuk⁸na, 00, p. ]. P⁴rsonal commun⁸cat⁸ons and confl⁸ct⁸n⁶ financ⁸al stat⁴m⁴nts may r⁴v⁴al w⁷at ⁴x⁴cut⁸v⁴s kn⁴w but w⁸t⁷out d⁸r⁴ct ⁴v⁸d⁴nc⁴ ⁸t ⁸s rar⁴ly poss⁸bl⁴ to asc⁴rta⁸n stat⁴ o⁵ m⁸nd. Mor⁴ stud⁸⁴s ar⁴ n⁴⁴d⁴d ⁸n t⁷⁸s prom⁸s⁸n⁶ ar⁴a ⁸n ord⁴r to d⁴v⁴lop a cl⁴ar and poss⁸bly rank⁴d s⁴t o⁵ ⁶round trut⁷ standards.
3.3.3 MASS MEDIA COMMUNICATIONS ⁴ ava⁸lab⁸l⁸ty o⁵ ⁶round trut⁷ as w⁴ll as spok⁴n data off⁴rs a stron⁶ ar⁶um⁴nt ⁵or ⁴xpand⁸n⁶ ⁵ormal stud⁸⁴s o⁵ d⁴c⁴pt⁸on cu⁴s to data d⁴r⁸v⁴d ⁵rom mass m⁴d⁸a commun⁸cat⁸ons. Kop⁴r and Sa⁷lman [] p⁴r⁵orm⁴d t⁷⁴⁸r study b⁴⁵or⁴ t⁷⁴ curr⁴nt ⁴ra o⁵ ub⁸qu⁸tous m⁴d⁸a and ⁵act c⁷⁴ck⁸n⁶ r⁴sourc⁴s. ⁴ constant pr⁴s⁴nc⁴ o⁵ ⁷⁸⁶⁷ profil⁴ sp⁴ak⁴rs on rad⁸o, TV, and t⁷⁴ ⁸nt⁴rn⁴t, and t⁷⁴ ⁴x⁸st⁴nc⁴ o⁵ or⁶an⁸zat⁸ons suc⁷ as act⁷⁴ck.or⁶ and Pol⁸t⁸act.com ⁷av⁴ cr⁴at⁴d a ⁵undam⁴ntal c⁷an⁶⁴ ⁸n t⁷⁴ acc⁴ss⁸b⁸l⁸ty o⁵ lan⁶ua⁶⁴ data and ⁶round trut⁷ v⁴r⁸ficat⁸on. ⁴v⁴lopm⁴nt o⁵ t⁷⁸s v⁴nu⁴ could solv⁴ many o⁵ t⁷⁴ data and ⁶round trut⁷ ⁸ssu⁴s t⁷at mak⁴ lar⁶⁴ d⁴c⁴pt⁸on corpora so d⁸fficult to construct.
APTR
53
e Language of Deception: Computational Approaches 4.1
COMPUTATIONAL APPROACHES TO VERBAL DECEPTION
⁸s c⁷apt⁴r r⁴v⁸⁴ws t⁷⁴ NLP stud⁸⁴s on t⁷⁴ automat⁸c d⁴t⁴ct⁸on o⁵ d⁴c⁴pt⁸on t⁷at pr⁴s⁴nt a pr⁴d⁸ct⁸v⁴ mod⁴l o⁵ d⁴c⁴pt⁸v⁴ v⁴rbal product⁸ons. As suc⁷, t⁷⁴ stud⁸⁴s ⁷av⁴ t⁷⁴ c⁷aract⁴r⁸st⁸cs o⁵ appl⁸⁴d work ⁸n t⁷⁴ NLP commun⁸ty, ⁸nclud⁸n⁶ • t⁷⁴ us⁴ o⁵ comparat⁸v⁴ m⁴asur⁴s o⁵ syst⁴m p⁴r⁵ormanc⁴ • t⁷⁴ us⁴ o⁵ a class⁸ficat⁸on sc⁷⁴m⁴ to s⁴parat⁴ t⁷⁴ tru⁴ ⁵rom t⁷⁴ d⁴c⁴pt⁸v⁴ product⁸ons or a rank⁸n⁶ sc⁷⁴m⁴ to ⁴st⁸mat⁴ t⁷⁴ l⁸k⁴l⁸⁷ood o⁵ a product⁸on b⁴⁸n⁶ d⁴c⁴pt⁸v⁴ • tra⁸n⁸n⁶ on a port⁸on o⁵ t⁷⁴ data and t⁴st⁸n⁶ on t⁷⁴ r⁴ma⁸nd⁴r, o⁵t⁴n ⁴nta⁸l⁸n⁶ crossval⁸dat⁸on du⁴ to t⁷⁴ small s⁸z⁴ o⁵ many datas⁴ts ⁸n d⁴c⁴pt⁸on work and • t⁷⁴ us⁴ o⁵ on⁴ or mor⁴ ⁴valuat⁸on m⁴asur⁴s to m⁴asur⁴ succ⁴ss and ⁴nabl⁴ compar⁸son across syst⁴ms. Many stud⁸⁴s also ⁸nvolv⁴ s⁸⁶n⁸ficant data pr⁴parat⁸on, pr⁸mar⁸ly ⁸nclud⁸n⁶ tok⁴n⁸zat⁸on and st⁴mm⁸n⁶.
4.1.1
ESTABLISHING COMPARATIVE MEASURES OF SYSTEM PERFORMANCE Accuracy ⁸n pr⁴d⁸ct⁸n⁶ w⁷⁴t⁷⁴r a narrat⁸v⁴ ⁸s T(ru⁴) or (als⁴) ⁸s t⁷⁴ most common m⁴asur⁴ o⁵ p⁴r⁵ormanc⁴ ⁸n NLP d⁴c⁴pt⁸on r⁴s⁴arc⁷, so w⁴ n⁴⁴d to cons⁸d⁴r w⁷at l⁴v⁴l o⁵ accuracy ⁸s ⁶ood. s 0% accuracy ⁶ood or bad ood as compar⁴d to w⁷at d⁴ally, to ⁴valuat⁴ a class⁸ficat⁸on syst⁴m w⁴ ⁷av⁴ to compar⁴ ⁸t to an ⁴nt⁸ty t⁷at p⁴r⁵orms t⁷⁴ sam⁴ task on t⁷⁴ sam⁴ data. To d⁴monstrat⁴ ⁷⁸⁶⁷ p⁴r⁵ormanc⁴, t⁷⁴ syst⁴m s⁷ould p⁴r⁵orm s⁸⁶n⁸ficantly b⁴tt⁴r t⁷an t⁷⁴ curr⁴nt c⁴⁸l⁸n⁶, or upp⁴r-bound l⁴v⁴l o⁵ p⁴r⁵ormanc⁴. or many NLP tasks, t⁷⁴ c⁴⁸l⁸n⁶ ⁸s ⁷uman p⁴r⁵ormanc⁴. Marcus ⁴t al. [], ⁵or ⁴xampl⁴, c⁸t⁴ ⁷umans as a⁶r⁴⁴⁸n⁶ on –% o⁵ t⁷⁴ ta⁶s ⁸n t⁷⁴ P⁴nn Tr⁴⁴bank v⁴rs⁸on o⁵ t⁷⁴ rown corpus. ow⁴v⁴r, ⁸n d⁴c⁴pt⁸on r⁴s⁴arc⁷, ⁷uman p⁴r⁵ormanc⁴ ⁸s o⁵t⁴n c⁸t⁴d as t⁷⁴ bas⁴l⁸n⁴, or low⁴r-bound, s⁸nc⁴ ⁷uman p⁴r⁵ormanc⁴ ⁸n d⁴t⁴rm⁸n⁸n⁶ w⁷⁴t⁷⁴r som⁴on⁴ ⁸s ly⁸n⁶ or not, ⁸s notor⁸ously low, w⁸t⁷
54
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
av⁴ra⁶⁴ accuracy scor⁴s o⁵ 0%, [ond and Paulo, 00, kman and O’Sull⁸van, ]. ⁴ ⁴arl⁸⁴st d⁴c⁴pt⁸on work ⁸n t⁷⁴ NLP parad⁸⁶m ⁷ad no ot⁷⁴r fi⁶ur⁴ to compar⁴ to, and so c⁸t⁴d t⁷⁸s randomn⁴ss ⁸n ⁷uman ab⁸l⁸ty to spot a l⁸ar as a typ⁴ o⁵ p⁴r⁵ormanc⁴ t⁷at could b⁴ ⁸mprov⁴d upon. At t⁷⁸s po⁸nt ⁸n d⁴c⁴pt⁸on r⁴s⁴arc⁷, ⁷ow⁴v⁴r, t⁷⁸s approac⁷ ⁸s no lon⁶⁴r t⁴nabl⁴. Mor⁴ r⁴c⁴nt r⁴s⁴arc⁷ ⁷as s⁷own t⁷at t⁷⁴r⁴ ⁸s a w⁸d⁴ ran⁶⁴ o⁵ sk⁸lls amon⁶ ⁷umans, and ⁷umans w⁸t⁷ ⁴xp⁴r⁸⁴nc⁴ ⁸n d⁴t⁴ct⁸n⁶ d⁴c⁴pt⁸on p⁴r⁵orm b⁴tt⁴r [kman ⁴t al., , Mann ⁴t al., 00, Vr⁸⁹ and Mann, 00b]. Mor⁴ ⁸mportantly, t⁷⁴ data ⁵or a ⁶⁸v⁴n ⁴xp⁴r⁸m⁴nt may b⁴ qu⁸t⁴ d⁸ff⁴r⁴nt ⁵rom t⁷⁴ ⁴xp⁴r⁸m⁴nts t⁷at ⁵ound t⁷⁴ c⁷anc⁴ find⁸n⁶s. or NLP appl⁸cat⁸ons, a mor⁴ r⁴l⁸abl⁴ bas⁴l⁸n⁴ may b⁴ d⁴t⁴rm⁸n⁴d by us⁸n⁶ ⁷uman ⁹ud⁶⁴s ⁸n t⁷⁴ sam⁴ ⁴xp⁴r⁸m⁴nt t⁷at ⁸s t⁴st⁸n⁶ t⁷⁴ automat⁸c d⁴t⁴ct⁸on syst⁴m. ⁸s approac⁷ was tak⁴n by N⁴wman ⁴t al. [00], w⁷⁸c⁷ ⁵ound t⁷⁴⁸r ⁷uman ⁹ud⁶⁴s corr⁴ctly class⁸⁵y⁸n⁶ % o⁵ t⁷⁴ narrat⁸v⁴s ⁸n on⁴ o⁵ t⁷⁴⁸r fiv⁴ narrat⁸v⁴ tasks, and Ott ⁴t al. [0], w⁷⁸c⁷ ⁵ound a ma⁹or⁸ty accuracy o⁵ % amon⁶ ⁸ts t⁷r⁴⁴ ⁷uman ⁹ud⁶⁴s ⁴st⁸mat⁸n⁶ T/ on t⁷⁴ sam⁴ ⁷ot⁴l r⁴v⁸⁴ws as d⁸d t⁷⁴⁸r automat⁸c syst⁴m. Anot⁷⁴r approac⁷ cons⁸d⁴rs t⁷⁴ bas⁴l⁸n⁴ to b⁴ t⁷⁴ balanc⁴ o⁵ T and narrat⁸v⁴s ⁸⁵ t⁷⁴r⁴ ar⁴ 0 tru⁴ narrat⁸v⁴s, ⁵or ⁴xampl⁴, and 0 ⁵als⁴ narrat⁸v⁴s, t⁷⁴n t⁷⁴ bas⁴l⁸n⁴ ⁸s cons⁸d⁴r⁴d to b⁴ 0%. ⁸s approac⁷ ⁸s tak⁴n by M⁸⁷alc⁴a and Strapparava [00]. As ⁵or a c⁴⁸l⁸n⁶ a⁶a⁸nst w⁷⁸c⁷ to compar⁴, t⁷⁴ fi⁴ld ⁸s now prov⁸d⁸n⁶ pr⁸or ⁴xp⁴r⁸m⁴nts on t⁷⁴ sam⁴ data t⁷anks to ⁷ot⁴l op⁸n⁸on spam data mad⁴ publ⁸cly ava⁸labl⁴ by Myl⁴ Ott.¹ Ott ⁴t al.’s publ⁸s⁷⁴d r⁴sults [Ott ⁴t al., 0, 0] ⁷av⁴ b⁴⁴n us⁴d as t⁷⁴ c⁴⁸l⁸n⁶, or b⁴st pract⁸c⁴, a⁶a⁸nst w⁷⁸c⁷ to compar⁴ subs⁴qu⁴nt syst⁴ms.² ⁸s c⁷apt⁴r cov⁴rs s⁴v⁴ral pap⁴rs us⁸n⁶ t⁷⁴ Ott ⁷ot⁴l data and r⁴⁵⁴r⁴nc⁸n⁶ t⁷⁴ Ott ⁴t al. [0] r⁴sults. O⁵ cours⁴, ⁴xpand⁸n⁶ to ot⁷⁴r data s⁴ts—⁴v⁴n r⁴v⁸⁴w data o⁵ a d⁸ff⁴r⁴nt typ⁴, suc⁷ as t⁷⁴ Amazon book r⁴v⁸⁴w data o⁵ ornac⁸ar⁸ and Po⁴s⁸o [0]—w⁸ll r⁴qu⁸r⁴ a n⁴w bas⁴l⁸n⁴ and ⁴stabl⁸s⁷ a n⁴w c⁴⁸l⁸n⁶.
4.1.2 CLASSIFICATION AND RANKING ⁴ task o⁵ s⁴parat⁸n⁶ tru⁴ ⁵rom ⁵als⁴ narrat⁸v⁴s ⁸s most o⁵t⁴n v⁸⁴w⁴d as a b⁸nary class⁸ficat⁸on task alt⁷ou⁶⁷, d⁴p⁴nd⁸n⁶ on t⁷⁴ appl⁸cat⁸on, ⁸t may b⁴ v⁸⁴w⁴d as a task ⁴st⁸mat⁸n⁶ t⁷⁴ l⁸k⁴l⁸⁷ood o⁵ d⁴c⁴pt⁸on. Most o⁵ t⁷⁴ appl⁸cat⁸ons cov⁴r⁴d ⁷⁴r⁴ ⁸nvolv⁴ sup⁴rv⁸s⁴d l⁴arn⁸n⁶, us⁸n⁶ a tra⁸n⁸n⁶ s⁴t o⁵ narrat⁸v⁴s ⁴ac⁷ o⁵ w⁷⁸c⁷ ⁸s known to b⁴ a m⁴mb⁴r o⁵ ⁴⁸t⁷⁴r t⁷⁴ T or class. us, t⁷⁴ most commonly us⁴d ⁸mpl⁴m⁴ntat⁸on al⁶or⁸t⁷ms ar⁴ t⁷⁴ Naïv⁴ ay⁴s class⁸fi⁴r and t⁷⁴ Support V⁴ctor Mac⁷⁸n⁴ class⁸fi⁴r. S⁴v⁴ral mat⁷⁴mat⁸cal mod⁴ls ⁷av⁴ b⁴⁴n put to t⁷⁴ s⁴rv⁸c⁴ o⁵ t⁷⁸s task, w⁸t⁷ ⁸nput to t⁷⁴ class⁸fi⁴rs at l⁸n⁶u⁸st⁸c l⁴v⁴ls ⁵rom t⁷⁴ c⁷aract⁴r and n-⁶ram up to and ⁸nclud⁸n⁶ t⁷⁴ d⁸scours⁴ structur⁴. Z⁷ou ⁴t al. [00b] compar⁴s t⁷⁴ r⁴sults o⁵ ⁵our mat⁷⁴mat⁸cal mod⁴ls—d⁸scr⁸m⁸nant analys⁸s, lo⁶⁸st⁸c r⁴⁶r⁴ss⁸on, d⁴c⁸s⁸on tr⁴⁴s, and n⁴ural n⁴tworks—on an ⁴ma⁸l commun⁸cat⁸on ¹http://myleott.com/op_spam/ ²Som⁴ o⁵ t⁷⁴ pap⁴rs cov⁴r⁴d ⁸n t⁷⁸s c⁷apt⁴r us⁴ t⁷⁴ t⁴rm “bas⁴l⁸n⁴” s⁸mply to m⁴an “syst⁴m a⁶a⁸nst w⁷⁸c⁷ to compar⁴” rat⁷⁴r t⁷an low⁴r-bound.
4.1. COMPUTATIONAL APPROACHES TO VERBAL DECEPTION
55
task. ⁴ study s⁷ows ⁷⁸⁶⁷ ov⁴rall accuracy rat⁴s o⁵ around 0% ⁵or lo⁶⁸st⁸c r⁴⁶r⁴ss⁸on and n⁴ural n⁴tworks.
4.1.3 TRAINING AND TESTING A standard m⁴t⁷od o⁵ tra⁸n⁸n⁶ a lan⁶ua⁶⁴ mod⁴l ⁸s by cross-val⁸dat⁸on (V) ⁸n w⁷⁸c⁷ a lar⁶⁴ port⁸on o⁵ t⁷⁴ data, as muc⁷ as 0%, ⁸s us⁴d to tra⁸n t⁷⁴ mod⁴l and t⁷⁴ r⁴ma⁸n⁸n⁶ port⁸on ⁸s ⁷⁴ld out ⁵or t⁴st⁸n⁶. ⁸s ⁸s usually don⁴ mult⁸pl⁴ t⁸m⁴s, or ⁵olds, so t⁷at all o⁵ t⁷⁴ data ⁶⁴ts to b⁴ us⁴d ⁵or tra⁸n⁸n⁶ and all ⁵or t⁴st⁸n⁶, as ⁸n “-⁵old” or “0-⁵old” cross-val⁸dat⁸on. Many o⁵ t⁷⁴ d⁴c⁴pt⁸on stud⁸⁴s vary t⁷⁴ cont⁴xtual ⁵⁴atur⁴s o⁵ t⁷⁴ narrat⁸v⁴s, pr⁸mar⁸ly ⁸n t⁴rms o⁵ narrat⁸v⁴ top⁸c. Som⁴ stud⁸⁴s us⁴ data ⁵rom t⁷⁴ ⁵ull var⁸⁴ty o⁵ narrat⁸v⁴s ⁵or tra⁸n⁸n⁶ and t⁴st⁸n⁶. ⁴ mor⁴ c⁷all⁴n⁶⁸n⁶ tra⁸n/t⁴st protocol ⁸n t⁷⁸s s⁸tuat⁸on ⁸s to ⁷old out on⁴ narrat⁸v⁴ top⁸c ⁵rom tra⁸n⁸n⁶ to us⁴ ⁵or t⁴st⁸n⁶. ⁴ a⁸m o⁵ t⁷⁸s typ⁴ o⁵ t⁴st⁸n⁶ ⁸s to s⁴⁴ ⁷ow ⁶⁴n⁴ral⁸zabl⁴ ar⁴ t⁷⁴ ⁵⁴atur⁴s t⁷at ar⁴ b⁴⁸n⁶ us⁴d to d⁸scr⁸m⁸nat⁴ trut⁷⁵ul ⁵rom d⁴c⁴pt⁸v⁴ narrat⁸v⁴s. N⁴wman ⁴t al. [00], ⁵or ⁴xampl⁴, cr⁴at⁴ a part⁸cularly c⁷all⁴n⁶⁸n⁶ task by ⁸nclud⁸n⁶ fiv⁴ narrat⁸v⁴ typ⁴s t⁷at vary ⁸n ⁴⁸t⁷⁴r top⁸c or mod⁴ (abort⁸on (v⁸d⁴o, typ⁴d, or wr⁸tt⁴n), ⁵r⁸⁴nds (v⁸d⁴o), mock cr⁸m⁴ (v⁸d⁴o)) and t⁴st⁸n⁶ on ⁴ac⁷ on⁴ us⁸n⁶ t⁷⁴ data ⁵rom t⁷⁴ ot⁷⁴r ⁵our as t⁷⁴ tra⁸n⁸n⁶ s⁴t. As w⁴ w⁸ll s⁴⁴ b⁴low, t⁷⁴⁸r accuracy rat⁴s ar⁴ s⁸⁶n⁸ficantly b⁴tt⁴r w⁷⁴n t⁷⁴ top⁸c ⁸n t⁷⁴ t⁴st⁸n⁶ occurs ⁸n t⁷⁴ tra⁸n⁸n⁶ also. 4.1.4
SYSTEM EVALUATION
Table 4.1: A con⁵us⁸on matr⁸x ⁵or d⁴c⁴pt⁸on
Actual lass
Tru⁴ als⁴
Pr⁴d⁸ct⁴d lass Tru⁴ als⁴ A
⁸v⁴n t⁷⁴ con⁵us⁸on matr⁸x ⁸n Tabl⁴ ., w⁷⁴r⁴ t⁷⁴ rows corr⁴spond to t⁷⁴ known class o⁵ t⁷⁴ data, ⁸.⁴., t⁷⁴ lab⁴ls T and obta⁸n⁴d ⁵rom t⁷⁴ ⁶round trut⁷ data and t⁷⁴ columns corr⁴spond to t⁷⁴ pr⁴d⁸ct⁸ons mad⁴ by t⁷⁴ mod⁴l, t⁷⁴r⁴ ar⁴ s⁴v⁴ral standard ⁴valuat⁸on m⁴asur⁴s, l⁸st⁴d b⁴low, t⁷at can b⁴ obta⁸n⁴d. Accuracy ⁸s an ⁴st⁸mat⁴ o⁵ t⁷⁴ succ⁴ss o⁵ a mod⁴l ⁸n d⁸scr⁸m⁸nat⁸n⁶ Tru⁴ ⁵rom als⁴ narrat⁸v⁴s t⁷⁴ ot⁷⁴r m⁴asur⁴s ar⁴ us⁴d to ⁴valuat⁴ t⁷⁴ succ⁴ss o⁵ a mod⁴l ⁸n d⁴t⁴rm⁸n⁸n⁶ Tru⁴ narrat⁸v⁴s or als⁴ narrat⁸v⁴s.³ Accuracy ⁸s a m⁴asur⁴ o⁵ t⁷⁴ narrat⁸v⁴s t⁷at ar⁴ corr⁴ctly class⁸fi⁴d (A+)/(A+++). ³⁵ only on⁴ fi⁶ur⁴ ⁸s r⁴port⁴d, ⁸t ⁸s most commonly a m⁴asur⁴ o⁵ t⁷⁴ als⁴ docum⁴nts s⁸nc⁴ most appl⁸cat⁸ons ar⁴ conc⁴rn⁴d w⁸t⁷ ⁸d⁴nt⁸⁵y⁸n⁶ t⁷⁴ d⁴c⁴pt⁸v⁴ commun⁸cat⁸ons.
56
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
False Positive ⁸s a m⁴asur⁴ o⁵ t⁷⁴ Tru⁴ narrat⁸v⁴s t⁷at ar⁴ class⁸fi⁴d as als⁴ /(A+), or als⁴ narrat⁸v⁴s t⁷at ar⁴ class⁸fi⁴d as Tru⁴ /(+). Precision ⁸s a m⁴asur⁴ o⁵ actual Tru⁴ narrat⁸v⁴s t⁷at ar⁴ pr⁴d⁸ct⁴d Tru⁴ A/(A+), or o⁵ actual als⁴ narrat⁸v⁴s t⁷at ar⁴ pr⁴d⁸ct⁴d als⁴ /(+). Recall ⁸s a m⁴asur⁴ o⁵ t⁷⁴ actual Tru⁴s t⁷at w⁴r⁴ ⁵ound by t⁷⁴ syst⁴m A/(A+), or o⁵ t⁷⁴ actual als⁴ narrat⁸v⁴s t⁷at w⁴r⁴ ⁵ound /(+). F-measure: t⁷⁴ ⁷armon⁸c m⁴an o⁵ pr⁴c⁸s⁸on and r⁴call ( * (pr⁴c⁸s⁸on * r⁴call)/pr⁴c⁸s⁸on + r⁴call)).
⁴ m⁴asur⁴ us⁴d s⁷ould b⁴ d⁴p⁴nd⁴nt on t⁷⁴ appl⁸cat⁸on. Accuracy ⁸s t⁷⁴ most commonly us⁴d m⁴asur⁴ ⁸n d⁴c⁴pt⁸on class⁸ficat⁸on s⁸nc⁴ most appl⁸cat⁸ons a⁸m to ⁶⁸v⁴ ⁴qual ⁸mportanc⁴ to t⁷⁴ Tru⁴ and t⁷⁴ als⁴ ⁸d⁴nt⁸ficat⁸on. ut t⁷⁴r⁴ ar⁴ appl⁸cat⁸ons t⁷at n⁴⁴d to find ⁵als⁴ stat⁴m⁴nts at t⁷⁴ ⁴xp⁴ns⁴ o⁵ ov⁴rta⁶⁶⁸n⁶ tru⁴s as ⁵als⁴—⁴l⁴ctron⁸c commun⁸cat⁸ons t⁷at m⁸⁶⁷t s⁸⁶nal cr⁸m⁸nal act⁸v⁸ty, ⁵or ⁴xampl⁴. n suc⁷ cas⁴s, a ⁷⁸⁶⁷ r⁴call valu⁴ ⁸s ⁸mportant. ⁸v⁴n t⁷⁴ d⁸v⁴rs⁸ty o⁵ t⁷⁴ narrat⁸v⁴s b⁴⁸n⁶ class⁸fi⁴d as T/ by curr⁴nt syst⁴ms ⁵rom 0 c⁷aract⁴r tw⁴⁴ts to ⁷ours lon⁶ spok⁴n t⁴st⁸mony and t⁷⁴ purpos⁴ o⁵ ⁴ac⁷ appl⁸cat⁸on, w⁷⁴r⁴ ⁷⁸⁶⁷ ⁵als⁴ pos⁸t⁸v⁴s m⁸⁶⁷t b⁴ mor⁴ or l⁴ss costly, ⁸t ⁸s ⁸mposs⁸bl⁴ to p⁸ck a b⁴st approac⁷. ow⁴v⁴r, t⁷⁴ ⁴valuat⁸on m⁴asur⁴s ⁶⁸v⁴ som⁴ ab⁸l⁸ty to compar⁴ syst⁴ms, k⁴⁴p⁸n⁶ ⁸n m⁸nd t⁷⁴ ⁴xtr⁴m⁴ d⁸v⁴rs⁸ty o⁵ t⁷⁴ appl⁸cat⁸ons ⁵or t⁷⁸s r⁴ason, r⁴c⁴nt stud⁸⁴s o⁵t⁴n prov⁸d⁴ mult⁸pl⁴ m⁴asur⁴s.
4.1.5 PREPPING THE DATA Amon⁶ t⁷⁴ tools us⁴d to pr⁴par⁴ data ⁵or ⁴xp⁴r⁸m⁴ntat⁸on, a stop-l⁸st ⁸s not usually us⁴d w⁷⁴n pr⁴par⁸n⁶ data ⁵or T/ class⁸ficat⁸on. W⁷⁸l⁴ t⁷⁴ ⁵unct⁸on words usually ⁸nclud⁴d on a stopl⁸st ar⁴ t⁷ou⁶⁷t to carry l⁸ttl⁴ s⁴mant⁸c w⁴⁸⁶⁷t, t⁷⁴ d⁸st⁸nct⁸on b⁴tw⁴⁴n s⁴l⁵/⁶roup-r⁴⁵⁴r⁴nc⁸n⁶ pronouns and poss⁴ss⁸v⁴s (I; me; my; we; ⁴tc.) and ot⁷⁴r-r⁴⁵⁴r⁴nc⁸n⁶ pronouns and poss⁴ss⁸v⁴s (you; your; she; them; ⁴tc.) ⁸s cons⁸d⁴r⁴d cr⁸t⁸cal ⁸n muc⁷ d⁴c⁴pt⁸on work, as ar⁴ modal v⁴rbs, w⁷⁸c⁷ can ⁴xpr⁴ss unc⁴rta⁸nty, and pr⁴pos⁸t⁸ons, w⁷⁸c⁷ can captur⁴ spat⁸o-t⁴mporal r⁴lat⁸ons or ⁵unct⁸on as t⁷⁴ ⁷⁴ad o⁵ an ⁴mb⁴dd⁴d s⁴nt⁴nc⁴, as do because; alt hough; ⁴tc. St⁴mm⁸n⁶, ⁸n w⁷⁸c⁷ morp⁷olo⁶⁸cal var⁸ants ar⁴ r⁴duc⁴d to a s⁸n⁶l⁴ ⁵orm, ⁸s also som⁴t⁸m⁴s avo⁸d⁴d. ⁴p⁴nd⁸n⁶ on t⁷⁴ T/ d⁸scr⁸m⁸nat⁸on mod⁴l, vo⁸c⁴, t⁴ns⁴, and asp⁴ct may b⁴ cons⁸d⁴r⁴d d⁸scr⁸m⁸nators. A⁶a⁸n, d⁴p⁴nd⁸n⁶ on t⁷⁴ d⁴c⁴pt⁸on mod⁴l, t⁷⁴ data may also b⁴ ta⁶⁶⁴d ⁵or part-o⁵-sp⁴⁴c⁷ and/or pars⁴d.
4.2
CONSIDERATIONS SPECIFIC TO DECEPTION
⁴r⁴ ar⁴ s⁴v⁴ral typ⁴s o⁵ var⁸at⁸on ⁸n t⁷⁴ data t⁷at ar⁴ common to many NLP appl⁸cat⁸ons and t⁷at ar⁴ part⁸cularly r⁴l⁴vant to d⁴c⁴pt⁸on stud⁸⁴s. ⁴s⁴ ⁸nclud⁴ var⁸at⁸on ⁸n t⁷⁴ typ⁴s o⁵ data to Also r⁴⁵⁴rr⁴d to as ⁴t⁴ct⁸on Rat⁴.
4.2. CONSIDERATIONS SPECIFIC TO DECEPTION
57
b⁴ analyz⁴d, t⁷⁴ un⁸t o⁵ analys⁸s t⁷at ⁸s class⁸fi⁴d as tru⁴ or d⁴c⁴pt⁸v⁴, t⁷⁴ att⁴mpt to c⁷aract⁴r⁸z⁴ l⁸⁴s o⁵ om⁸ss⁸on as w⁴ll as comm⁸ss⁸on, t⁷⁴ top⁸c, and, o⁵ cours⁴, t⁷⁴ l⁸n⁶u⁸st⁸c l⁴v⁴l o⁵ t⁷⁴ data, w⁷⁸c⁷ can ran⁶⁴ ⁵rom t⁷⁴ c⁷aract⁴r up to t⁷⁴ structur⁴ o⁵ t⁷⁴ d⁸scours⁴. n add⁸t⁸on, t⁷⁴r⁴ ar⁴ ⁸ssu⁴s p⁴cul⁸ar to d⁴c⁴pt⁸on stud⁸⁴s t⁷at ⁸nvolv⁴ ⁷ow ⁶round trut⁷ ⁸s obta⁸n⁴d and r⁴cord⁴d.
4.2.1 DATA TYPES AMENABLE TO DECEPTION RESEARCH ⁴r⁴ ar⁴ many doma⁸ns and ⁶⁴nr⁴s ⁸n w⁷⁸c⁷ ⁷⁸⁶⁷-stak⁴s ly⁸n⁶ can b⁴ ⁵ound. ⁴ c⁷o⁸c⁴ o⁵ data typ⁴s to mod⁴l and t⁴st ⁸s only mod⁴rat⁴ly constra⁸n⁴d by t⁷⁴ narrat⁸v⁴’s ab⁸l⁸ty to d⁸splay d⁴c⁴pt⁸on and t⁷⁴ n⁴⁴ds o⁵ NLP mod⁴l⁸n⁶ and t⁴st⁸n⁶. Assum⁸n⁶ t⁷at l⁸ars l⁴ak cu⁴s to t⁷⁴⁸r d⁴c⁴pt⁸on, narrat⁸v⁴s t⁷at ar⁴ r⁴⁷⁴ars⁴d, suc⁷ as publ⁸c sp⁴⁴c⁷⁴s, put t⁷⁴ b⁴⁷av⁸ors o⁵ t⁷⁴ narrator, part⁸cularly lan⁶ua⁶⁴ c⁷o⁸c⁴s, und⁴r consc⁸ous control so narrat⁸v⁴ data n⁴⁴ds to b⁴ spontan⁴ous or s⁴m⁸-spontan⁴ous. ⁸s ⁸s poss⁸bl⁴ ⁴v⁴n ⁵or wr⁸tt⁴n narrat⁸v⁴s und⁴r c⁴rta⁸n cond⁸t⁸ons. n add⁸t⁸on, t⁷⁴ data n⁴⁴ds to ⁷av⁴ mor⁴ lan⁶ua⁶⁴ to b⁴ analyz⁴d t⁷an t⁷⁴ s⁸mpl⁴ “Y⁴s/No” r⁴spons⁴s o⁵, ⁵or ⁴xampl⁴, a poly⁶rap⁷ t⁴st. ⁸nally, ⁸n ord⁴r to t⁴st an NLP d⁴c⁴pt⁸on mod⁴l, t⁷⁴ data or port⁸ons o⁵ t⁷⁴ data ⁷av⁴ to b⁴ v⁴r⁸fiabl⁴. ssu⁴s o⁵ v⁴r⁸fiab⁸l⁸ty ar⁴ cov⁴r⁴d ⁸n ⁷apt⁴r , S⁴ct⁸on . and ⁸n ⁸tzpatr⁸ck and ac⁷⁴nko [0]. ommun⁸cat⁸ons t⁷at ⁸nvolv⁴ bot⁷ substant⁸al narrat⁸v⁴ port⁸ons and t⁷⁴ poss⁸b⁸l⁸ty o⁵ d⁴c⁴pt⁸on can b⁴ d⁸v⁸d⁴d across two d⁸m⁴ns⁸ons, ⁸nt⁴ract⁸v⁸ty and ⁴l⁸c⁸tat⁸on. ⁸alo⁶u⁴s may ⁴⁸t⁷⁴r b⁴ ⁸nt⁴ract⁸v⁴, ⁸n w⁷⁸c⁷ cas⁴ t⁷⁴ narrat⁸v⁴ ⁸s construct⁴d by all ⁸nvolv⁴d part⁸⁴s, or ⁸t can b⁴ non⁸nt⁴ract⁸v⁴, construct⁴d by on⁴ narrator ⁵or an aud⁸⁴nc⁴ o⁵ on⁴ or mor⁴, as ⁸n t⁷⁴ cas⁴ o⁵ a l⁴ctur⁴, a w⁴bpa⁶⁴, or unsol⁸c⁸t⁴d ⁴ma⁸l. ⁴ s⁴cond d⁸m⁴ns⁸on, ⁴l⁸c⁸tat⁸on, captur⁴s w⁷⁴t⁷⁴r t⁷⁴ narrator ⁸s b⁴⁸n⁶ ask⁴d to prov⁸d⁴ t⁷⁴ narrat⁸v⁴ or ⁸s ⁶⁸v⁸n⁶ ⁸t ⁵r⁴⁴ly. ⁴ narrator ⁷as l⁴ss control ov⁴r an ⁴l⁸c⁸t⁴d narrat⁸v⁴, poss⁸bly ⁸ncr⁴as⁸n⁶ t⁷⁴ odds ⁵or t⁷⁴ app⁴aranc⁴ o⁵ cu⁴s to d⁴c⁴pt⁸on. Tabl⁴ . s⁷ows t⁷⁴ poss⁸bl⁴ v⁴nu⁴s, class⁸fi⁴d accord⁸n⁶ to ⁸nt⁴ract⁸v⁸ty and ⁴l⁸c⁸tat⁸on, ⁸n w⁷⁸c⁷ spontan⁴ous or s⁴m⁸-spontan⁴ous commun⁸cat⁸on, and t⁷⁴r⁴⁵or⁴ substant⁸al d⁴c⁴pt⁸on, can occur. W⁴ ⁴xclud⁴ c⁴rta⁸n data typ⁴s b⁴caus⁴ suc⁷ data ⁸s not publ⁸cly ava⁸labl⁴, ⁸ts v⁴rac⁸ty cannot b⁴ asc⁴rta⁸n⁴d, or ⁷uman sub⁹⁴ct ⁸ssu⁴s mak⁴ ⁸t ⁸mposs⁸bl⁴ to us⁴. ⁸tzpatr⁸ck and ac⁷⁴nko [0] cov⁴r t⁷⁴s⁴ ⁸ssu⁴s ⁸n d⁴ta⁸l. 4.2.2 UNIT OF ANALYSIS: THE LIAR OR THE LIE Som⁴ appl⁸cat⁸ons s⁴⁴k to class⁸⁵y t⁷⁴ narrator as a l⁸ar or a trut⁷-t⁴ll⁴r bas⁴d on t⁷⁴ ⁵ull narrat⁸v⁴, w⁷⁸c⁷ m⁴ans t⁷at t⁷⁴ class⁸ficat⁸on sc⁷⁴m⁴ w⁸ll op⁴rat⁴ on t⁷⁴ ⁵ull t⁴xt rat⁷⁴r t⁷an on ⁸nd⁸v⁸dual propos⁸t⁸ons. Ot⁷⁴r appl⁸cat⁸ons s⁴⁴k to class⁸⁵y ⁸nd⁸v⁸dual propos⁸t⁸ons or p⁴r⁷aps a ⁶roup o⁵ propos⁸t⁸ons as a l⁸⁴, w⁸t⁷ ot⁷⁴r cla⁸ms class⁸fi⁴d as ⁴⁸t⁷⁴r trut⁷⁵ul or unknown w⁸t⁷ r⁴sp⁴ct to d⁴c⁴pt⁸on. ⁴ part⁸cular appl⁸cat⁸on may d⁴mand on⁴ l⁴v⁴l rat⁷⁴r t⁷an t⁷⁴ ot⁷⁴r ⁵or ⁴xampl⁴, ⁵or onl⁸n⁴ ⁷ot⁴l r⁴v⁸⁴ws [Ott ⁴t al., 0], t⁷⁴ r⁴ad⁴r wants to know w⁷⁴t⁷⁴r t⁷⁴ r⁴v⁸⁴w was actually ⁴s⁴ d⁸m⁴ns⁸ons and t⁷⁴ bas⁸s ⁵or Tabl⁴ . ar⁴ ⁵rom [ur⁶oon ⁴t al., 00].
58
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
Table 4.2: V⁴nu⁴s o⁵ v⁴rbal d⁴c⁴pt⁸on dott⁴d l⁸n⁴s s⁴parat⁴ spok⁴n ⁵rom wr⁸tt⁴n lan⁶ua⁶⁴
Elicited
Not Elicited
Interactive ac⁴-to-⁵ac⁴ ⁴pos⁸t⁸ons ourt t⁴st⁸mony Ot⁷⁴r publ⁸c t⁴st⁸mony P⁷on⁴ conv⁴rsat⁸ons ————————⁷at/⁴ma⁸l Ov⁴r⁷⁴ar⁸n⁶ W⁸r⁴tap ————————⁷at/⁴ma⁸l
Not Interactive R⁴cord⁴d stat⁴m⁴nts to pol⁸c⁴ ————————Wr⁸tt⁴n stat⁴m⁴nts to pol⁸c⁴
Vo⁸c⁴ma⁸l R⁴cord⁴d sp⁴⁴c⁷ Pol⁸t⁸cal sp⁴⁴c⁷⁴s ————————Onl⁸n⁴ product ⁴ndors⁴m⁴nts Unsol⁸c⁸t⁴d ⁴ma⁸l ac⁴book Ot⁷⁴r soc⁸al m⁴d⁸a W⁴bpa⁶⁴s
wr⁸tt⁴n by som⁴on⁴ w⁷o stay⁴d at t⁷⁴ ⁷ot⁴l, rat⁷⁴r t⁷an w⁷⁴t⁷⁴r ⁴ac⁷ cla⁸m ⁸s ⁴nt⁸r⁴ly val⁸d. Ott ⁴t al.’s [0] r⁴v⁸⁴ws ar⁴ s⁷ort, av⁴ra⁶⁸n⁶ words, so t⁷⁴ d⁸ff⁴r⁴nc⁴ b⁴tw⁴⁴n d⁴t⁴rm⁸n⁸n⁶ t⁷⁴ l⁸ar and t⁷⁴ l⁸⁴ ⁸s m⁸n⁸mal. A lon⁶⁴r narrat⁸v⁴ suc⁷ as occurs ⁸n ⁸nv⁴st⁸⁶at⁸v⁴ ⁸nt⁴rv⁸⁴ws or tr⁸al t⁴st⁸mony may ⁷av⁴ a m⁸x o⁵ trut⁷⁵ul stat⁴m⁴nts and ⁵abr⁸cat⁸ons. n suc⁷ cas⁴s ⁵act find⁸n⁶ ⁸s d⁴p⁴nd⁴nt on t⁷⁴ ⁸nd⁸v⁸dual cla⁸ms and so t⁷⁴ l⁸⁴s ⁸n t⁷⁴ narrat⁸v⁴ n⁴⁴d to b⁴ ⁸d⁴nt⁸fi⁴d rat⁷⁴r t⁷an t⁷⁴ ov⁴rall v⁴rac⁸ty o⁵ t⁷⁴ narrator. Lon⁶⁴r narrat⁸v⁴s also pr⁴s⁴nt t⁷⁴ opportun⁸ty to obs⁴rv⁴ t⁷⁴ l⁸n⁶u⁸st⁸c b⁴⁷av⁸or o⁵ t⁷⁴ narrator and ⁸d⁸osyncras⁸⁴s s/⁷⁴ may ⁴x⁷⁸b⁸t ⁸n ly⁸n⁶.
4.2.3 LIES OF OMISSION AND COMMISSION Most o⁵ t⁷⁴ work don⁴ ⁸n d⁴c⁴pt⁸on d⁴t⁴ct⁸on d⁴als w⁸t⁷ l⁸⁴s o⁵ comm⁸ss⁸on, w⁷⁴r⁴ t⁷⁴ l⁸ar mak⁴s cla⁸ms t⁷at ar⁴ not tru⁴. As w⁴ ⁷av⁴ s⁴⁴n t⁷rou⁶⁷out, t⁷⁴ ⁴xp⁴ctat⁸on ⁸n suc⁷ cas⁴s ⁸s t⁷at t⁷⁴ lan⁶ua⁶⁴ o⁵ t⁷⁴ l⁸⁴ w⁸ll b⁴ d⁸ff⁴r⁴nt ⁵rom t⁷⁴ lan⁶ua⁶⁴ us⁴d ⁸n t⁷⁴ r⁴lay o⁵ a trut⁷. ow⁴v⁴r, t⁷⁴ narrator may t⁴ll a story ⁸n w⁷⁸c⁷ cr⁸t⁸cal ⁵acts ar⁴ s⁸mply om⁸tt⁴d ⁵rom t⁷⁴ t⁴ll⁸n⁶. or ⁴xampl⁴, Scott P⁴t⁴rson, conv⁸ct⁴d o⁵ murd⁴r⁸n⁶ ⁷⁸s pr⁴⁶nant w⁸⁵⁴ ⁸n al⁸⁵orn⁸a ⁸n 00, d⁴scr⁸b⁴s ⁸n d⁴ta⁸l ⁷⁸s boat tr⁸p to an ⁸sland on t⁷⁴ day ⁷⁸s w⁸⁵⁴ w⁴nt m⁸ss⁸n⁶, but ⁵a⁸ls to m⁴nt⁸on t⁷at ⁷⁴r body was ⁸n t⁷⁴ boat and t⁷at ⁷⁴ l⁴⁵t ⁸t on t⁷⁴ ⁸sland. Ot⁷⁴r t⁷an t⁷⁴ dropp⁸n⁶ o⁵ first p⁴rson pronouns, t⁷⁴r⁴ ⁸s no s⁸⁶n⁸ficant d⁸ff⁴r⁴nc⁴ b⁴tw⁴⁴n ⁷⁸s lan⁶ua⁶⁴ us⁴ ⁸n t⁷⁸s d⁴scr⁸pt⁸on and ⁷⁸s lan⁶ua⁶⁴ us⁴
4.3. THE CURRENT SYSTEMS
59
w⁷⁴n d⁴ta⁸l⁸n⁶ ⁵acts about ⁷⁸ms⁴l⁵, ⁷⁸s ⁷ous⁴, ⁴tc. Om⁸ss⁸on ⁸s ⁶⁴n⁴rally ⁷ard⁴r to ⁸d⁴nt⁸⁵y. or on⁴ t⁷⁸n⁶, t⁷⁴ narrator ⁸s mak⁸n⁶ no cla⁸m ⁷⁴ ⁸s only n⁴⁶l⁴ct⁸n⁶ to ⁸nclud⁴ all r⁴l⁴vant ⁸n⁵ormat⁸on.
4.2.4 LEVEL OF DATA USED FOR MODELING ⁴ l⁸n⁶u⁸st⁸c l⁴v⁴l at w⁷⁸c⁷ t⁷⁴ data ⁸s analyz⁴d ⁸n r⁴c⁴nt NLP stud⁸⁴s ran⁶⁴s ⁵rom t⁷at o⁵ t⁷⁴ c⁷aract⁴r to t⁷at o⁵ t⁷⁴ structur⁴ o⁵ t⁷⁴ d⁸scours⁴, w⁸t⁷ most stud⁸⁴s us⁸n⁶ ⁴⁸t⁷⁴r t⁷⁴ n-⁶ram or l⁴x⁸cal ⁵⁴atur⁴s t⁷at ⁷av⁴ b⁴⁴n s⁷own by t⁷⁴ pr⁸or l⁸t⁴ratur⁴ on d⁴c⁴pt⁸on to b⁴ r⁴l⁴vant. 4.2.5 TRAINING DATA AND GROUND TRUTH All work ⁸n d⁴c⁴pt⁸on must d⁴al w⁸t⁷ t⁷⁴ ⁶round trut⁷ probl⁴m. To b⁴ abl⁴ to r⁴co⁶n⁸z⁴ t⁷⁴ l⁸⁴, t⁷⁴ r⁴s⁴arc⁷⁴r must not only ⁸d⁴nt⁸⁵y d⁸st⁸nct⁸v⁴ b⁴⁷av⁸or w⁷⁴n som⁴on⁴ ⁸s ly⁸n⁶ but must asc⁴rta⁸n w⁷⁴t⁷⁴r t⁷⁴ stat⁴m⁴nt b⁴⁸n⁶ mad⁴ ⁸s tru⁴ or not ⁸n ord⁴r to ass⁴ss t⁷⁴ accuracy o⁵ t⁷⁴ class⁸ficat⁸on. As d⁸scuss⁴d ⁸n S⁴ct⁸on ., t⁷⁴ standard m⁴t⁷od o⁵ obta⁸n⁸n⁶ ⁶round trut⁷ ⁸s t⁷rou⁶⁷ a laboratory ⁴xp⁴r⁸m⁴nt w⁷⁴r⁴ sub⁹⁴cts can r⁴port t⁷⁴⁸r l⁸⁴s or b⁴ film⁴d comm⁸tt⁸n⁶ t⁷⁴ acts t⁷at t⁷⁴y subs⁴qu⁴ntly l⁸⁴ about. ⁴ most common alt⁴rnat⁸v⁴ to t⁷⁴ laboratory ⁴xp⁴r⁸m⁴nt ⁸s ⁵act c⁷⁴ck⁸n⁶ a⁵t⁴r t⁷⁴ narrat⁸v⁴ ⁷as b⁴⁴n produc⁴d. ⁸t⁷⁴r m⁴t⁷od y⁸⁴lds a small amount o⁵ v⁴r⁸fi⁴d data, so a standard n-⁶ram ba⁶-o⁵-words approac⁷ ov⁴r m⁸ll⁸ons o⁵ words o⁵ narrat⁸v⁴ ⁸s not poss⁸bl⁴. To ⁶⁴t around t⁷⁴ small datas⁴t probl⁴m, most syst⁴ms r⁴sort to a mor⁴ abstract l⁴v⁴l o⁵ analys⁸s t⁷an t⁷⁴ n-⁶ram. Many syst⁴ms us⁴ ⁵⁴atur⁴s t⁷at pr⁸or psyc⁷olo⁶y and/or cr⁸m⁸nal ⁹ust⁸c⁴ r⁴s⁴arc⁷ ⁷av⁴ s⁷own to b⁴ corr⁴lat⁴d w⁸t⁷ d⁴c⁴pt⁸on. W⁴ ⁴xam⁸n⁴ t⁷⁴s⁴ ⁵⁴atur⁴s as w⁴ll as ot⁷⁴r curr⁴nt NLP approac⁷⁴s to d⁴c⁴pt⁸on d⁴t⁴ct⁸on b⁴low.
4.3
THE CURRENT SYSTEMS
⁴c⁴pt⁸on d⁴t⁴ct⁸on ⁸s a r⁴lat⁸v⁴ly n⁴w appl⁸cat⁸on ⁸n computat⁸onal l⁸n⁶u⁸st⁸cs, w⁸t⁷ most o⁵ t⁷⁴ work app⁴ar⁸n⁶ ⁸n t⁷⁴ last t⁴n y⁴ars. n t⁷at d⁴cad⁴, ⁷ow⁴v⁴r, s⁴v⁴ral ⁶roups ⁷av⁴ produc⁴d mor⁴ t⁷an on⁴ publ⁸cat⁸on ⁸n t⁷⁴ ar⁴a, ⁸n w⁷⁸c⁷ cas⁴ w⁴ conc⁴ntrat⁴ on t⁷⁴ s⁴m⁸nal publ⁸cat⁸on ⁵or t⁷⁴ ⁶roup unl⁴ss ot⁷⁴r publ⁸cat⁸ons ar⁴ substant⁸ally d⁸ff⁴r⁴nt ⁸n t⁴rms ⁴⁸t⁷⁴r o⁵ t⁷⁴ appl⁸cat⁸on or t⁷⁴ m⁴t⁷ods us⁴d to ⁸d⁴nt⁸⁵y ⁵als⁴ narrat⁸v⁴s. ⁴ syst⁴ms cov⁴r⁴d ⁷⁴r⁴ con⁵orm to t⁷⁴ c⁷aract⁴r⁸st⁸cs o⁵ appl⁸⁴d work ⁸n NLP—us⁴ o⁵ a bas⁴l⁸n⁴, a class⁸ficat⁸on sc⁷⁴m⁴, a tra⁸n⁸n⁶/t⁴st⁸n⁶ protocol, and on⁴ or mor⁴ o⁵ t⁷⁴ standard ⁴valuat⁸on m⁴tr⁸cs. ⁸s not only puts t⁷⁴ work squar⁴ly ⁸n t⁷⁴ NLP ⁵ram⁴work, but ⁸t ⁴nabl⁴s us to mak⁴ w⁷at w⁴ ⁷op⁴ to b⁴ us⁴⁵ul compar⁸sons. ⁴ cov⁴ra⁶⁴ ⁸s or⁶an⁸z⁴d by t⁷⁴ l⁴v⁴l at w⁷⁸c⁷ t⁷⁴ mod⁴l ⁸s bu⁸lt, start⁸n⁶ w⁸t⁷ t⁷⁴ c⁷aract⁴r and n-⁶ram t⁷rou⁶⁷ t⁷⁴ l⁴x⁸cal ⁵⁴atur⁴ and syntact⁸c structur⁴ and up to t⁷⁴ lo⁶⁸cal and d⁸scours⁴ structur⁴s.
60
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
4.3.1 CHARACTERS AND n-GRAMS At t⁷⁴ small⁴st l⁴v⁴l o⁵ analys⁸s ⁸s t⁷⁴ c⁷aract⁴r, an appropr⁸at⁴ l⁴v⁴l o⁵ analys⁸s ⁵or s⁷ort narrat⁸v⁴s l⁸k⁴ tw⁴⁴ts, ⁵or w⁷⁸c⁷ ⁷⁴n ⁴t al. [0] ⁷av⁴ bu⁸lt a syst⁴m to d⁴t⁴ct scamm⁸n⁶ tw⁴⁴ts o⁵ t⁷r⁴⁴ typ⁴s . stra⁸⁶⁷t cons, w⁷⁸c⁷ s⁴⁴k to ⁶⁴t mon⁴y ⁵rom us⁴rs t⁷rou⁶⁷ “asy-mon⁴y, work-⁵rom-⁷om⁴” sc⁷⁴m⁴s or “mon⁴y-mak⁸n⁶ w⁸t⁷ Tw⁸tt⁴r” scams
. tw⁸tom⁴rc⁸als, w⁷⁸c⁷ s⁴nd us⁴rs to a ⁵ak⁴ s⁸t⁴ w⁷⁴r⁴ t⁷⁴y ar⁴ ask⁴d to pay a s⁷⁸pp⁸n⁶ ⁵⁴⁴ ⁵or a bo⁶us product or s⁴rv⁸c⁴ and . p⁷⁸s⁷⁸n⁶ and v⁸rus spr⁴ad⁸n⁶ scams, w⁷⁸c⁷ d⁴c⁴⁸v⁴ us⁴rs ⁸nto ⁴nt⁴r⁸n⁶ passwords, ⁴tc. and us⁴ t⁷⁴s⁴ to p⁷⁸s⁷ or to spr⁴ad a v⁸rus.
⁷⁴n ⁴t al. us⁴s a suffix tr⁴⁴ al⁶or⁸t⁷m ⁵or class⁸ficat⁸on. Suffix tr⁴⁴s allow ⁵ast ⁸mpl⁴m⁴ntat⁸on, but r⁴qu⁸r⁴ cons⁸d⁴rably mor⁴ spac⁴ t⁷an stor⁸n⁶ t⁷⁴ str⁸n⁶ ⁸ts⁴l⁵, and so may not b⁴ appropr⁸at⁴ ⁵or lon⁶⁴r narrat⁸v⁴s. To bu⁸ld t⁷⁴⁸r datas⁴t, ⁷⁴n ⁴t al. [0] coll⁴ct⁴d , un⁸qu⁴ tw⁴⁴ts by qu⁴ry⁸n⁶ ⁵or ⁵r⁴qu⁴nt n⁶l⁸s⁷ stop words and p⁷ras⁴s known to occur ⁸n scam tw⁴⁴ts, suc⁷ as “work at ⁷om⁴” and “t⁴⁴t⁷ w⁷⁸t⁴n⁸n⁶.” 0% o⁵ t⁷⁴s⁴ tw⁴⁴ts w⁴r⁴ randomly ass⁸⁶n⁴d to t⁷⁴ t⁴st s⁴t, w⁷⁸c⁷ was lab⁴l⁴d ⁸nd⁴p⁴nd⁴ntly by t⁷⁴ t⁷r⁴⁴ r⁴s⁴arc⁷⁴rs and c⁷⁴ck⁴d ⁵or ⁸nt⁴r-rat⁴r r⁴l⁸ab⁸l⁸ty. N⁸n⁴ tw⁴⁴ts ⁸n t⁷⁴ tra⁸n⁸n⁶ s⁴t w⁴r⁴ also lab⁴l⁴d. ⁴ tra⁸n⁸n⁶ s⁴t was sub⁹⁴ct⁴d to mod⁴l-bas⁴d clust⁴r⁸n⁶ to ⁴nsur⁴ t⁷at t⁷⁴ t⁷r⁴⁴ scam typ⁴s and t⁷⁴ non-scam tw⁴⁴ts would b⁴ ⁷⁸⁶⁷ly d⁸ff⁴r⁴nt⁸at⁴d. ac⁷ tw⁴⁴t was t⁷⁴n ⁸t⁴rat⁸v⁴ly ⁸nclud⁴d ⁸nto t⁷⁴ tra⁸n⁸n⁶ s⁴t as scam or non-scam bas⁴d on ⁸ts matc⁷ w⁸t⁷ ⁴ac⁷ class. ⁷⁴n ⁴t al. r⁴port an accuracy rat⁴ o⁵ 0.%, w⁷⁸c⁷, a⁵t⁴r s⁴l⁵-tra⁸n⁸n⁶ on t⁷⁴ ,000 unlab⁴l⁴d tra⁸n⁸n⁶ tw⁴⁴ts, ⁸ncr⁴as⁴d to %. ⁴ datas⁴t t⁷at ⁷⁴n ⁴t al. ar⁴ abl⁴ to produc⁴ w⁸t⁷ t⁷⁴⁸r s⁴m⁸-sup⁴rv⁸s⁴d m⁴t⁷od ⁸s ⁴xtr⁴m⁴ly lar⁶⁴ by d⁴c⁴pt⁸on d⁴t⁴ct⁸on standards. ⁴ lack o⁵ w⁸d⁴ var⁸at⁸on ⁸n t⁷⁴ data mak⁴s t⁷⁴⁸r approac⁷ ⁵⁴as⁸bl⁴. As t⁷⁴ d⁴scr⁸pt⁸on o⁵ t⁷⁴ t⁷r⁴⁴ scam typ⁴s abov⁴ s⁷ows, t⁷⁴r⁴ ⁸s a common s⁴t o⁵ words t⁷at ⁴ac⁷ s⁴t s⁷ar⁴s w⁸t⁷ t⁷⁴ ot⁷⁴r m⁴mb⁴rs o⁵ t⁷⁴ s⁴t. or appl⁸cat⁸ons w⁷⁴r⁴ t⁷⁴ als⁴ and/or Tru⁴ data ⁷as t⁷⁸s c⁷aract⁴r⁸st⁸c, a s⁴m⁸-sup⁴rv⁸s⁴d m⁴t⁷od can produc⁴ a lar⁶⁴ datas⁴t, mak⁸n⁶ an analys⁸s at t⁷⁴ l⁴v⁴l o⁵ t⁷⁴ c⁷aract⁴r ⁵⁴as⁸bl⁴. At t⁷⁴ l⁴v⁴l o⁵ t⁷⁴ n-⁶ram, M⁸⁷alc⁴a and Strapparava [00] (M&S) t⁴st⁴d t⁷⁴ ⁴xt⁴nt to w⁷⁸c⁷ t⁷⁴ t⁴xt ⁸n 00 ⁵als⁴ narrat⁸v⁴s can b⁴ automat⁸cally d⁸st⁸n⁶u⁸s⁷⁴d ⁵rom t⁷at ⁸n 00 tru⁴ narrat⁸v⁴s on t⁷r⁴⁴ w⁸d⁴ly vary⁸n⁶ top⁸cs. ⁴y ⁶av⁴ t⁷⁴ top⁸cs to 00 M⁴c⁷an⁸cal Turk⁴rs w⁸t⁷ ⁸nstruct⁸ons to wr⁸t⁴ a – s⁴nt⁴nc⁴ narrat⁸v⁴ about t⁷⁴⁸r tru⁴ ⁵⁴⁴l⁸n⁶s and t⁷⁴ contrary on ⁴ac⁷ top⁸c. Tok⁴n⁸zat⁸on and st⁴mm⁸n⁶ w⁴r⁴ p⁴r⁵orm⁴d on t⁷⁴ narrat⁸v⁴s stopwords w⁴r⁴ not ⁴l⁸m⁸nat⁴d. Tabl⁴ . s⁷ows t⁷⁴ cross-top⁸c accuracy r⁴sults ac⁷⁸⁴v⁴d by M&S us⁸n⁶ Naïv⁴ ay⁴s (N) and Support V⁴ctor Mac⁷⁸n⁴ (SVM) class⁸fi⁴rs w⁸t⁷ 0-⁵old cross-val⁸dat⁸on ⁵or t⁴st⁸n⁶. ⁴r⁴ ⁸s a ⁶r⁴at d⁴al o⁵ r⁴s⁴arc⁷ on ass⁴ss⁸n⁶ ⁸n⁵ormat⁸on cr⁴d⁸b⁸l⁸ty on t⁷⁴ w⁴b us⁸n⁶ ⁵actors ⁴xt⁴rnal to t⁷⁴ narrat⁸v⁴, ⁸nclud⁸n⁶ ⁸n⁵ormat⁸on sourc⁴s, proport⁸on o⁵ dub⁸ous m⁴ssa⁶⁴s, d⁸stort⁸on o⁵ rat⁸n⁶s bas⁴d on m⁴ssa⁶⁴s, ⁴tc. A ⁶ood sourc⁴ ⁵or t⁷⁸s r⁴s⁴arc⁷ ar⁴ t⁷⁴ W⁴bQual⁸ty works⁷ops assoc⁸at⁴d w⁸t⁷ t⁷⁴ annual nt⁴rnat⁸onal World W⁸d⁴ W⁴b on⁵⁴r⁴nc⁴. W⁴ ⁷av⁴ ass⁸⁶n⁴d t⁷⁴ art⁸cl⁴s t⁷at w⁸ll b⁴ d⁸scuss⁴d ⁸n s⁴v⁴ral plac⁴s t⁷rou⁶⁷out t⁷⁸s c⁷apt⁴r abbr⁴v⁸at⁴d nam⁴s, ⁴.⁶., “M&S” ⁵or M⁸⁷alc⁴a and Strapparava, to ⁵ac⁸l⁸tat⁴ r⁴⁵⁴r⁴nc⁸n⁶.
4.3. THE CURRENT SYSTEMS
Table 4.3: M⁸⁷alc⁴a and Strapparava’s T/ class⁸ficat⁸on accuracy (%)
Topic
NB 0.0 . .0 0.
61
SVM . . 0. 0.
W⁸t⁷ t⁷⁴ bas⁴l⁸n⁴ ⁵or t⁷⁸s study at 0%, ⁶⁸v⁴n t⁷⁴ balanc⁴ o⁵ 00 tru⁴ and 00 ⁵als⁴ narrat⁸v⁴s, t⁷⁴s⁴ r⁴sults ar⁴ s⁸⁶n⁸ficant, d⁴monstrat⁸n⁶ t⁷⁴ prom⁸s⁴ o⁵ automat⁸c class⁸ficat⁸on ⁸n d⁸st⁸n⁶u⁸s⁷⁸n⁶ two typ⁴s o⁵ lan⁶ua⁶⁴. M&S also t⁴st t⁷⁴ ⁴ff⁴ct o⁵ top⁸c on t⁷⁴ class⁸ficat⁸on r⁴sults, us⁸n⁶ t⁷⁴ data ⁵rom two o⁵ t⁷⁴ top⁸cs ⁵or tra⁸n⁸n⁶, w⁸t⁷ t⁴st⁸n⁶ don⁴ on t⁷⁴ r⁴ma⁸n⁸n⁶ top⁸c. Tabl⁴ . s⁷ows t⁷⁴ ⁴ff⁴ct o⁵ top⁸c c⁷an⁶⁴ on t⁷⁴ succ⁴ss o⁵ t⁷⁴ class⁸ficat⁸on, w⁸t⁷ accuracy cut ⁸n ⁷al⁵ or mor⁴. ⁸s ⁸s not surpr⁸s⁸n⁶ ⁶⁸v⁴n ⁴xp⁴ct⁴d d⁸ff⁴r⁴nc⁴s ⁸n t⁷⁴ words (n-⁶rams) us⁴d to d⁸scuss ⁴ac⁷ o⁵ t⁷⁴ top⁸cs. Table 4.4: M⁸⁷alc⁴a and Strapparava’s cross-top⁸c accuracy (%)
Training
+ +
+
Test
NB .0 . . .
SVM .0 . . .
n a post ⁷oc study, M&S also cons⁸d⁴r t⁷⁴ ⁴ff⁴ct⁸v⁴n⁴ss o⁵ t⁷⁴ class-bas⁴d ⁵⁴atur⁴s o⁵ L⁸n⁶u⁸st⁸c nqu⁸ry and Word ount (LW), d⁸scuss⁴d b⁴low, by rank⁸n⁶ t⁷⁴ fiv⁴ LW ⁵⁴atur⁴s t⁷at app⁴ar most o⁵t⁴n ⁸n t⁷⁴⁸r d⁴c⁴pt⁸v⁴ narrat⁸v⁴s and l⁴ast o⁵t⁴n ⁸n t⁷⁴⁸r tru⁴ narrat⁸v⁴s and v⁸c⁴-v⁴rsa. or ⁴xampl⁴, t⁷⁴ LW ⁵⁴atur⁴ w⁷⁸c⁷ most ⁶r⁴atly d⁸stanc⁴d t⁷⁴ d⁴c⁴pt⁸v⁴ t⁴xt ⁵rom t⁷⁴ trut⁷⁵ul t⁴xt was “MTAPYSAL,” w⁷⁸c⁷ ⁸nclud⁴d t⁷⁴ words god; d ie; sacred; mercy; si n; dead; hel l; soul; lord; si ns , w⁷⁸l⁴ t⁷⁴ LW ⁵⁴atur⁴ t⁷at d⁸stanc⁴d t⁷⁴ trut⁷⁵ul ⁵rom t⁷⁴ d⁴c⁴pt⁸v⁴ t⁴xt was “OPTMSM,” w⁷⁸c⁷ ⁸nclud⁴d t⁷⁴ words best; ready; hope; accept; accept ed; det ermi ned; accept ed; won; super . M&S ⁴stabl⁸s⁷⁴d t⁷at ⁸t ⁸s poss⁸bl⁴ to d⁸ff⁴r⁴nt⁸at⁴ l⁴x⁸cally d⁸v⁴rs⁴ tru⁴ ⁵rom d⁴c⁴pt⁸v⁴ narrat⁸v⁴s at t⁷⁴ n-⁶ram l⁴v⁴l w⁸t⁷ a d⁴⁶r⁴⁴ o⁵ s⁸⁶n⁸ficanc⁴ w⁷⁸l⁴ ⁷⁴n ⁴t al., ⁷av⁸n⁶ p⁸ck⁴d a ⁷⁸⁶⁷ly tractabl⁴ narrat⁸v⁴ appl⁸cat⁸on w⁸t⁷ l⁸ttl⁴ l⁴x⁸cal var⁸at⁸on, s⁷ow⁴d t⁷at t⁷⁴ d⁸ff⁴r⁴nt⁸at⁸on can b⁴ accompl⁸s⁷⁴d w⁸t⁷ a ⁷⁸⁶⁷ d⁴⁶r⁴⁴ o⁵ succ⁴ss at t⁷⁴ l⁴v⁴l o⁵ t⁷⁴ c⁷aract⁴r. Ot⁷⁴r stud⁸⁴s ⁷av⁴ cont⁸nu⁴d to us⁴ t⁷⁴ n-⁶ram l⁴v⁴l, pr⁸mar⁸ly as a bas⁴l⁸n⁴ a⁶a⁸nst w⁷⁸c⁷ to compar⁴ mod⁴ls us⁸n⁶ mor⁴ abstract r⁴pr⁴s⁴ntat⁸ons o⁵ t⁷⁴ data, ⁸nclud⁸n⁶ LW and ot⁷⁴r ⁵⁴atur⁴s.
62
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
Ott ⁴t al. [0] (Ott), also t⁴st⁸n⁶ t⁷⁴ ab⁸l⁸ty o⁵ n-⁶rams as w⁴ll as part-o⁵-sp⁴⁴c⁷ (POS) ta⁶ ⁵r⁴qu⁴nc⁸⁴s and LW ⁵⁴atur⁴s to d⁸scr⁸m⁸nat⁴ T ⁵rom narrat⁸v⁴s, ar⁴ abl⁴ to ac⁷⁸⁴v⁴ a r⁴markabl⁴ l⁴v⁴l o⁵ d⁸scr⁸m⁸nat⁸on on – s⁴nt⁴nc⁴ ⁷ot⁴l r⁴v⁸⁴ws o⁵ ⁷⁸ca⁶o ⁷ot⁴ls. ⁴ Ott data ⁸s d⁸ff⁴r⁴nt ⁸n s⁴v⁴ral ways. ⁴ 00 ⁵als⁴ narrat⁸v⁴s ar⁴ cl⁴arly ⁵als⁴ M⁴c⁷an⁸cal Turk⁴rs wrot⁴ t⁷⁴m. ac⁷ Turk⁴r was ⁶⁸v⁴n t⁷⁴ nam⁴ and w⁴bs⁸t⁴ o⁵ t⁷⁴ ⁷ot⁴l s/⁷⁴ was to r⁴v⁸⁴w and told to pr⁴t⁴nd to work ⁵or t⁷⁴ ⁷ot⁴l’s mark⁴t⁸n⁶ d⁴partm⁴nt and produc⁴ a r⁴al⁸st⁸c, ⁵avorabl⁴ r⁴v⁸⁴w. ⁴ aut⁷ors tak⁴ s⁴v⁴ral st⁴ps to r⁴tr⁸⁴v⁴ a comparabl⁴ numb⁴r o⁵ tru⁴ r⁴v⁸⁴ws, m⁸n⁸n⁶ almost ,000 r⁴v⁸⁴ws ⁵rom Tr⁸pAdv⁸sor o⁵ t⁷⁴ 0 most popular ⁷ot⁴ls and ⁴l⁸m⁸nat⁸n⁶ non--star r⁴v⁸⁴ws (assum⁴dly ⁵or par⁸ty w⁸t⁷ t⁷⁴ ⁶low⁸n⁶ ⁵als⁴ r⁴v⁸⁴ws), non-n⁶l⁸s⁷ r⁴v⁸⁴ws, s⁷ort r⁴v⁸⁴ws, and r⁴v⁸⁴ws wr⁸tt⁴n by first-t⁸m⁴ aut⁷ors s⁸nc⁴ t⁷⁸s ⁸s not t⁷⁴ typ⁸cal Tr⁸pAdv⁸sor patt⁴rn. Tabl⁴ . s⁷ows t⁷⁴ Ott r⁴sults on t⁷⁴ n-⁶ram mod⁴ls us⁸n⁶ Naïv⁴ ay⁴s and Support V⁴ctor Mac⁷⁸n⁴ class⁸fi⁴rs and a -⁵old n⁴st⁴d V proc⁴dur⁴ t⁷at ⁴valuat⁴s l⁴arn⁴d mod⁴ls on r⁴v⁸⁴ws ⁵rom uns⁴⁴n ⁷ot⁴ls.
Table 4.5: Ott’s n-⁶ram accuracy (%). ⁴ + ⁸nd⁸cat⁴s t⁷at t⁷⁴ ⁵⁴atur⁴ s⁴t subsum⁴s t⁷⁴ pr⁴c⁴d⁸n⁶ ⁵⁴atur⁴ s⁴t. [Ott ⁴t al., 0, p. ].
Features SVM SVM
+ +
SVM
N
+
N
Accuracy . . .0 . .
⁴ Ott study also r⁴ports ⁶ood accuracy ⁸n d⁸st⁸n⁶u⁸s⁷⁸n⁶ tru⁴ ⁵rom ⁵als⁴ narrat⁸v⁴s us⁸n⁶ POS ta⁶ ⁵r⁴qu⁴nc⁸⁴s (.0%) and LW ⁵⁴atur⁴s (.%), as w⁴ll as LW+b⁸⁶rams (.%) w⁷⁴n t⁴st⁸n⁶ w⁸t⁷ t⁷⁴ SVM class⁸fi⁴r, but t⁷⁴ n-⁶ram scor⁴s ar⁴ part⁸cularly r⁴markabl⁴ ⁸n l⁸⁶⁷t o⁵ t⁷⁴ accuracy scor⁴s t⁷at M&S ac⁷⁸⁴v⁴d w⁸t⁷ n-⁶rams. W⁷at mak⁴s t⁷⁴m so ⁴ ma⁸n d⁸ff⁴r⁴nc⁴ b⁴tw⁴⁴n t⁷⁴ M&S ⁴xp⁴r⁸m⁴nt and t⁷⁴ Ott ⁴xp⁴r⁸m⁴nt ⁸s ⁸n t⁷⁴ ⁷⁸⁶⁷ly constra⁸n⁴d top⁸c o⁵ ⁷ot⁴l r⁴v⁸⁴ws cov⁴r⁴d by Ott as oppos⁴d to sub⁹⁴cts’ ⁴xpr⁴ss⁸ons o⁵ op⁸n⁸ons on ⁷ot-button ⁸ssu⁴s, w⁷⁸c⁷ can b⁴ muc⁷ mor⁴ d⁸v⁴rs⁴. t ⁸s ⁴ncoura⁶⁸n⁶, t⁷ou⁶⁷, t⁷at ⁶⁸v⁴n t⁷⁴ lack o⁵ d⁸v⁴rs⁸ty ⁸n t⁷⁴ ⁷ot⁴l r⁴v⁸⁴ws, at l⁴ast ⁸n t⁷⁴⁸r l⁴x⁸con, Ott ⁴t al. st⁸ll finds a s⁷arp d⁸ff⁴r⁴nc⁴ b⁴tw⁴⁴n t⁷⁴ trut⁷⁵ul and t⁷⁴ d⁴c⁴pt⁸v⁴ r⁴v⁸⁴ws. On t⁷⁴ ot⁷⁴r ⁷and, t⁷⁴ tru⁴ r⁴v⁸⁴w⁴rs ar⁴ prompt⁴d by Tr⁸pAdv⁸sor to rat⁴ t⁷⁴ ⁷ot⁴l on s⁴v⁴ral param⁴t⁴rs, ⁸nclud⁸n⁶ locat⁸on, sl⁴⁴p qual⁸ty, rooms, s⁴rv⁸c⁴, valu⁴, and cl⁴anl⁸n⁴ss, and r⁴v⁸⁴w⁴rs t⁴nd to r⁴p⁴at t⁷⁴s⁴ words ⁸n t⁷⁴⁸r r⁴v⁸⁴ws. Ott do⁴s not r⁴port t⁷⁴ bo⁶us r⁴v⁸⁴w⁴rs b⁴⁸n⁶ ⁶⁸v⁴n t⁷⁴s⁴ prompts. Tr⁸pAdv⁸sor.com r⁴v⁸⁴ws ar⁴ not back⁴d up by transact⁸onal data anyon⁴ can subm⁸t a r⁴v⁸⁴w. To w⁴⁴d out bo⁶us r⁴v⁸⁴ws, Tr⁸pAdv⁸sor ⁴mploys a ⁵raud d⁴t⁴ct⁸on t⁴am and ⁷as so⁵twar⁴ t⁷at tracks t⁷⁴ r⁴v⁸⁴ws ⁵or anomal⁸⁴s [R⁴⁸t⁴r, 00].
4.3. THE CURRENT SYSTEMS
63
Ott ⁴t al. [0] stud⁸⁴d only ⁵avorabl⁴ r⁴v⁸⁴ws, bot⁷ trut⁷⁵ul and d⁴c⁴pt⁸v⁴. ut n⁴⁶at⁸v⁴ r⁴v⁸⁴ws—t⁷os⁴ ass⁸⁶n⁴d or stars by t⁷⁴ r⁴v⁸⁴w⁴r—can b⁴ ⁴v⁴n mor⁴ dama⁶⁸n⁶ to a brand t⁷an ar⁴ ⁵avorabl⁴ r⁴v⁸⁴ws ⁵or a comp⁴t⁸tor. Ott-13 looks at r⁴v⁸⁴ws t⁷at bad-mout⁷ t⁷⁴ comp⁴t⁸t⁸on [Ott ⁴t al., 0]. ⁴ d⁴s⁸⁶n o⁵ t⁷⁴ n⁴⁶at⁸v⁴ r⁴v⁸⁴w study ⁸s almost ⁸d⁴nt⁸cal to Ott’s pr⁴v⁸ous study—00 T r⁴v⁸⁴ws, 0 r⁴v⁸⁴ws p⁴r ⁷ot⁴l, and 00 r⁴v⁸⁴ws, 0 r⁴v⁸⁴ws p⁴r ⁷ot⁴l—w⁸t⁷ r⁴v⁸⁴ws wr⁸tt⁴n by Turk⁴rs. ⁴ T r⁴v⁸⁴ws, ⁷ow⁴v⁴r, com⁴ ⁵rom mult⁸pl⁴ sourc⁴s ⁵or t⁷⁴ n⁴⁶at⁸v⁴ study ⁸n add⁸t⁸on to Tr⁸pAdv⁸sor, t⁷⁴s⁴ ⁸nclud⁴ xp⁴d⁸a, ot⁴ls.com, Orb⁸tz, Pr⁸c⁴l⁸n⁴, and Y⁴lp. ⁴ av⁴ra⁶⁴ l⁴n⁶t⁷ o⁵ t⁷⁴ n⁴⁶at⁸v⁴ r⁴v⁸⁴ws, at words, ⁸s also substant⁸ally lon⁶⁴r t⁷an t⁷⁴ pos⁸t⁸v⁴ r⁴v⁸⁴ws, at words. To prov⁸d⁴ a bas⁴l⁸n⁴ ⁵or t⁷⁸s sp⁴c⁸fic study, t⁷r⁴⁴ und⁴r⁶raduat⁴ un⁸v⁴rs⁸ty stud⁴nts w⁴r⁴ ask⁴d to mak⁴ T/ ⁹ud⁶m⁴nts on 0 T and 0 r⁴v⁸⁴ws ⁵rom ⁴ac⁷ o⁵ ⁵our ⁷ot⁴ls. ⁴ us⁴ o⁵ ⁷uman ⁹ud⁶⁴s also ⁴nsur⁴d t⁷at t⁷⁴ r⁴v⁸⁴ws w⁴r⁴ conv⁸nc⁸n⁶. W⁷⁸l⁴ t⁷⁴ b⁴st ⁷uman ⁹ud⁶⁴ ⁷ad an accuracy rat⁴ o⁵ %, t⁷⁴ ⁸nt⁴rrat⁴r a⁶r⁴⁴m⁴nt amon⁶ t⁷⁴ ⁹ud⁶⁴s was low (l⁴⁸ss kappa at 0.0 o⁷⁴n’s kappa at 0. ⁵or t⁷⁴ pa⁸rw⁸s⁴ a⁶r⁴⁴m⁴nt b⁴tw⁴⁴n t⁷⁴ two most accurat⁴ ⁹ud⁶⁴s). ⁴ automat⁴d class⁸ficat⁸on ⁵or t⁷⁴ n⁴⁶at⁸v⁴ r⁴v⁸⁴w study was don⁴ ⁸n t⁷⁴ sam⁴ mann⁴r as ⁵or t⁷⁴ pos⁸t⁸v⁴ r⁴v⁸⁴ws but only un⁸⁶ram and b⁸⁶ram ⁵⁴atur⁴s w⁴r⁴ t⁴st⁴d. ⁴ ⁴xpand⁴d data s⁴t o⁵ 00 pos⁸t⁸v⁴ and 00 n⁴⁶at⁸v⁴ r⁴v⁸⁴ws allow⁴d ⁵or t⁷r⁴⁴ tra⁸n⁸n⁶ s⁸tuat⁸ons tra⁸n us⁸n⁶ pos⁸t⁸v⁴ s⁴nt⁸m⁴nt data only, tra⁸n us⁸n⁶ n⁴⁶at⁸v⁴ only, and tra⁸n us⁸n⁶ bot⁷ pos⁸t⁸v⁴ and n⁴⁶at⁸v⁴ data. Tabl⁴ . ⁶⁸v⁴s t⁷⁴ accuracy scor⁴s ⁵or ⁴ac⁷ s⁸tuat⁸on, s⁷ow⁸n⁶ t⁷at t⁷⁴ comb⁸n⁴d s⁸tuat⁸on ⁶⁸v⁴s t⁷⁴ b⁴st ov⁴rall r⁴sults, poss⁸bly b⁴caus⁴ o⁵ t⁷⁴ lar⁶⁴r tra⁸n⁸n⁶ s⁴t, as t⁷⁴ Ott- pap⁴r sp⁴culat⁴s.
Table 4.6: M⁴asur⁴s (%) o⁵ Ott-’s SVM class⁸fi⁴r on n⁴⁶at⁸v⁴ and pos⁸t⁸v⁴ r⁴v⁸⁴ws. “⁴ld out” r⁴⁵⁴rs to class⁸fi⁴rs tra⁸n⁴d on r⁴v⁸⁴ws o⁵ on⁴ s⁴nt⁸m⁴nt and t⁴st⁴d on t⁷⁴ ot⁷⁴r. “ross Val.” r⁴⁵⁴rs to “-⁵old strat⁸fi⁴d cross-val⁸dat⁸on” w⁷⁴r⁴ t⁷⁴ mod⁴l ⁸s tra⁸n⁴d on r⁴v⁸⁴ws ⁵or ⁷ot⁴ls and t⁴st⁴d on r⁴v⁸⁴ws ⁵or t⁷⁴ r⁴ma⁸n⁸n⁶ ⁷ot⁴ls, w⁸t⁷ s⁴nt⁸m⁴nt as ⁸nd⁸cat⁴d ⁸n t⁷⁴ tabl⁴.
Train Sentiment
(00 r⁴v⁸⁴ws) (00 r⁴v⁸⁴ws) (00 r⁴v⁸⁴ws)
Test Sentiment (00 r⁴v⁸⁴ws,rossVal.) (00 r⁴v⁸⁴ws,⁴ldOut) (00 r⁴v⁸⁴ws,⁴ldOut) (00 r⁴v⁸⁴ws,rossVal.) (00 r⁴v⁸⁴ws,rossVal.) (00 r⁴v⁸⁴ws,rossVal.)
Accuracy
. . , .0 . .0
True F-measure . . .0 . . .
False F-measure . 0. . . . .
⁴ r⁴sults also s⁷ow t⁷at tra⁸n⁸n⁶ on on⁴ s⁴nt⁸m⁴nt and t⁴st⁸n⁶ on t⁷⁴ ot⁷⁴r low⁴rs accuracy, su⁶⁶⁴st⁸n⁶ t⁷at “cu⁴s to d⁴c⁴pt⁸on d⁸ff⁴r d⁴p⁴nd⁸n⁶ on t⁷⁴ s⁴nt⁸m⁴nt o⁵ t⁷⁴ t⁴xt.”
64
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
Ott- also looks at t⁷⁴ lan⁶ua⁶⁴ o⁵ t⁷⁴ r⁴v⁸⁴ws, find⁸n⁶ t⁷at r⁴v⁸⁴ws, unl⁸k⁴ T r⁴v⁸⁴ws, ar⁴ scarc⁴ on spat⁸al d⁴ta⁸ls and ⁸nclud⁴ mor⁴ v⁴rbs r⁴lat⁸v⁴ to nouns, w⁷⁸c⁷, Ott su⁶⁶⁴sts, parall⁴l t⁷⁴ c⁷aract⁴r⁸st⁸cs o⁵ ⁸ma⁶⁸nat⁸v⁴ wr⁸t⁸n⁶ and r⁴call t⁷⁴ cla⁸ms o⁵ R⁴al⁸ty Mon⁸tor⁸n⁶ (s⁴⁴ ⁷apt⁴r ). ow⁴v⁴r, w⁷⁸l⁴ first-p⁴rson pronoun usa⁶⁴ ⁸n t⁷⁴ pos⁸t⁸v⁴-only study occurr⁴d at tw⁸c⁴ t⁷⁴ rat⁴ ⁸n r⁴v⁸⁴ws as ⁸n T, ⁸t was only % ⁷⁸⁶⁷⁴r ⁸n t⁷⁴ n⁴⁶at⁸v⁴ r⁴v⁸⁴ws. Not surpr⁸s⁸n⁶ly, t⁷ou⁶⁷, pos⁸t⁸v⁴ r⁴v⁸⁴ws ⁴xpr⁴ss mor⁴ pos⁸t⁸v⁴ ⁴mot⁸ons and n⁴⁶at⁸v⁴ r⁴v⁸⁴ws mor⁴ n⁴⁶at⁸v⁴ on⁴s, captur⁸n⁶, as Ott obs⁴rv⁴s, t⁷⁴ s⁴nt⁸m⁴nt o⁵ t⁷⁴ wr⁸t⁴rs rat⁷⁴r t⁷an t⁷⁴ n⁴⁶at⁸v⁸ty assoc⁸at⁴d w⁸t⁷ d⁴c⁴pt⁸on, as r⁴pr⁴s⁴nt⁴d ⁸n ⁴Paulo ⁴t al.’s [00] S⁴l⁵-Pr⁴s⁴ntat⁸onal approac⁷ d⁸scuss⁴d ⁸n ⁷apt⁴r . ⁴rnánd⁴z us⁸l⁸⁴r ⁴t al. [0] (H-F), c⁸t⁸n⁶ t⁷⁴ d⁸fficulty o⁵ construct⁸n⁶ lar⁶⁴ data s⁴ts l⁸k⁴ t⁷at o⁵ Ott- ⁸n r⁴al appl⁸cat⁸on sc⁴nar⁸os, propos⁴ t⁷⁴ s⁴m⁸-sup⁴rv⁸s⁴d PU-l⁴arn⁸n⁶ m⁴t⁷od to bu⁸ld a tra⁸n⁸n⁶ s⁴t o⁵ pos⁸t⁸v⁴ and n⁴⁶at⁸v⁴ r⁴v⁸⁴ws ⁵rom known pos⁸t⁸v⁴ (P), d⁴c⁴pt⁸v⁴ data and unlab⁴l⁴d (U) data [⁴rnánd⁴z us⁸l⁸⁴r ⁴t al., 0]. ⁴ m⁴t⁷od ⁸s c⁷os⁴n b⁴caus⁴ o⁵ pr⁸or succ⁴ss ⁸n d⁴al⁸n⁶ w⁸t⁷ data s⁷ow⁸n⁶ ⁷⁸⁶⁷ co⁷⁴s⁸on, as do⁴s t⁷⁴ tar⁶⁴t (pos⁸t⁸v⁴) class o⁵ T r⁴v⁸⁴ws, and ⁷⁸⁶⁷ var⁸at⁸on ⁸n t⁷⁴ unlab⁴l⁴d s⁴t du⁴, w⁴ would assum⁴, to t⁷⁴ m⁸xtur⁴ o⁵ co⁷⁴s⁸v⁴ pos⁸t⁸v⁴ T r⁴v⁸⁴ws w⁸t⁷ mor⁴ d⁸v⁴rs⁴ pos⁸t⁸v⁴ r⁴v⁸⁴ws as w⁴ll as n⁴⁶at⁸v⁴ T and r⁴v⁸⁴ws. - t⁴sts t⁷⁴ or⁸⁶⁸nal PU-l⁴arn⁸n⁶ al⁶or⁸t⁷m, w⁷⁸c⁷ ⁸t⁴rat⁸v⁴ly bu⁸lds a tra⁸n⁸n⁶ s⁴t o⁵ “r⁴l⁸abl⁴ n⁴⁶at⁸v⁴” data, a⁶a⁸nst t⁷⁴⁸r mod⁸fi⁴d PU-l⁴arn⁸n⁶ al⁶or⁸t⁷m, w⁷⁸c⁷ ⁴l⁸m⁸nat⁴s n⁴⁶at⁸v⁴ data o⁵ l⁴ss⁴r r⁴l⁸ab⁸l⁸ty, y⁸⁴ld⁸n⁶ a small⁴r numb⁴r o⁵ “⁷⁸⁶⁷ qual⁸ty n⁴⁶at⁸v⁴ ⁸nstanc⁴s.” ⁸v⁴n t⁷⁸s -way s⁴t o⁵ class⁴s—pos⁸t⁸v⁴/n⁴⁶at⁸v⁴, T/, and PU-or⁸⁶⁸nal/PU-mod⁸fi⁴d—- t⁴st () t⁷⁴ ⁵⁴as⁸b⁸l⁸ty o⁵ PU-l⁴arn⁸n⁶ ⁵or t⁷⁸s r⁴al⁸st⁸c s⁸tuat⁸on, () t⁷⁴ sup⁴r⁸or⁸ty o⁵ t⁷⁴ PU-mod⁸fi⁴d al⁶or⁸t⁷m, and () t⁷⁴ ⁴ff⁴ct o⁵ t⁷⁴ pos⁸t⁸v⁴/n⁴⁶at⁸v⁴ polar⁸ty und⁴r PU-l⁴arn⁸n⁶. ⁴y also compar⁴ t⁷⁴ p⁴r⁵ormanc⁴ o⁵ un⁸⁶rams vs. un⁸+b⁸⁶rams and Naïv⁴ ay⁴s vs. SVM class⁸fi⁴rs. ⁴s⁴ t⁴sts s⁷ow t⁷at t⁷⁴ PU-mod⁸fi⁴d al⁶or⁸t⁷m ac⁷⁸⁴v⁴d a b⁴st -m⁴asur⁴ o⁵ 0. ⁸n t⁷⁴ class⁸ficat⁸on o⁵ n⁴⁶at⁸v⁴ op⁸n⁸ons us⁸n⁶ 0 lab⁴l⁴d d⁴c⁴pt⁸v⁴ op⁸n⁸ons ⁵or tra⁸n⁸n⁶ and a b⁴st -m⁴asur⁴ o⁵ 0. “⁸n t⁷⁴ d⁴t⁴ct⁸on o⁵ pos⁸t⁸v⁴ d⁴c⁴pt⁸v⁴ and trut⁷⁵ul op⁸n⁸ons us⁸n⁶ only 00 lab⁴l⁴d tra⁸n⁸n⁶ sampl⁴s.” - not⁴s t⁷at t⁷⁴ vocabulary o⁵ t⁷⁴ n⁴⁶at⁸v⁴ op⁸n⁸ons ⁸s lar⁶⁴r t⁷an t⁷at ⁵or t⁷⁴ pos⁸t⁸v⁴ “⁸nd⁸cat⁸n⁶ t⁷⁴⁸r cont⁴nt ⁸s ⁸n ⁶⁴n⁴ral mor⁴ d⁴ta⁸l⁴d and d⁸v⁴rs⁴,” r⁴qu⁸r⁸n⁶ t⁷⁴ lar⁶⁴r tra⁸n⁸n⁶ s⁴t. Ov⁴rall, t⁷⁴ PU-mod⁸fi⁴d al⁶or⁸t⁷m outp⁴r⁵orms bot⁷ a bas⁴l⁸n⁴ m⁴asur⁴ us⁸n⁶ t⁷⁴ w⁷ol⁴ unlab⁴l⁴d s⁴t as n⁴⁶at⁸v⁴s ⁵or t⁷⁴ tra⁸n⁸n⁶ s⁴t as w⁴ll as t⁷⁴ PU-or⁸⁶⁸nal al⁶or⁸t⁷m, w⁸t⁷ an -m⁴asur⁴ o⁵ 0. ⁵or bot⁷ T and pos⁸t⁸v⁴ op⁸n⁸ons w⁷⁴n tra⁸n⁴d on 00 pos⁸t⁸v⁴ op⁸n⁸ons and 0 unlab⁴l⁴d op⁸n⁸ons. or bot⁷ T and n⁴⁶at⁸v⁴ op⁸n⁸ons, t⁷⁴ PU-mod⁸fi⁴d al⁶or⁸t⁷m ac⁷⁸⁴v⁴s t⁷⁴ sam⁴ b⁴st -m⁴asur⁴ as t⁷⁴ PU-or⁸⁶⁸nal al⁶or⁸t⁷m, 0., w⁷⁴n tra⁸n⁴d on 0 n⁴⁶at⁸v⁴ op⁸n⁸ons and 0 unlab⁴l⁴d op⁸n⁸ons. A⁶a⁸n, t⁷⁴ low⁴r -m⁴asur⁴ and lar⁶⁴r tra⁸n⁸n⁶ s⁴t r⁴qu⁸r⁴d d⁴monstrat⁴s t⁷⁴ ⁶r⁴at⁴r d⁸fficulty o⁵ class⁸⁵y⁸n⁶ n⁴⁶at⁸v⁴ op⁸n⁸ons. W⁸t⁷ r⁴sp⁴ct to t⁷⁴ us⁴ o⁵ a s⁸n⁶l⁴ polar⁸ty, pos⁸t⁸v⁴ or n⁴⁶at⁸v⁴, to tra⁸n as oppos⁴d to comb⁸n⁸n⁶ t⁷⁴ two polar⁸t⁸⁴s ⁸nto a lar⁶⁴r tra⁸n⁸n⁶ s⁴t, -’s scor⁴s s⁷ow t⁷at t⁷⁴ comb⁸nat⁸on r⁴sults ⁸n sup⁴r⁸or p⁴r⁵ormanc⁴. A comb⁸n⁴d tra⁸n⁸n⁶ s⁴t o⁵ 0 op⁸n⁸ons and ,00 unlab⁴l⁴d op⁸n⁸ons ac⁷⁸⁴v⁴s t⁷⁴ top -m⁴asur⁴ o⁵ 0. on op⁸n⁸ons and 0.0 on T op⁸n⁸ons. ⁸s com-
4.3. THE CURRENT SYSTEMS
65
par⁴s w⁴ll to Ott-’s -m⁴asur⁴s on comb⁸n⁴d polar⁸ty tra⁸n⁸n⁶ s⁴ts (s⁴⁴ Tabl⁴ .), ⁶⁸v⁴n t⁷at Ott-’s tra⁸n⁸n⁶ data ⁸s all lab⁴l⁴d. ⁸nally, - t⁴st⁴d t⁷⁴ p⁴r⁵ormanc⁴ o⁵ t⁷⁴ PU-l⁴arn⁸n⁶ m⁴t⁷od on d⁸scr⁸m⁸nat⁸on o⁵ pos⁸t⁸v⁴ vs. n⁴⁶at⁸v⁴ r⁴v⁸⁴ws us⁸n⁶ un⁸⁶rams and un⁸+b⁸⁶rams w⁸t⁷ Naïv⁴ ay⁴s and SVM class⁸fi⁴rs. - ⁵ound t⁷at PU-l⁴arn⁸n⁶ prov⁸d⁸n⁶ data to t⁷⁴ N class⁸fi⁴r ⁶⁸v⁴s t⁷⁴ b⁴st r⁴sults on pr⁴d⁸ct⁸n⁶ pos⁸t⁸v⁴ r⁴v⁸⁴ws, w⁸t⁷ an -m⁴asur⁴ on un⁸⁶rams o⁵ 0. (tra⁸n⁸n⁶ data o⁵ 00 d⁴c⁴pt⁸v⁴ n⁴⁶at⁸v⁴ and 0 unlab⁴l⁴d r⁴v⁸⁴ws) and an -m⁴asur⁴ on un⁸+b⁸⁶rams o⁵ 0. (tra⁸n⁸n⁶ data o⁵ 0 d⁴c⁴pt⁸v⁴ n⁴⁶at⁸v⁴ and 0 unlab⁴l⁴d r⁴v⁸⁴ws). or t⁷⁴ pr⁴d⁸ct⁸on o⁵ n⁴⁶at⁸v⁴ r⁴v⁸⁴ws, t⁷⁴ N class⁸fi⁴r prov⁸d⁴d t⁷⁴ b⁴st r⁴sults, w⁸t⁷ an -m⁴asur⁴ o⁵ 0. on un⁸⁶rams and 0. on un⁸+b⁸⁶rams (tra⁸n⁸n⁶ data o⁵ 0 d⁴c⁴pt⁸v⁴ n⁴⁶at⁸v⁴ and 0 unlab⁴l⁴d r⁴v⁸⁴ws). n summary, - prov⁸d⁴s confirmat⁸on o⁵ Ott-’s cla⁸m w⁸t⁷ r⁴sp⁴ct to us⁸n⁶ comb⁸n⁴d polar⁸ty data t⁷at mor⁴ data, r⁴⁶ardl⁴ss o⁵ polar⁸ty, ⁸s b⁴tt⁴r, ⁵rom w⁷⁸c⁷ w⁴ can ⁸n⁵⁴r t⁷at pos⁸t⁸v⁴ and n⁴⁶at⁸v⁴ r⁴v⁸⁴ws ⁷av⁴ mor⁴ ⁸n common w⁸t⁷ ⁴ac⁷ ot⁷⁴r t⁷an do Tru⁴ and als⁴ r⁴v⁸⁴ws. N⁴⁶at⁸v⁴ d⁴c⁴pt⁸v⁴ op⁸n⁸ons ar⁴ ⁷ard⁴r to d⁴t⁴ct t⁷an pos⁸t⁸v⁴ d⁴c⁴pt⁸v⁴ op⁸n⁸ons. And, ⁵or appl⁸cat⁸ons t⁷at r⁴qu⁸r⁴ lar⁶⁴ tra⁸n⁸n⁶ s⁴ts, PU-l⁴arn⁸n⁶, part⁸cularly t⁷⁴ mod⁸fi⁴d l⁴arn⁸n⁶ propos⁴d by -, prov⁸d⁴s a r⁴asonabl⁴ m⁴ans o⁵ ⁶⁴n⁴rat⁸n⁶ a lar⁶⁴ tra⁸n⁸n⁶ s⁴t ⁵rom a small s⁴t o⁵ known pos⁸t⁸v⁴ d⁴c⁴pt⁸v⁴ op⁸n⁸ons and a lar⁶⁴ s⁴t o⁵ unlab⁴l⁴d op⁸n⁸ons. ornac⁸ar⁸ and Po⁴s⁸o [0] (F&P-14) advanc⁴ t⁷⁴ sam⁴ ⁶oal as - to bu⁸ld a data s⁴t ⁵or d⁴c⁴pt⁸on r⁴s⁴arc⁷ by max⁸m⁸z⁸n⁶ t⁷⁴ ⁴st⁸mat⁴ o⁵ ⁶round trut⁷ ⁵or r⁴v⁸⁴ws w⁷os⁴ T/ valu⁴ ⁸s not known w⁸t⁷ a ⁷⁸⁶⁷ d⁴⁶r⁴⁴ o⁵ c⁴rta⁸nty [ornac⁸ar⁸ and Po⁴s⁸o, 0]. nst⁴ad o⁵ t⁷⁴ s⁴m⁸-sup⁴rv⁸s⁴d approac⁷ o⁵ -, ⁷ow⁴v⁴r, t⁷⁴ aut⁷ors us⁴ w⁷at w⁴ m⁸⁶⁷t t⁴rm sl⁴ut⁷⁸n⁶ to d⁸scov⁴r t⁷⁴ v⁴rac⁸ty o⁵ Amazon book r⁴v⁸⁴ws, work⁸n⁶ w⁸t⁷ J⁴r⁴my uns, a wr⁸t⁴r w⁷o ⁷ad unmask⁴d a numb⁴r o⁵ bo⁶us Amazon book r⁴v⁸⁴w⁴rs. &P- ⁴stabl⁸s⁷⁴s r⁴v⁸⁴ws as ⁵als⁴ “w⁸t⁷ a ⁷⁸⁶⁷ d⁴⁶r⁴⁴ o⁵ confid⁴nc⁴,” bas⁴d pr⁸mar⁸ly on r⁴v⁸⁴w⁴r adm⁸ss⁸on o⁵ t⁷⁴⁸r ⁵als⁸ty and r⁴v⁸⁴ws as tru⁴ bas⁴d on t⁷⁴ po⁸ntl⁴ssn⁴ss o⁵ pay⁸n⁶ som⁴on⁴ to wr⁸t⁴ a als⁴ r⁴v⁸⁴w o⁵ books by d⁴ad aut⁷ors (onan oyl⁴ and K⁸pl⁸n⁶ ar⁴ nam⁴d) or b⁴st-s⁴ll⁸n⁶ l⁸v⁸n⁶ aut⁷ors (K⁴n oll⁴tt and St⁴p⁷⁴n K⁸n⁶ ar⁴ nam⁴d). ⁸s ⁸s t⁷⁴⁸r ⁶old-standard data. An add⁸t⁸onal , r⁴v⁸⁴ws ar⁴ lab⁴l⁴d as tru⁴ and , ar⁴ lab⁴l⁴d als⁴ bas⁴d on ⁵our cu⁴s to d⁴c⁴pt⁸on () r⁴v⁸⁴ws ⁸d⁴nt⁸fi⁴d ⁸n n⁴ws art⁸cl⁴s as purc⁷as⁴d () r⁴v⁸⁴ws o⁵ a s⁸n⁶l⁴ book t⁷at all app⁴ar w⁸t⁷⁸n a s⁷ort p⁴r⁸od o⁵ t⁸m⁴, su⁶⁶⁴st⁸n⁶ t⁷at t⁷⁴ r⁴v⁸⁴ws w⁴r⁴ purc⁷as⁴d on d⁴adl⁸n⁴ () r⁴v⁸⁴ws by r⁴v⁸⁴w⁴rs w⁷o d⁸d not us⁴ t⁷⁴⁸r r⁴al nam⁴¹⁰ and () lack o⁵ transact⁸onal data t⁷at t⁷⁴ r⁴v⁸⁴w⁴r actually purc⁷as⁴d t⁷⁴ book ⁵rom Amazon. ⁸s ⁸s t⁷⁴⁸r s⁸lv⁴r-standard data. ⁴ m⁴an l⁴n⁶t⁷ o⁵ ⁴ac⁷ r⁴v⁸⁴w was . tok⁴ns—clos⁴ to t⁷⁴ l⁴n⁶t⁷ o⁵ t⁷⁴ n⁴⁶at⁸v⁴ r⁴v⁸⁴ws o⁵ Ott-. &P- do not d⁸scuss t⁷⁴ ov⁴rall cont⁴nt o⁵ t⁷⁴ books r⁴v⁸⁴w⁴d ⁷ow⁴v⁴r, t⁷⁴ on⁴ ⁵als⁴ r⁴v⁸⁴w t⁷at ⁸s assoc⁸at⁴d w⁸t⁷ a book ⁸s t⁸tl⁴d Write Your First Book, w⁷⁸l⁴ t⁷⁴ tru⁴ r⁴v⁸⁴ws, w⁷os⁴ cont⁴nt m⁸⁶⁷t b⁴ ⁸n⁵⁴rr⁴d bas⁴d on t⁷⁴ nam⁴d aut⁷ors, app⁴ar to b⁴ fict⁸on. ⁵ non-fict⁸on books ⁵⁴⁴d t⁷⁴ ⁵als⁴ r⁴v⁸⁴ws w⁷⁸l⁴ fict⁸on books ⁵⁴⁴d t⁷⁴ tru⁴ cat⁴⁶or⁸⁴s, t⁷⁸s would b⁴ a con⁵ound⁸n⁶ W⁴ ⁶⁸v⁴ a mor⁴ d⁴ta⁸l⁴d d⁴scr⁸pt⁸on o⁵ t⁷⁴ &P- data coll⁴ct⁸on and v⁴r⁸ficat⁸on work ⁸n ⁷apt⁴r , w⁷⁴r⁴ w⁴ d⁸scuss ⁶round trut⁷ annotat⁸on. ¹⁰Amazon ⁴nabl⁴s r⁴v⁸⁴w⁴rs to r⁴⁶⁸st⁴r us⁸n⁶ a ps⁴udonym.
66
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
⁵actor ⁵or &P- s⁸nc⁴ t⁷⁴ vocabulary us⁴d to r⁴v⁸⁴w a non-fict⁸on book may b⁴ qu⁸t⁴ d⁸ff⁴r⁴nt ⁵rom t⁷at us⁴d to r⁴v⁸⁴w a work o⁵ fict⁸on. &P- advanc⁴ a s⁴cond ⁶oal to l⁴arn t⁷⁴ b⁴st class⁸fi⁴r ⁹o⁸ntly w⁸t⁷ t⁷⁴ ⁶round trut⁷. or t⁷⁴ class⁸ficat⁸on ⁴xp⁴r⁸m⁴nts, &P- us⁴ a ⁵a⁸rly standard s⁴t o⁵ ⁵⁴atur⁴s ⁵or t⁷⁴ r⁴v⁸⁴w v⁴ctors, ⁸nclud⁸n⁶ t⁷⁴ most ⁵r⁴qu⁴nt n-⁶rams w⁸t⁷ t⁷⁴ ⁷⁸⁶⁷⁴st n⁵ormat⁸on a⁸n (d⁸scr⁸m⁸natory pow⁴r), t⁷⁴⁸r l⁴mmas and parts o⁵ sp⁴⁴c⁷, and t⁷⁴ l⁴n⁶t⁷ o⁵ t⁷⁴ r⁴v⁸⁴w. ⁴ s⁸lv⁴r-standard r⁴v⁸⁴ws compr⁸s⁴ t⁷⁴ tra⁸n⁸n⁶ s⁴t t⁷⁴ ⁶old-standard r⁴v⁸⁴ws, t⁷⁴ t⁴st s⁴t. &P- t⁴st two m⁴t⁷ods o⁵ class⁸ficat⁸on, Ma⁹or⁸ty Vot⁸n⁶ and L⁴arn⁸n⁶ ⁵rom rowds [Raykar ⁴t al., 00]. Raykar’s al⁶or⁸t⁷m ⁸mprov⁴s on Ma⁹or⁸ty Vot⁸n⁶ by ⁴valuat⁸n⁶ t⁷⁴ sk⁸ll o⁵ ⁴ac⁷ vot⁴r (or “annotator”) on Tru⁴ vs. als⁴ ⁵rom past p⁴r⁵ormanc⁴ on alr⁴ady-s⁴⁴n annotat⁸on. ⁴ study’s us⁴ o⁵ ⁸ts d⁴c⁴pt⁸on cu⁴s d⁸st⁸n⁶u⁸s⁷⁴s t⁷⁴ two m⁴t⁷ods. Und⁴r Ma⁹or⁸ty Vot⁸n⁶, ⁸⁵ a r⁴v⁸⁴w ⁷as mor⁴ t⁷an two o⁵ t⁷⁴ ⁵our cu⁴s, ⁸t ⁸s class⁸fi⁴d as als⁴, ot⁷⁴rw⁸s⁴, Tru⁴. Und⁴r L⁴arn⁸n⁶ ⁵rom rowds, ⁴ac⁷ cu⁴¹¹ ⁸s tr⁴at⁴d as an annotator and w⁴⁸⁶⁷t⁴d bas⁴d on pr⁸or p⁴r⁵ormanc⁴ d⁸scr⁸m⁸nat⁸n⁶ T ⁵rom on alr⁴ady s⁴⁴n r⁴v⁸⁴ws. A⁶a⁸n, t⁷⁴ cu⁴s ar⁴ b⁴⁸n⁶ us⁴d to ⁴stabl⁸s⁷ t⁷⁴ s⁸lv⁴r-standard ⁶round trut⁷, but w⁸t⁷ track b⁴⁸n⁶ k⁴pt o⁵ t⁷⁴⁸r succ⁴ss at d⁸scr⁸m⁸nat⁸n⁶ T ⁵rom . &P- ac⁷⁸⁴v⁴ a total accuracy scor⁴ o⁵ .% on Ma⁹or⁸ty Vot⁸n⁶, and .% on L⁴arn⁸n⁶ ⁵rom rowds, w⁸t⁷ -m⁴asur⁴s o⁵ .% and .%, r⁴sp⁴ct⁸v⁴ly, w⁷⁸c⁷ ⁸s comparabl⁴ to -s PU-l⁴arn⁸n⁶ r⁴sults on T/ d⁸scr⁸m⁸nat⁸on. Ma⁹or⁸ty Vot⁸n⁶ do⁴s part⁸cularly w⁴ll on Pr⁴c⁸s⁸on (.%), but at t⁷⁴ sacr⁸fic⁴ o⁵ R⁴call (.%). L⁴arn⁸n⁶ ⁵rom rowds, on t⁷⁴ ot⁷⁴r ⁷and, ⁷as mor⁴ balanc⁴d r⁴sults Pr⁴c⁸s⁸on .0% R⁴call .0%. M⁴t⁷ods t⁷at bu⁸ld lar⁶⁴ tra⁸n⁸n⁶ s⁴ts ⁵rom product r⁴v⁸⁴w data w⁷os⁴ Tru⁴/als⁴ status ⁸s not known a pr⁸or⁸, suc⁷ as t⁷os⁴ o⁵ - and &P-, ar⁴ su⁸tabl⁴ ⁵or appl⁸cat⁸ons l⁸k⁴ t⁷⁴ ⁴st⁸mat⁸on o⁵ t⁷⁴ l⁸k⁴l⁸⁷ood o⁵ a product r⁴v⁸⁴w b⁴⁸n⁶ ⁵als⁴. or most products purc⁷as⁴d onl⁸n⁴, t⁷⁸s ⁴st⁸mat⁴ s⁷ould b⁴ ⁷⁴lp⁵ul ⁴nou⁶⁷. t ⁸s ⁸mportant to k⁴⁴p ⁸n m⁸nd, t⁷ou⁶⁷, t⁷at som⁴ typ⁴s o⁵ r⁴v⁸⁴w—doctor r⁴comm⁴ndat⁸ons, ⁵or ⁴xampl⁴—ar⁴ mor⁴ cr⁸t⁸cal and may r⁴qu⁸r⁴ t⁷⁴ ⁷⁸⁶⁷⁴r l⁴v⁴l o⁵ confid⁴nc⁴ t⁷at only a ⁶old standard can ⁶⁸v⁴. t ⁸s conc⁴⁸vabl⁴ t⁷at, w⁸t⁷ ⁴nou⁶⁷ ⁴ffort, pos⁸t⁸v⁴ r⁴v⁸⁴ws could b⁴ obta⁸n⁴d ⁸n t⁷⁴s⁴ cr⁸t⁸cal ar⁴as ⁸t ⁸s ⁷ard⁴r to ⁸ma⁶⁸n⁴ obta⁸n⁸n⁶ muc⁷ n⁴⁶at⁸v⁴ data but &P- d⁴scr⁸b⁴ sl⁴ut⁷⁸n⁶ t⁴c⁷n⁸qu⁴s t⁷at m⁸⁶⁷t b⁴ cl⁴v⁴rly adapt⁴d to t⁷⁸s task.
4.3.2 FEATURES ⁴ d⁸fficulty o⁵ val⁸dat⁸n⁶ narrat⁸v⁴s ⁵or ⁶round trut⁷ r⁴sults ⁸n t⁷⁴ product⁸on o⁵ small datas⁴ts, mak⁸n⁶ t⁷⁴ us⁴ o⁵ stat⁸st⁸cal t⁴c⁷n⁸qu⁴s on n-⁶rams r⁴lat⁸v⁴ly ⁸n⁴ff⁴ct⁸v⁴ unl⁴ss t⁷⁴ data ⁸s constra⁸n⁴d ⁸n ot⁷⁴r ways, pr⁸mar⁸ly by top⁸c c⁷o⁸c⁴, as su⁶⁶⁴st⁴d ⁸n t⁷⁴ pr⁴v⁸ous s⁴ct⁸on. As w⁴ ⁷av⁴ s⁴⁴n t⁷⁴r⁴, t⁷⁸s constra⁸nt ⁴nabl⁴s t⁷⁴ us⁴ o⁵ s⁴m⁸-sup⁴rv⁸s⁴d l⁴arn⁸n⁶ to bu⁸ld lar⁶⁴r datas⁴ts ⁵or appl⁸cat⁸ons l⁸k⁴ t⁷⁴ Tw⁸tt⁴r scams o⁵ ⁷⁴n ⁴t al. [0] and ⁷ot⁴l r⁴v⁸⁴ws o⁵ ⁴rnánd⁴z us⁸l⁸⁴r ⁴t al. [0], w⁷⁴r⁴ t⁷⁴ lan⁶ua⁶⁴ ⁸s ⁷⁸⁶⁷ly constra⁸n⁴d. ¹¹or r⁴asons ⁸nt⁴rnal to t⁷⁴ al⁶or⁸t⁷m, &P do not us⁴ t⁷⁴ first cu⁴.
4.3. THE CURRENT SYSTEMS
67
W⁷⁴r⁴ t⁷⁴ lan⁶ua⁶⁴ ⁸s l⁴ss constra⁸n⁴d, ⁷ow⁴v⁴r, most computat⁸onal stud⁸⁴s o⁵ d⁴c⁴pt⁸on ⁷av⁴ turn⁴d to som⁴ ⁵orm o⁵ ⁶⁴n⁴ral⁸zat⁸on ov⁴r a s⁴t o⁵ n-⁶rams, abov⁴ and b⁴yond st⁴mm⁸n⁶, to boost t⁷⁴ amount o⁵ r⁴l⁴vant data. ⁴ ⁵⁴atur⁴s and not⁴d by M&S ⁴x⁴mpl⁸⁵y t⁷⁸s t⁴c⁷n⁸qu⁴ and s⁴v⁴ral o⁵ t⁷⁴ computat⁸onal d⁴c⁴pt⁸on stud⁸⁴s ⁷av⁴ mad⁴ us⁴ o⁵ t⁷⁴ LW cat⁴⁶or⁸⁴s t⁷at M&S br⁸⁴fly compar⁴ to t⁷⁴⁸r n-⁶ram approac⁷, w⁷⁸l⁴ ot⁷⁴rs ⁷av⁴ mad⁴ us⁴ o⁵ a var⁸⁴ty o⁵ ⁵⁴atur⁴s to captur⁴ d⁸ff⁴r⁴nc⁴s b⁴tw⁴⁴n trut⁷⁵ul and d⁴c⁴pt⁸v⁴ lan⁶ua⁶⁴. Studies Using Surface Features
ur⁶oon ⁴t al. [00] ⁸s t⁷⁴ ⁴arl⁸⁴st automat⁴d ⁴ffort w⁴ know t⁷at d⁸scr⁸m⁸nat⁴d tru⁴ ⁵rom ⁵als⁴ narrat⁸v⁴s. t r⁴ports t⁷⁴ analys⁸s o⁵ v⁴rbal data ⁵rom a mock t⁷⁴⁵t w⁷⁴r⁴ ⁷al⁵ t⁷⁴ part⁸c⁸pants ar⁴ ⁸nstruct⁴d to st⁴al a wall⁴t and l⁸⁴ about t⁷⁴ t⁷⁴⁵t and t⁷⁴ ot⁷⁴r ⁷al⁵ not st⁴al and t⁴ll t⁷⁴ trut⁷, y⁸⁴ld⁸n⁶ ⁸nt⁴rv⁸⁴w narrat⁸v⁴s, o⁵ w⁷⁸c⁷ w⁴r⁴ conduct⁴d v⁸a t⁴xt c⁷at and 0 v⁸a aud⁸o c⁷at. ur⁶oon us⁴d ⁵⁴atur⁴s t⁷at ar⁴ t⁷⁴ l⁴ast cont⁴xt s⁴ns⁸t⁸v⁴ and t⁷⁴ most am⁴nabl⁴ to automat⁸on, ⁸nclud⁸n⁶ . quant⁸ty (numb⁴r o⁵ syllabl⁴s, words, s⁴nt⁴nc⁴s) . vocabulary ompl⁴x⁸ty (numb⁴r o⁵ b⁸⁶ [sic] words, numb⁴r o⁵ syllabl⁴s p⁴r word) . ⁶rammat⁸cal ompl⁴x⁸ty (numb⁴r o⁵ s⁷ort s⁴nt⁴nc⁴s, numb⁴r o⁵ lon⁶ s⁴nt⁴nc⁴s, l⁴s⁷K⁸nca⁸d ⁶rad⁴ l⁴v⁴l, av⁴ra⁶⁴ numb⁴r o⁵ words p⁴r s⁴nt⁴nc⁴, s⁴nt⁴nc⁴ compl⁴x⁸ty, numb⁴r o⁵ con⁹unct⁸ons) and . sp⁴c⁸fic⁸ty and ⁴xpr⁴ss⁸v⁴n⁴ss (⁴mot⁸v⁴n⁴ss ⁸nd⁴x, rat⁴ o⁵ ad⁹⁴ct⁸v⁴s and adv⁴rbs, numb⁴r o⁵ aff⁴ct⁸v⁴ t⁴rms). ⁴ ⁵⁴atur⁴s w⁴r⁴ obta⁸n⁴d us⁸n⁶ a s⁷allow pars⁴r (“rok or sk⁸m”) and d⁸ct⁸onary look-up. Us⁸n⁶ a . d⁴c⁸s⁸on tr⁴⁴ al⁶or⁸t⁷m [Qu⁸nlan, ] and -⁵old cross-val⁸dat⁸on, ur⁶oon ⁴t al. [00] obta⁸n⁴d an accuracy o⁵ .% as r⁴pr⁴s⁴nt⁴d ⁸n Tabl⁴ . Table 4.7: on⁵us⁸on matr⁸x produc⁴d by ur⁶oon ⁴t al.’s . tr⁴⁴
Actual lass
Tru⁴ als⁴
Pr⁴d⁸ct⁴d lass Tru⁴ als⁴ 0
Us⁸n⁶ ur⁶oon’s bas⁴l⁸n⁴ o⁵ ⁷uman ⁹ud⁶m⁴nts, w⁷⁸c⁷ ar⁴ “typ⁸cally v⁴ry poor at d⁴t⁴ct⁸n⁶ d⁴c⁴pt⁸on and ⁵allac⁸ous ⁸n⁵ormat⁸on, t⁷⁴⁸r “pr⁴d⁸ct⁸on rat⁴” o⁵ 0.%¹² ⁸s s⁸⁶n⁸ficant and support⁴d t⁷⁴ us⁴ o⁵ l⁸n⁶u⁸st⁸c ⁵⁴atur⁴s to s⁴parat⁴ trut⁷⁵ul ⁵rom d⁴c⁴pt⁸v⁴ lan⁶ua⁶⁴. ¹²W⁴ ar⁴ assum⁸n⁶ t⁷at t⁷⁴ % d⁸scr⁴pancy b⁴tw⁴⁴n ur⁶oon ⁴t al.’s accuracy scor⁴ as d⁴r⁸v⁴d ⁵rom Tabl⁴ . and t⁷⁴⁸r quot⁴d “pr⁴d⁸ct⁸on rat⁴” r⁴sults ⁵rom a d⁸ff⁴r⁴nc⁴ b⁴tw⁴⁴n t⁷⁴ two m⁴asur⁴s t⁷at ⁸s not ⁴xpl⁸c⁸tly stat⁴d ⁸n t⁷⁴ art⁸cl⁴.
68
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
Z⁷ou ⁴t al. [00b] t⁴sts cu⁴s mot⁸vat⁴d by var⁸ous t⁷⁴or⁸⁴s o⁵ d⁴c⁴pt⁸on. Z⁷ou ⁴t al. [00a] d⁴scr⁸b⁴s t⁷⁴ cu⁴s, t⁷⁴ NLP t⁴c⁷n⁸qu⁴s us⁴d to ⁸mpl⁴m⁴nt ⁴ac⁷ cu⁴, and t⁷⁴ s⁸⁶n⁸ficanc⁴ l⁴v⁴l o⁵ ⁴ac⁷ cu⁴ w⁸t⁷ r⁴sp⁴ct to ⁸ts d⁸scr⁸m⁸natory pow⁴r. Z⁷ou ⁴t al. [00b] ar⁴ abl⁴ to ac⁷⁸⁴v⁴ a r⁴markabl⁴ l⁴v⁴l o⁵ d⁸scr⁸m⁸nat⁸on on laboratory-⁶⁴n⁴rat⁴d ⁴ma⁸l ⁴xc⁷an⁶⁴s o⁵ stud⁴nt pa⁸rs work⁸n⁶ on a mod⁸fi⁴d v⁴rs⁸on o⁵ t⁷⁴ ⁴s⁴rt Surv⁸val Probl⁴m [Laff⁴rty ⁴t al., ] ⁸n w⁷⁸c⁷ t⁷⁴ sub⁹⁴cts w⁴r⁴ to ac⁷⁸⁴v⁴ a cons⁴nsus rank⁸n⁶ o⁵ ⁸t⁴ms n⁴c⁴ssary ⁵or surv⁸val a⁵t⁴r a ⁹⁴⁴p cras⁷ ⁸n t⁷⁴ Kuwa⁸t⁸ d⁴s⁴rt. n two sl⁸⁶⁷tly d⁸ff⁴r⁴nt ⁴xp⁴r⁸m⁴nts a total o⁵ sub⁹⁴cts ⁴xc⁷an⁶⁴d a max⁸mum o⁵ ⁴ma⁸ls p⁴r sub⁹⁴ct w⁸t⁷ a total o⁵ ⁴ma⁸l s⁴nd⁴rs ⁸nstruct⁴d to d⁴c⁴⁸v⁴ ⁸n t⁷⁴⁸r ar⁶um⁴nts ⁵or t⁷⁴⁸r pr⁴⁵⁴rr⁴d rank⁸n⁶. ⁴ ma⁸n ⁶oal o⁵ Z⁷ou ⁴t al. [00b] ⁸s to compar⁴ t⁷⁴ p⁴r⁵ormanc⁴ o⁵ ⁵our stat⁸st⁸cal m⁴t⁷ods—d⁸scr⁸m⁸nant analys⁸s, lo⁶⁸st⁸c r⁴⁶r⁴ss⁸on, d⁴c⁸s⁸on tr⁴⁴s, and n⁴ural n⁴tworks—⁸n d⁸scr⁸m⁸nat⁸n⁶ T ⁵rom narrat⁸v⁴s. ⁴ study prov⁸d⁴s a rat⁸onal⁴ ⁵or t⁷⁴ vary⁸n⁶ p⁴r⁵ormanc⁴ o⁵ ⁴ac⁷ m⁴t⁷od on () t⁷⁴ coll⁴ct⁸on o⁵ ⁸nd⁸v⁸dual m⁴ssa⁶⁴s (t⁷⁴ “m⁴ssa⁶⁴ data”) and () t⁷⁴ “⁴st⁸mat⁴ o⁵ ⁴ac⁷ cu⁴ ⁸n ⁸nd⁸v⁸dual m⁴ssa⁶⁴s . . . av⁴ra⁶⁴d by sub⁹⁴ct” (t⁷⁴ “sub⁹⁴ct data”). ⁴ study also cons⁸d⁴rs t⁷⁴ p⁴r⁵ormanc⁴ o⁵ t⁷⁴ ⁵our m⁴t⁷ods on t⁷⁴ ⁵ull s⁴t o⁵ cu⁴s compar⁴d to t⁷⁴ p⁴r⁵ormanc⁴ on only t⁷⁴ most d⁸scr⁸m⁸natory cu⁴s w⁸t⁷ t⁷⁴ latt⁴r y⁸⁴ld⁸n⁶ b⁴tt⁴r accuracy scor⁴s across all ⁵our. Z⁷ou ⁴t al. [00b] ac⁷⁸⁴v⁴ a ⁷⁸⁶⁷ d⁴⁶r⁴⁴ o⁵ accuracy—as muc⁷ as .% w⁸t⁷ t⁷⁴ n⁴ural n⁴twork on t⁷⁴ sub⁹⁴ct data and .% on t⁷⁴ m⁴ssa⁶⁴ data—us⁸n⁶ t⁷⁴ ⁷⁸⁶⁷ly d⁸scr⁸m⁸natory cu⁴s. ⁸s ⁸s part⁸cularly r⁴markabl⁴ ⁶⁸v⁴n t⁷at t⁷⁴ study ⁸s amon⁶ t⁷⁴ ⁴arl⁸⁴st d⁴c⁴pt⁸on d⁴t⁴ct⁸on ⁸mpl⁴m⁴ntat⁸ons. ⁴ data ⁶⁴n⁴rat⁴d by t⁷⁴ ⁴s⁴rt Surv⁸val task, ⁸s, l⁸k⁴ t⁷⁴ ⁷ot⁴l r⁴v⁸⁴w data, ⁷⁸⁶⁷ly constra⁸n⁴d, mak⁸n⁶ t⁷⁴ cu⁴ ⁸d⁴nt⁸ficat⁸on mor⁴ tractabl⁴, but t⁷⁴ d⁸scr⁸m⁸nat⁸on task ⁸s st⁸ll d⁸fficult, w⁷⁸c⁷ mak⁴s t⁷⁴ sp⁴c⁸fic cu⁴s and t⁷⁴ n⁴ural n⁴tworks bot⁷ look l⁸k⁴ prom⁸s⁸n⁶ contr⁸butors to t⁷⁴ ⁷⁸⁶⁷ p⁴r⁵ormanc⁴. ⁴ study ⁸s car⁴⁵ul to s⁴parat⁴ out t⁷⁴ poss⁸bl⁴ d⁸ff⁴r⁴nc⁴s ⁶⁸v⁴n ⁶⁴nr⁴ and m⁴d⁸um o⁵ commun⁸cat⁸on. ⁴ on⁴ drawback o⁵ Z⁷ou ⁴t al. [00b] ⁸s t⁷⁴ laboratory s⁴tt⁸n⁶ ⁸n w⁷⁸c⁷ d⁴c⁴⁸v⁴rs ar⁴ ⁸nstruct⁴d to l⁸⁴, w⁸t⁷ no t⁷r⁴at and, ⁸nd⁴⁴d, support ⁵or t⁷⁴⁸r ⁵abr⁸cat⁸ons. ⁸s ra⁸s⁴s t⁷⁴ ⁸ssu⁴ o⁵ ⁴colo⁶⁸cal val⁸d⁸ty and t⁷⁴ r⁴lat⁴d ⁸ssu⁴, cons⁸d⁴r⁴d ⁸n Ott ⁴t al. [0], as to t⁷⁴ r⁴lat⁸on b⁴tw⁴⁴n d⁴c⁴pt⁸v⁴ and ⁸ma⁶⁸nat⁸v⁴ narrat⁸v⁴. W⁴ w⁸ll r⁴turn to t⁷⁸s ⁸ssu⁴ ⁸n ⁷apt⁴r . ac⁷⁴nko ⁴t al. [00] (BFS) was t⁷⁴ first computat⁸onal d⁴c⁴pt⁸on study to us⁴ r⁴al-world data tak⁴n ⁵rom mult⁸pl⁴ sourc⁴s, ⁸nclud⁸n⁶ t⁷r⁴⁴ cr⁸m⁸nal stat⁴m⁴nts, two pol⁸c⁴ ⁸nt⁴rro⁶at⁸ons, a tobacco lawsu⁸t d⁴pos⁸t⁸on, and J⁴ffr⁴y Sk⁸ll⁸n⁶’s con⁶r⁴ss⁸onal t⁴st⁸mony on t⁷⁴ nron ⁵a⁸lur⁴. t ⁴stabl⁸s⁷⁴d t⁷⁴ v⁴rac⁸ty o⁵ ⁸nd⁸v⁸dual propos⁸t⁸ons us⁸n⁶ pol⁸c⁴ r⁴ports, court t⁴st⁸mony, m⁴⁴t⁸n⁶ m⁸nut⁴s, v⁸d⁴otap⁴s, ⁴tc. as w⁴ll as contrad⁸ct⁸ons ⁸n t⁷⁴ narrat⁸v⁴ suc⁷ as a con⁵⁴ss⁸on at ⁸ts ⁴nd. rom t⁷⁴s⁴ corroborat⁸ons, ⁸t was abl⁴ to ass⁸⁶n T/ valu⁴s to propos⁸t⁸ons ⁸n ⁸ts , words o⁵ narrat⁸v⁴, als⁴ and t⁷⁴ r⁴ma⁸nd⁴r Tru⁴.
4.3. THE CURRENT SYSTEMS
69
S us⁴d l⁸n⁶u⁸st⁸c ⁵⁴atur⁴s c⁸t⁴d ⁸n t⁷⁴ psyc⁷olo⁶y and cr⁸m⁸nal ⁹ust⁸c⁴ l⁸t⁴ratur⁴ t⁷at can b⁴ ⁵ormally r⁴pr⁴s⁴nt⁴d and automat⁴d ⁸n an NLP syst⁴m. ⁴ ⁵⁴atur⁴s ar⁴ o⁵ t⁷r⁴⁴ typ⁴s ¹³
. lack o⁵ comm⁸tm⁴nt ⁸n w⁷⁸c⁷ t⁷⁴ narrator us⁴s l⁸n⁶u⁸st⁸c m⁴ans to avo⁸d mak⁸n⁶ a d⁸r⁴ct stat⁴m⁴nt. ⁴s⁴ ⁵⁴atur⁴s ⁸nclud⁴ ⁷⁴d⁶⁴s, qual⁸fi⁴d ass⁴rt⁸ons ⁸n w⁷⁸c⁷ ⁸t r⁴ma⁸ns uncl⁴ar w⁷⁴t⁷⁴r an act⁸on was p⁴r⁵orm⁴d (⁴.⁶., I needed to get my inhaler ), un⁴xpla⁸n⁴d laps⁴s o⁵ t⁸m⁴ (later that day), ov⁴rz⁴alous ⁴xpr⁴ss⁸ons (I swear to God ), and rat⁸onal⁸zat⁸ons (I was unfamiliar with the road ) . n⁴⁶at⁸v⁸ty. ⁴s⁴ ⁵⁴atur⁴s ⁸nclud⁴ n⁴⁶at⁸v⁴ ⁵orms (never, inconceivable), n⁴⁶at⁸v⁴ ⁴mot⁸ons (I was a nervous wreck), and m⁴mory loss (I forget ) and . ⁸ncons⁸st⁴nc⁸⁴s ⁸n v⁴rb and noun ⁵orms ⁸nclud⁸n⁶ v⁴rb t⁴ns⁴ c⁷an⁶⁴s, t⁷⁴mat⁸c rol⁴ c⁷an⁶⁴s (⁴.⁶., a NP s⁷⁸⁵t⁸n⁶ ⁵rom a⁶⁴nt to pat⁸⁴nt), d⁸ff⁴r⁴nt NP ⁵orms ⁵or t⁷⁴ sam⁴ r⁴⁵⁴r⁴nt (my family b⁴com⁴s some people) and c⁷an⁶⁴s ⁸n pronoun usa⁶⁴. S ⁷ypot⁷⁴s⁸z⁴ t⁷at t⁷⁴ d⁴ns⁸ty d⁸str⁸but⁸on o⁵ t⁷⁴s⁴ ⁵⁴atur⁴s corr⁴lat⁴s w⁸t⁷ d⁴c⁴pt⁸on. To ⁸nstant⁸at⁴ t⁷⁸s t⁷⁴y us⁴ a mov⁸n⁶ av⁴ra⁶⁴ to scor⁴ ⁴ac⁷ word ⁸n a narrat⁸v⁴, w⁸t⁷ ⁴ac⁷ ⁸nd⁸cator and prox⁸m⁸ty to an ⁸nd⁸cator low⁴r⁸n⁶ t⁷⁴ av⁴ra⁶⁴. ⁴ mov⁸n⁶ av⁴ra⁶⁴ ⁴nabl⁴s t⁷⁴ automat⁸c s⁴⁶m⁴ntat⁸on o⁵ t⁷⁴ narrat⁸v⁴ ⁸nto non-ov⁴rlapp⁸n⁶ r⁴⁶⁸ons t⁷at ar⁴ ⁸d⁴nt⁸fi⁴d as l⁸k⁴ly tru⁴, l⁸k⁴ly d⁴c⁴pt⁸v⁴ or som⁴w⁷⁴r⁴ ⁸n b⁴tw⁴⁴n. onv⁴rt⁸n⁶ t⁷⁴ l⁸k⁴ly tru⁴ and l⁸k⁴ly d⁴c⁴pt⁸v⁴ s⁴⁶m⁴nts ⁸nto T and , r⁴sp⁴ct⁸v⁴ly, S ac⁷⁸⁴v⁴ t⁷⁴ r⁴sults r⁴pr⁴s⁴nt⁴d ⁸n Tabl⁴ . w⁸t⁷ an accuracy o⁵ .%. Table 4.8: T/ class⁸ficat⁸on bas⁴d on S’s cu⁴ d⁴ns⁸ty al⁶or⁸t⁷m
Actual lass
Tru⁴ als⁴
Pr⁴d⁸ct⁴d lass Tru⁴ als⁴ Accuracy (%) 0 . .
S also po⁸nt out t⁷at by ra⁸s⁸n⁶ t⁷⁴ cut-off scor⁴ ⁵or als⁴, t⁷⁴ syst⁴m can ⁵avor ⁷⁸⁶⁷ r⁴call or ⁷⁸⁶⁷ pr⁴c⁸s⁸on ⁵or T or as t⁷⁴ appl⁸cat⁸on r⁴qu⁸r⁴s ⁵or ⁴xampl⁴, ⁸n cas⁴s w⁷⁴r⁴ ⁸nv⁴st⁸⁶ators ar⁴ look⁸n⁶ ⁵or l⁴ads, ⁷⁸⁶⁷ r⁴call could b⁴ valuabl⁴ ⁸n and o⁵ ⁸ts⁴l⁵ at t⁷⁴ ⁴xp⁴ns⁴ o⁵ ⁷⁸⁶⁷ pr⁴c⁸s⁸on. ⁴ accuracy o⁵ t⁷⁴ S r⁴sult ⁸s substant⁸al ⁶⁸v⁴n t⁷⁴ d⁸v⁴rs⁸ty o⁵ t⁷⁴ data and t⁷⁴ r⁴al world lan⁶ua⁶⁴ and sourc⁴ c⁷⁴ck⁸n⁶. t ⁸s also wort⁷ not⁸n⁶ ⁷⁴r⁴ t⁷⁴ obs⁴rvat⁸on by bot⁷ ⁴Paulo ⁴t al. [00] and Vr⁸⁹ [00] t⁷at t⁷⁴ n⁴xt st⁴p ⁵orward ⁸n d⁴c⁴pt⁸on r⁴s⁴arc⁷ would b⁴ ⁸n t⁷⁴ ⁸d⁴nt⁸ficat⁸on o⁵ clust⁴rs o⁵ cu⁴s. omputat⁸onal m⁴t⁷ods ⁴nabl⁴ t⁷⁴ r⁴co⁶n⁸t⁸on o⁵ clust⁴rs, and t⁷⁴ scor⁸n⁶ and mov⁸n⁶ av⁴ra⁶⁴ t⁴c⁷n⁸qu⁴ o⁵ S ⁸s t⁷⁴ first al⁶or⁸t⁷m, as ⁵ar as w⁴ know, t⁷at ⁸d⁴nt⁸fi⁴s clust⁴rs o⁵ d⁴c⁴pt⁸on cu⁴s. ¹³S⁴v⁴ral o⁵ t⁷⁴s⁴ ⁵⁴atur⁴s ar⁴ not pur⁴ly sur⁵ac⁴ ⁵⁴atur⁴s qual⁸fi⁴d ass⁴rt⁸ons, t⁷⁴mat⁸c rol⁴ c⁷an⁶⁴s, and r⁴⁵⁴r⁴nt c⁷⁴ck⁸n⁶ r⁴qu⁸r⁴ pars⁴ tr⁴⁴ ⁸n⁵ormat⁸on, w⁷⁸l⁴ rat⁸onal⁸zat⁸ons r⁴qu⁸r⁴ ⁷uman ⁹ud⁶m⁴nt.
70
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
Studies that Lack CL Properties but with Interesting Observations
W⁷⁸l⁴ t⁷⁴y do not propos⁴ a pr⁴d⁸ct⁸v⁴ mod⁴l ⁵or d⁴c⁴pt⁸on, K⁴⁸la and Sk⁸ll⁸corn [00] po⁸nt to an ⁸nt⁴r⁴st⁸n⁶ appl⁸cat⁸on o⁵ automat⁸c d⁴c⁴pt⁸on d⁴t⁴ct⁸on t⁷⁴ rank⁸n⁶ o⁵ a lar⁶⁴ s⁴t o⁵ docum⁴nts, ⁸n t⁷⁸s cas⁴ t⁷⁴ , ⁴ma⁸ls o⁵ t⁷⁴ nron datas⁴t, to r⁴fl⁴ct “t⁷⁴⁸r ⁸mportanc⁴ ⁵or d⁸scov⁴r⁸n⁶ mal⁵⁴asanc⁴ ⁸ns⁸d⁴ an or⁶an⁸zat⁸on” [K⁴⁸la and Sk⁸ll⁸corn, 00]. Z⁷ou ⁴t al. [00a] la⁸d t⁷⁴ ⁶roundwork ⁵or Z⁷ou ⁴t al. [00b] by ⁴valuat⁸n⁶ v⁴rbal cu⁴s ⁸d⁴nt⁸fi⁴d as “am⁴nabl⁴ to automat⁸on.” ⁴ study look⁴d ⁵or s⁸⁶n⁸ficant d⁸ff⁴r⁴nc⁴s ⁸n t⁷⁴ occurr⁴nc⁴ o⁵ t⁷⁴s⁴ cu⁴s ⁸n trut⁷⁵ul vs. d⁴c⁴pt⁸v⁴ m⁴ssa⁶⁴s, w⁷⁸c⁷ w⁴r⁴, as m⁴nt⁸on⁴d abov⁴, laboratory-⁶⁴n⁴rat⁴d ⁴ma⁸ls ⁸nvolv⁸n⁶ t⁷⁴ ⁴s⁴rt Surv⁸val Probl⁴m. ⁴ study prov⁸d⁴s som⁴t⁸m⁴s tractabl⁴ ways o⁵ m⁴asur⁸n⁶ abstract t⁷⁴or⁴t⁸cal cla⁸ms a⁶a⁸nst trut⁷⁵ul and d⁴c⁴pt⁸v⁴ ⁴ma⁸ls. Unc⁴rta⁸nty, ⁵or ⁴xampl⁴, ⁸s m⁴asur⁴d by t⁷⁴ occurr⁴nc⁴ o⁵ mod⁸fi⁴rs, modal v⁴rbs, t⁷⁸rd p⁴rson pronouns, as w⁴ll as words t⁷at d⁸r⁴ctly ⁸nd⁸cat⁴ unc⁴rta⁸nty suc⁷ as maybe. W⁷⁸l⁴ t⁷⁴ study do⁴s not prov⁸d⁴ a pr⁴d⁸ct⁸v⁴ mod⁴l o⁵ d⁴c⁴pt⁸on, ⁸t conn⁴cts s⁴v⁴ral o⁵ t⁷⁴ mor⁴ abstract cu⁴s ⁵rom t⁷⁴ l⁸t⁴ratur⁴ to concr⁴t⁴, l⁸n⁶u⁸st⁸cally bas⁴d m⁴asur⁴s. e Lexical Inquiry and Word Count Features
⁴ LW ⁵⁴atur⁴s ⁶r⁴w out o⁵ att⁴mpts by t⁷⁴ psyc⁷olo⁶⁸sts Jam⁴s P⁴nn⁴bak⁴r and Mart⁷a ranc⁸s [P⁴nn⁴bak⁴r and ranc⁸s, , P⁴nn⁴bak⁴r ⁴t al., ] to pr⁴d⁸ct co⁶n⁸t⁸v⁴ c⁷an⁶⁴ by count⁸n⁶ r⁴l⁴vant words ⁸n a narrat⁸v⁴, w⁸t⁷ co⁶n⁸t⁸v⁴ c⁷an⁶⁴ d⁴fin⁴d as t⁷⁴ us⁴ o⁵ words ⁸n two ⁶⁴n⁴ral t⁴xt d⁸m⁴ns⁸ons s⁴l⁵-r⁴fl⁴ct⁸v⁴ t⁷⁸nk⁸n⁶ and causal t⁷⁸nk⁸n⁶. ⁴ s⁴l⁵-r⁴fl⁴ct⁸on cat⁴⁶ory ⁸nclud⁴s words suc⁷ as realize, understand, think, and consider. ⁴ causal t⁷⁸nk⁸n⁶ cat⁴⁶ory ⁸nclud⁴s words suc⁷ as cause, effect, reason, and because, [P⁴nn⁴bak⁴r ⁴t al., , p. ]. n add⁸t⁸on to t⁷⁴ word cat⁴⁶or⁸⁴s captur⁸n⁶ psyc⁷olo⁶⁸cal constructs, t⁷⁴ curr⁴nt LW appl⁸cat⁸on ⁸nclud⁴s ⁶⁴n⁴ral d⁴scr⁸ptor cat⁴⁶or⁸⁴s (total word count, words p⁴r s⁴nt⁴nc⁴, p⁴rc⁴nta⁶⁴ o⁵ words captur⁴d by t⁷⁴ d⁸ct⁸onary, and p⁴rc⁴nta⁶⁴ o⁵ words lon⁶⁴r t⁷an l⁴tt⁴rs), standard l⁸n⁶u⁸st⁸c d⁸m⁴ns⁸ons (⁴.⁶., p⁴rc⁴nta⁶⁴ o⁵ words ⁸n t⁷⁴ t⁴xt t⁷at ar⁴ pronouns, art⁸cl⁴s, aux⁸l⁸ary v⁴rbs, ⁴tc.), p⁴rsonal conc⁴rn cat⁴⁶or⁸⁴s (⁴.⁶., work, ⁷om⁴, l⁴⁸sur⁴ act⁸v⁸t⁸⁴s), paral⁸n⁶u⁸st⁸c d⁸m⁴ns⁸ons (ass⁴nts, fill⁴rs, nonflu⁴nc⁸⁴s), and punctuat⁸on cat⁴⁶or⁸⁴s (p⁴r⁸ods, commas, ⁴tc.) [P⁴nn⁴bak⁴r ⁴t al., 00, p. ]. ⁴ words class⁸fi⁴d w⁸t⁷⁸n t⁷⁴ psyc⁷olo⁶⁸cal cat⁴⁶or⁸⁴s w⁴r⁴ bas⁴d on word l⁸sts ⁵rom common ⁴mot⁸on rat⁸n⁶ scal⁴s, Roget’s esaurus, n⁶l⁸s⁷ d⁸ct⁸onar⁸⁴s, and ⁸nput by ⁷uman ⁹ud⁶⁴s at var⁸ous sta⁶⁴s o⁵ d⁴v⁴lopm⁴nt. As ⁵ar as w⁴ know, word m⁴mb⁴rs⁷⁸p ⁸n t⁷⁴ cat⁴⁶or⁸⁴s ⁷as not b⁴⁴n ⁴mp⁸r⁸cally t⁴st⁴d a⁶a⁸nst psyc⁷olo⁶⁸cal proc⁴ss⁴s l⁸k⁴ adapt⁸v⁴ b⁴r⁴av⁴m⁴nt, ⁵or w⁷⁸c⁷ t⁷⁴ LW pr⁴d⁴c⁴ssor was or⁸⁶⁸nally d⁴s⁸⁶n⁴d, to d⁴t⁴rm⁸n⁴ t⁷⁴ ⁶oodn⁴ss o⁵ fit o⁵ ⁴ac⁷ word ⁸n t⁷⁴ cat⁴⁶ory. ⁴ computat⁸onal us⁴ o⁵ t⁷⁴ LW cat⁴⁶or⁸⁴s to d⁸st⁸n⁶u⁸s⁷ tru⁴ ⁵rom ⁵als⁴ narrat⁸v⁴s b⁴⁶an w⁸t⁷ an adv⁸s⁴⁴ o⁵ Jam⁴s P⁴nn⁴bak⁴r, Matt⁷⁴w N⁴wman, a P⁷.. cand⁸dat⁴ ⁸n psyc⁷olo⁶y at t⁷⁴ Un⁸v⁴rs⁸ty o⁵ T⁴xas-Aust⁸n at t⁷⁴ t⁸m⁴. N⁴wman’s study analyz⁴d t⁷⁴ lan⁶ua⁶⁴ us⁴d by stud⁴nt sub⁹⁴cts ⁶⁸v⁸n⁶ bot⁷ a ⁵als⁴ account and a tru⁴ account o⁵ t⁷⁴⁸r ⁵⁴⁴l⁸n⁶s on t⁷⁴ top⁸cs o⁵ abort⁸on and ⁵r⁸⁴nds [N⁴wman ⁴t al., 00]. On t⁷⁴ top⁸c o⁵ abort⁸on, sub⁹⁴cts w⁴r⁴ ⁴⁸t⁷⁴r
4.3. THE CURRENT SYSTEMS
71
ask⁴d to sp⁴ak t⁷⁴⁸r tru⁴ and ⁵als⁴ op⁸n⁸ons, to typ⁴ t⁷⁴m, or to ⁷andwr⁸t⁴ t⁷⁴m, Sub⁹⁴cts also part⁸c⁸pat⁴d ⁸n a mock cr⁸m⁴ ⁸n w⁷⁸c⁷ ⁷al⁵ w⁴r⁴ told to st⁴al a dollar b⁸ll and all w⁴r⁴ told to d⁴ny tak⁸n⁶ t⁷⁴ mon⁴y. ⁸s r⁴sult⁴d ⁸n fiv⁴ stud⁸⁴s w⁸t⁷ no sub⁹⁴cts ⁸n common. As a bas⁴l⁸n⁴, N⁴wman us⁴d a pan⁴l o⁵ – ⁷uman ⁹ud⁶⁴s w⁷o ⁴valuat⁴d t⁷⁴ abort⁸on narrators as r⁴port⁸n⁶ t⁷⁴⁸r tru⁴ ⁵⁴⁴l⁸n⁶s or not. ⁴ un⁸qu⁴ c⁷aract⁴r⁸st⁸c o⁵ t⁷⁴ N⁴wman work ⁸s t⁷at ⁸t us⁴d t⁷⁴ LW ⁵⁴atur⁴s to cat⁴⁶or⁸z⁴ lan⁶ua⁶⁴ as d⁴c⁴pt⁸v⁴ or tru⁴. To t⁴st t⁷⁴ ab⁸l⁸ty o⁵ t⁷⁴ ⁵⁴atur⁴s to pr⁴d⁸ct T vs. , N⁴wman s⁴l⁴ct⁴d t⁷⁴ fiv⁴ ⁵⁴atur⁴s t⁷at w⁴r⁴ most d⁸scr⁸m⁸natory and ⁴nt⁴r⁴d t⁷⁴m ⁸nto a lo⁶⁸st⁸c r⁴⁶r⁴ss⁸on to t⁴st t⁷⁴ ab⁸l⁸ty o⁵ t⁷⁴ ⁵⁴atur⁴ d⁸str⁸but⁸on cross-top⁸c, tra⁸n⁸n⁶ on ⁵our o⁵ t⁷⁴ stud⁸⁴s and t⁴st⁸n⁶ on t⁷⁴ fi⁵t⁷. ⁴ s⁴l⁴ct⁴d ⁷⁸⁶⁷ly d⁸scr⁸m⁸natory ⁵⁴atur⁴s o⁵ d⁴c⁴pt⁸on w⁴r⁴ ⁵⁴w⁴r first-p⁴rson s⁸n⁶ular pronouns, ⁵⁴w⁴r t⁷⁸rd-p⁴rson pronouns, mor⁴ n⁴⁶at⁸v⁴ ⁴mot⁸on words, ⁵⁴w⁴r ⁴xclus⁸v⁴ words, and mor⁴ mot⁸on v⁴rbs. N⁴wman r⁴ports an ov⁴rall accuracy on t⁷⁴ fiv⁴ stud⁸⁴s o⁵ %, w⁷⁸c⁷ d⁸ff⁴rs s⁸⁶n⁸ficantly ⁵rom t⁷⁴ ⁹ud⁶⁴s accuracy o⁵ %. W⁷⁸l⁴ t⁷⁴ ov⁴rall accuracy ⁸s su⁶⁶⁴st⁸v⁴ o⁵ t⁷⁴ ab⁸l⁸ty o⁵ t⁷⁴ LW cat⁴⁶or⁸⁴s to d⁸st⁸n⁶u⁸s⁷ T ⁵rom narrat⁸v⁴s, t⁷⁴ d⁴p⁴nd⁴nc⁴ o⁵ t⁷⁴ accuracy scor⁴ on top⁸c ⁸s notabl⁴, w⁸t⁷ t⁷⁴ abort⁸on narrat⁸v⁴s ⁸nt⁴r al⁸a produc⁸n⁶ t⁷⁴ ⁷⁸⁶⁷⁴r scor⁴s w⁷⁸l⁴ t⁷⁴ ov⁴rall accuracy o⁵ t⁷⁴ ⁵our stor⁸⁴s to pr⁴d⁸ct t⁷⁴ ⁵r⁸⁴nds narrat⁸v⁴ was % and to pr⁴d⁸ct t⁷⁴ mock cr⁸m⁴ was %, muc⁷ low⁴r t⁷an M&S’s cross-top⁸c accuracy on t⁷⁴⁸r s⁸mpl⁴ n-⁶ram study r⁴pr⁴s⁴nt⁴d ⁸n Tabl⁴ .. Also notabl⁴ ⁸s t⁷⁴ ⁵act t⁷at only on⁴ l⁸n⁶u⁸st⁸c d⁸m⁴ns⁸on, r⁴⁵⁴r⁴nc⁴s to s⁴l⁵, was a d⁸scr⁸m⁸natory ⁵⁴atur⁴ ⁸n bot⁷ t⁷⁴ N⁴wman ⁴t al. and t⁷⁴ M&S post ⁷oc study, d⁴sp⁸t⁴ t⁷⁴ ⁵act t⁷at two o⁵ t⁷⁴ t⁷r⁴⁴ top⁸cs ⁸n bot⁷ stud⁸⁴s w⁴r⁴ abort⁸on and ⁵r⁸⁴nds. W⁴ ⁷av⁴ alr⁴ady cons⁸d⁴r⁴d t⁷⁴ r⁴sults o⁵ Ott ⁴t al. [0] w⁸t⁷ r⁴sp⁴ct to n-⁶rams, but t⁷⁴ study cons⁸d⁴rs t⁷r⁴⁴ p⁴rsp⁴ct⁸v⁴s on t⁷⁴ d⁴t⁴ct⁸on o⁵ d⁴c⁴pt⁸on w⁸t⁷ r⁴sp⁴ct to t⁷⁴⁸r onl⁸n⁴ ⁷ot⁴l r⁴v⁸⁴w data, r⁴⁶ard⁸n⁶ ⁸t not only as () a t⁴xt class⁸ficat⁸on task, but also as () a cas⁴ o⁵ psyc⁷ol⁸n⁶u⁸st⁸c d⁴c⁴pt⁸on d⁴t⁴ct⁸on, ⁸n w⁷⁸c⁷ w⁴ ⁴xp⁴ct d⁴c⁴pt⁸v⁴ stat⁴m⁴nts to ⁴x⁴mpl⁸⁵y t⁷⁴ psyc⁷olo⁶⁸cal ⁴ff⁴cts o⁵ ly⁸n⁶, suc⁷ as ⁸ncr⁴as⁴d n⁴⁶at⁸v⁴ ⁴mot⁸on and psyc⁷olo⁶⁸cal d⁸stanc⁸n⁶ and as () a cas⁴ o⁵ ⁶⁴nr⁴ ⁸d⁴nt⁸ficat⁸on ⁸n w⁷⁸c⁷ t⁷⁴ d⁴c⁴pt⁸v⁴ stat⁴m⁴nts ar⁴ s⁴⁴n as cas⁴s o⁵ ⁸ma⁶⁸nat⁸v⁴ wr⁸t⁸n⁶ w⁷⁸l⁴ t⁷⁴ trut⁷⁵ul stat⁴m⁴nts ar⁴ s⁴⁴n as cas⁴s o⁵ ⁸n⁵ormat⁸v⁴ wr⁸t⁸n⁶. ⁴ r⁴sults o⁵ t⁷⁴ Ott ⁴t al. [0] class⁸ficat⁸on task s⁷ow⁴d accuracy scor⁴s as ⁷⁸⁶⁷ as .% ⁵or b⁸⁶rams us⁸n⁶ a SVM class⁸fi⁴r. ⁴ r⁴sults o⁵ t⁷⁴ psyc⁷olo⁶⁸cal approac⁷, w⁷⁸c⁷ t⁷⁴ study ⁸mpl⁴m⁴nts w⁸t⁷ t⁷⁴ LW ⁵⁴atur⁴s, do⁴s not p⁴r⁵orm w⁴ll by compar⁸son, scor⁸n⁶ .%, alt⁷ou⁶⁷ us⁴ o⁵ t⁷⁴ LW ⁵⁴atur⁴s do⁴s boost t⁷⁴ b⁸⁶ram/SVM p⁴r⁵ormanc⁴ to .%. Ott ⁴t al. [0] also cons⁸d⁴r d⁴c⁴pt⁸v⁴ and trut⁷⁵ul wr⁸t⁸n⁶ as sub⁶⁴nr⁴s o⁵ ⁸ma⁶⁸nat⁸v⁴ and ⁸n⁵ormat⁸v⁴ wr⁸t⁸n⁶ r⁴sp⁴ct⁸v⁴ly, c⁸t⁸n⁶ t⁷⁴ work o⁵ Jo⁷nson and Ray⁴ [] on R⁴al⁸ty Mon⁸tor⁸n⁶ and ⁵ollow⁸n⁶ up on l⁸n⁶u⁸st⁸c work by Rayson ⁴t al. [00], w⁷⁸c⁷ ar⁶u⁴d t⁷at ⁸ma⁶⁸nat⁸v⁴ wr⁸t⁸n⁶ cons⁸sts o⁵ mor⁴ v⁴rbs, adv⁴rbs, pronouns, and pr⁴-d⁴t⁴rm⁸n⁴rs w⁷⁸l⁴ ⁸n⁵ormat⁸v⁴ wr⁸t⁸n⁶ cons⁸sts o⁵ mor⁴ nouns, ad⁹⁴ct⁸v⁴s, pr⁴pos⁸t⁸ons, d⁴t⁴rm⁸n⁴rs, and coord⁸nat⁸n⁶ con⁹unct⁸ons. Us⁸n⁶ t⁷⁴s⁴ patt⁴rns, Ott ⁴t al. ar⁶u⁴s ⁵or t⁷⁴ prom⁸s⁴ o⁵ v⁸⁴w⁸n⁶ T/ d⁸scr⁸m⁸nat⁸on as a
72
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
⁶⁴nr⁴ ⁸d⁴nt⁸ficat⁸on task, w⁸t⁷ POS d⁸str⁸but⁸on captur⁸n⁶ ⁶⁴nr⁴ r⁴pr⁴s⁴ntat⁸on. ⁴ POS /SVM class⁸fi⁴r was abl⁴ to d⁸scr⁸m⁸nat⁴ t⁷⁴⁸r T/ ⁷ot⁴l r⁴v⁸⁴ws w⁸t⁷ an accuracy o⁵ %, n many laboratory ⁴xp⁴r⁸m⁴nts, ⁸t ⁸s uncl⁴ar w⁷⁴t⁷⁴r t⁷⁴ sub⁹⁴cts ar⁴ truly d⁴c⁴⁸v⁸n⁶ or, b⁴caus⁴ o⁵ t⁷⁴ lack o⁵ t⁷r⁴at and t⁷⁴ sanct⁸on⁸n⁶ o⁵ t⁷⁴ ly⁸n⁶, t⁷⁴y ar⁴ m⁴r⁴ly b⁴⁸n⁶ ⁸nv⁴nt⁸v⁴. ⁵ a robust l⁸nk b⁴tw⁴⁴n d⁴c⁴pt⁸v⁴ and ⁸ma⁶⁸nat⁸v⁴ narrat⁸on could b⁴ support⁴d, ⁸t would ⁶o a lon⁶ way toward support⁸n⁶ t⁷⁴ us⁴ o⁵ laboratory data ⁸n d⁴c⁴pt⁸on stud⁸⁴s, an ⁸ssu⁴ t⁷at w⁴ cons⁸d⁴r ⁸n mor⁴ d⁴ta⁸l ⁸n ⁷apt⁴r . Larck⁴r and Zakolyuk⁸na [00] (L&Z) t⁴st⁴d LW cat⁴⁶or⁸⁴s on r⁴al world data t⁷⁴ financ⁸al ⁷⁴alt⁷ o⁵ compan⁸⁴s as r⁴pr⁴s⁴nt⁴d ⁸n quart⁴rly ⁴arn⁸n⁶s con⁵⁴r⁴nc⁴ calls, w⁷⁸c⁷ w⁴ d⁸scuss⁴d ⁸n ⁷apt⁴r . L&Z s⁴l⁴ct⁴d LW cat⁴⁶or⁸⁴s t⁷at t⁷⁴y b⁴l⁸⁴v⁴d to b⁴ part⁸cularly r⁴l⁴vant to t⁷⁴⁸r top⁸c and add⁴d financ⁸al vocabulary to t⁷⁴ LW cat⁴⁶or⁸⁴s. ⁴ L&Z narrat⁸v⁴ data cons⁸sts o⁵ t⁷⁴ Q&A port⁸ons o⁵ quart⁴rly ⁴arn⁸n⁶s con⁵⁴r⁴nc⁴ calls. t conta⁸ns , O narrat⁸v⁴s and , O narrat⁸v⁴s, w⁸t⁷ a “narrat⁸v⁴” b⁴⁸n⁶ t⁷⁴ turn tak⁴n ⁸n answ⁴r to a qu⁴st⁸on on a call. ⁴ ⁶round trut⁷ data com⁴s ⁵rom subs⁴qu⁴nt financ⁸al r⁴stat⁴m⁴nts ⁸d⁴nt⁸fi⁴d by lass L⁴w⁸s & o., w⁸t⁷ con⁵⁴r⁴nc⁴ call narrat⁸v⁴s lab⁴l⁴d as d⁴c⁴pt⁸v⁴ “⁸⁵ t⁷⁴y ⁸nvolv⁴ substant⁸al subs⁴qu⁴nt r⁴stat⁴m⁴nt o⁵ n⁴t ⁸ncom⁴ and ar⁴ assoc⁸at⁴d w⁸t⁷ mor⁴ s⁴v⁴r⁴ typ⁴s o⁵ r⁴stat⁴m⁴nts” [Larck⁴r and Zakolyuk⁸na, 00, pp. -]. ⁸s r⁴sult⁴d ⁸n 0% o⁵ t⁷⁴ narrat⁸v⁴s b⁴⁸n⁶ lab⁴l⁴d as “d⁴c⁴pt⁸v⁴” and a ⁶r⁴at⁴r w⁴⁸⁶⁷t put on d⁴c⁴pt⁸v⁴ narrat⁸v⁴s ⁸n t⁷⁴ analys⁸s to balanc⁴ out t⁷⁴s⁴ “rar⁴” ⁴v⁴nts. A ⁶r⁴at⁴r d⁸fficulty t⁷an t⁷⁴ ⁸mbalanc⁴, ⁷ow⁴v⁴r, ⁸s t⁷⁴ ⁵act t⁷at “som⁴ man⁸pulat⁴d quart⁴rs ar⁴ n⁴v⁴r r⁴stat⁴d or r⁴stat⁴d outs⁸d⁴ o⁵ t⁷⁴ t⁸m⁴ p⁴r⁸od w⁴ ⁴xam⁸n⁴” (⁵n. ), w⁷⁸c⁷ m⁴ans t⁷at som⁴ transcr⁸pts lab⁴l⁴d as tru⁴ may actually b⁴ d⁴c⁴pt⁸v⁴. St⁸ll, L&Z d⁴monstrat⁴ t⁷at t⁷⁴ lan⁶ua⁶⁴ o⁵ t⁷⁴ d⁴c⁴pt⁸v⁴ transcr⁸pts ⁸s d⁸st⁸nct⁸v⁴, w⁸t⁷ accuracy rat⁴s as ⁷⁸⁶⁷ as % ⁵or Os and % ⁵or Os. Anot⁷⁴r ⁸ssu⁴ w⁸t⁷ t⁷⁴ us⁴ o⁵ r⁴stat⁴m⁴nts ⁸s t⁷⁴ qu⁴st⁸on o⁵ w⁷⁴t⁷⁴r t⁷⁴ call part⁸c⁸pants actually know about a man⁸pulat⁸on at t⁷⁴ t⁸m⁴ o⁵ t⁷⁴ call. n ⁸mpos⁸n⁶ t⁷⁴ ⁷⁸⁶⁷⁴r bar on t⁷⁴ typ⁴s and l⁴v⁴ls o⁵ r⁴stat⁴m⁴nt t⁷at ar⁴ us⁴d ⁵or t⁷⁴⁸r ⁶round trut⁷ annotat⁸on, L&Z’s a⁸m ⁸s to ⁴nsur⁴ t⁷at t⁷⁴ or⁸⁶⁸nal ⁵raud would ⁷av⁴ b⁴⁴n so s⁴v⁴r⁴ t⁷at ⁸t would b⁴ d⁸fficult to ⁸ma⁶⁸n⁴ t⁷⁴ part⁸c⁸pants not know⁸n⁶ about ⁸t. ⁴ t⁸m⁴-s⁴ns⁸t⁸v⁸ty o⁵ t⁷⁴ T/ d⁴t⁴rm⁸nat⁸on ⁸s p⁴cul⁸ar to financ⁸al data, w⁷⁴r⁴ a narrat⁸v⁴ can fl⁸p ⁵rom T to s⁴v⁴ral y⁴ars lat⁴r. t ⁸s l⁴ss l⁸k⁴ly to ar⁸s⁴ ⁸n t⁷⁴ stud⁸⁴s t⁷at us⁴ l⁴⁶al data, w⁷⁴r⁴ tru⁴ stat⁴m⁴nts can b⁴ ⁴stabl⁸s⁷⁴d on t⁷⁴ bas⁸s o⁵ ⁸nv⁴st⁸⁶at⁸on, alt⁷ou⁶⁷ l⁴⁶al data ⁸s not ⁸mmun⁴ ⁵rom subs⁴qu⁴nt d⁸scov⁴ry o⁵ n⁴w ⁵acts. ornac⁸ar⁸ and Po⁴s⁸o [0] (F&P) t⁴st var⁸ous p⁴rmutat⁸ons o⁵ t⁷⁴ LW cat⁴⁶or⁸⁴s as w⁴ll as “sur⁵ac⁴ ⁵⁴atur⁴s” on t⁷⁴ r⁴al-world utt⁴ranc⁴s ⁵rom t⁷⁴⁸r tal⁸an ⁴our (⁴c⁴pt⁸on ⁸n ourts) corpus o⁵ cr⁸m⁸nal proc⁴⁴d⁸n⁶s ⁵rom t⁷r⁴⁴ court ⁹ur⁸sd⁸ct⁸ons ⁸n taly w⁷⁴r⁴ t⁷⁴ d⁴⁵⁴ndant was ⁵ound ⁶u⁸lty o⁵ ⁵als⁴ t⁴st⁸mony. &P us⁴ utt⁴ranc⁴s ⁵rom 0 ⁷⁴ar⁸n⁶s as a tra⁸n⁸n⁶ s⁴t and t⁴st on utt⁴ranc⁴s ⁵rom d⁸ff⁴r⁴nt ⁷⁴ar⁸n⁶s. ⁴ T/ status o⁵ t⁷⁴s⁴ utt⁴ranc⁴s was cl⁴arly ⁸d⁴nt⁸fi⁴d. &P us⁴ “sur⁵ac⁴ ⁵⁴atur⁴ v⁴ctors” t⁷at ⁸nclud⁴ l⁴mmas and parts o⁵ sp⁴⁴c⁷ as w⁴ll as b⁸⁶rams and tr⁸⁶rams o⁵ t⁷⁴s⁴
4.3. THE CURRENT SYSTEMS
Table 4.9: P⁴r⁵ormanc⁴ o⁵ &P’s sur⁵ac⁴ ⁵⁴atur⁴s on t⁷⁴⁸r ⁴our data
orr⁴ctly class⁸fi⁴d
ncorr⁴ctly class⁸fi⁴d
als⁴ Tru⁴
Total
0
Pr⁴c⁸s⁸on
R⁴call
-m⁴asur⁴
0. 0.0
0. 0.0
0.0 0.
73
Table 4.10: LW cat⁴⁶or⁸⁴s’ p⁴r⁵ormanc⁴ on &P’s ⁴our data
orr⁴ctly class⁸fi⁴d
ncorr⁴ctly class⁸fi⁴d
als⁴ Tru⁴
Total
0
Pr⁴c⁸s⁸on
R⁴call
-m⁴asur⁴
0. 0.
0. 0.
0. 0.
as a bas⁴l⁸n⁴ a⁶a⁸nst w⁷⁸c⁷ to compar⁴ t⁷⁴ mor⁴ abstract LW ⁵⁴atur⁴s. ⁴ r⁴sults o⁵ t⁷⁴s⁴ sur⁵ac⁴ ⁵⁴atur⁴s, s⁷own ⁸n Tabl⁴ ., r⁴⁸n⁵orc⁴ t⁷⁴ find⁸n⁶s o⁵ M&S t⁷at s⁸⁶n⁸ficant d⁸st⁸nct⁸ons can b⁴ ⁵ound b⁴tw⁴⁴n t⁷⁴ lan⁶ua⁶⁴ o⁵ trut⁷ and o⁵ l⁸⁴s ⁴v⁴n at t⁷⁴ l⁴v⁴l o⁵ t⁷⁴ n-⁶ram, and, ⁵or &P, ⁴v⁴n on l⁸n⁶u⁸st⁸cally qu⁸t⁴ d⁸v⁴rs⁴ r⁴al-world data. or purpos⁴s o⁵ compar⁸son, t⁷⁴ accuracy rat⁴ ⁷⁴r⁴ ⁸s .%, w⁷⁸c⁷ ⁸s r⁴markabl⁴ ⁶⁸v⁴n t⁷⁴ data t⁷⁴ ⁵als⁴ t⁴st⁸mony data r⁴sult⁴d ⁵rom l⁸⁴s told at t⁷⁴ or⁸⁶⁸nal tr⁸als, w⁷⁸c⁷ cov⁴r⁴d an ⁴normous ran⁶⁴ o⁵ data ⁵rom ⁵raudul⁴nt cla⁸ms o⁵ lost c⁷⁴cks to dru⁶ traffick⁸n⁶ to ⁷om⁸c⁸d⁴. To t⁴st w⁷⁴t⁷⁴r som⁴ ⁵orm o⁵ t⁷⁴ LW ⁵⁴atur⁴s could p⁴r⁵orm b⁴tt⁴r, &P bu⁸lt fiv⁴ k⁸nds o⁵ v⁴ctors, us⁸n⁶ t⁷⁴ tal⁸an LW d⁸ct⁸onary [A⁶ost⁸ and R⁴ll⁸n⁸, 00] () all o⁵ t⁷⁴ cat⁴⁶or⁸⁴s us⁴d by N⁴wman, () all ⁵⁴atur⁴s o⁵ t⁷⁴ tal⁸an LW d⁸ct⁸onary plus t⁷⁴ ⁶⁴n⁴ral d⁴scr⁸ptor cat⁴⁶or⁸⁴s, () t⁷⁴ most d⁸scr⁸m⁸natory ⁵⁴atur⁴s ⁵rom t⁷⁴ s⁴cond v⁴ctor, () N⁴wman’s most d⁸scr⁸m⁸natory cat⁴⁶or⁸⁴s, and () t⁷⁴ most d⁸scr⁸m⁸natory ⁵⁴atur⁴s ⁵rom t⁷⁴ s⁴cond v⁴ctor. Tabl⁴ .0 s⁷ows t⁷⁴ r⁴sults o⁵ N⁴wman’s cat⁴⁶or⁸⁴s, t⁷⁴ b⁴st p⁴r⁵orm⁸n⁶ o⁵ t⁷⁴ v⁴ctors, on t⁷⁴ ⁴our data. or purpos⁴s o⁵ compar⁸son, t⁷⁴ accuracy scor⁴ ⁷⁴r⁴ ⁸s .%, only sl⁸⁶⁷tly b⁴tt⁴r t⁷an t⁷⁴ .% o⁵ t⁷⁴ sur⁵ac⁴ ⁵⁴atur⁴ mod⁴l. W⁸t⁷ t⁷⁴ us⁴ o⁵ pr⁴c⁸s⁸on and r⁴call, &P call att⁴nt⁸on to t⁷⁴ us⁴⁵uln⁴ss o⁵ a syst⁴m w⁸t⁷ ⁴v⁴n mod⁴rat⁴ accuracy. All fiv⁴ v⁴ctors ⁷av⁴ r⁴call rat⁴s o⁵ Tru⁴ utt⁴ranc⁴s abov⁴ 0%, w⁸t⁷ &P’s own fiv⁴ v⁴ctors r⁴call⁸n⁶ Tru⁴s at a r⁴markabl⁴ .%. W⁷⁸l⁴ non⁴ o⁵ t⁷⁴ v⁴ctors ⁷av⁴ ⁷⁸⁶⁷ r⁴call o⁵ ⁵als⁴ utt⁴ranc⁴s, t⁷⁴ “N⁴wman ” v⁴ctor ⁷as ⁵a⁸rly ⁷⁸⁶⁷ pr⁴c⁸s⁸on (.%) ⁸n pr⁴d⁸ct⁸n⁶ als⁴. ⁵ an appl⁸cat⁸on n⁴⁴d⁴d to b⁴ sur⁴ to ⁸d⁴nt⁸⁵y all t⁷⁴ tru⁴ utt⁴ranc⁴s and/or b⁴ sur⁴ t⁷at t⁷⁴ utt⁴ranc⁴s ⁸d⁴nt⁸fi⁴d as als⁴ w⁴r⁴ cl⁴arly ⁵als⁴, t⁷⁴s⁴ v⁴ctors would s⁴rv⁴ t⁷⁴ purpos⁴.
74
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
Table 4.11: -m⁴asur⁴ p⁴r⁵ormanc⁴ o⁵ LW by cat⁴⁶ory ⁵rom Alm⁴la ⁴t al.
LW at⁴⁶ory . l⁸n⁶u⁸st⁸c . psyc⁷olo⁶⁸c . d⁴scr⁸ptors . p⁴rsonal
omos⁴xual Adopt⁸on
ullfi⁶⁷t⁸n⁶
⁴st r⁸⁴nd
Total
0. 0. 0.0 0.0
0. 0. 0.0 0.
0. 0.0 0. 0.
0. 0.0 0. 0.
Table 4.12: -m⁴asur⁴ p⁴r⁵ormanc⁴ o⁵ LW by cat⁴⁶ory comb⁸nat⁸ons ⁵rom Alm⁴la ⁴t al.
LW at⁴⁶or⁸⁴s & , , , &
omos⁴xual Adopt⁸on
ullfi⁶⁷t⁸n⁶
⁴st r⁸⁴nd
Total
0.0 0. 0.
0. 0.0 0.
0. 0. 0.
0. 0. 0.
Alm⁴la ⁴t al. [0], us⁸n⁶ a v⁴rs⁸on o⁵ LW00 [Ram⁸r⁴z-sparza ⁴t al., 00] t⁷at “⁷as b⁴⁴n ⁵ully val⁸dat⁴d ⁵or Span⁸s⁷ across s⁴v⁴ral psyc⁷ol⁸n⁶u⁸st⁸c stud⁸⁴s,” class⁸⁵y a data s⁴t s⁸m⁸lar to t⁷at o⁵ M⁸⁷alc⁴a and Strapparava, tra⁸n⁴d and t⁴st⁴d us⁸n⁶ ⁴ac⁷ cat⁴⁶ory o⁵ LW (psyc⁷olo⁶⁸cal constructs, ⁶⁴n⁴ral d⁴scr⁸ptors, standard l⁸n⁶u⁸st⁸c d⁸m⁴ns⁸ons, and p⁴rsonal conc⁴rns) s⁴parat⁴ly and t⁷⁴n ⁸n t⁷⁴ poss⁸bl⁴ comb⁸nat⁸ons o⁵ t⁷⁴ ⁵our cat⁴⁶or⁸⁴s. ⁴ r⁴sults ⁵or t⁷⁴ s⁸n⁶l⁴ton cat⁴⁶or⁸⁴s by top⁸c and cross-top⁸c ar⁴ s⁷own ⁸n Tabl⁴ . t⁷⁴ scor⁴s r⁴pr⁴s⁴nt -m⁴asur⁴. ⁴ b⁴st p⁴r⁵orm⁸n⁶ comb⁸nat⁸ons o⁵ LW cat⁴⁶or⁸⁴s ar⁴ s⁷own ⁸n Tabl⁴ .. A⁶a⁸n, t⁷⁴ scor⁴s r⁴pr⁴s⁴nt -m⁴asur⁴s. Alm⁴la’s top scor⁴ across all LW t⁴sts and top⁸cs ⁸s .% us⁸n⁶ t⁷⁴ comb⁸nat⁸on o⁵ all ⁵our cat⁴⁶or⁸⁴s on t⁷⁴ ⁴st r⁸⁴nd top⁸c. or compar⁸son w⁸t⁷ t⁷⁴ ot⁷⁴r LW stud⁸⁴s t⁷at c⁸t⁴ -m⁴asur⁴, Ott ⁴t al.’s ⁷⁸⁶⁷⁴st -m⁴asur⁴ ⁸s .% us⁸n⁶ t⁷⁴ LW ⁵⁴atur⁴s alon⁴ on t⁷⁴ mor⁴ l⁴x⁸cally constra⁸n⁴d ⁷ot⁴l r⁴v⁸⁴ws and ornac⁸ar⁸ and Po⁴s⁸o’s ⁸s .% Alm⁴la ⁴t al. ar⁶u⁴ t⁷at t⁷⁴ ⁷⁸⁶⁷⁴r p⁴r⁵ormanc⁴, ⁸n ⁶⁴n⁴ral, on t⁷⁴ ⁴st r⁸⁴nd top⁸c s⁷ows t⁷⁴ stron⁶ d⁴p⁴nd⁴nc⁴ o⁵ t⁷⁴ task on top⁸c and ⁷ypot⁷⁴s⁸z⁴ t⁷at t⁷⁴ b⁴tt⁴r p⁴r⁵ormanc⁴ on t⁷⁸s top⁸c may b⁴ du⁴ to t⁷⁴ ⁶r⁴at⁴r ⁴mot⁸onal ⁸nvolv⁴m⁴nt t⁷at narrators ⁷av⁴ ⁸n d⁴scr⁸b⁸n⁶ t⁷⁴⁸r b⁴st ⁵r⁸⁴nd. ⁴r⁴ may also b⁴ a b⁴tt⁴r al⁸⁶nm⁴nt b⁴tw⁴⁴n t⁷⁴ LW vocabulary and t⁷⁴ ⁴st r⁸⁴nd top⁸c t⁷an b⁴tw⁴⁴n LW and t⁷⁴ ot⁷⁴r top⁸cs. or t⁷⁴ purpos⁴s o⁵ cat⁴⁶or⁸z⁸n⁶ d⁴c⁴pt⁸v⁴ narrat⁸v⁴, t⁷⁴n, ⁸t would s⁴⁴m mor⁴ product⁸v⁴ to d⁴t⁴rm⁸n⁴ w⁷⁸c⁷ words ar⁴ mor⁴ ⁷⁸⁶⁷ly d⁸scr⁸m⁸natory and t⁷⁴n to d⁴t⁴rm⁸n⁴ w⁷⁴t⁷⁴r cat⁴⁶ory ⁶⁴n⁴ral⁸zat⁸ons can b⁴ mad⁴ across t⁷⁴s⁴ words.
4.3. THE CURRENT SYSTEMS
75
4.3.3 STUDIES LOOKING ABOVE THE LEXICAL LEVEL n add⁸t⁸on to t⁷⁴ words and p⁷ras⁴s t⁷at m⁸⁶⁷t d⁸st⁸n⁶u⁸s⁷ tru⁴ ⁵rom ⁵als⁴ narrat⁸v⁴s, t⁷⁴ back⁶round l⁸t⁴ratur⁴ ⁷as ⁵ound som⁴ d⁸scr⁸m⁸nat⁸v⁴ pow⁴r ⁸n ot⁷⁴r l⁴v⁴ls o⁵ lan⁶ua⁶⁴ analys⁸s. Accord⁸n⁶ to ⁴Paulo ⁴t al. [00], at t⁷⁴ acoust⁸c and prosod⁸c l⁴v⁴l, ⁵or ⁴xampl⁴, stat⁸st⁸cally s⁸⁶n⁸ficant d⁸ff⁴r⁴nc⁴s ⁷av⁴ b⁴⁴n ⁵ound ⁸n vocal t⁴ns⁸on [orvat⁷, , ] and ⁷⁸⁶⁷⁴r p⁸tc⁷ [Zuck⁴rman ⁴t al., ]. At t⁷⁴ syntact⁸c l⁴v⁴l, stat⁸st⁸cally s⁸⁶n⁸ficant d⁸ff⁴r⁴nc⁴s ⁷av⁴ b⁴⁴n ⁵ound ⁸n lon⁶⁴r, mor⁴ d⁴ta⁸l⁴d s⁴nt⁴nc⁴s [Port⁴r and Yu⁸ll⁴, ]. At t⁷⁴ s⁴mant⁸c l⁴v⁴l, plaus⁸b⁸l⁸ty, narrat⁸v⁴ cons⁸st⁴ncy and co⁷⁴r⁴nc⁴ [Kö⁷nk⁴n ⁴t al., , Port⁴r and Yu⁸ll⁴, , Spor⁴r, , Zaparn⁸uk ⁴t al., ], as w⁴ll as ⁸nt⁴rnal cons⁸st⁴ncy [⁴Paulo ⁴t al., ], [⁴⁸nr⁸c⁷ and ork⁴nau, ], and [Zuck⁴rman ⁴t al., ] ⁷av⁴ b⁴⁴n ⁵ound to s⁷ow stat⁸st⁸cally s⁸⁶n⁸ficant d⁸ff⁴r⁴nc⁴s b⁴tw⁴⁴n tru⁴ and ⁵als⁴ narrat⁸v⁴s. And at t⁷⁴ l⁴v⁴l o⁵ d⁸scours⁴, t⁷⁴ l⁴n⁶t⁷ o⁵ t⁷⁴ prolo⁶u⁴ part⁸t⁸on [Adams, 00, Adams and Jarv⁸s, 00] ⁷as b⁴⁴n s⁷own to b⁴ d⁸ff⁴r⁴nt⁸at⁸n⁶. Stud⁸⁴s ⁷av⁴ only b⁴⁶un to ⁴xplor⁴ t⁷⁴ ab⁸l⁸ty o⁵ t⁷⁴s⁴ ot⁷⁴r l⁴v⁴ls o⁵ analys⁸s to d⁸scr⁸m⁸nat⁴ tru⁴ ⁵rom ⁵als⁴ narrat⁸v⁴s. W⁴ cov⁴r ⁷⁴r⁴ t⁷⁴ r⁴s⁴arc⁷ known to us. Amon⁶ t⁷⁴ laboratory data stud⁸⁴s, ⁸rsc⁷b⁴r⁶ ⁴t al. [00] com⁴s t⁷⁴ clos⁴st to sat⁸s⁵y⁸n⁶ ⁴colo⁶⁸cal val⁸d⁸ty ⁸n a sc⁴nar⁸o s⁸m⁸lar to t⁷at o⁵ kman and r⁸⁴s⁴n’s Nurs⁴s Study. ⁸rty-two MA stud⁴nts w⁴r⁴ ask⁴d to p⁴r⁵orm tasks ⁸n s⁸x ar⁴as w⁷⁴r⁴ t⁷⁴⁸r p⁴r⁵ormanc⁴, t⁷⁴y w⁴r⁴ told, would b⁴ compar⁴d to tw⁴nty-fiv⁴ “top ⁴ntr⁴pr⁴n⁴urs o⁵ Am⁴r⁸ca.” ⁴ sub⁹⁴cts w⁴r⁴ t⁷⁴n ask⁴d to att⁴mpt to conv⁸nc⁴ an ⁸nt⁴rv⁸⁴w⁴r t⁷at t⁷⁴y ⁷ad p⁴r⁵orm⁴d l⁸k⁴ t⁷⁴ ⁴ntr⁴pr⁴n⁴urs, alt⁷ou⁶⁷ t⁷⁴ d⁸fficulty o⁵ t⁷⁴ tasks ⁷ad b⁴⁴n man⁸pulat⁴d so t⁷at only two scor⁴s app⁴ar⁴d to b⁴ s⁸m⁸lar to t⁷⁴ ⁴ntr⁴pr⁴n⁴ur⁸al p⁴r⁵ormanc⁴, w⁷⁸l⁴ two ot⁷⁴rs w⁴r⁴ ⁷⁸⁶⁷⁴r and two w⁴r⁴ low⁴r. us, t⁷⁴y would b⁴ ⁶⁸v⁸n⁶ two tru⁴ accounts and ⁵our ⁵als⁴ on⁴s. ⁴ sub⁹⁴cts w⁴r⁴ also ask⁴d to pr⁴ss ⁵oot l⁴v⁴rs ⁵or ⁴ac⁷ o⁵ t⁷⁴⁸r accounts to r⁴cord w⁷⁴t⁷⁴r ⁴ac⁷ was tru⁴ or ⁵als⁴, as an add⁴d m⁴asur⁴ o⁵ ⁶round trut⁷. W⁷⁸l⁴ t⁷⁴ ⁸rsc⁷b⁴r⁶ study t⁴st⁴d LW cat⁴⁶or⁸⁴s, t⁷⁴y also t⁴st⁴d acoust⁸c and prosod⁸c ⁵⁴atur⁴s t⁷ou⁶⁷t to b⁴ assoc⁸at⁴d w⁸t⁷ d⁴c⁴pt⁸on, as w⁴ll as sp⁴ak⁴r-d⁴p⁴nd⁴nt ⁵⁴atur⁴s, w⁷⁸c⁷ ⁸nclud⁴d rat⁸os o⁵ fill⁴d paus⁴s ⁸n ly⁸n⁶ and trut⁷⁵ul cond⁸t⁸ons ⁵or a ⁶⁸v⁴n sp⁴ak⁴r as w⁴ll as rat⁸os o⁵ lau⁶⁷t⁴r and cu⁴ p⁷ras⁴ us⁴, and fill⁴d paus⁴s to all p⁷ras⁴s. ⁴ LW ⁵⁴atur⁴s r⁴duc⁴d t⁷⁴ ⁴rror rat⁴ ⁵rom a bas⁴l⁸n⁴ .% to % t⁷⁴ acoust⁸c and prosod⁸c ⁵⁴atur⁴s r⁴duc⁴d ⁸t to .%, but t⁷⁴ most ⁸mpr⁴ss⁸v⁴ r⁴duct⁸on com⁴s ⁵rom t⁷⁴ comb⁸n⁴d LW, acoust⁸c-prosod⁸c, and sp⁴ak⁴r-d⁴p⁴nd⁴nt ⁵⁴atur⁴s, w⁷⁸c⁷ r⁴duc⁴d t⁷⁴ ⁴rror rat⁴ to .%, l⁴nd⁸n⁶ support to t⁷⁴ not⁸on t⁷at t⁷⁴ trut⁷⁵ul and ly⁸n⁶ b⁴⁷av⁸or ⁸s, at l⁴ast part⁸ally, sp⁴ak⁴r d⁴p⁴nd⁴nt. R⁴call ⁷⁴r⁴ t⁷⁴ poly⁶rap⁷ proc⁴dur⁴, w⁷⁸c⁷ ⁴stabl⁸s⁷⁴s a bas⁴l⁸n⁴ ⁵or ⁴ac⁷ ⁸nt⁴rv⁸⁴w⁴⁴ w⁸t⁷ mutually known ⁶round trut⁷ qu⁴st⁸ons b⁴⁵or⁴ s⁴r⁸ous qu⁴st⁸on⁸n⁶ b⁴⁶⁸ns. ⁴n⁶ ⁴t al. [0a] (FBC) add Probab⁸l⁸st⁸c ont⁴xt r⁴⁴ rammar product⁸on rul⁴s [P⁴trov and Kl⁴⁸n, 00] to t⁷⁴ n-⁶ram mod⁴l t⁴st⁴d by M⁸⁷alc⁴a and Strapparava (00) and to t⁷⁴ n-⁶ram + part-o⁵-sp⁴⁴c⁷ mod⁴l o⁵ Ott ⁴t al. (0). t⁴st ⁵our d⁸ff⁴r⁴nt l⁴v⁴ls o⁵ P product⁸on rul⁴s
76
4. THE LANGUAGE OF DECEPTION: COMPUTATIONAL APPROACHES
Table 4.13: Top accuracy scor⁴s (%) ⁵or Ott (0), M&S (0), and (0) mod⁴ls. Numb⁴rs ⁸n par⁴nt⁷⁴s⁴s ar⁴ ⁵rom ’s ⁸mpl⁴m⁴ntat⁸on o⁵ M&S M&S do not t⁴st a b⁸⁶ram mod⁴l. Numb⁴rs ⁸n ⁸tal⁸cs ar⁴ r⁴sults r⁴port⁴d by Ott ⁴t al. [0] and M⁸⁷alc⁴a and Strapparava [00].
Tr⁸pAdv⁸sor
Abort⁸on
⁴st r⁸⁴nd
⁴at⁷ P⁴nalty
words
un⁸⁶ram b⁸⁶ram
88.4 89.6
70.0 (.)
77.0 (0.)
67.4 (.)
PCFG C unigram
rO C unigram rO Cunigram rO Cunigram
. 0. 91.2
77.0 .0 .0
. 85.0 .
0. 71.5 .0
• r unl⁴x⁸cal⁸z⁴d product⁸on rul⁴s (⁸.⁴., all product⁸on rul⁴s ⁴xc⁴pt ⁵or t⁷os⁴ w⁸t⁷ t⁴rm⁸nal nod⁴s) ⁴.⁶., NP2 ! NP3 SAR • r l⁴x⁸cal⁸z⁴d product⁸on rul⁴s (⁸.⁴., all product⁸on rul⁴s), ⁸nclud⁸n⁶ t⁷os⁴ w⁸t⁷ t⁴rm⁸nal nod⁴s, ⁴.⁶., PRP !“you” • rO unl⁴x⁸cal⁸z⁴d product⁸on rul⁴s comb⁸n⁴d w⁸t⁷ t⁷⁴ ⁶randpar⁴nt nod⁴, ⁴.⁶., NP2^VP1 ! NP3 SAR and • r O l⁴x⁸cal⁸z⁴d product⁸on rul⁴s (⁸.⁴., all product⁸on rul⁴s) comb⁸n⁴d w⁸t⁷ t⁷⁴ ⁶randpar⁴nt nod⁴, ⁴.⁶., PRP^NP4 !“you” . ⁴y d⁴monstrat⁴ t⁷at t⁷⁴ P rul⁴s prov⁸d⁴ a substant⁸al ⁶a⁸n ov⁴r t⁷⁴s⁴ approac⁷⁴s, as Tabl⁴ . s⁷ow⁸n⁶ b⁴st p⁴r⁵ormanc⁴s ⁸n bold⁵ac⁴, d⁴monstrat⁴s. Most ⁸mpr⁴ss⁸v⁴ ⁸s t⁷⁴ ⁵act t⁷at t⁷⁴ ⁶a⁸n ⁸s ac⁷⁸⁴v⁴d not only on product r⁴v⁸⁴w data ( t⁴st on YLP r⁴staurant r⁴v⁸⁴ws as w⁴ll as on Ott ⁴t al.’s ⁷ot⁴l r⁴v⁸⁴ws) but also on M&S’s ⁴ssay data, w⁷⁸c⁷ ⁸nclud⁴s t⁷⁴ w⁸d⁴ly d⁸v⁴r⁶⁴nt top⁸cs o⁵ abort⁸on, t⁷⁴ d⁴at⁷ p⁴nalty, and b⁴st ⁵r⁸⁴nds. also t⁴st a s⁴cond s⁴t o⁵ Tr⁸pAdv⁸sor ⁷ot⁴l data, w⁸t⁷ 00 trut⁷⁵ul and 00 d⁴c⁴pt⁸v⁴ r⁴v⁸⁴ws bas⁴d on spam d⁴t⁴ct⁸on ⁷⁴ur⁸st⁸cs ⁸ntroduc⁴d ⁸n ⁴n⁶ ⁴t al. [0b], ⁸.⁴., bot⁷ t⁷⁴ Tru⁴ and als⁴ r⁴v⁸⁴ws ⁸n t⁷⁴ ⁴ur⁸st⁸c data s⁴t occurr⁴d “⁸n t⁷⁴ w⁸ld” as oppos⁴d to t⁷⁴ ⁶old-standard data, w⁷⁸c⁷ ⁸nvolv⁴d M⁴c⁷an⁸cal Turk⁴rs wr⁸t⁸n⁶ t⁷⁴ als⁴ r⁴v⁸⁴ws. t ⁸s wort⁷ not⁸n⁶ t⁷at t⁷⁴ var⁸ous mod⁴ls p⁴r⁵orm mor⁴ poorly on t⁷⁸s “Tr⁸pAdv⁸sor-⁴ur⁸st⁸c” data t⁷an on Ott’s ⁶oldstandard data (s⁴⁴ Tabl⁴ .), w⁷⁸c⁷ su⁶⁶⁴sts a r⁴v⁸s⁸t⁸n⁶ o⁵ t⁷⁴ qu⁴st⁸on o⁵ ⁷ow to obta⁸n a ⁶old-standard data s⁴t ⁵or d⁴c⁴pt⁸on. ⁴n⁶ and ⁸rst [0] (F&H) add to ’s P + un⁸⁶ram mod⁴l a “coll⁴ct⁸v⁴ profil⁴” ⁵or ⁴ac⁷ ⁷ot⁴l ⁸n t⁷⁴ Ott datas⁴t, t⁷⁴ coll⁴ct⁸v⁴ profil⁴ b⁴⁸n⁶ ⁴ss⁴nt⁸ally a mod⁴l o⁵ a tru⁴ r⁴v⁸⁴w ⁵or ⁴ac⁷ ⁷ot⁴l. ⁴ bas⁸c ⁸d⁴a ⁸s t⁷at r⁴v⁸⁴ws by custom⁴rs w⁷o ⁷av⁴ ⁴xp⁴r⁸⁴nc⁴d t⁷⁴ ⁷ot⁴l w⁸ll s⁷ar⁴ common ⁵⁴atur⁴s o⁵ t⁷⁴ ⁴xp⁴r⁸⁴nc⁴ t⁷at w⁸ll b⁴ lack⁸n⁶—or ⁴v⁴n contrad⁸ct⁴d by—stat⁴m⁴nts ⁸n
4.3. THE CURRENT SYSTEMS
77
Table 4.14: Accuracy (%) r⁴sults ⁵rom two s⁴ts o⁵ Tr⁸pAdv⁸sor data. Numb⁴rs ⁸n ⁸tal⁸cs ar⁴ r⁴sults r⁴port⁴d by Ott ⁴t al. [0].
Tr⁸pAdv⁸sor old ⁴ur⁸st⁸c words
un⁸⁶ram b⁸⁶ram
88.4 89.6
74.4 .
PCFG C unigram
rO C unigram r Cunigram rO Cunigram
. 0. 91.2
. . 76.6
d⁴c⁴pt⁸v⁴ r⁴v⁸⁴ws. (R⁴call t⁷⁴ co⁷⁴s⁸v⁴ prop⁴rty o⁵ tru⁴ r⁴v⁸⁴ws c⁸t⁴d by ⁴rnánd⁴z us⁸l⁸⁴r ⁴t al. [0].) ⁴ profil⁴ cons⁸sts o⁵ d⁸st⁸nct asp⁴cts usually r⁴al⁸z⁴d as prop⁴r noun p⁷ras⁴s t⁷at r⁴⁵⁴r to landmarks n⁴ar t⁷⁴ ⁷ot⁴l and ⁶⁴n⁴ral asp⁴cts, ⁵or ⁴xampl⁴, ⁵or ⁷ot⁴ls locat⁸on, s⁴rv⁸c⁴, and br⁴ak⁵ast. ⁴ d⁴scr⁸ptors o⁵ t⁷⁴s⁴ asp⁴cts ar⁴ pull⁴d ⁵rom t⁷⁴ narrat⁸v⁴s and ass⁸⁶n⁴d a w⁴⁸⁶⁷t. or d⁸st⁸nct asp⁴cts, t⁷⁴ w⁴⁸⁶⁷t ⁸s bas⁴d on t⁷⁴ ⁵r⁴qu⁴ncy o⁵ t⁷⁴ asp⁴ct ⁸n t⁷⁴ r⁴v⁸⁴w coll⁴ct⁸on ⁵or ⁶⁴n⁴ral asp⁴cts, t⁷⁴ w⁴⁸⁶⁷t ⁸s ass⁸⁶n⁴d to t⁷⁴ d⁴scr⁸ptor words d⁴scr⁸b⁸n⁶ t⁷⁴ asp⁴ct. n t⁷⁴ ⁴xampl⁴ ⁶⁸v⁴n by &,“M⁸c⁷⁸⁶an Av⁴,” a d⁸st⁸nct asp⁴ct, ⁸s ass⁸⁶n⁴d a w⁴⁸⁶⁷t o⁵ .0, w⁷⁸l⁴ t⁷⁴ ⁶⁴n⁴ral asp⁴ct “Room” ⁷as d⁴scr⁸ptors “wond⁴r⁵ul” w⁸t⁷ a w⁴⁸⁶⁷t o⁵ .0, “d⁴lux⁴,” .0, and “⁷u⁶⁴,” .0. A profil⁴ ⁸s also bu⁸lt ⁵or ⁴ac⁷ r⁴v⁸⁴w to b⁴ ⁹ud⁶⁴d as T or and ⁴ac⁷ o⁵ t⁷⁴s⁴ t⁴st profil⁴s ⁸s compar⁴d—“al⁸⁶n⁴d”—to t⁷⁴ coll⁴ct⁸v⁴ profil⁴ and v⁸c⁴ v⁴rsa. W⁷⁴n t⁷⁴ t⁴st profil⁴ ⁸s al⁸⁶n⁴d to t⁷⁴ coll⁴ct⁸v⁴ profil⁴, t⁷⁴ r⁴pr⁴s⁴ntat⁸v⁴n⁴ss o⁵ t⁷⁴ t⁴st profil⁴ as a cas⁴ o⁵ t⁷⁴ coll⁴ct⁸v⁴ profil⁴ ⁸s t⁴st⁴d w⁷⁴n t⁷⁴ coll⁴ct⁸v⁴ profil⁴ ⁸s al⁸⁶n⁴d w⁸t⁷ t⁷⁴ t⁴st profil⁴, any confl⁸cts ⁸n t⁷⁴ t⁴st profil⁴ ar⁴ cau⁶⁷t. ⁸s al⁸⁶nm⁴nt y⁸⁴lds a l⁸st o⁵ compat⁸b⁸l⁸ty ⁵⁴atur⁴s ⁵or ⁴ac⁷ t⁴st profil⁴ t⁷at ⁴nabl⁴ m⁴asur⁴m⁴nt o⁵ t⁷⁴ d⁸stanc⁴ b⁴tw⁴⁴n t⁷⁴ t⁴st profil⁴ and t⁷⁴ coll⁴ct⁸v⁴ profil⁴, t⁷⁴ mor⁴ d⁸stant, t⁷⁴ mor⁴ d⁴c⁴pt⁸v⁴. & r⁴⁸mpl⁴m⁴nt t⁷⁴ Ott and mod⁴ls ⁸n ord⁴r to add t⁷⁴⁸r profil⁴ al⁸⁶nm⁴nt compat⁸b⁸l⁸ty mod⁴l to comb⁸nat⁸ons o⁵ Ott’s n-⁶ram and ’s syntact⁸c (SYN ) mod⁴ls. T⁴st⁸n⁶ on t⁷⁴s⁴ r⁴⁸mpl⁴m⁴nt⁴d bas⁴l⁸n⁴ mod⁴ls y⁸⁴lds t⁷⁴ accuracy scor⁴s ⁸n Tabl⁴ .. & add t⁷⁴⁸r profil⁴ al⁸⁶nm⁴nt mod⁴l to t⁷⁴s⁴ r⁴sults to ac⁷⁸⁴v⁴ an accuracy o⁵ 91.3%, “s⁸⁶n⁸ficantly b⁴tt⁴r . . . t⁷an t⁷⁴ bas⁴l⁸n⁴ ⁸n Tabl⁴ ., us⁸n⁶ t⁷⁴ W⁸lcoxon s⁸⁶n-rank t⁴st (p