Debate words

Lingua Franca 2023-08-25

The Transcript Library at rev.com is a great resource — within 24 hours, they had transcripts of Wednesday's Fox News Republican presidential debate, and also of Tucker Carlson's debate night interview with Donald Trump on X.

So this morning I downloaded the transcripts, and ran the code that I've used several times over the years to identify the characteristic word-choices of an individual or of a group.

Given eight participants in the Fox debate, the number of words that each one used was not very large.  In descending order:

Candidate N Words Ramaswamy 2465 Pence 2362 DeSantis 1921 Christie 1928 Haley 1701 Burgum 1569 Scott 1241 Hutchinson 1219

And given that the candidates were asked different questions, and didn't have time to say much overall, it's interesting that the differences are still somewhat interpretable.

For example, Vivek Ramaswamy repeated a number of words that none of the other candidates used even once: generation (6 times), revolution (5 times), reality (5 times), professional (5 times), nuclear (4 times), epidemic (4 times). He used mental 4 times in his 2465 words, for a rate of 1.6 per thousand, whereas the others (specifically Mike Pence) used it 1 time across a total of 11952 words, for a rate of .08 per thousand — a rate almost 20 times lower.

Obviously, it matters how and why Ramaswamy used those words — you can check them out in the rev.com transcript. For example, some of his uses of  revolution, reality, and professional occur during a chaotic passage, at around 33 minutes, in which DeSantis, Ramaswamy, Pence and the moderators all interrupt one another repeatedly. The immediate context:

Ramaswamy: I just want to respond to Mike for one second because he invoked me back. Listen, now that everybody’s gotten their memorized, pre-prepared slogans out of the way, we can actually have a real discussion now. The reality and the fact of the matter is-

Pence: Was that one of yours?

Ramaswamy: Not really, Mike, actually. We’re just going to have some fun tonight. And the reality is, you have a bunch of people, professional politicians, super PAC puppets, following slogans handed over to them by their 400-page super PACs last week. The real choice we face in this primary is this: do you want a super PAC puppet or do you want a patriot who speaks the truth? Do you want incremental reform, which is what you’re hearing about? Or do you want revolution? And I stand on the side of the American Revolution, rather than this incrementalism.

Comparing overall rates of word usage is a difficult statistical problem, for reasons discussed at length in  Monroe, Colaresi & Quinn "Fightin' Words: : Lexical Feature Selection and Evaluation for Identifying the Content of Political Conflict", Political Analysis 2009. They argue that a plausible method is to use the  "weighted log-odds-ratio, informative Dirichlet prior" algorithm described on p. 387-8 of their paper. I've used that algorithm in a number of earlier posts (see the list at the bottom), and tried it again here.

By that method, here are the top ten words for each candidate (followed by their "weighted log-odds-ratios").

Ramaswamy:

generation   2.279
revolution   2.080
reality      2.080
professional 2.080
address      1.997
nuclear      1.861
epidemic     1.861
love         1.665
mental       1.616
homeland     1.616

Pence:

leadership 2.313
american   2.292
united     2.137
vivek      2.096
states     2.030
promise    1.877
clear      1.877
leader     1.687
yet        1.625
proven     1.625

Christie:

democratic 2.575
jersey     2.384
here       2.101
who        1.996
waiting    1.946
incumbent  1.946
stood      1.864
sit        1.784
tonight    1.728
type       1.716

DeSantis:

decline   3.228
florida   3.078
going     1.993
are       1.761
country   1.722
thousands 1.715
her       1.715
tens      1.685
succeed   1.685
reasons   1.685

Haley:

defense   1.983
all       1.959
they      1.935
weeks     1.758
girls     1.717
classroom 1.717
ban       1.680
ukraine   1.642
senate    1.569
less      1.569

Burgum:

town       2.455
small      2.040
innovation 2.040
oil        2.004
dakota     2.004
buying     2.004
buy        2.004
north      1.783
loves      1.736
feds       1.736

Scott:

must     1.938
number   1.792
percent  1.782
package  1.782
illinois 1.782
justice  1.553
poverty  1.543
leave    1.543
fire     1.543
asking   1.543

Hutchinson:

terms        2.727
arkansas     2.525
science      2.061
computer     2.061
under        1.797
whenever     1.785
solution     1.785
disqualified 1.785
attacking    1.785
important    1.559

There are obviously many other ways to approach such transcripts, e.g. via LIWC; and there are often interesting acoustic measures, e.g. as discussed in "Debate quantification: how MAD did he get?", 10/29.2016. And the Carlson/Trump conversation is still waiting.

But that's all I have time for this morning, except to add a table of pronoun usage. The values are percentages of all the words used by each candidate — thus the 4.57 for Mike Pence's 1st person singular pronouns means that he used "I", "me", and "my" 108 times in his 2362 lexical tokens, and 108/2362 = 0.0457.

Candidate 1st Person Singular 2nd Person 1st Person Plural Burgum 2.62 1.02 4.02 Christie 2.44 1.50 3.48 DeSantis 3.94 2.38 4.30 Haley 2.06 2.41 3.59 Hutchinson 2.79 0.66 4.02 Pence 4.57 1.40 3.13 Ramaswamy 3.81 1.70 3.61 Scott 2.81 1.85 4.59

The fact that Mike Pence leads by a significant margin in first-person pronouns doesn't reveal a narcissistic personality disorder, it merely reflects that fact that his central pitch was about his experience as vice president. But I'm not expecting George Will and other pundits to start writing column after column accusing Pence of narcissism, as they (falsely) did with respect to Obama.

A few relevant past posts:

"Fact-checking George F. Will", 6/7/2009 "Fact-checking George F. Will, one more time", 10/6/2009 "Another lie from George Will", 5/7/2012 "More B.S. from George F. Will", 8/28/2015

"Obama's favored and disfavored SOTU words", 1/29/2014 "Male and female word usage", 8/7/2014 "The most Trumpish (and Bushish) words", 9/5/2015 "Make America rather formidable again", 9/10/2015 "Political vocabulary display", 9/10/2015 "The most Kasichoid, Cruzian, Trumpish, and Rubiositous words", 3/11/2016 "More political text analytics", 4/15/2016 "Style shifting in student writing assignments",  10/5/2018