What I tell people is, I don’t listen very hard to the actual dialog, just enough not to get caught short or miss a cue for a super or a commercial break or things like that, but really, what you want to do is watch the preview image of the reaction shot, and listen to the meter and pace and pitch of what’s being said, while watching for the facial cues of the guy that’s not on program, for the moment he is about to respond.
Then is when I cut in the reaction, a beat ahead of the speaking. Seems like mindreading, but it’s just face-reading, watching for the intake of breath and partially-open mouth and slight nod of the head that signals a person is about to reply.
If you count on a shot list, you’re always going to be behind on the switch. It’s all about the anticipation. The really bad-looking talk shows on student-run access always feel like they are switched using an egg timer, in that the choice of when to cut seems regularly timed but randomly applied. Or they always wait until one party completely finishes talking before cutting to the next shot. Good practice trick is to watch any of the evening talk shows with the sound off and try to call the camera cuts out loud just by watching faces. Soon, you discover the rythm.
We do 3-camera live-to-tape public affairs talk shows weekly here so I keep in practice. I open, close, and bump in and out of breaks with a wide establishing shot to orient the viewers, then cut to a tight shot of the host as he or she sets up the conversation. While they are introducing the guest or guests, I’ll often cut to a silent single shot of the guest for just a beat or two, but that’s optional, as the wide shot works for this too. My choice there depends on how dynamic the guest looks. If he’s alert and active, I’ll use the tight shot to show him nodding for a beat.
What I also like to do is use shot choice as nonverbal punctuation.
What I mean is, people on talk shows generally talk in paragraphs, with an opening statement, then more detail, and a sort of summation or conclusion statement for that thought. I like to pop to the wide shot as that “paragraph” is finishing, then the host starts a new question.
If the question is involved and sounds like it will be several sentences by itself, I’ll pop into a tighter shot for the second half of that question, on the beat of a verbal comma or breath, then I’ll have the guest’s matching medium or tight reaction shot ready on one camera and the covering wide 2-shot as backup.
If it looks like the dialog is going to ping-pong rapidly for a bit and thus be too kinetic, I’ll take it to the 2-shot, with a tighter single of the guest I can go to as punctuation or emphasis for a particular point if he takes a breath. Again, the key to that is to watch the faces and body language more than listening to the dialog. When you can’t tell if the dialog is about to wind down, the 2-shot is safe cover.
I save OTS shots as a spice, best used sparingly, and used when one side is monopolizing the conversation and I want my other guy to get soem more screen time. The other time I use OTS is when the guest is talking at length about something very emotional or dramatic; then I’ll use it for the sense of “evesdropping” it gives the audience. The OTS has a particular kind of conspiratorial and confessional emotional subtext to it, and I pick shots for reasons of semiotics as much as anything.
Of course it really helps to have experienced camera operators, because they anticipate what I want and how the show is going, and they know innately when to change up a shot or suggest or “sell” me a shot like an OTS when they are not online. A good director takes such suggestions gratefully. Sometimes I say “yes, 2, good call, steady that up and we’ll take it next, leave room for a lower third” or “Sorry one, two already has a similar shot, I need you to stay tighter on so-and-so until the bumper”.
It’s funny, sometimes the host and guests will ask us after the show how we liked it, and sometimes our honest answer is:
“Frankly, I was too busy watching it to pay attention to it, but it seemd really great”. That’s how I know we did a pretty good show. If I can remember every word said on camera, the show was too boring and I switched it in my sleep.