There are a few ways to do this:
-
Figure out people's actual names by listening to how they address each other and/or using the information we know from scraping, like the names of the judges on the panel.
-
Just call people speaker1, speaker2, etc.
I haven't looked into how to do this, but I gather there are a bunch of AI methods these days. Definitely something to research. If anybody wants to pick this up, I'd love to see a feature/quality/price/etc comparison across diarization methods.
There are a few ways to do this:
Figure out people's actual names by listening to how they address each other and/or using the information we know from scraping, like the names of the judges on the panel.
Just call people speaker1, speaker2, etc.
I haven't looked into how to do this, but I gather there are a bunch of AI methods these days. Definitely something to research. If anybody wants to pick this up, I'd love to see a feature/quality/price/etc comparison across diarization methods.