# Audio Samples for AI-assisted Tagging of Deepfake Audio Calls using Challenge-Response
First is the imposter voice. Second is the verified ground truth. Third is a deepfake: an imposter creates using the target's recording, other than the ground truth. Last column is a short caption.
## Audio Samples of Regular Speech
No Challenge
Deepfake Sounds Genuine
## Audio Samples of Top-11 Valid Machine-Detectable Challenges
Captions are prospective explanations and not machine predictions.
Static Mouth
Audible distortions at 'formalities'
Cup Mouth
Non-compliance and Distortions
Whisper
Non-compliance
Speak Softly
Non-compliance
High Pitch
High Non-Compliance
Foreign Words
Vibrating Voice Distortions (also seen with suspends linguistic chain ya ne)
Sing
Non-compliance towards the last
Emotions
Sounds flatter in comparison to imposter
Crosstalk
Non-compliance and Distortions
## Audio Samples of the 9 Weaker Tasks
Speak Loudly
Non-Compliance (Deepfake still whispers)
Read Quickly
Deepfake Sounds Genuine
Read Slowly
Mild Distortions
## Video Samples of Selected Challenges
High-Pitch
Cross-talk (with a self played audio on phone)
Whisper