AI transcription technology was allegedly used to capture and transcribe physician–patient conversations, using microphone-enabled devices in examination rooms, without necessary consent.
Abstract: Motivated by depression's significant impact on global health, this work proposes MultiDepNet, a novel multi-modal interpretable depression detection system integrating visual, physiological ...
Google's new offline-first dictation app uses Gemma AI models to take on the apps like Wispr Flow.
Abstract: Deception detection is a critical research area with applications in security, forensics, and psychology. Traditional multimodal deception detection models incorporate audio and visual ...