Audio–visual speech recognition based on regulated transformer and spatio–temporal fusion strategy for driver assistive systems

DOI: 10.1016/j.eswa.2024.124159 Publication Date: 2024-05-09T15:31:05Z