Abstract: Recently, various audio-visual speech recognition (AVSR) systems have been developed by using multimodal learning techniques. One key issue is that most of them are based on 2D audio-visual ...
Abstract: The application of the 3-D radar data cube (RDC), which integrates time, distance, and Doppler frequency information for accurate human activity recognition (HAR), has attracted much recent ...