LipFormer: Learning to Lipread Unseen Speakers Based on Visual-Landmark Transformers.
Feng XueYu LiDeyin LiuYincen XieLin WuRichang HongPublished in: IEEE Trans. Circuits Syst. Video Technol. (2023)
Keyphrases
- previously unseen
- learning process
- visual learning
- learning algorithm
- knowledge acquisition
- neural network
- learning tasks
- learning systems
- active learning
- prior knowledge
- decision trees
- data sets
- machine learning
- elementary school
- visual perception
- inductive inference
- learning scheme
- learning analytics
- incremental learning
- learning scenarios
- object recognition
- unsupervised learning
- empirical studies
- training set
- image classification
- supervised learning
- support vector machine
- low level
- hidden markov models