End-to-End Native Language Identification Using a Modified Vision Transformer(ViT) from L2 English Speech.
Kishan PipariyaDebolina PramanikPuja BharatiSabyasachi ChandraShyamal Kumar Das MandalPublished in: SPECOM (2) (2023)
Keyphrases
- end to end
- language identification
- speaker identification
- english text
- multi lingual
- speech signal
- speech recognition
- computer vision
- gaussian mixture model
- document images
- noisy environments
- vision system
- congestion control
- admission control
- feature extraction
- web information retrieval
- audio features
- text to speech
- spoken language
- audio visual
- application layer
- multimedia
- image processing
- pattern recognition
- hough transform