Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person.
Lucas Rafael Stefanel GrisRicardo MarcaciniArnaldo Cândido JúniorEdresson CasanovaAnderson da Silva SoaresSandra Maria AluísioPublished in: CoRR (2023)
Keyphrases
- topic modeling
- topic models
- prediction accuracy
- text mining
- latent dirichlet allocation
- collaborative filtering
- topic extraction
- text classification
- modeling framework
- speech recognition
- automatic speech recognition
- text documents
- scientific articles
- latent topics
- text corpora
- artificial intelligence
- co occurrence
- knowledge base