@reillypascal I meant that the experience the article reported of racist words being wrongly transcribed sounds bad!
I am a bit confused because it sometimes sounds like whisper.cpp is producing the speech2text.
Whisper.cpp is just the program that applies a chosen model to a chosen audio file resulting in that model's speech2text.
It is the model that does the speech2text. You can choose different models of different sizes made by different people to call with whisper.cpp.