My next project where I learn in public is about doing transcripts with Whisper.
Whisper is a quite good automatic speech recognition model that is open source and can run on your own computers, provided you have a GPU.
I prefer reading to listening, so I wanted to transcribe my long list of things to listen to, ideally in automatic way.
The first step was to use Whisper to transcribe anything dropped into a folder. I recorded myself while doing this and I plan to do at least 2 more sessions of this.
Lesson learned: OBS studio uses a lot of resources and sometimes the recording has issues because of Whisper also hogging up resources.