Does Fuse automatically transcribe a video if the audio contains no speech?

If the video does not contain any spoken words, Fuse cannot auto-transcribe the video.

