AI Video Transcription via Whisper and Howler.js in TS
Software Engineering
4 courses
45 practices
6 hours
Build a TypeScript app that transcribes audio and video files using OpenAI Whisper and Howler.js. Learn how to set up your environment, process media from sources like Google Drive or LinkedIn, integrate transcription APIs, and convert speech into accurate, structured text.
Getting Started with OpenAI Whisper API in TypeScript
3 lessons
9 practices
Learn how to build a video transcription system using TypeScript and OpenAI's GPT-4o Transcription API. This course covers environment setup, audio playback with Howler.js, video scraping, and generating video summaries.
Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal
From our community
Hear what our customers have to say about CodeSignal Learn
I'm impressed by the quality and can't stop recommending it. It's also a lot of fun!
Francisco Aguilar Meléndez
Data Scientist
+11
I love that it's personalized. When I'm stuck, I don't have to hope my Google searches come out successful. The AI mentor Cosmo knows exactly what I need.
Faith Yim
Software Engineer
+14
It's an amazing product and exceeded my expectations, helping me prepare for my job interviews. Hands-on learning requires you to actually know what you are doing.
Alex Bush
Full Stack Engineer
+9
I'm really impressed by the AI tutor Cosmo's feedback about my code. It's honestly kind of insane to me that it's so targeted and specific.
Abbey Helterbran
Tech consultant
+8
I tried Leetcode but it was too disorganized. CodeSignal covers all the topics I'm interested in and is way more structured.
Jonathan Miller
Senior Machine Learning Engineer
+12
I'm impressed by the quality and can't stop recommending it. It's also a lot of fun!
Francisco Aguilar Meléndez
Data Scientist
+11
11 practices
This unit builds a backend-controlled audio player using Howler.js for playback. The backend transcribes a segment of the selected audio file based on when the user starts and stops playback.
Learn how to fetch and process videos from remote sources like Google Drive or LinkedIn. This course walks you through downloading, extracting audio, and generating accurate transcriptions and summaries using the Whisper API.
Discover how to build real-time browser-based transcription tools with the Web Audio API and Whisper. You’ll capture microphone input, process it in chunks, and use Whisper’s advanced features for segment timing, contextual cues, and multilingual transcription.