Generate edited video frames using text prompts
Edit images using text instructions
Transcribe audio to text instantly using WebGPU