Visuals Toolset

All processing happens in your browser

Aspect ratio

Shape reference

16:9Widescreen video,YouTube / TV 3:2Standard landscape,classic photo 4:3Slightly tallerthan 3:2 1:1Square 4:5Vertical rectangle,popular Instagram shape 2:3Vertical, tallestclassic photo shape 9:16Tall video / Stories,Reels / TikTok
Drop an image here, or click to browse
Supports JPEG, PNG, WebP — EXIF orientation handled automatically
Source image is smaller than 1080 px on the short side — exported file may look soft at large sizes.
Drop videos here, or click to browse
Supports MP4, MOV, MKV, WebM, AVI — multiple files supported
Audio is copied without re-encoding (lossless, fast)
Drop a video or audio file here, or click to browse
Video: MP4, MOV, MKV, WebM — Audio: MP3, AAC, WAV, M4A, OGG — stream copy, near-instant
Drop a video here, or click to browse
MP4, MOV, MKV, WebM, AVI — audio track extracted in-browser
Drop images here, or click to browse
JPEG, PNG, WebP, GIF — multiple files supported
Drop an image here, or click to browse
JPEG, PNG, WebP, GIF — extracts dominant colours
Drop a photo here, or click to browse
JPEG, PNG, WebP — drag the label to position, pull corners to resize

Canvas

Size

Audio

Drop audio or video file

Background (optional)

Drop photo or video
🎨

Waveform

Style

Colour

🎨
Opacity 90%
Height 40%
1Pick a canvas size
2Drop an audio or video file
3Add a background (optional)
4Style & export

Transcribe

Transcribe audio or video locally — no upload, no API key needed. Powered by OpenAI Whisper running in your browser.

Model

Drop audio or video file — or click to browse

OCR

Extract text from images locally — no upload required. Powered by Tesseract.js running in your browser.

Language in image

Select the language the source is written in. The system uses this to load the right language model.

Drop image (JPEG, PNG, WebP) — or click to browse