Description
We're looking for an AI / generative video specialist to build a self-hosted platform (running on our own server) that generates fully AI UGC videos, starting from an avatar we create ourselves and producing spoken clips in Italian with correct, properly synchronized lip sync. We operate in the supplements and beauty sector, so our videos often feature physical products on camera. It is critical that the product label stays intact, legible, and undistorted throughout AI generation — no warped text, garbled logos, or mangled packaging. This is a hard requirement. We already have active, working accounts on the main services (Seedance, fal.ai, ElevenLabs, and others). The platform must connect to these via API — not rebuild everything from scratch. Important — what already works: the Italian voice generated by ElevenLabs works very well for us. That part of the pipeline is solid, and we want to keep using it. Each avatar will have its own ElevenLabs voice code, which the platform should use as the starting point for generation. What the platform must do Maintain an internal library of avatars (5–6, no more) that we create ourselves using Nano Banana (reference images of the characters). Generation should always start from one of these avatars, paired with its corresponding ElevenLabs voice code. Generate multiple videos in parallel, where each subsequent scene starts from the last frame of the previous scene, to maintain visual consistency and character continuity. When a produ