Posted 1 Days Ago Job ID: 2109205 27 quotes received

Offline Video Generation Platforme

Fixed Price$5k-$10k
Quotes (27)  ·  Premium Quotes (4)  ·  Invited (0)  ·  Hired (0)

  Send before: September 05, 2025

Send a Quote

Full-Stack Developer Team for Hasaballa AI Video Generation Platfo

## 🧠 Project Overview

We are seeking a skilled and dedicated team of developers to build **Hasaballa AI**, an **offline-first**, AI-powered video generation platform designed for **Arabic-speaking creators**.


The platform leverages **advanced AI models** to generate ultra-realistic images, voices, animations, and full videos (1-15 minutes) based on user scripts, supporting **all Arabic dialects** with a **100% Arabic RTL interface**.


🖥️ Hasaballa AI operates entirely offline (**no internet, cloud, credits, or subscriptions**) on a high-performance system:


- Windows 11 Home 64-bit  

- AMD Ryzen™ 9 9955HX  

- RTX 5090 24GB GDDR7  

- 64GB DDR5 RAM  

- 4TB M.2 NVMe SSD  

- 16.0" WQXGA 300Hz display  


🎬 The platform aims to produce cinematic videos **matching or exceeding the quality of the following examples** (please review all links for minimum quality standards):  

🔗 https://tinyurl.com/4py6ntv5  

🔗 https://tinyurl.com/457h6pxb  

🔗 https://tinyurl.com/kye234sk  

🔗 https://tinyurl.com/s65thcdc  

🔗 https://tinyurl.com/mjb7cw8v  

🔗 https://tinyurl.com/r6b2ndn9


---


## 🧩 How Hasaballa AI Works

Hasaballa AI offers three modes of operation for maximum flexibility:


1. 🎬 **Smart Director – Automatic Mode**  

   Users write a full script in the Hasaballa GPT chat window (scenes, images, voices, animations), click the "Smart Director – Auto" button, and the system automatically generates:

   - Ultra-realistic images  

   - Character animations  

   - Voice generation/cloning  

   - Lip-sync  

   - Background audio  

   - Final video with a full timeline  

   ✅ No manual steps are required, but users can review/tweak results afterward.


2. 🎛️ **Smart Director – Manual Mode**  

   Users write a script, click the "Smart Director – Manual" button, and control each production stage (e.g., generate image, skip voice, add audio) step-by-step.  

   🧠 Generated elements are added to the final timeline — ideal for advanced users seeking precise control.


3. 🧩 **Standalone Mode**  

   Users can use individual tools (image generation, voice cloning, animation, lip-sync, background audio) via dedicated buttons for quick tasks or partial content creation (e.g., podcasts, teasers).  

   🛠️ Manual assembly or import into the editor is required for final video output.


---


## 🛠️ Project Scope (Milestones and Tasks)


### 📸 Milestone 1: Ultra-Realistic Image Generation ($1200-$1400, 2-3 weeks)

- **Tasks**:

  - Develop an ultra-realistic image generation system for characters and backgrounds, supporting high-fidelity outputs for video integration.

  - Image generation by:

    - Text to Image  

    - Real image to Image (AI)

  - Integrate with Character Pack (image + voice) for consistent character identity across scenes.


- **Technical Requirements**:

  - Use open-source models like Stable Diffusion or DALL·E-style architectures.  

  - Optimize for RTX 5090 with CUDA and FP16 for performance.  

  - Ensure compatibility with animation and video modules.


---


### 🔊 Milestone 2: Voice Generation & Sound Library ($1000-$1300, 2-3 weeks)

- **Tasks**:

  - Develop a voice generation and cloning system supporting 8+ characters with studio-grade audio (48kHz, natural breathing, pauses, emotional tones).

  - Support all Arabic dialects (e.g., Egyptian, Gulf, Levantine, Moroccan) with 100% voice cloning accuracy.

  - Integrate a local sound library and Pixabay search for background audio (music, applause, etc.).

  - Implement a background audio engine for multi-layered audio tracks with seamless blending.

  - Add a "Skip This Step" button for voice and sound stages, integrated with Smart Director.


- **Technical Requirements**:

  - Use open-source models like VITS or Tacotron 2 for voice generation/cloning.

  - Optimize for performance (8-14 seconds per 10-15 word voice line) using CUDA on RTX 5090.

  - Integrate with Character Pack from Milestone 1.


---


### 🕴️ Milestone 3: Image Animation & Lip Sync ($1200-$1600, 2-3 weeks)

- **Tasks**:

  - Develop an image animation system for full-body motion (eyes, facial expressions, shoulders, hands, legs) and cinematic background motion (e.g., moving trees, people walking).

  - Implement lip-sync and full-body motion matching or exceeding DreamFace or SkyReels-level quality, supporting four animation modes: Narration, Dialogue, Mixed, and No Lip Sync.

  - Add a "Skip This Step" button for animation, integrated with Smart Director.

  - Ensure animations are generated in 12-20 seconds per image with high-fidelity output.


- **Technical Requirements**:

  - Use open-source models like SadTalker or FirstOrderMotion for animation.

  - Optimize for RTX 5090 with CUDA and FP16 for performance.

  - Integrate with Character Pack and voice modules from previous milestones.


---


### 💬 Milestone 4: Chat Window (Hasaballa GPT) & Smart Director ($2000-$2300, 2-3 weeks)

- **Tasks**:

  - Build a chat window powered by an offline AI model (e.g., Mixtral 8x7B or Nous-Hermes-2 Mistral 7B) supporting all Arabic dialects and script analysis (15 scenes in 6-10 seconds).

  - Support text input (15,000 characters), voice commands (speech-to-text), and upload of 8+ reference images/voices.

  - Implement intelligent script parsing for scenes, backgrounds, sounds, and lip-sync instructions.

  - Develop Smart Director with Auto and Manual modes, including a visual progress bar for Manual mode.

  - Add optional internet access buttons ("Smart Internet Access" and "Disconnect") and a saved projects panel.


- **Technical Requirements**:

  - Optimize AI model for offline use with CUDA on RTX 5090.

  - Ensure full RTL support for the Arabic interface.

  - Integrate with previous modules (image, voice, animation).


---


### 🎞️ Milestone 5: Final Video, Editing, and Integrations ($1000-$1400, 3 weeks)

- **Tasks**:

  - Develop a CapCut-style video editor with trimming, transitions, subtitles (manual/AI-generated), filters, zoom/pan, and drag-and-drop timeline.

  - Support video export in HD, Full HD, 720p, 1080p, 4K with aspect ratios (16:9, 9:16, 1:1).

  - Implement Smart Edit Layer for editing via Arabic prompts with timecodes.

  - Integrate with Google AdSense and YouTube Studio for revenue tracking and performance analytics (requires internet connection button).

  - Build a 100% Arabic RTL interface with large buttons and tooltips.

  - Add auto-update, save project, project folder system, and backup/restore functions.

  - Fully connect and integrate all modules (image generation, voice, animation, GPT chat, and timeline editor) into a unified, seamless platform** — ensuring smooth transitions and full workflow continuity.

  - Provide full source code ownership upon completion.

- **Technical Requirements**:

  - Use FFmpeg for video processing and export.

  - Implement APIs for AdSense/YouTube integration.

  - Optimize for RTX 5090 with CUDA and FP16.

  - Ensure secure file management and backup systems.


---


## 🎯 Required Skills

- **AI & Machine Learning**: Open-source models like Stable Diffusion, VITS, Tacotron 2, SadTalker, Mixtral 8x7B.

- **Video Processing**: FFmpeg, CapCut-style editing tools.

- **Web Development**: HTML, JS, React – with full RTL Arabic interface design.

- **API Integration**: Google AdSense, YouTube Studio.

- **System Design**: Secure file systems, offline-first architecture.

- **Hardware Optimization**: CUDA, FP16, RTX 5090.

- **Arabic Fluency**: Understanding dialects, full Arabic support.


---


## 🖥️ Platform Requirements

- Full offline operation (up to 200GB)

- Image gen: 10–18 sec/image  

- Voice gen: 8–14 sec / 10–15 words  

- Animation: 12–20 sec/image  

- Script analysis: 15 scenes in 6–10 sec  

- Video output: 7 mins in 45 mins / 15 mins in 90 mins  

- Must match or exceed reference video quality


---


## 💰 Budget & Timeline

- **Total Budget**: $6400–$8000  

- **Milestone Breakdown**:

  - M1: $1200–$1400  

  - M2: $1000–$1300  

  - M3: $1200–$1600  

  - M4: $2000–$2300  

  - M5: $1000–$1400  

- **Timeline**: 11–15 weeks total


🔔 *Note*: Milestone 5 may need increased time for quality — 3–4 weeks.


"We aim to maximize quality within our total budget cap of $8000, prioritizing efficient resource allocation across all milestones."

---


## 📦 Deliverables

- Fully functional Hasaballa AI platform  

- Source code with docs (Python, JS, etc.)  

- Optimized for RTX 5090  

- RTL Arabic UI  

- Full source code ownership


---


## 👥 Team Requirements

- Team of 2–6 devs (AI, video, front-end, back-end)  

- Senior or mid-level  

- Weekly progress updates  

- Portfolio required


---


## 📬 How to Apply

Please include:

1. 🧩 Breakdown per milestone  

2. 🧠 Relevant portfolio links  

3. 🕒 Timeline + budget confirmation  

4. 👥 Team composition  

5. ✅ Availability to start now

6. Developers must optimize the platform (up to 200GB) for efficient performance on the specified hardware using techniques like FP16 and model compression.


---


📌 Notes  

- Full offline operation (Pixabay, AdSense, YouTube optional online)  

- No proprietary code unless open-license  

- Must review quality links  

- Ownership must be fully transferred


✅ Official Screening Questions for Hasaballa AI Developers:


1. Are you willing to deliver a high-quality, realistic 37-second demo to prove your readiness for this project?


2. What are the key AI tools and frameworks you’ve mastered or worked with in the areas of:

   - Image generation

   - Voice synthesis

   - Photo animation

   - Lip-syncing


3. Are you capable of building the entire platform on your own, or do you work with a full development team?


4. Will you strictly follow our quality standards?

   Please note: any compromise in image generation, voice quality, animation, backgrounds, or lip-syncing will immediately terminate the contract.



🚀 Apply now and help build the next-generation Arabic AI video platform!


... Show more
Ahmed S Germany