Posted 1 Days Ago Job ID: 2109205 27 quotes received

Offline Video Generation Platforme

Fixed Price$5k-$10k

Quotes (27) · Premium Quotes (4) · Invited (0) · Hired (0)

Send before: September 05, 2025

Programming & Development Programming & Software

Python API JSON Open Source Computer Graphics Embedded Development Operating Systems Version Control Data Extraction Amazon Web Services Artificial Intelligence Object-Oriented Programming Chatbots General / Other Programming & Software

Full-Stack Developer Team for Hasaballa AI Video Generation Platfo

## 🧠 Project Overview

We are seeking a skilled and dedicated team of developers to build **Hasaballa AI**, an **offline-first**, AI-powered video generation platform designed for **Arabic-speaking creators**.

The platform leverages **advanced AI models** to generate ultra-realistic images, voices, animations, and full videos (1-15 minutes) based on user scripts, supporting **all Arabic dialects** with a **100% Arabic RTL interface**.

🖥️ Hasaballa AI operates entirely offline (**no internet, cloud, credits, or subscriptions**) on a high-performance system:

- Windows 11 Home 64-bit

- AMD Ryzen™ 9 9955HX

- RTX 5090 24GB GDDR7

- 64GB DDR5 RAM

- 4TB M.2 NVMe SSD

- 16.0" WQXGA 300Hz display

🎬 The platform aims to produce cinematic videos **matching or exceeding the quality of the following examples** (please review all links for minimum quality standards):

🔗 https://tinyurl.com/4py6ntv5

🔗 https://tinyurl.com/457h6pxb

🔗 https://tinyurl.com/kye234sk

🔗 https://tinyurl.com/s65thcdc

🔗 https://tinyurl.com/mjb7cw8v

🔗 https://tinyurl.com/r6b2ndn9

---

## 🧩 How Hasaballa AI Works

Hasaballa AI offers three modes of operation for maximum flexibility:

1. 🎬 **Smart Director – Automatic Mode**

Users write a full script in the Hasaballa GPT chat window (scenes, images, voices, animations), click the "Smart Director – Auto" button, and the system automatically generates:

- Ultra-realistic images

- Character animations

- Voice generation/cloning

- Lip-sync

- Background audio

- Final video with a full timeline

✅ No manual steps are required, but users can review/tweak results afterward.

2. 🎛️ **Smart Director – Manual Mode**

Users write a script, click the "Smart Director – Manual" button, and control each production stage (e.g., generate image, skip voice, add audio) step-by-step.

🧠 Generated elements are added to the final timeline — ideal for advanced users seeking precise control.

3. 🧩 **Standalone Mode**

Users can use individual tools (image generation, voice cloning, animation, lip-sync, background audio) via dedicated buttons for quick tasks or partial content creation (e.g., podcasts, teasers).

🛠️ Manual assembly or import into the editor is required for final video output.

---

## 🛠️ Project Scope (Milestones and Tasks)

### 📸 Milestone 1: Ultra-Realistic Image Generation ($1200-$1400, 2-3 weeks)

- **Tasks**:

- Develop an ultra-realistic image generation system for characters and backgrounds, supporting high-fidelity outputs for video integration.

- Image generation by:

- Text to Image

- Real image to Image (AI)

- Integrate with Character Pack (image + voice) for consistent character identity across scenes.

- **Technical Requirements**:

- Use open-source models like Stable Diffusion or DALL·E-style architectures.

- Optimize for RTX 5090 with CUDA and FP16 for performance.

- Ensure compatibility with animation and video modules.

---

### 🔊 Milestone 2: Voice Generation & Sound Library ($1000-$1300, 2-3 weeks)

- **Tasks**:

- Develop a voice generation and cloning system supporting 8+ characters with studio-grade audio (48kHz, natural breathing, pauses, emotional tones).

- Support all Arabic dialects (e.g., Egyptian, Gulf, Levantine, Moroccan) with 100% voice cloning accuracy.

- Integrate a local sound library and Pixabay search for background audio (music, applause, etc.).

- Implement a background audio engine for multi-layered audio tracks with seamless blending.

- Add a "Skip This Step" button for voice and sound stages, integrated with Smart Director.

- **Technical Requirements**:

- Use open-source models like VITS or Tacotron 2 for voice generation/cloning.

- Optimize for performance (8-14 seconds per 10-15 word voice line) using CUDA on RTX 5090.

- Integrate with Character Pack from Milestone 1.

---

### 🕴️ Milestone 3: Image Animation & Lip Sync ($1200-$1600, 2-3 weeks)

- **Tasks**:

- Develop an image animation system for full-body motion (eyes, facial expressions, shoulders, hands, legs) and cinematic background motion (e.g., moving trees, people walking).

- Implement lip-sync and full-body motion matching or exceeding DreamFace or SkyReels-level quality, supporting four animation modes: Narration, Dialogue, Mixed, and No Lip Sync.

- Add a "Skip This Step" button for animation, integrated with Smart Director.

- Ensure animations are generated in 12-20 seconds per image with high-fidelity output.

- **Technical Requirements**:

- Use open-source models like SadTalker or FirstOrderMotion for animation.

- Optimize for RTX 5090 with CUDA and FP16 for performance.

- Integrate with Character Pack and voice modules from previous milestones.

---

### 💬 Milestone 4: Chat Window (Hasaballa GPT) & Smart Director ($2000-$2300, 2-3 weeks)

- **Tasks**:

- Build a chat window powered by an offline AI model (e.g., Mixtral 8x7B or Nous-Hermes-2 Mistral 7B) supporting all Arabic dialects and script analysis (15 scenes in 6-10 seconds).

- Support text input (15,000 characters), voice commands (speech-to-text), and upload of 8+ reference images/voices.

- Implement intelligent script parsing for scenes, backgrounds, sounds, and lip-sync instructions.

- Develop Smart Director with Auto and Manual modes, including a visual progress bar for Manual mode.

- Add optional internet access buttons ("Smart Internet Access" and "Disconnect") and a saved projects panel.

- **Technical Requirements**:

- Optimize AI model for offline use with CUDA on RTX 5090.

- Ensure full RTL support for the Arabic interface.

- Integrate with previous modules (image, voice, animation).

---

### 🎞️ Milestone 5: Final Video, Editing, and Integrations ($1000-$1400, 3 weeks)

- **Tasks**:

- Develop a CapCut-style video editor with trimming, transitions, subtitles (manual/AI-generated), filters, zoom/pan, and drag-and-drop timeline.

- Support video export in HD, Full HD, 720p, 1080p, 4K with aspect ratios (16:9, 9:16, 1:1).

- Implement Smart Edit Layer for editing via Arabic prompts with timecodes.

- Integrate with Google AdSense and YouTube Studio for revenue tracking and performance analytics (requires internet connection button).

- Build a 100% Arabic RTL interface with large buttons and tooltips.

- Add auto-update, save project, project folder system, and backup/restore functions.

- Fully connect and integrate all modules (image generation, voice, animation, GPT chat, and timeline editor) into a unified, seamless platform** — ensuring smooth transitions and full workflow continuity.

- Provide full source code ownership upon completion.

- **Technical Requirements**:

- Use FFmpeg for video processing and export.

- Implement APIs for AdSense/YouTube integration.

- Optimize for RTX 5090 with CUDA and FP16.

- Ensure secure file management and backup systems.

---

## 🎯 Required Skills

- **AI & Machine Learning**: Open-source models like Stable Diffusion, VITS, Tacotron 2, SadTalker, Mixtral 8x7B.

- **Video Processing**: FFmpeg, CapCut-style editing tools.

- **Web Development**: HTML, JS, React – with full RTL Arabic interface design.

- **API Integration**: Google AdSense, YouTube Studio.

- **System Design**: Secure file systems, offline-first architecture.

- **Hardware Optimization**: CUDA, FP16, RTX 5090.

- **Arabic Fluency**: Understanding dialects, full Arabic support.

---

## 🖥️ Platform Requirements

- Full offline operation (up to 200GB)

- Image gen: 10–18 sec/image

- Voice gen: 8–14 sec / 10–15 words

- Animation: 12–20 sec/image

- Script analysis: 15 scenes in 6–10 sec

- Video output: 7 mins in 45 mins / 15 mins in 90 mins

- Must match or exceed reference video quality

---

## 💰 Budget & Timeline

- **Total Budget**: $6400–$8000

- **Milestone Breakdown**:

- M1: $1200–$1400

- M2: $1000–$1300

- M3: $1200–$1600

- M4: $2000–$2300

- M5: $1000–$1400

- **Timeline**: 11–15 weeks total

🔔 *Note*: Milestone 5 may need increased time for quality — 3–4 weeks.

"We aim to maximize quality within our total budget cap of $8000, prioritizing efficient resource allocation across all milestones."

---

## 📦 Deliverables

- Fully functional Hasaballa AI platform

- Source code with docs (Python, JS, etc.)

- Optimized for RTX 5090

- RTL Arabic UI

- Full source code ownership

---

## 👥 Team Requirements

- Team of 2–6 devs (AI, video, front-end, back-end)

- Senior or mid-level

- Weekly progress updates

- Portfolio required

---

## 📬 How to Apply

Please include:

1. 🧩 Breakdown per milestone

2. 🧠 Relevant portfolio links

3. 🕒 Timeline + budget confirmation

4. 👥 Team composition

5. ✅ Availability to start now

6. Developers must optimize the platform (up to 200GB) for efficient performance on the specified hardware using techniques like FP16 and model compression.

---

📌 Notes

- Full offline operation (Pixabay, AdSense, YouTube optional online)

- No proprietary code unless open-license

- Must review quality links

- Ownership must be fully transferred

✅ Official Screening Questions for Hasaballa AI Developers:

1. Are you willing to deliver a high-quality, realistic 37-second demo to prove your readiness for this project?

2. What are the key AI tools and frameworks you’ve mastered or worked with in the areas of:

- Image generation

- Voice synthesis

- Photo animation

- Lip-syncing

3. Are you capable of building the entire platform on your own, or do you work with a full development team?

4. Will you strictly follow our quality standards?

Please note: any compromise in image generation, voice quality, animation, backgrounds, or lip-syncing will immediately terminate the contract.

🚀 Apply now and help build the next-generation Arabic AI video platform!

Job Q&A

Become a member to ask a question, view Q&A, and get more benefits.

Similar Jobs

AI Sermon Platform Developer Needed ASAP
Fixed Price or HourlyPosted: July 17, 2025
Extract Videos and PDFs from Protected
Fixed Price or HourlyPosted: June 24, 2025
Intake AI Voice Agent wth Doc Processing
Fixed Price or HourlyPosted: July 30, 2025

Posted By

Ahmed S

Germany


Feedback	No Feedback 0.0%
Total Spend	$0
Jobs Posted	1
Jobs Paid	0
Paid Invoices	0
Outstanding Invoices	0

More Jobs from Ahmed S (0)

Add to Watchlist Send a Quote