lxbfYeaa/ContentGeneration

Template

Fork 0

forked from LiveCarta/ContentGeneration

Go to file Use this template

Artsiom Siamashka 9d0c509acd Removed --system flag

2026-04-02 14:37:05 +02:00

HunyuanVideo-1.5 @ 2641c0de73

Video generation pipelines files added

2026-04-01 04:36:27 -07:00

src

Add skip-audio mode and resilient merge handling

2026-04-02 13:25:54 +02:00

tests

Migrate to uv sync and pytest coverage workflow

2026-04-02 14:28:18 +02:00

.dockerignore

Moved COPY code statement, added shell scripts to the dockerignore

2026-04-02 11:14:51 +02:00

.env.example

Refactored code, added Dockerfile, replaced bash scripts with python alternatives, added README with instructions on running a pipeline

2026-04-01 16:56:06 +02:00

.gitignore

Refactored code, added Dockerfile, replaced bash scripts with python alternatives, added README with instructions on running a pipeline

2026-04-01 16:56:06 +02:00

concat_merged.sh

Video generation pipelines files added

2026-04-01 04:36:27 -07:00

Dockerfile

Removed --system flag

2026-04-02 14:37:05 +02:00

generate_videos.sh

Video generation pipelines files added

2026-04-01 04:36:27 -07:00

merge_audio_video.sh

Video generation pipelines files added

2026-04-01 04:36:27 -07:00

pyproject.toml

Migrate to uv sync and pytest coverage workflow

2026-04-02 14:28:18 +02:00

README.md

Migrate to uv sync and pytest coverage workflow

2026-04-02 14:28:18 +02:00

run_video_pipeline.py

Migrate to uv sync and pytest coverage workflow

2026-04-02 14:28:18 +02:00

topic_description.txt

Video generation pipelines files added

2026-04-01 04:36:27 -07:00

uv.lock

Migrate to uv sync and pytest coverage workflow

2026-04-02 14:28:18 +02:00

README.md

ContentGeneration Pipeline

This project runs a 3-step video pipeline:

Generate shot videos from images + prompts.
Merge each generated video with its audio.
Concatenate merged clips into one final output.

The pipeline entrypoint is run_video_pipeline.py.

Quick Start

Local Python:

cp .env.example .env
uv sync --dev
uv run python run_video_pipeline.py

Docker (GPU):

cp .env.example .env
docker build -t content-generation:latest .
docker run --rm --gpus all --env-file .env \
  -v "$(pwd)":/app \
  -v "$HOME/.cache/huggingface":/root/.cache/huggingface \
  -w /app content-generation:latest

First run (skip S3 upload):

python run_video_pipeline.py --skip-s3-upload

Docker first run (skip S3 upload):

docker run --rm --gpus all --env-file .env \
  -v "$(pwd)":/app \
  -v "$HOME/.cache/huggingface":/root/.cache/huggingface \
  -w /app \
  content-generation:latest \
  python run_video_pipeline.py --skip-s3-upload

Project Layout

run_video_pipeline.py: main entrypoint.
src/: helper scripts used by the pipeline.
HunyuanVideo-1.5/: Hunyuan inference code and model dependencies.
reel_script.json: required script input with shots.
images/, audios/, videos/, merged/, results/: working/output folders.
.env.example: environment variable template.

Prerequisites

Linux with NVIDIA GPU and CUDA runtime.
ffmpeg and ffprobe available on PATH.
Python 3.10+.
uv installed (https://docs.astral.sh/uv/).
Hunyuan model checkpoints under HunyuanVideo-1.5/ckpts.
If using FLUX local download, access approved for black-forest-labs/FLUX.1-schnell.

Environment Variables

Create local env file:

cp .env.example .env

Fill required variables in .env:

ELEVENLABS_API_KEY for audio generation.
HUGGINGFACE_HUB_TOKEN if gated Hugging Face model access is needed.
AWS_S3_BUCKET (+ optional AWS vars) if you want final output uploaded to S3.

Run Locally (Python)

Create and activate a virtual environment:

uv venv
source .venv/bin/activate

Install Python dependencies:

uv sync --dev

Install Hunyuan dependencies:

uv pip install -r HunyuanVideo-1.5/requirements.txt
uv pip install --upgrade tencentcloud-sdk-python
uv pip install sgl-kernel==0.3.18

Run full pipeline:

uv run python run_video_pipeline.py

Common options:

# Skip generation and only merge + concat
python run_video_pipeline.py --skip-generate

# Skip S3 upload
python run_video_pipeline.py --skip-s3-upload

# Override base directory
python run_video_pipeline.py --base-dir /absolute/path/to/workdir

# Change logging verbosity
python run_video_pipeline.py --log-level DEBUG

Run with Docker

Build image:

docker build -t content-generation:latest .

Optional build with extra attention backends:

docker build -t content-generation:latest --build-arg INSTALL_OPTIONAL_ATTENTION=1 .

Run pipeline in container (GPU required):

docker run --rm --gpus all \
  --env-file .env \
  -v "$(pwd)":/app \
  -v "$HOME/.cache/huggingface":/root/.cache/huggingface \
  -w /app \
  content-generation:latest

Pass extra pipeline args:

docker run --rm --gpus all \
  --env-file .env \
  -v "$(pwd)":/app \
  -v "$HOME/.cache/huggingface":/root/.cache/huggingface \
  -w /app \
  content-generation:latest \
  python run_video_pipeline.py --skip-s3-upload --log-level DEBUG

Input Expectations

reel_script.json must exist and contain a shots array.
images/shot_<n>.png and audios/output_<n>.mp3 should align by shot number.
Final output is written by default to results/final_output.mp4.

S3 Upload Behavior

If AWS_S3_BUCKET is set, the pipeline uploads final output to S3 using S3VideoStorage.
If AWS_S3_BUCKET is missing, upload is skipped with a warning.
Disable upload explicitly with --skip-s3-upload.

Troubleshooting

torch.cuda.is_available() is false in Docker.

Run with GPU flags: docker run --gpus all ...
Verify NVIDIA Container Toolkit is installed on host.
Check host GPU visibility: nvidia-smi.

ffmpeg or ffprobe not found.

Local: install ffmpeg with your package manager.
Docker: ffmpeg is installed in the provided Dockerfile.

Hunyuan generate step fails due to missing checkpoints.

Ensure checkpoints are available under HunyuanVideo-1.5/ckpts.
Confirm mounted project path in Docker includes checkpoints.

Hugging Face model download fails (401/403).

Accept model access terms for gated models (for example FLUX.1-schnell).
Set HUGGINGFACE_HUB_TOKEN in .env.

S3 upload fails.

Confirm AWS_S3_BUCKET is set.
If needed, set AWS_REGION and credentials (AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, optional AWS_SESSION_TOKEN).
For S3-compatible providers, set AWS_S3_ENDPOINT_URL.

Permission issues when running Docker with mounted volumes.

Use your host user mapping if needed: docker run --rm --gpus all -u "$(id -u):$(id -g)" ...

Out-of-memory during video generation.

Keep PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True,max_split_size_mb:128.
Reduce workload by skipping optional enhancements or lowering resolution/steps in generation scripts.

Verify syntax quickly before running.

uv run python -m py_compile run_video_pipeline.py src/*.py

Testing

Run tests with coverage:

uv run pytest