YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Multi View
Multi-view face as condition for human-centric video generation.
✔️ TODO List
- Base code for single-shot video generation
- Spliting RoPE for video and reference images
- Face selecting router
- Supporting multi-shot video generation
- Four level shot RoPE
- Inter-shot self-attention and frame-pack based intra-shot attention
🚀 Training
bash train.sh
🚀 Inference
bash test.sh
⚙️ Configuration
YAML:
train_args:
max_checkpoints_to_keep: 3
resume_from_checkpoint: True
seed: 42
save_steps: 150
save_epoches: 1
batch_size: 8
visual_log_project_name: Wan2.2_5B-Multi_view-normal_rope_384_640-3ref
output_path: /root/paddlejob/workspace/qizipeng/baidu/personal-code/Multi-view/multi_view/ckpts
local_model_path: /root/paddlejob/workspace/qizipeng/wanx_pretrainedmodels
zero_face_ratio: 0.1
split_rope: False
split1: False
split2: False
split3: False
infer_args:
infer_step: 1350
epoch_id: 17
dataset_args:
base_path: /root/paddlejob/workspace/qizipeng/baidu/personal-code/Multi-view/multi_view/datasets/merged_wangpan_artgrid_taobao_visionchina_123rf_nasuyun_xinpianchang_disk.json
height: 384
width: 640
num_frames: 81
ref_num: 3
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support