[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"project-72036":3},{"id":4,"name":5,"fullName":6,"owner":7,"repo":5,"description":8,"homepage":9,"htmlUrl":9,"language":10,"languages":9,"totalLinesOfCode":9,"stars":11,"forks":12,"watchers":13,"openIssues":14,"contributorsCount":15,"subscribersCount":15,"size":15,"stars1d":16,"stars7d":17,"stars30d":18,"stars90d":15,"forks30d":15,"starsTrendScore":19,"compositeScore":20,"rankGlobal":9,"rankLanguage":9,"license":21,"archived":22,"fork":22,"defaultBranch":23,"hasWiki":22,"hasPages":24,"topics":25,"createdAt":9,"pushedAt":9,"updatedAt":26,"readmeContent":27,"aiSummary":28,"trendingCount":15,"starSnapshotCount":15,"syncStatus":29,"lastSyncTime":30,"discoverSource":31},72036,"InfiniteTalk","MeiGen-AI\u002FInfiniteTalk","MeiGen-AI","​​Unlimited-length talking video generation​​ that supports image-to-video and video-to-video generation",null,"Python",6844,1208,59,158,0,43,95,294,129,40.25,"Apache License 2.0",false,"main",true,[],"2026-06-12 02:02:57","\u003Cdiv align=\"center\">\n\n\u003Cp align=\"center\">\n  \u003Cimg src=\"assets\u002Flogo2.jpg\" alt=\"InfinteTalk\" width=\"440\"\u002F>\n\u003C\u002Fp>\n\n\u003Ch1>InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing\u003C\u002Fh1>\n\n\n[Shaoshu Yang*](https:\u002F\u002Fscholar.google.com\u002Fcitations?user=JrdZbTsAAAAJ&hl=en) · [Zhe Kong*](https:\u002F\u002Fscholar.google.com\u002Fcitations?user=4X3yLwsAAAAJ&hl=zh-CN) · [Feng Gao*](https:\u002F\u002Fscholar.google.com\u002Fcitations?user=lFkCeoYAAAAJ) · [Meng Cheng*]() · [Xiangyu Liu*]() · [Yong Zhang](https:\u002F\u002Fyzhang2016.github.io\u002F)\u003Csup>&#9993;\u003C\u002Fsup> · [Zhuoliang Kang](https:\u002F\u002Fscholar.google.com\u002Fcitations?user=W1ZXjMkAAAAJ&hl=en)\n\n[Wenhan Luo](https:\u002F\u002Fwhluo.github.io\u002F) · [Xunliang Cai](https:\u002F\u002Fopenreview.net\u002Fprofile?id=~Xunliang_Cai1) · [Ran He](https:\u002F\u002Fscholar.google.com\u002Fcitations?user=ayrg9AUAAAAJ&hl=en)· [Xiaoming Wei](https:\u002F\u002Fscholar.google.com\u002Fcitations?user=JXV5yrZxj5MC&hl=zh-CN) \n\n\u003Csup>*\u003C\u002Fsup>Equal Contribution\n\u003Csup>&#9993;\u003C\u002Fsup>Corresponding Authors\n\n\u003Ca href='https:\u002F\u002Fmeigen-ai.github.io\u002FInfiniteTalk\u002F'>\u003Cimg src='https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FProject-Page-green'>\u003C\u002Fa>\n\u003Ca href='https:\u002F\u002Farxiv.org\u002Fabs\u002F2508.14033'>\u003Cimg src='https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FTechnique-Report-red'>\u003C\u002Fa>\n\u003Ca href='https:\u002F\u002Fhuggingface.co\u002FMeiGen-AI\u002FInfiniteTalk'>\u003Cimg src='https:\u002F\u002Fimg.shields.io\u002Fbadge\u002F%F0%9F%A4%97%20Hugging%20Face-Model-blue'>\u003C\u002Fa>\n\u003C\u002Fdiv>\n\n> **TL; DR:**  InfiniteTalk is an unlimited-length talking video generation​​ model that supports both audio-driven video-to-video and image-to-video generation\n\n\u003Cp align=\"center\">\n  \u003Cimg src=\"assets\u002Fpipeline.png\">\n\u003C\u002Fp>\n\n\n\n\n\n\n\n## 🔥 Latest News\n* Dec 16, 2025: 🚀 We are excited to announce the release of **[LongCat-Video-Avatar](https:\u002F\u002Fgithub.com\u002FMeiGen-AI\u002FLongCat-Video-Avatar)**, a unified model that delivers expressive and highly dynamic audio-driven character animation, supporting native tasks including Audio-Text-to-Video, Audio-Text-Image-to-Video, and Video Continuation with seamless compatibility for both single-stream and multi-stream audio inputs. The release includes our Technical Report, [code](https:\u002F\u002Fgithub.com\u002Fmeituan-longcat\u002FLongCat-Video), [model weights](https:\u002F\u002Fhuggingface.co\u002Fmeituan-longcat\u002FLongCat-Video-Avatar), and [project page](https:\u002F\u002Fmeigen-ai.github.io\u002FLongCat-Video-Avatar\u002F).\n* August 19, 2025: We release the [Technique-Report](https:\u002F\u002Farxiv.org\u002Fabs\u002F2508.14033) , weights, and code of **InfiniteTalk**. The Gradio and the [ComfyUI](https:\u002F\u002Fgithub.com\u002FMeiGen-AI\u002FInfiniteTalk\u002Ftree\u002Fcomfyui) branch have been released. \n* August 19, 2025: We release the [project page](https:\u002F\u002Fmeigen-ai.github.io\u002FInfiniteTalk\u002F) of **InfiniteTalk** \n\n\n## ✨ Key Features\nWe propose **InfiniteTalk**​​, a novel sparse-frame video dubbing framework. Given an input video and audio track, InfiniteTalk synthesizes a new video with ​​accurate lip synchronization​​ while ​​simultaneously aligning head movements, body posture, and facial expressions​​ with the audio. Unlike traditional dubbing methods that focus solely on lips, InfiniteTalk enables ​​infinite-length video generation​​ with accurate lip synchronization and consistent identity preservation. Beside, InfiniteTalk can also be used as an image-audio-to-video model with an image and an audio as input. \n- 💬 ​​Sparse-frame Video Dubbing​​ – Synchronizes not only lips, but aslo head, body, and expressions\n- ⏱️ ​​Infinite-Length Generation​​ – Supports unlimited video duration\n- ✨ ​​Stability​​ – Reduces hand\u002Fbody distortions compared to MultiTalk\n- 🚀 ​​Lip Accuracy​​ – Achieves superior lip synchronization to MultiTalk\n\n\n\n## 🌐 Community  Works\n- [Wan2GP](https:\u002F\u002Fgithub.com\u002Fdeepbeepmeep\u002FWan2GP\u002F): Thanks [deepbeepmeep](https:\u002F\u002Fgithub.com\u002Fdeepbeepmeep) for integrating InfiniteTalk in Wan2GP that is optimized for low VRAM and offers many video edtiting option and other models (MMaudio support, Qwen Image Edit, ...). \n- [ComfyUI](https:\u002F\u002Fgithub.com\u002Fkijai\u002FComfyUI-WanVideoWrapper): Thanks for the comfyui support of [kijai](https:\u002F\u002Fgithub.com\u002Fkijai). \n\n\n\n## 📑 Todo List\n\n- [x] Release the technical report\n- [x] Inference\n- [x] Checkpoints\n- [x] Multi-GPU Inference\n- [ ] Inference acceleration\n  - [x] TeaCache\n  - [x] int8 quantization\n  - [ ] LCM distillation\n  - [ ] Sparse Attention\n- [x] Run with very low VRAM\n- [x] Gradio demo\n- [x] ComfyUI\n\n## Video Demos\n\n\n### Video-to-video (HQ videos can be found on [Google Drive](https:\u002F\u002Fdrive.google.com\u002Fdrive\u002Ffolders\u002F1BNrH6GJZ2Wt5gBuNLmfXZ6kpqb9xFPjU?usp=sharing) )\n\n\n\u003Ctable border=\"0\" style=\"width: 100%; text-align: left; margin-top: 20px;\">\n  \u003Ctr>\n      \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002F04f15986-8de7-4bb4-8cde-7f7f38244f9f\" width=\"320\" controls loop>\u003C\u002Fvideo>\n      \u003C\u002Ftd>\n       \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002F1500f72e-a096-42e5-8b44-f887fa8ae7cb\" width=\"320\" controls loop>\u003C\u002Fvideo>\n     \u003C\u002Ftd>\n     \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002F28f484c2-87dc-4828-a9e7-cb963da92d14\" width=\"320\" controls loop>\u003C\u002Fvideo>\n     \u003C\u002Ftd>\n     \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002F665fabe4-3e24-4008-a0a2-a66e2e57c38b\" width=\"320\" controls loop>\u003C\u002Fvideo>\n     \u003C\u002Ftd>\n  \u003C\u002Ftr>\n\u003C\u002Ftable>\n\n### Image-to-video\n\n\u003Ctable border=\"0\" style=\"width: 100%; text-align: left; margin-top: 20px;\">\n  \u003Ctr>\n      \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002F7e4a4dad-9666-4896-8684-2acb36aead59\" width=\"320\" controls loop>\u003C\u002Fvideo>\n      \u003C\u002Ftd>\n      \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002Fbd6da665-f34d-4634-ae94-b4978f92ad3a\" width=\"320\" controls loop>\u003C\u002Fvideo>\n      \u003C\u002Ftd>\n       \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002F510e2648-82db-4648-aaf3-6542303dbe22\" width=\"320\" controls loop>\u003C\u002Fvideo>\n     \u003C\u002Ftd>\n     \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002F27bb087b-866a-4300-8a03-3bbb4ce3ddf9\" width=\"320\" controls loop>\u003C\u002Fvideo>\n     \u003C\u002Ftd>\n     \n  \u003C\u002Ftr>\n  \u003Ctr>\n      \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002F3263c5e1-9f98-4b9b-8688-b3e497460a76\" width=\"320\" controls loop>\u003C\u002Fvideo>\n      \u003C\u002Ftd>\n      \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002F5ff3607f-90ec-4eee-b964-9d5ee3028005\" width=\"320\" controls loop>\u003C\u002Fvideo>\n      \u003C\u002Ftd>\n       \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002Fe504417b-c8c7-4cf0-9afa-da0f3cbf3726\" width=\"320\" controls loop>\u003C\u002Fvideo>\n     \u003C\u002Ftd>\n     \u003Ctd>\n          \u003Cvideo src=\"https:\u002F\u002Fgithub.com\u002Fuser-attachments\u002Fassets\u002F56aac91e-c51f-4d44-b80d-7d115e94ead7\" width=\"320\" controls loop>\u003C\u002Fvideo>\n     \u003C\u002Ftd>\n     \n  \u003C\u002Ftr>\n\u003C\u002Ftable>\n\n## Quick Start\n\n### 🛠️Installation\n\n#### 1. Create a conda environment and install pytorch, xformers\n```\nconda create -n multitalk python=3.10\nconda activate multitalk\npip install torch==2.4.1 torchvision==0.19.1 torchaudio==2.4.1 --index-url https:\u002F\u002Fdownload.pytorch.org\u002Fwhl\u002Fcu121\npip install -U xformers==0.0.28 --index-url https:\u002F\u002Fdownload.pytorch.org\u002Fwhl\u002Fcu121\n```\n#### 2. Flash-attn installation:\n```\npip install misaki[en]\npip install ninja \npip install psutil \npip install packaging\npip install wheel\npip install flash_attn==2.7.4.post1\n```\n\n#### 3. Other dependencies\n```\npip install -r requirements.txt\nconda install -c conda-forge librosa\n```\n\n#### 4. FFmeg installation\n```\nconda install -c conda-forge ffmpeg\n```\nor\n```\nsudo yum install ffmpeg ffmpeg-devel\n```\n\n### 🧱Model Preparation\n\n#### 1. Model Download\n\n| Models        |                       Download Link                                           |    Notes                      |\n| --------------|-------------------------------------------------------------------------------|-------------------------------|\n| Wan2.1-I2V-14B-480P  |      🤗 [Huggingface](https:\u002F\u002Fhuggingface.co\u002FWan-AI\u002FWan2.1-I2V-14B-480P)       | Base model\n| chinese-wav2vec2-base |      🤗 [Huggingface](https:\u002F\u002Fhuggingface.co\u002FTencentGameMate\u002Fchinese-wav2vec2-base)          | Audio encoder\n| MeiGen-InfiniteTalk      |      🤗 [Huggingface](https:\u002F\u002Fhuggingface.co\u002FMeiGen-AI\u002FInfiniteTalk)              | Our audio condition weights\n\nDownload models using huggingface-cli:\n``` sh\nhuggingface-cli download Wan-AI\u002FWan2.1-I2V-14B-480P --local-dir .\u002Fweights\u002FWan2.1-I2V-14B-480P\nhuggingface-cli download TencentGameMate\u002Fchinese-wav2vec2-base --local-dir .\u002Fweights\u002Fchinese-wav2vec2-base\nhuggingface-cli download TencentGameMate\u002Fchinese-wav2vec2-base model.safetensors --revision refs\u002Fpr\u002F1 --local-dir .\u002Fweights\u002Fchinese-wav2vec2-base\nhuggingface-cli download MeiGen-AI\u002FInfiniteTalk --local-dir .\u002Fweights\u002FInfiniteTalk\n\n```\n\n### 🔑 Quick Inference\n\nOur model is compatible with both 480P and 720P resolutions. \n> Some tips\n> - Lip synchronization accuracy:​​ Audio CFG works optimally between 3–5. Increase the audio CFG value for better synchronization.\n> - FusionX： While it enables faster inference and higher quality, FusionX LoRA exacerbates color shift over 1 minute and reduces ID preservation in videos.\n> - V2V generation: Enables unlimited length generation. The model mimics the original video's camera movement, though not identically. Using SDEdit improves camera movement accuracy significantly but introduces color shift and is best suited for short clips. Improvements for long video camera control are planned.\n> - I2V generation: Generates good results from a single image for up to 1 minute. Beyond 1 minute, color shifts become more pronounced. One trick for the high-quailty generation beyond 1 min is to copy the image to a video by translating or zooming in the image.  Here is a script to [convert image to video](https:\u002F\u002Fgithub.com\u002FMeiGen-AI\u002FInfiniteTalk\u002Fblob\u002Fmain\u002Ftools\u002Fconvert_img_to_video.py).  \n> - Quantization model: If your inference process is killed due to insufficient memory, we suggest using the quantization model, which can help **reduce memory usage**.\n\n#### Usage of InfiniteTalk\n```\n--mode streaming: long video generation.\n--mode clip: generate short video with one chunk. \n--use_teacache: run with TeaCache.\n--size infinitetalk-480: generate 480P video.\n--size infinitetalk-720: generate 720P video.\n--use_apg: run with APG.\n--teacache_thresh: A coefficient used for TeaCache acceleration\n—-sample_text_guide_scale： When not using LoRA, the optimal value is 5. After applying LoRA, the recommended value is 1.\n—-sample_audio_guide_scale： When not using LoRA, the optimal value is 4. After applying LoRA, the recommended value is 2.\n—-sample_audio_guide_scale： When not using LoRA, the optimal value is 4. After applying LoRA, the recommended value is 2.\n--max_frame_num: The max frame length of the generated video, the default is 40 seconds(1000 frames).\n```\n\n#### 1. Inference\n\n##### 1) Run with single GPU\n\n\n```\npython generate_infinitetalk.py \\\n    --ckpt_dir weights\u002FWan2.1-I2V-14B-480P \\\n    --wav2vec_dir 'weights\u002Fchinese-wav2vec2-base' \\\n    --infinitetalk_dir weights\u002FInfiniteTalk\u002Fsingle\u002Finfinitetalk.safetensors \\\n    --input_json examples\u002Fsingle_example_image.json \\\n    --size infinitetalk-480 \\\n    --sample_steps 40 \\\n    --mode streaming \\\n    --motion_frame 9 \\\n    --save_file infinitetalk_res\n\n```\n\n##### 2) Run with 720P\n\nIf you want run with 720P, set `--size infinitetalk-720`:\n\n```\npython generate_infinitetalk.py \\\n    --ckpt_dir weights\u002FWan2.1-I2V-14B-480P \\\n    --wav2vec_dir 'weights\u002Fchinese-wav2vec2-base' \\\n    --infinitetalk_dir weights\u002FInfiniteTalk\u002Fsingle\u002Finfinitetalk.safetensors \\\n    --input_json examples\u002Fsingle_example_image.json \\\n    --size infinitetalk-720 \\\n    --sample_steps 40 \\\n    --mode streaming \\\n    --motion_frame 9 \\\n    --save_file infinitetalk_res_720p\n\n```\n\n##### 3) Run with very low VRAM\n\nIf you want run with very low VRAM, set `--num_persistent_param_in_dit 0`:\n\n\n```\npython generate_infinitetalk.py \\\n    --ckpt_dir weights\u002FWan2.1-I2V-14B-480P \\\n    --wav2vec_dir 'weights\u002Fchinese-wav2vec2-base' \\\n    --infinitetalk_dir weights\u002FInfiniteTalk\u002Fsingle\u002Finfinitetalk.safetensors \\\n    --input_json examples\u002Fsingle_example_image.json \\\n    --size infinitetalk-480 \\\n    --sample_steps 40 \\\n    --num_persistent_param_in_dit 0 \\\n    --mode streaming \\\n    --motion_frame 9 \\\n    --save_file infinitetalk_res_lowvram\n```\n\n##### 4) Multi-GPU inference\n\n```\nGPU_NUM=8\ntorchrun --nproc_per_node=$GPU_NUM --standalone generate_infinitetalk.py \\\n    --ckpt_dir weights\u002FWan2.1-I2V-14B-480P \\\n    --wav2vec_dir 'weights\u002Fchinese-wav2vec2-base' \\\n    --infinitetalk_dir weights\u002FInfiniteTalk\u002Fsingle\u002Finfinitetalk.safetensors \\\n    --dit_fsdp --t5_fsdp \\\n    --ulysses_size=$GPU_NUM \\\n    --input_json examples\u002Fsingle_example_image.json \\\n    --size infinitetalk-480 \\\n    --sample_steps 40 \\\n    --mode streaming \\\n    --motion_frame 9 \\\n    --save_file infinitetalk_res_multigpu\n```\n\n##### 5) Multi-Person animation\n\n```\npython generate_infinitetalk.py \\\n    --ckpt_dir weights\u002FWan2.1-I2V-14B-480P \\\n    --wav2vec_dir 'weights\u002Fchinese-wav2vec2-base' \\\n    --infinitetalk_dir weights\u002FInfiniteTalk\u002Fmulti\u002Finfinitetalk.safetensors \\\n    --input_json examples\u002Fmulti_example_image.json \\\n    --size infinitetalk-480 \\\n    --sample_steps 40 \\\n    --num_persistent_param_in_dit 0 \\\n    --mode streaming \\\n    --motion_frame 9 \\\n    --save_file infinitetalk_res_multiperson\n```\n\n\n#### 2. Run with FusioniX or Lightx2v(Require only 4~8 steps)\n\n[FusioniX](https:\u002F\u002Fhuggingface.co\u002Fvrgamedevgirl84\u002FWan14BT2VFusioniX\u002Fblob\u002Fmain\u002FFusionX_LoRa\u002FWan2.1_I2V_14B_FusionX_LoRA.safetensors) require 8 steps and [lightx2v](https:\u002F\u002Fhuggingface.co\u002FKijai\u002FWanVideo_comfy\u002Fblob\u002Fmain\u002FWan21_T2V_14B_lightx2v_cfg_step_distill_lora_rank32.safetensors) requires only 4 steps.\n\n```\npython generate_infinitetalk.py \\\n    --ckpt_dir weights\u002FWan2.1-I2V-14B-480P \\\n    --wav2vec_dir 'weights\u002Fchinese-wav2vec2-base' \\\n    --infinitetalk_dir weights\u002FInfiniteTalk\u002Fsingle\u002Finfinitetalk.safetensors \\\n    --lora_dir weights\u002FWan2.1_I2V_14B_FusionX_LoRA.safetensors \\\n    --input_json examples\u002Fsingle_example_image.json \\\n    --lora_scale 1.0 \\\n    --size infinitetalk-480 \\\n    --sample_text_guide_scale 1.0 \\\n    --sample_audio_guide_scale 2.0 \\\n    --sample_steps 8 \\\n    --mode streaming \\\n    --motion_frame 9 \\\n    --sample_shift 2 \\\n    --num_persistent_param_in_dit 0 \\\n    --save_file infinitetalk_res_lora\n```\n\n\n\n#### 3. Run with the quantization model (Only support run with single gpu)\n\n```\npython generate_infinitetalk.py \\\n    --ckpt_dir weights\u002FWan2.1-I2V-14B-480P \\\n    --wav2vec_dir 'weights\u002Fchinese-wav2vec2-base' \\\n    --infinitetalk_dir weights\u002FInfiniteTalk\u002Fsingle\u002Finfinitetalk.safetensors \\\n    --input_json examples\u002Fsingle_example_image.json \\\n    --size infinitetalk-480 \\\n    --sample_steps 40 \\\n    --mode streaming \\\n    --quant fp8 \\\n    --quant_dir weights\u002FInfiniteTalk\u002Fquant_models\u002Finfinitetalk_single_fp8.safetensors \\\n    --motion_frame 9 \\\n    --num_persistent_param_in_dit 0 \\\n    --save_file infinitetalk_res_quant\n```\n\n\n#### 4. Run with Gradio\n\n\n\n```\npython app.py \\\n    --ckpt_dir weights\u002FWan2.1-I2V-14B-480P \\\n    --wav2vec_dir 'weights\u002Fchinese-wav2vec2-base' \\\n    --infinitetalk_dir weights\u002FInfiniteTalk\u002Fsingle\u002Finfinitetalk.safetensors \\\n    --num_persistent_param_in_dit 0 \\\n    --motion_frame 9 \n```\nor\n```\npython app.py \\\n    --ckpt_dir weights\u002FWan2.1-I2V-14B-480P \\\n    --wav2vec_dir 'weights\u002Fchinese-wav2vec2-base' \\\n    --infinitetalk_dir weights\u002FInfiniteTalk\u002Fmulti\u002Finfinitetalk.safetensors \\\n    --num_persistent_param_in_dit 0 \\\n    --motion_frame 9 \n```\n\n\n## 📚 Citation\n\nIf you find our work useful in your research, please consider citing:\n\n```\n@misc{yang2025infinitetalkaudiodrivenvideogeneration,\n      title={InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing}, \n      author={Shaoshu Yang and Zhe Kong and Feng Gao and Meng Cheng and Xiangyu Liu and Yong Zhang and Zhuoliang Kang and Wenhan Luo and Xunliang Cai and Ran He and Xiaoming Wei},\n      year={2025},\n      eprint={2508.14033},\n      archivePrefix={arXiv},\n      primaryClass={cs.CV},\n      url={https:\u002F\u002Farxiv.org\u002Fabs\u002F2508.14033}, \n}\n```\n\n## 📜 License\nThe models in this repository are licensed under the Apache 2.0 License. We claim no rights over the your generated contents, \ngranting you the freedom to use them while ensuring that your usage complies with the provisions of this license. \nYou are fully accountable for your use of the models, which must not involve sharing any content that violates applicable laws, \ncauses harm to individuals or groups, disseminates personal information intended for harm, spreads misinformation, or targets vulnerable populations. \n\n","InfiniteTalk 是一个支持无限长度对话视频生成的模型，能够实现从图像到视频以及从视频到视频的转换。其核心功能在于通过音频驱动的方式生成与输入音频同步的新视频，不仅实现了精确的唇形同步，还能同时调整头部动作、身体姿态和面部表情以匹配音频内容。技术上，InfiniteTalk 采用了稀疏帧视频配音框架，相较于传统仅关注唇部运动的方法，它能够在更长的时间范围内保持自然流畅的表现力。该项目适合于需要高质量音频驱动视频生成的应用场景，如虚拟人物动画制作、在线教育中的教师形象合成等。",2,"2026-06-11 03:40:04","high_star"]