Powerful AI that takes care of your daily tasks. Stop manually processing your text, document, and image data. Let AI work its magic, without a single line of code.
InfiniteTalk is a talking-video system. You feed it a images or an existing video plus an audio track, and it makes a lip-synced clip. There’s a ComfyUI workflow with ready nodes/workflows so you can run it inside ComfyUI. It’s built around the Wan 2.1 i2v pipeline and uses an audio encoder (Wav2Vec2) to drive mouth/face motion.
What I’m doing
I run InfiniteTalk inside ComfyUI to get three results:
Studied Computer Science. Passionate about AI, ComfyUI workflows, and hands-on learning through trial and error. Creator of AIStudyNow — sharing tested workflows, tutorials, and real-world experiments.
Can you post what setup you have for ComfyUI, Python version, pytorch version etc? I use Windows. I have not been able to get this to work with Python 3.12. It starts to run and then gets an error about missing Triton. When I try to install Triton, it always fails saying “no compatible version found”
Example 3 — two people talking (multi) please ma’am help me how to fix “I’m use infinitetalk_multi WanVideoSampler ‘NoneType’ object has no attribute ‘max’ “How to fix””
First of all, I would like to sincerely thank you for sharing your workflow and valuable guidance. I am currently using your workflow (Unlimited Talk – Single AI, studynow.com) and have also downloaded all the recommended models to ensure proper setup.
Here is my current process:
I uploaded my character image.
I added an audio file (9 seconds in length).
I did not change or modify any settings.
I simply pressed the RUN button to generate the output.
The process completed successfully, however, the output video shows correct and clean results only for the first two seconds. After that, the video becomes heavily distorted with high noise. The audio, on the other hand, plays perfectly throughout.
I have attached a screenshot for reference: (https://i.postimg.cc/VNRPNhYs/Screenshot-2025-08-28-170224.png)
.
My system specifications:
Operating System: Windows 11 Pro
GPU: NVIDIA RTX 5070
RAM: 64 GB DDR5
ComfyUI Installation: Manually installed Step by Step (not using the portable version)
Could you please guide me on how I can resolve this issue? I would greatly appreciate any troubleshooting steps or adjustments you might recommend.
hi, i get error in comfy ui about “ResolutionMaster” node. i haven’t it. can you please give the link of download this node? i tried to download from comfy ui but after i restart comfy ui, still the problem is.
Hey, many thanks for sharing the workflows and detailed instructions! (and thanks for the other resources you’re posting on youtube and on this website, very helpful learning how it works)
I was able to run the workflow, however the video looks slo-mo and the audio isn’t synced (although lipsync was generated). Tried to use 48khz and 41khz. Any ideas why this is happening ?
Thanks :)
How do I bypass sageattention? Thanks in advance.
Can you post what setup you have for ComfyUI, Python version, pytorch version etc? I use Windows. I have not been able to get this to work with Python 3.12. It starts to run and then gets an error about missing Triton. When I try to install Triton, it always fails saying “no compatible version found”
download the latest comfyui Portable Version https://github.com/comfyanonymous/ComfyUI/releases/latest/download/ComfyUI_windows_portable_nvidia.7z
Navigate to the python_embeded folder
Open that folder in cmd
./python.exe -m pip install triton
please ma’am help me how to fix “I’m use infinitetalk_multi WanVideoSampler ‘NoneType’ object has no attribute ‘max’ “How to fix””
Example 3 — two people talking (multi) please ma’am help me how to fix “I’m use infinitetalk_multi WanVideoSampler ‘NoneType’ object has no attribute ‘max’ “How to fix””
can you send me image my email 23scienceinsights@gmail.com id so i can check
Please check Emal ma’am I’m sent
Hello Dear Esha,
First of all, I would like to sincerely thank you for sharing your workflow and valuable guidance. I am currently using your workflow (Unlimited Talk – Single AI, studynow.com) and have also downloaded all the recommended models to ensure proper setup.
Here is my current process:
I uploaded my character image.
I added an audio file (9 seconds in length).
I did not change or modify any settings.
I simply pressed the RUN button to generate the output.
The process completed successfully, however, the output video shows correct and clean results only for the first two seconds. After that, the video becomes heavily distorted with high noise. The audio, on the other hand, plays perfectly throughout.
I have attached a screenshot for reference: (https://i.postimg.cc/VNRPNhYs/Screenshot-2025-08-28-170224.png)
.
My system specifications:
Operating System: Windows 11 Pro
GPU: NVIDIA RTX 5070
RAM: 64 GB DDR5
ComfyUI Installation: Manually installed Step by Step (not using the portable version)
Could you please guide me on how I can resolve this issue? I would greatly appreciate any troubleshooting steps or adjustments you might recommend.
Thank you very much for your time and support.
Best regards,
Ch Nisar
hi, i get error in comfy ui about “ResolutionMaster” node. i haven’t it. can you please give the link of download this node? i tried to download from comfy ui but after i restart comfy ui, still the problem is.
Curious if you tried without the lightx2v lora? Was the video quality better or same or worse?
Curious if you tried generating without the LightX lora? What was the quality like without the lora?
Same, i had checked it
Got it. Also did you get a chance to compare between LightX2v and FusionX? Any idea which one turned out better?
Hi! Does this model only fit square 5 characters well? I tried 16*9 and they don’t come out very well, unlike the square or vertical format.
Hey, many thanks for sharing the workflows and detailed instructions! (and thanks for the other resources you’re posting on youtube and on this website, very helpful learning how it works)
I was able to run the workflow, however the video looks slo-mo and the audio isn’t synced (although lipsync was generated). Tried to use 48khz and 41khz. Any ideas why this is happening ?
Thanks :)