i found a model called hunyuan video foley. you give it a quiet video, it makes the sound that fits: wind that pushes, fire that crackles, rocks that rumble, even a light music bed when the scene needs it.
I tried a few shots and it surprised me. i dropped in a beach clip with no audio, hit run
how I set it up in comfyui, then a few quick runs so you can hear what it does.
there isn’t an official comfyui node yet. i made a small custom one so we can test. repo: https://github.com/aistudynow/Comfyui-HunyuanFoley
the weights live on the model page: https://huggingface.co/tencent/HunyuanVideo-Foley/tree/main
More Read
Setup I Used
open ComfyUI/custom_nodes/
in a terminal, run
git clone
https://github.com/aistudynow/Comfyui-HunyuanFoley.git
wait for it to finish. go into the new Custom node/HunyuanVideo-Foley
folder. Open the Folder in cmd
pip install -r requirements.txt
Also
Open a terminal in G:\ComfyUI_windows_portable\
:
.\python_embeded\python.exe -m pip install -r ComfyUI\custom_nodes\ComfyUI_HunyuanFoley\requirements.txt
inside the node, make a folder models/hunyuanvideo_foley/
if you don’t see it. put these four files there and keep names exact: config.yaml
, hunyuanvideo_foley.pth
, synchformer_state_dict.pth
, vae_128d_48k.pth
.
back in comfyui, set the paths in the node:
main model → hunyuanvideo_foley.pth
vae → vae_128d_48k.pth
syncformer → synchformer_state_dict.pth
config → config.yaml
if a path is wrong you’ll get a red warning. point it to the right file and you’re fine.
The Small Graph
load the video with Load Video. a short hint is enough: “ocean sound”, “burning tree”, “windy street”. match frame rate in two spots: set fps in the hunyuanvideo-foley sampler, and set the Combine Video node to the same number. so 24/24, or 30/30, etc. when it looks right, press Generate.
if your graph also rebuilds the picture and you see color flicker, leave any color-match step off on that pass. foley is only making audio; this just avoids visual shimmer if you re-encode frames.
What I Ran
ocean
quiet beach shot. hint “ocean sound”. both fps at 24. it breathes with the water instead of looping flat.
fire
a tree on fire. i typed “burning tree” and ran it. close crackle, a few pops, and it tracks the flame.
beach + thunder + music
person on a small island, dark sky. i asked for “distant thunder rumbles and crackles” and added “dramatic, intense background music”. you get low thunder in the back and a simple score that fits. both at once.
car on wet road
old clip. tiny hint only. when the tires hit water, the splash lands on that frame. i didn’t line it up by hand.
windy street
strong wind, light footsteps. i barely wrote anything. it still adds wind, a soft step layer, and a little bed so it feels finished.