Item 44007313

prettyblocks • 14 hours ago

I'm very interested in working with video inputs, is it possible to do that with Qwen2.5-Omni and Ollama?

tough • 6 hours ago

https://huggingface.co/blog/smolvlm

oezi • 12 hours ago

I have only tested Qwen2.5-Omni for audio and it was hit and miss for my use case of tagging audio.

machinelearning • 10 hours ago

What's a use case are you interested in re: video?

1 reply

prettyblocks • 3 hours ago

I'm curious how effective these models would be at recognizing if the input video was ai generated or heavily manipulated. Also various things around face/object segmentation.