Higgsfield Video Models

Higgsfield Video Generation API Guide

Overview

Higgsfield provides several video-generation workflows, including basic image-to-video, talking-avatar video, and ad-video generation. This page explains when to use each endpoint.

Endpoint Scenarios

1. Basic Image-to-Video - /higgsfield/generate

Applicable scenarios

turn a static image into a dynamic video
create short-form video content
produce animation effects
make social-media clips

When to use

you already have a static image and want to add motion
you need looping animation
you want a product showcase clip
you are creating artistic motion work

Not suitable for

talking-head videos
commercial ad videos with specialized ad effects

2. Talking Video - /higgsfield/speak

Applicable scenarios

digital-human talking videos
training and education content
customer-service videos
personal vlog-style material

When to use

a person needs to speak on screen
you are making an instructional video
you need an automated customer-service reply video
you are producing virtual-host content
you already have audio and need lip-sync

Audio source options

upload an audio file: when you already have a recorded wav/mp3 file
text-to-speech: when you need to generate speech from text and choose a voice or sound style

3. Ad Video - /higgsfield/ads

Applicable scenarios

product ads
e-commerce promotion videos
brand promotion
marketing campaigns

When to use

the product needs a polished advertising effect
you are promoting goods on an e-commerce platform
you are preparing a brand campaign
you need specialized product-motion effects

Notes

the cost is higher than the base workflow
templates need to be obtained from the official site
the endpoint is optimized specifically for commercial ad scenarios

Standard Workflows

Plan A: Basic Image-to-Video

Prepare the image
Get the motion-template list
Choose a suitable motion_id
Call the generation endpoint
Query task status
Get the video URL after completion

Plan B: Talking Video

Prepare the character image
Get avatar presets
Decide the audio source
Upload an audio URL or configure text-to-speech
Optionally choose voice and sound presets
Configure the speak parameters
Call the speaking endpoint
Query task status
Get the video URL

Plan C: Ad Video

Prepare the product image
Visit the official site to get ad templates
Choose product_placement_sample_id
Call the ads endpoint
Query task status
Get the video URL

Cost Guidance

Capability	Relative cost	Recommended use
Basic video (lite)	1x	testing and simple animation
Basic video (standard)	2x	production content and higher quality
Basic video (turbo)	1.4x	fast turnaround
Talking video	10x	important speaking-avatar videos
Ad video	2.6x	commercial promotion

Template Strategy

1. Motion Templates

text

GET {{BASE_URL}}/higgsfield/tpl/motions?size=30&search=keyword

browse available templates first
search based on the motion effect you want
record the target motion_id

2. Avatar Templates

text

GET {{BASE_URL}}/higgsfield/tpl/avatar-presets?size=30

choose a suitable on-screen persona
consider the preference of the target audience
record the avatar_preset_id

3. Voice and Sound Templates

text

GET {{BASE_URL}}/higgsfield/tpl/voices
GET {{BASE_URL}}/higgsfield/tpl/sounds

choose a suitable voice style
add sound effects where needed

Quick Decision Guide

Goal	Recommended endpoint	Key parameter
make an image move	/generate	motion_id
make a person speak	/speak	avatar_preset_id + audio / speak config
make a product ad	/ads	product_placement_sample_id
test quickly	/generate with lite	model: "lite"
highest output quality	/generate with standard	model: "standard"
urgent project	/generate with turbo	model: "turbo"

Troubleshooting

How do I choose the right model?

lite: lowest cost, good for testing and previews
standard: best for formal published output
turbo: good when time is tight but quality still matters

When should I upload audio versus use text-to-speech for talking videos?

upload audio if you already have a recorded file
use text-to-speech if you want fast generation from text
choose a specific voice_id if you need a certain speaking style

Google-Veo

阿里Wan(万相视频

Grok 视频

Seedance(即梦视频

简单版

官方接口格式

任务查询

GoAmzAI格式(兼容版，开发接入请勿对接

官方格式

简单版(goamz/rocket

General版

统一格式

换脸任务提交

任务提交

任务查询(免费

即梦4

OpenAI Chat 格式

OpenAI Dalle 格式

Replicate 官方格式

Bfl 官方格式

Higgsfield Video Models

Higgsfield Video Generation API Guide

Overview

Endpoint Scenarios

1. Basic Image-to-Video - /higgsfield/generate

2. Talking Video - /higgsfield/speak

3. Ad Video - /higgsfield/ads

Standard Workflows

Plan A: Basic Image-to-Video

Plan B: Talking Video

Plan C: Ad Video

Cost Guidance

Template Strategy

1. Motion Templates

2. Avatar Templates

3. Voice and Sound Templates

Quick Decision Guide

Troubleshooting

任务查询

Higgsfield Video Models ​

Higgsfield Video Generation API Guide ​

Overview ​

Endpoint Scenarios ​

1. Basic Image-to-Video - /higgsfield/generate ​

2. Talking Video - /higgsfield/speak ​

3. Ad Video - /higgsfield/ads ​

Standard Workflows ​

Plan A: Basic Image-to-Video ​

Plan B: Talking Video ​

Plan C: Ad Video ​

Cost Guidance ​

Template Strategy ​

1. Motion Templates ​

2. Avatar Templates ​

3. Voice and Sound Templates ​

Quick Decision Guide ​

Troubleshooting ​

Higgsfield Video Models

Higgsfield Video Generation API Guide

Overview

Endpoint Scenarios

1. Basic Image-to-Video - /higgsfield/generate

2. Talking Video - /higgsfield/speak

3. Ad Video - /higgsfield/ads

Standard Workflows

Plan A: Basic Image-to-Video

Plan B: Talking Video

Plan C: Ad Video

Cost Guidance

Template Strategy

1. Motion Templates

2. Avatar Templates

3. Voice and Sound Templates

Quick Decision Guide

Troubleshooting