The maximum size of a chunk in words.
This value cannot be higher than 300 or lower than 20 (for sentence strategy) or 10 (for word strategy).
Default value is 250.0.
overlap
number
The number of overlapping words for chunks.
It is applicable only to a word chunking strategy.
This value cannot be higher than half the max_chunk_size value.
Default value is 100.0.
sentence_overlap
number
The number of overlapping sentences for chunks.
It is applicable only for a sentence chunking strategy.
It can be either 1 or 0.
Default value is 1.0.
strategy
string
The chunking strategy: sentence or word.
Default value is sentence.
service
stringRequired
The service type
service_settings
objectRequired
Settings specific to the service
task_settings
object
Task settings specific to the service and task type
inference_id
stringRequired
The inference Id
task_type
stringRequired
The task type
Values are sparse_embedding, text_embedding, rerank, completion, or chat_completion.
GET
/_inference
curl \
--request GET 'http://api.example.com/_inference'