Question about the temperature parameter in the Hugging Face Inference API #1086

xufengduan · 2024-12-27T20:39:17Z

Hi everyone,

I have a question regarding the temperature parameter in the Hugging Face Inference API, particularly in the context of chat models. According to the documentation, the default value for temperature is 1. However, I noticed that some models seem to have a different default, such as 0.6, as specified in their generation_config.json file.

Here are my questions:

When using the Inference API, if I don’t explicitly set the temperature parameter, does the API always use the model’s default value from the generation_config.json? Or does it fall back to a global default of 1 as mentioned in the docs?
If I don’t pass in any additional parameters (like max_length, top_p, etc.), does the API automatically use all the defaults specified in the model’s generation_config.json file? Or are there other fallback defaults from the API side?

Thank you in advance for your help!

coyotte508 · 2025-01-06T12:36:13Z

Hi @xufengduan

This would be more a question for the backend, eg https://github.com/huggingface/text-generation-inference. The @huggingface/inference only passes the value as is, without adding anything hidden.

cc @oOraph @SBrandeis maybe if you're aware of the answer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the temperature parameter in the Hugging Face Inference API #1086

Question about the temperature parameter in the Hugging Face Inference API #1086

xufengduan commented Dec 27, 2024

coyotte508 commented Jan 6, 2025

Question about the temperature parameter in the Hugging Face Inference API #1086

Question about the temperature parameter in the Hugging Face Inference API #1086

Comments

xufengduan commented Dec 27, 2024

coyotte508 commented Jan 6, 2025