Newly released model qwen 3.5 is thinking model.
Ollama disable thinking isnt working glbally but works when sending "think:false" in generate API request.
As thinking is enabled, the json decode error is generated and fallback is used.
It would be good if another parameter can be set which is then sent in generate API.
Newly released model qwen 3.5 is thinking model.
Ollama disable thinking isnt working glbally but works when sending "think:false" in generate API request.
As thinking is enabled, the json decode error is generated and fallback is used.
It would be good if another parameter can be set which is then sent in generate API.