You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Send a request to either /api/generate or /api/chat endpoint.
Observe the server response.
Expected Behavior:
The server should return a successful response with HTTP status code 200.
Actual Behavior:
The server returns an error response with HTTP status code 500.
Example Request:
curl -X POST http://localhost:11434/api/generate \
-H "Content-Type: application/json" \
-d '{"model":"deepseek-r1_Q8_0:14","prompt":"Hi. How are You?"}'
Name and Version
Environment:
qwen2.5-coder:7b
,MFDoom/deepseek-r1-tool-calling:32b
,deepseek-r1:32b
,qwen2.5-coder:32b
]Operating systems
Linux
GGML backends
CUDA
Hardware
NVIDIA-SMI 570.124.04
,Driver Version: 570.124.04
,CUDA Version: 12.8
Models
DeepSeek-R1-Distill-Qwen-14B-Q8_0.gguf
Problem description & steps to reproduce
Steps to Reproduce:
ollama serve
./api/generate
or/api/chat
endpoint.Expected Behavior:
The server should return a successful response with HTTP status code 200.
Actual Behavior:
The server returns an error response with HTTP status code 500.
Example Request:
First Bad Commit
No response
Relevant log output
The text was updated successfully, but these errors were encountered: