Users are experiencing frequent timeouts and rate limit (429) errors when using GLM5 inference.
HTTPSConnectionPool(host='cloud-api.near.ai', port=443): Read timed out. (read timeout=60)
Rate limiting (429): 429 Client Error: Too Many Requests for url: https://cloud-api.near.ai/v1/chat/completions
Problem
Users are experiencing frequent timeouts and rate limit (429) errors when using GLM5 inference.
Error Details
Source
Proposed Resolution