feat: implement retry_count for agent completions

### Problem Statement

Transient network errors or local LLM hiccups (e.g., temporary disconnection from Ollama) can kill a long-running session. Currently, if an inference call fails once, the entire session crashes or hangs, leading to a poor user experience.


### Proposed Solution

1. Add a `retry_count: int = Field(default=3)` to `SessionConfig` in `config.py`.
2. Implement a retry loop in `Agent.generate_response()` (or within the `litellm` call wrapper).
3. If an inference call fails, the system should catch the error and retry the completion up to the specified limit before giving up.


### Alternatives Considered

Manual retries by the user, which is disruptive to the dynamic conversation flow.


### Priority

Medium 🟡

### Additional Context

This works in tandem with the 'timeout' feature to provide a much more stable and robust experience for users with varying hardware or network quality.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement retry_count for agent completions #8

Problem Statement

Proposed Solution

Alternatives Considered

Priority

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

feat: implement retry_count for agent completions #8

Description

Problem Statement

Proposed Solution

Alternatives Considered

Priority

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions