Skip to content

WIP: Support per-agent rewards in multi-agent setups#1910

Closed
nph4rd wants to merge 13 commits intoPrimeIntellect-ai:mainfrom
nph4rd:multiagent-heterogeneous-rewards
Closed

WIP: Support per-agent rewards in multi-agent setups#1910
nph4rd wants to merge 13 commits intoPrimeIntellect-ai:mainfrom
nph4rd:multiagent-heterogeneous-rewards

Conversation

@nph4rd
Copy link
Copy Markdown
Contributor

@nph4rd nph4rd commented Feb 27, 2026

Adds support for per-agent rewards and advantages in multi-agent environments. This is a companion change to PrimeIntellect-ai/verifiers#965 which adds abstractions for multi-agent setups and heterogeneous reward functions.

@nph4rd nph4rd force-pushed the multiagent-heterogeneous-rewards branch 2 times, most recently from 1c71dea to 7e3fa23 Compare March 7, 2026 06:02
@nph4rd nph4rd force-pushed the multiagent-heterogeneous-rewards branch from 1af84fb to f875ab3 Compare March 22, 2026 00:03
@nph4rd nph4rd force-pushed the multiagent-heterogeneous-rewards branch from f875ab3 to 9c69f89 Compare March 22, 2026 00:07
@faresobeid faresobeid closed this Mar 31, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants