π Hi, I'm @Rexopia
π Projects:
- π€ HawkLM-demo: A 316M parameter LLM pre-trained on Redpajama-1T.
- π€ HawkLM-Chat-demo: Instruction-tuned version of HawkLM-demo, fine-tuned on OpenOrca.
- CrewForge: A virtual team in your terminal: multiple AI agents discuss and converge, so you don't have to iterate alone.
π§ Research Interests:
- Reinforcement learning for sparse reward reasoning: boosting reward signal density via latent self-correction trajectories
- Information geometry of synthetic tabular data: Riemannian manifold representations via Fisher Information pullback
π« Contact Me:
- π§ Email: ruiji.zhang@outlook.com



