i love getting my hands dirty by training deep neural networks and optimizing them. lately, i’ve been focusing on reinforcement learning. portfolio: click here
my repos revolves around machine learning (deep-rl, computer vision, natural language processing, etc.)
- Jhapa, Nepal
- https://ajheshbasnet.com.np
- @ajheshbasnet
Pinned Loading
-
Transformer-from-scratch.
Transformer-from-scratch. Publicattention is all you need — pytorch implementation of the original transformer architecture for english to nepali neural machine translation (nmt), achieving around 27 bleu score.
-
-
Mixture-of-Experts-Logic
Mixture-of-Experts-Logic Publicapplying Mixture of Experts (MoEs) logic
Jupyter Notebook 1
-
openai-gpt-oss
openai-gpt-oss PublicA PyTorch reimplementation of OpenAI’s GPT OSS model. Designed for research, experimentation, and learning, featuring MoE layers, mixed-precision training, and modular components for easy customiza…
Jupyter Notebook 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.