Skip to content
View SarathL754's full-sized avatar

Block or report SarathL754

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. vigil3d-video-inference vigil3d-video-inference Public

    End-to-end video violence detection system using a 3D CNN, deployed with FastAPI, Docker, AWS EC2, S3, and a React frontend on Vercel.

    Python

  2. Reducing-Hallucinations-with-Direct-Preference-Optimization Reducing-Hallucinations-with-Direct-Preference-Optimization Public

    An RLHF-inspired DPO framework that explicitly teaches LLMs when to refuse, significantly reducing hallucinations.

  3. Decision-Transformer-from-Scratch-HalfCheetah-Minari-BC-vs-Return-Conditioned-DT Decision-Transformer-from-Scratch-HalfCheetah-Minari-BC-vs-Return-Conditioned-DT Public

    Implementing Decision Transformers from scratch for offline RL, benchmarking return-conditioned policies against Behavior Cloning.

    Python

  4. VulneraAI-agent VulneraAI-agent Public

    An agentic LLM security scanner that analyzes applications against OWASP Top 10 using tool-calling, LangGraph, and AWS Bedrock.

    Python

  5. Email-Assistant-langgraph Email-Assistant-langgraph Public

    Python

  6. Multi-agent-RL-texas-holdem-aec Multi-agent-RL-texas-holdem-aec Public

    An engineering-focused multi-agent reinforcement learning system for Texas Hold’em using PettingZoo AEC and a custom PyTorch PPO self-play setup.

    Python