thing i am thinking about: deepseek-r1 goes to show the utilization of distillation to turn sparse models into dense ones.
Researching LLMs @ SigmoidLabs. I finetune and play with large language models as a hobby.
-
Sigmoid Labs
- Earth
Highlights
- Pro
Popular repositories Loading
-
-
-
-
bedrock-wiki
bedrock-wiki PublicForked from Bedrock-OSS/bedrock-wiki
This wiki is a knowledge-sharing website for Minecraft Bedrock Add-Ons, containing documentation, tutorials, and general how-to information.
-
MinecraftConsoles
MinecraftConsoles PublicForked from smartcmd/MinecraftConsoles
A certain block game
C++
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.