diff --git a/content/gem/_index.md b/content/gem/_index.md
index 014d01d..80f5701 100644
--- a/content/gem/_index.md
+++ b/content/gem/_index.md
@@ -56,7 +56,7 @@ for _ in range(30):
 GEM includes __single file__ examples for training an LLM agent through `oat` or `verl` framework.
 
 <div class="gem-callout success">
-    <strong><a href="https://github.com/axon-rl/gem/blob/main/examples/train_oat.py">train with OAT</a></strong>
+    <strong><a href="https://github.com/axon-rl/gem/blob/main/examples/train_oat">train with OAT</a></strong>
 </div>
 
 The [OAT](https://github.com/sail-sg/oat) framework provides a comprehensive solution for training language model agents in reinforcement learning environments.
diff --git a/layouts/gem/single.html b/layouts/gem/single.html
index ac2d66b..bce422c 100644
--- a/layouts/gem/single.html
+++ b/layouts/gem/single.html
@@ -34,7 +34,7 @@ <h3>✨ Features</h3>
 
         <h3>🧱 Advanced</h3>
         <ul>
-            <li><a href="/gem/advanced/">Advanced Overview</a></li>
+            <li><a href="/gem/advanced/">Overview</a></li>
             <li><a href="/gem/advanced/#custom-environments">Custom Environments</a></li>
         </ul>
     </aside>
diff --git a/public/categories/index.xml b/public/categories/index.xml
index 3a12913..4976d4f 100644
--- a/public/categories/index.xml
+++ b/public/categories/index.xml
@@ -1,11 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?>
-<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
-  <channel>
-    <title>Categories on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link>http://localhost:53236/categories/</link>
-    <description>Recent content in Categories on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description>
-    <generator>Hugo</generator>
-    <language>en-us</language>
-    <atom:link href="http://localhost:53236/categories/index.xml" rel="self" type="application/rss+xml" />
-  </channel>
-</rss>
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Categories on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title><link>https://axon-rl.github.io/categories/</link><description>Recent content in Categories on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><atom:link href="https://axon-rl.github.io/categories/index.xml" rel="self" type="application/rss+xml"/></channel></rss>
\ No newline at end of file
diff --git a/public/gem/advanced/index.html b/public/gem/advanced/index.html
index 7b32768..ed0084e 100644
--- a/public/gem/advanced/index.html
+++ b/public/gem/advanced/index.html
@@ -1,98 +1,81 @@
-<!DOCTYPE html>
-<html lang="en-us">
-
-<head><script src="/livereload.js?mindelay=10&amp;v=2&amp;port=53236&amp;path=livereload" data-no-instant defer></script>
-    <meta charset="UTF-8">
-    <meta name="viewport" content="width=device-width, initial-scale=1.0">
-    <title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link rel="preconnect" href="https://fonts.googleapis.com">
-    <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
-    <link
-        href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap"
-        rel="stylesheet">
-    <link rel="icon" href="/images/axon-vanilla.svg" type="image/svg+xml">
-    
-    <link rel="stylesheet" href="/css/styles.min.css">
+<!doctype html><html lang=en-us>
+<head>
+<meta charset=utf-8>
+<meta name=viewport content="width=device-width,initial-scale=1">
+<title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
+<link rel=preconnect href=https://fonts.googleapis.com>
+<link rel=preconnect href=https://fonts.gstatic.com crossorigin>
+<link href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap" rel=stylesheet>
+<link rel=icon href=/images/axon-vanilla.svg type=image/svg+xml>
+<link rel=stylesheet href=/css/styles.min.css>
 </head>
-
 <body>
-    <header>
-        <nav class="container">
-            <div class="logo">
-                <a href="/"
-                    style="text-decoration: none; color: inherit; display: flex; align-items: center; gap: 0px;">
-                    <img src="/images/axon-vanilla.svg" alt="Axon-RL Logo" height="30" width="30">
-                    <span style="text-decoration: none; color: #174f62;">Axon-RL</span>
-                </a>
-            </div>
-            <ul class="nav-links">
-                <li><a href="/">Home</a></li>
-                <li><a href="/gem/">💎 GEM</a></li>
-            </ul>
-            <button id="theme-toggle" class="theme-toggle" aria-label="Toggle dark mode">
-                <span class="theme-icon">🌙</span>
-            </button>
-        </nav>
-    </header>
-
-    
-<div class="gem-layout">
-    <aside class="gem-sidebar">
-        <h3>🚀 Getting Started</h3>
-        <ul>
-            <li><a href="/gem/#overview">Overview</a></li>
-            <li><a href="/gem/#installation">Installation</a></li>
-            <li><a href="/gem/#quick-start">Quick Start</a></li>
-            <li><a href="/gem/#training-agents">Training Agents</a></li>
-        </ul>
-
-        <h3>🌍 Environments</h3>
-        <ul>
-            <li><a href="/gem/environments/">Environment Overview</a></li>
-            <li><a href="/gem/environments/#games">Games</a></li>
-            <li><a href="/gem/environments/#math">Math</a></li>
-            <li><a href="/gem/environments/#code">Code</a></li>
-            <li><a href="/gem/environments/#question-answering">Question Answering</a></li>
-            <li><a href="/gem/environments/#reasoning-gym">Reasoning Gym</a></li>
-        </ul>
-
-        <h3>🛠️ Tools</h3>
-        <ul>
-            <li><a href="/gem/tools/">Tools Overview</a></li>
-            <li><a href="/gem/tools/#python-tool">Python Tool</a></li>
-            <li><a href="/gem/tools/#search-tool">Search Tool</a></li>
-        </ul>
-
-        <h3>✨ Features</h3>
-        <ul>
-            <li><a href="/gem/features/#wrappers">Wrappers</a></li>
-            <li><a href="/gem/features/#vectorization">Vectorization</a></li>
-        </ul>
-
-        <h3>🧱 Advanced</h3>
-        <ul>
-            <li><a href="/gem/advanced/">Advanced Overview</a></li>
-            <li><a href="/gem/advanced/#custom-environments">Custom Environments</a></li>
-        </ul>
-    </aside>
-
-    <main class="gem-content">
-        <h1>🧱 Advanced</h1>
-
-        
-
-        <div class="gem-article">
-            <h2 id="overview">Overview</h2>
+<header>
+<nav class=container>
+<div class=logo>
+<a href=/ style=text-decoration:none;color:inherit;display:flex;align-items:center;gap:0>
+<img src=/images/axon-vanilla.svg alt="Axon-RL Logo" height=30 width=30>
+<span style=text-decoration:none;color:#174f62>Axon-RL</span>
+</a>
+</div>
+<ul class=nav-links>
+<li><a href=/>Home</a></li>
+<li><a href=/gem/>💎 GEM</a></li>
+</ul>
+<button id=theme-toggle class=theme-toggle aria-label="Toggle dark mode">
+<span class=theme-icon>🌙</span>
+</button>
+</nav>
+</header>
+<div class=gem-layout>
+<aside class=gem-sidebar>
+<h3>🚀 Getting Started</h3>
+<ul>
+<li><a href=/gem/#overview>Overview</a></li>
+<li><a href=/gem/#installation>Installation</a></li>
+<li><a href=/gem/#quick-start>Quick Start</a></li>
+<li><a href=/gem/#training-agents>Training Agents</a></li>
+</ul>
+<h3>🌍 Environments</h3>
+<ul>
+<li><a href=/gem/environments/>Environment Overview</a></li>
+<li><a href=/gem/environments/#games>Games</a></li>
+<li><a href=/gem/environments/#math>Math</a></li>
+<li><a href=/gem/environments/#code>Code</a></li>
+<li><a href=/gem/environments/#question-answering>Question Answering</a></li>
+<li><a href=/gem/environments/#reasoning-gym>Reasoning Gym</a></li>
+</ul>
+<h3>🛠️ Tools</h3>
+<ul>
+<li><a href=/gem/tools/>Tools Overview</a></li>
+<li><a href=/gem/tools/#python-tool>Python Tool</a></li>
+<li><a href=/gem/tools/#search-tool>Search Tool</a></li>
+</ul>
+<h3>✨ Features</h3>
+<ul>
+<li><a href=/gem/features/#wrappers>Wrappers</a></li>
+<li><a href=/gem/features/#vectorization>Vectorization</a></li>
+</ul>
+<h3>🧱 Advanced</h3>
+<ul>
+<li><a href=/gem/advanced/>Overview</a></li>
+<li><a href=/gem/advanced/#custom-environments>Custom Environments</a></li>
+</ul>
+</aside>
+<main class=gem-content>
+<h1>🧱 Advanced</h1>
+<div class=gem-article>
+<h2 id=overview>Overview</h2>
 <p>Advanced GEM features, custom environments, and training.</p>
-<h2 id="custom-environments">Custom Environments</h2>
-<p>GEM makes it simple to create custom environments. To create a new environment, simply add <code>.reset()</code> and <code>.step()</code> methods, and then register the environment <a href="https://github.com/axon-rl/gem/blob/main/gem/envs/__init__.py">here</a>. See examples for more information.</p>
-<h3 id="gemcoreenvreset"><code>gem.core.Env.reset()</code></h3>
+<h2 id=custom-environments>Custom Environments</h2>
+<p>GEM makes it simple to create custom environments. To create a new environment, simply add <code>.reset()</code> and <code>.step()</code> methods, and then register the environment <a href=https://github.com/axon-rl/gem/blob/main/gem/envs/__init__.py>here</a>. See examples for more information.</p>
+<h3 id=gemcoreenvreset><code>gem.core.Env.reset()</code></h3>
 <p><strong>Returns:</strong></p>
 <ul>
 <li><code>obs</code> (str) - Initial question/observation from the environment.</li>
 <li><code>info</code> (dict) - Any extra info e.g. for logging or to aid debugging.</li>
 </ul>
-<h3 id="gemcoreenvstepaction"><code>gem.core.Env.step(action)</code></h3>
+<h3 id=gemcoreenvstepaction><code>gem.core.Env.step(action)</code></h3>
 <p><strong>Returns:</strong></p>
 <ul>
 <li><code>obs</code> (str) - Next observation/output from the environment.</li>
@@ -101,47 +84,47 @@ <h3 id="gemcoreenvstepaction"><code>gem.core.Env.step(action)</code></h3>
 <li><code>truncated</code> (bool) - Following Gym environments but currently unused.</li>
 <li><code>info</code> (dict) - Any extra info.</li>
 </ul>
-<h3 id="creating-a-custom-environment">Creating a Custom Environment</h3>
+<h3 id=creating-a-custom-environment>Creating a Custom Environment</h3>
 <ol>
 <li><strong>Inherit from <code>gem.core.Env</code></strong>: Your environment should extend the base environment class</li>
 <li><strong>Implement Required Methods</strong>: Add your custom <code>.reset()</code> and <code>.step()</code> logic</li>
 <li><strong>Register the Environment</strong>: Add your environment to the registry for easy access</li>
 <li><strong>Test and Validate</strong>: Ensure your environment works correctly with GEM&rsquo;s ecosystem</li>
 </ol>
-<h3 id="example-structure">Example Structure</h3>
-<div class="highlight"><pre tabindex="0" class="chroma"><code class="language-python" data-lang="python"><span class="line"><span class="cl"><span class="kn">from</span> <span class="nn">gem.core</span> <span class="kn">import</span> <span class="n">Env</span>
-</span></span><span class="line"><span class="cl"><span class="kn">from</span> <span class="nn">gem.envs.registration</span> <span class="kn">import</span> <span class="n">register</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl"><span class="k">class</span> <span class="nc">ReverseStringEnv</span><span class="p">(</span><span class="n">Env</span><span class="p">):</span>
-</span></span><span class="line"><span class="cl">    <span class="k">def</span> <span class="fm">__init__</span><span class="p">(</span><span class="bp">self</span><span class="p">,</span> <span class="n">str_len</span><span class="p">:</span> <span class="nb">int</span> <span class="o">=</span> <span class="mi">5</span><span class="p">):</span>
-</span></span><span class="line"><span class="cl">        <span class="nb">super</span><span class="p">()</span><span class="o">.</span><span class="fm">__init__</span><span class="p">()</span>
-</span></span><span class="line"><span class="cl">        <span class="bp">self</span><span class="o">.</span><span class="n">str_len</span> <span class="o">=</span> <span class="n">str_len</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl">    <span class="k">def</span> <span class="nf">_get_instructions</span><span class="p">(</span><span class="bp">self</span><span class="p">)</span> <span class="o">-&gt;</span> <span class="nb">str</span><span class="p">:</span>
-</span></span><span class="line"><span class="cl">        <span class="k">return</span> <span class="p">(</span>
-</span></span><span class="line"><span class="cl">            <span class="s2">&#34;You are tasked to reverse a given string.</span><span class="se">\n</span><span class="s2">&#34;</span>
-</span></span><span class="line"><span class="cl">            <span class="s2">&#34;You may provide your response in any manner. Only the content wrapped inside </span><span class="se">\\</span><span class="s2">boxed</span><span class="si">{}</span><span class="s2"> will be considered as your final answer.</span><span class="se">\n</span><span class="s2">&#34;</span>
-</span></span><span class="line"><span class="cl">            <span class="sa">f</span><span class="s2">&#34;Please reverse the string: </span><span class="si">{</span><span class="bp">self</span><span class="o">.</span><span class="n">gt_str</span><span class="si">}</span><span class="s2">.</span><span class="se">\n</span><span class="s2">&#34;</span>
-</span></span><span class="line"><span class="cl">        <span class="p">)</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl">    <span class="k">def</span> <span class="nf">reset</span><span class="p">(</span><span class="bp">self</span><span class="p">,</span> <span class="n">seed</span><span class="o">=</span><span class="kc">None</span><span class="p">):</span>
-</span></span><span class="line"><span class="cl">        <span class="nb">super</span><span class="p">()</span><span class="o">.</span><span class="n">reset</span><span class="p">(</span><span class="n">seed</span><span class="p">)</span>
-</span></span><span class="line"><span class="cl">        <span class="n">characters</span> <span class="o">=</span> <span class="n">string</span><span class="o">.</span><span class="n">ascii_letters</span> <span class="o">+</span> <span class="n">string</span><span class="o">.</span><span class="n">digits</span>  <span class="c1"># A-Z, a-z, 0-9</span>
-</span></span><span class="line"><span class="cl">        <span class="bp">self</span><span class="o">.</span><span class="n">gt_str</span> <span class="o">=</span> <span class="s2">&#34;&#34;</span><span class="o">.</span><span class="n">join</span><span class="p">(</span><span class="n">random</span><span class="o">.</span><span class="n">choices</span><span class="p">(</span><span class="n">characters</span><span class="p">,</span> <span class="n">k</span><span class="o">=</span><span class="bp">self</span><span class="o">.</span><span class="n">str_len</span><span class="p">))</span>
-</span></span><span class="line"><span class="cl">        <span class="k">return</span> <span class="bp">self</span><span class="o">.</span><span class="n">_get_instructions</span><span class="p">(),</span> <span class="p">{}</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl">    <span class="k">def</span> <span class="nf">step</span><span class="p">(</span><span class="bp">self</span><span class="p">,</span> <span class="n">action</span><span class="p">):</span>
-</span></span><span class="line"><span class="cl">        <span class="n">clean_action</span> <span class="o">=</span> <span class="n">extract_last_boxed_answer</span><span class="p">(</span><span class="n">action</span><span class="p">)</span>
-</span></span><span class="line"><span class="cl">        <span class="k">if</span> <span class="n">clean_action</span> <span class="ow">is</span> <span class="kc">None</span><span class="p">:</span>
-</span></span><span class="line"><span class="cl">            <span class="n">reward</span> <span class="o">=</span> <span class="mi">0</span>
-</span></span><span class="line"><span class="cl">        <span class="k">else</span><span class="p">:</span>
-</span></span><span class="line"><span class="cl">            <span class="n">reward</span> <span class="o">=</span> <span class="nb">float</span><span class="p">(</span><span class="n">clean_action</span><span class="p">[::</span><span class="o">-</span><span class="mi">1</span><span class="p">]</span> <span class="o">==</span> <span class="bp">self</span><span class="o">.</span><span class="n">gt_str</span><span class="p">)</span>
-</span></span><span class="line"><span class="cl">        <span class="k">return</span> <span class="n">TERMINAL_STATE</span><span class="p">,</span> <span class="n">reward</span><span class="p">,</span> <span class="kc">True</span><span class="p">,</span> <span class="kc">True</span><span class="p">,</span> <span class="p">{}</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl"><span class="c1"># Register your environment</span>
-</span></span><span class="line"><span class="cl"><span class="n">register</span><span class="p">(</span><span class="s2">&#34;custom:ReverseString&#34;</span><span class="p">,</span> <span class="n">ReverseStringEnv</span><span class="p">)</span>
-</span></span></code></pre></div><h3 id="best-practices">Best Practices</h3>
+<h3 id=example-structure>Example Structure</h3>
+<div class=highlight><pre tabindex=0 class=chroma><code class=language-python data-lang=python><span class=kn>from</span> <span class=nn>gem.core</span> <span class=kn>import</span> <span class=n>Env</span>
+<span class=kn>from</span> <span class=nn>gem.envs.registration</span> <span class=kn>import</span> <span class=n>register</span>
+
+<span class=k>class</span> <span class=nc>ReverseStringEnv</span><span class=p>(</span><span class=n>Env</span><span class=p>):</span>
+    <span class=k>def</span> <span class=fm>__init__</span><span class=p>(</span><span class=bp>self</span><span class=p>,</span> <span class=n>str_len</span><span class=p>:</span> <span class=nb>int</span> <span class=o>=</span> <span class=mi>5</span><span class=p>):</span>
+        <span class=nb>super</span><span class=p>()</span><span class=o>.</span><span class=fm>__init__</span><span class=p>()</span>
+        <span class=bp>self</span><span class=o>.</span><span class=n>str_len</span> <span class=o>=</span> <span class=n>str_len</span>
+
+    <span class=k>def</span> <span class=nf>_get_instructions</span><span class=p>(</span><span class=bp>self</span><span class=p>)</span> <span class=o>-&gt;</span> <span class=nb>str</span><span class=p>:</span>
+        <span class=k>return</span> <span class=p>(</span>
+            <span class=s2>&#34;You are tasked to reverse a given string.</span><span class=se>\n</span><span class=s2>&#34;</span>
+            <span class=s2>&#34;You may provide your response in any manner. Only the content wrapped inside </span><span class=se>\\</span><span class=s2>boxed</span><span class=si>{}</span><span class=s2> will be considered as your final answer.</span><span class=se>\n</span><span class=s2>&#34;</span>
+            <span class=sa>f</span><span class=s2>&#34;Please reverse the string: </span><span class=si>{</span><span class=bp>self</span><span class=o>.</span><span class=n>gt_str</span><span class=si>}</span><span class=s2>.</span><span class=se>\n</span><span class=s2>&#34;</span>
+        <span class=p>)</span>
+
+    <span class=k>def</span> <span class=nf>reset</span><span class=p>(</span><span class=bp>self</span><span class=p>,</span> <span class=n>seed</span><span class=o>=</span><span class=kc>None</span><span class=p>):</span>
+        <span class=nb>super</span><span class=p>()</span><span class=o>.</span><span class=n>reset</span><span class=p>(</span><span class=n>seed</span><span class=p>)</span>
+        <span class=n>characters</span> <span class=o>=</span> <span class=n>string</span><span class=o>.</span><span class=n>ascii_letters</span> <span class=o>+</span> <span class=n>string</span><span class=o>.</span><span class=n>digits</span>  <span class=c1># A-Z, a-z, 0-9</span>
+        <span class=bp>self</span><span class=o>.</span><span class=n>gt_str</span> <span class=o>=</span> <span class=s2>&#34;&#34;</span><span class=o>.</span><span class=n>join</span><span class=p>(</span><span class=n>random</span><span class=o>.</span><span class=n>choices</span><span class=p>(</span><span class=n>characters</span><span class=p>,</span> <span class=n>k</span><span class=o>=</span><span class=bp>self</span><span class=o>.</span><span class=n>str_len</span><span class=p>))</span>
+        <span class=k>return</span> <span class=bp>self</span><span class=o>.</span><span class=n>_get_instructions</span><span class=p>(),</span> <span class=p>{}</span>
+
+    <span class=k>def</span> <span class=nf>step</span><span class=p>(</span><span class=bp>self</span><span class=p>,</span> <span class=n>action</span><span class=p>):</span>
+        <span class=n>clean_action</span> <span class=o>=</span> <span class=n>extract_last_boxed_answer</span><span class=p>(</span><span class=n>action</span><span class=p>)</span>
+        <span class=k>if</span> <span class=n>clean_action</span> <span class=ow>is</span> <span class=kc>None</span><span class=p>:</span>
+            <span class=n>reward</span> <span class=o>=</span> <span class=mi>0</span>
+        <span class=k>else</span><span class=p>:</span>
+            <span class=n>reward</span> <span class=o>=</span> <span class=nb>float</span><span class=p>(</span><span class=n>clean_action</span><span class=p>[::</span><span class=o>-</span><span class=mi>1</span><span class=p>]</span> <span class=o>==</span> <span class=bp>self</span><span class=o>.</span><span class=n>gt_str</span><span class=p>)</span>
+        <span class=k>return</span> <span class=n>TERMINAL_STATE</span><span class=p>,</span> <span class=n>reward</span><span class=p>,</span> <span class=kc>True</span><span class=p>,</span> <span class=kc>True</span><span class=p>,</span> <span class=p>{}</span>
+
+
+<span class=c1># Register your environment</span>
+<span class=n>register</span><span class=p>(</span><span class=s2>&#34;custom:ReverseString&#34;</span><span class=p>,</span> <span class=n>ReverseStringEnv</span><span class=p>)</span>
+</code></pre></div><h3 id=best-practices>Best Practices</h3>
 <ul>
 <li><strong>Clear Instructions</strong>: Provide clear, unambiguous instructions in your observations</li>
 <li><strong>Consistent Rewards</strong>: Design a reward structure that encourages desired behavior</li>
@@ -149,30 +132,20 @@ <h3 id="example-structure">Example Structure</h3>
 <li><strong>Informative Output</strong>: Use the info dictionary to provide debugging information</li>
 <li><strong>Documentation</strong>: Document your environment&rsquo;s behavior and expected usage</li>
 </ul>
-
-        </div>
-
-    </main>
 </div>
-
-
-    <footer class="footer">
-        <div class="container">
-            <p>&copy; 2025 Axon-RL. All rights reserved.</p>
-            <div class="footer-links">
-                <a href="https://github.com/axon-rl" target="_blank">GitHub</a>
-            </div>
-        </div>
-    </footer>
-
-    
-    
-    
-    
-    <script src="/js/typing-animation.min.js"></script>
-    <script src="/js/theme-toggle.min.js"></script>
-    <script src="/js/smooth-scroll.min.js"></script>
-    <script src="/js/code-copy.min.js"></script>
+</main>
+</div>
+<footer class=footer>
+<div class=container>
+<p>&copy; 2025 Axon-RL. All rights reserved.</p>
+<div class=footer-links>
+<a href=https://github.com/axon-rl target=_blank>GitHub</a>
+</div>
+</div>
+</footer>
+<script src=/js/typing-animation.min.js></script>
+<script src=/js/theme-toggle.min.js></script>
+<script src=/js/smooth-scroll.min.js></script>
+<script src=/js/code-copy.min.js></script>
 </body>
-
-</html>
+</html>
\ No newline at end of file
diff --git a/public/gem/advanced/index.xml b/public/gem/advanced/index.xml
index 87ae121..438e5c3 100644
--- a/public/gem/advanced/index.xml
+++ b/public/gem/advanced/index.xml
@@ -1,11 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?>
-<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
-  <channel>
-    <title>🧱 Advanced on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link>http://localhost:53236/gem/advanced/</link>
-    <description>Recent content in 🧱 Advanced on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description>
-    <generator>Hugo</generator>
-    <language>en-us</language>
-    <atom:link href="http://localhost:53236/gem/advanced/index.xml" rel="self" type="application/rss+xml" />
-  </channel>
-</rss>
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>🧱 Advanced on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title><link>https://axon-rl.github.io/gem/advanced/</link><description>Recent content in 🧱 Advanced on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><atom:link href="https://axon-rl.github.io/gem/advanced/index.xml" rel="self" type="application/rss+xml"/></channel></rss>
\ No newline at end of file
diff --git a/public/gem/environments/index.html b/public/gem/environments/index.html
index 25f05ad..46b7ffa 100644
--- a/public/gem/environments/index.html
+++ b/public/gem/environments/index.html
@@ -1,88 +1,71 @@
-<!DOCTYPE html>
-<html lang="en-us">
-
-<head><script src="/livereload.js?mindelay=10&amp;v=2&amp;port=53236&amp;path=livereload" data-no-instant defer></script>
-    <meta charset="UTF-8">
-    <meta name="viewport" content="width=device-width, initial-scale=1.0">
-    <title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link rel="preconnect" href="https://fonts.googleapis.com">
-    <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
-    <link
-        href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap"
-        rel="stylesheet">
-    <link rel="icon" href="/images/axon-vanilla.svg" type="image/svg+xml">
-    
-    <link rel="stylesheet" href="/css/styles.min.css">
+<!doctype html><html lang=en-us>
+<head>
+<meta charset=utf-8>
+<meta name=viewport content="width=device-width,initial-scale=1">
+<title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
+<link rel=preconnect href=https://fonts.googleapis.com>
+<link rel=preconnect href=https://fonts.gstatic.com crossorigin>
+<link href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap" rel=stylesheet>
+<link rel=icon href=/images/axon-vanilla.svg type=image/svg+xml>
+<link rel=stylesheet href=/css/styles.min.css>
 </head>
-
 <body>
-    <header>
-        <nav class="container">
-            <div class="logo">
-                <a href="/"
-                    style="text-decoration: none; color: inherit; display: flex; align-items: center; gap: 0px;">
-                    <img src="/images/axon-vanilla.svg" alt="Axon-RL Logo" height="30" width="30">
-                    <span style="text-decoration: none; color: #174f62;">Axon-RL</span>
-                </a>
-            </div>
-            <ul class="nav-links">
-                <li><a href="/">Home</a></li>
-                <li><a href="/gem/">💎 GEM</a></li>
-            </ul>
-            <button id="theme-toggle" class="theme-toggle" aria-label="Toggle dark mode">
-                <span class="theme-icon">🌙</span>
-            </button>
-        </nav>
-    </header>
-
-    
-<div class="gem-layout">
-    <aside class="gem-sidebar">
-        <h3>🚀 Getting Started</h3>
-        <ul>
-            <li><a href="/gem/#overview">Overview</a></li>
-            <li><a href="/gem/#installation">Installation</a></li>
-            <li><a href="/gem/#quick-start">Quick Start</a></li>
-            <li><a href="/gem/#training-agents">Training Agents</a></li>
-        </ul>
-
-        <h3>🌍 Environments</h3>
-        <ul>
-            <li><a href="/gem/environments/">Environment Overview</a></li>
-            <li><a href="/gem/environments/#games">Games</a></li>
-            <li><a href="/gem/environments/#math">Math</a></li>
-            <li><a href="/gem/environments/#code">Code</a></li>
-            <li><a href="/gem/environments/#question-answering">Question Answering</a></li>
-            <li><a href="/gem/environments/#reasoning-gym">Reasoning Gym</a></li>
-        </ul>
-
-        <h3>🛠️ Tools</h3>
-        <ul>
-            <li><a href="/gem/tools/">Tools Overview</a></li>
-            <li><a href="/gem/tools/#python-tool">Python Tool</a></li>
-            <li><a href="/gem/tools/#search-tool">Search Tool</a></li>
-        </ul>
-
-        <h3>✨ Features</h3>
-        <ul>
-            <li><a href="/gem/features/#wrappers">Wrappers</a></li>
-            <li><a href="/gem/features/#vectorization">Vectorization</a></li>
-        </ul>
-
-        <h3>🧱 Advanced</h3>
-        <ul>
-            <li><a href="/gem/advanced/">Advanced Overview</a></li>
-            <li><a href="/gem/advanced/#custom-environments">Custom Environments</a></li>
-        </ul>
-    </aside>
-
-    <main class="gem-content">
-        <h1>🌍 Environments</h1>
-
-        
-
-        <div class="gem-article">
-            <h2 id="overview">Overview</h2>
+<header>
+<nav class=container>
+<div class=logo>
+<a href=/ style=text-decoration:none;color:inherit;display:flex;align-items:center;gap:0>
+<img src=/images/axon-vanilla.svg alt="Axon-RL Logo" height=30 width=30>
+<span style=text-decoration:none;color:#174f62>Axon-RL</span>
+</a>
+</div>
+<ul class=nav-links>
+<li><a href=/>Home</a></li>
+<li><a href=/gem/>💎 GEM</a></li>
+</ul>
+<button id=theme-toggle class=theme-toggle aria-label="Toggle dark mode">
+<span class=theme-icon>🌙</span>
+</button>
+</nav>
+</header>
+<div class=gem-layout>
+<aside class=gem-sidebar>
+<h3>🚀 Getting Started</h3>
+<ul>
+<li><a href=/gem/#overview>Overview</a></li>
+<li><a href=/gem/#installation>Installation</a></li>
+<li><a href=/gem/#quick-start>Quick Start</a></li>
+<li><a href=/gem/#training-agents>Training Agents</a></li>
+</ul>
+<h3>🌍 Environments</h3>
+<ul>
+<li><a href=/gem/environments/>Environment Overview</a></li>
+<li><a href=/gem/environments/#games>Games</a></li>
+<li><a href=/gem/environments/#math>Math</a></li>
+<li><a href=/gem/environments/#code>Code</a></li>
+<li><a href=/gem/environments/#question-answering>Question Answering</a></li>
+<li><a href=/gem/environments/#reasoning-gym>Reasoning Gym</a></li>
+</ul>
+<h3>🛠️ Tools</h3>
+<ul>
+<li><a href=/gem/tools/>Tools Overview</a></li>
+<li><a href=/gem/tools/#python-tool>Python Tool</a></li>
+<li><a href=/gem/tools/#search-tool>Search Tool</a></li>
+</ul>
+<h3>✨ Features</h3>
+<ul>
+<li><a href=/gem/features/#wrappers>Wrappers</a></li>
+<li><a href=/gem/features/#vectorization>Vectorization</a></li>
+</ul>
+<h3>🧱 Advanced</h3>
+<ul>
+<li><a href=/gem/advanced/>Overview</a></li>
+<li><a href=/gem/advanced/#custom-environments>Custom Environments</a></li>
+</ul>
+</aside>
+<main class=gem-content>
+<h1>🌍 Environments</h1>
+<div class=gem-article>
+<h2 id=overview>Overview</h2>
 <p>GEM supports a diverse range of environments and makes it easy to add your own. GEM provides four main categories of environments, each designed for different types of agent training and evaluation.</p>
 <p>All GEM environments follow a consistent interface pattern:</p>
 <ul>
@@ -91,201 +74,191 @@ <h2 id="overview">Overview</h2>
 <li><code>env.sample_random_action()</code> - Get a random valid action</li>
 </ul>
 <p>This design closely follows the Gymnasium standard, making it easy to integrate with existing RL frameworks and tools.</p>
-<h2 id="games">Games</h2>
+<h2 id=games>Games</h2>
 <p>Interactive game environments including Sudoku, Minesweeper, Wordle, and more from the TextArena collection.</p>
-<p>We maintain local versions of many of the <a href="https://github.com/LeonGuertler/TextArena">TextArena</a> games with <em>(i)</em> improved dense game reward design and <em>(ii)</em> compatible gym-style interface.</p>
-<h3 id="available-game-environments">Available Game Environments</h3>
-<table class="gem-table">
-    <thead>
-        <tr>
-            <th>Environment</th>
-            <th>Description</th>
-        </tr>
-    </thead>
-    <tbody>
-        <tr>
-            <td><code>game:GuessTheNumber</code></td>
-            <td>The agent has multiple guesses to guess the hidden number. The agent receives whether the hidden number is higher or lower than its guess.</td>
-        </tr>
-        <tr>
-            <td><code>game:Mastermind</code></td>
-            <td>The agent has multiple guesses to guess the hidden code. The agent receives black and white pegs depending on the number of correct digits in the right and wrong places.</td>
-        </tr>
-        <tr>
-            <td><code>game:Minesweeper</code></td>
-            <td>The agent must reveal all safe grid squares without revealing a mine. For each revealed square the agent receives the number of adjacent squares that contain mines.</td>
-        </tr>
-        <tr>
-            <td><code>game:Wordle</code></td>
-            <td>The agent must guess the hidden word. After each turn the agent receives feedback ("G"=correct letter + correct position, "Y"=correct letter + incorrect position, "X"=incorrect letter).</td>
-        </tr>
-        <tr>
-            <td><code>game:FifteenPuzzle</code></td>
-            <td>Arrange tiles on the board into ascending order using the empty space to slide tiles into different positions.</td>
-        </tr>
-        <tr>
-            <td><code>game:Hangman</code></td>
-            <td>The objective of the game is to guess the word by providing one letter guesses or the entire word.</td>
-        </tr>
-        <tr>
-            <td><code>game:Sudoku</code></td>
-            <td>Classic Sudoku Game. `easy` version renders a 4x4 board.</td>
-        </tr>
-        <tr>
-            <td><code>game:TowerofHanoi</code></td>
-            <td>a classic single-player puzzle game where the objective is to move a stack of disks from one tower to another following specific rules.</td>
-        </tr>
-    </tbody>
+<p>We maintain local versions of many of the <a href=https://github.com/LeonGuertler/TextArena>TextArena</a> games with <em>(i)</em> improved dense game reward design and <em>(ii)</em> compatible gym-style interface.</p>
+<h3 id=available-game-environments>Available Game Environments</h3>
+<table class=gem-table>
+<thead>
+<tr>
+<th>Environment</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td><code>game:GuessTheNumber</code></td>
+<td>The agent has multiple guesses to guess the hidden number. The agent receives whether the hidden number is higher or lower than its guess.</td>
+</tr>
+<tr>
+<td><code>game:Mastermind</code></td>
+<td>The agent has multiple guesses to guess the hidden code. The agent receives black and white pegs depending on the number of correct digits in the right and wrong places.</td>
+</tr>
+<tr>
+<td><code>game:Minesweeper</code></td>
+<td>The agent must reveal all safe grid squares without revealing a mine. For each revealed square the agent receives the number of adjacent squares that contain mines.</td>
+</tr>
+<tr>
+<td><code>game:Wordle</code></td>
+<td>The agent must guess the hidden word. After each turn the agent receives feedback ("G"=correct letter + correct position, "Y"=correct letter + incorrect position, "X"=incorrect letter).</td>
+</tr>
+<tr>
+<td><code>game:FifteenPuzzle</code></td>
+<td>Arrange tiles on the board into ascending order using the empty space to slide tiles into different positions.</td>
+</tr>
+<tr>
+<td><code>game:Hangman</code></td>
+<td>The objective of the game is to guess the word by providing one letter guesses or the entire word.</td>
+</tr>
+<tr>
+<td><code>game:Sudoku</code></td>
+<td>Classic Sudoku Game. `easy` version renders a 4x4 board.</td>
+</tr>
+<tr>
+<td><code>game:TowerofHanoi</code></td>
+<td>a classic single-player puzzle game where the objective is to move a stack of disks from one tower to another following specific rules.</td>
+</tr>
+</tbody>
 </table>
-<h3 id="difficulty-variants">Difficulty Variants</h3>
+<h3 id=difficulty-variants>Difficulty Variants</h3>
 <p>Each environment additionally has <code>-easy</code>, <code>-hard</code>, and <code>-random</code> variants, where <code>-random</code> denotes that the environment is set to a random level of difficulty at each reset.</p>
-<h3 id="adding-new-games">Adding New Games</h3>
+<h3 id=adding-new-games>Adding New Games</h3>
 <p>Adding new games is easy. Simply include <code>.step()</code>, <code>.reset()</code> functions and register the environment with a new name.</p>
-<h2 id="math">Math</h2>
+<h2 id=math>Math</h2>
 <p>Mathematical reasoning environments with automatic answer parsing and checking, compatible with various math datasets.</p>
 <p>GEM&rsquo;s math environment class includes automatic answer parsing and checking and is designed to be compatible with any math dataset. To add a new environment simply register the dataset. A typical use case is combining these with access to the python tool to train the agent to utilize code.</p>
-<h3 id="available-math-environments">Available Math Environments</h3>
-<table class="gem-table">
-    <thead>
-        <tr>
-            <th>Environment</th>
-            <th>Dataset</th>
-        </tr>
-    </thead>
-    <tbody>
-        <tr>
-            <td><code>math:ASDIV2k</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/ASDIV-2k">ASDIV-2k</a></td>
-        </tr>
-        <tr>
-            <td><code>math:GSM8k</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/GSM-8k">GSM-8k</a></td>
-        </tr>
-        <tr>
-            <td><code>math:Math12k</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/MATH-12k">MATH-12k</a></td>
-        </tr>
-        <tr>
-            <td><code>math:ORZ57k</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/ORZ-57k">ORZ-57k</a></td>
-        </tr>
-    </tbody>
+<h3 id=available-math-environments>Available Math Environments</h3>
+<table class=gem-table>
+<thead>
+<tr>
+<th>Environment</th>
+<th>Dataset</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td><code>math:ASDIV2k</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/ASDIV-2k>ASDIV-2k</a></td>
+</tr>
+<tr>
+<td><code>math:GSM8k</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/GSM-8k>GSM-8k</a></td>
+</tr>
+<tr>
+<td><code>math:Math12k</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/MATH-12k>MATH-12k</a></td>
+</tr>
+<tr>
+<td><code>math:ORZ57k</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/ORZ-57k>ORZ-57k</a></td>
+</tr>
+</tbody>
 </table>
-<h3 id="features">Features</h3>
+<h3 id=features>Features</h3>
 <ul>
 <li><strong>Automatic Answer Parsing</strong>: Built-in parsing for mathematical expressions and numerical answers</li>
 <li><strong>Answer Checking</strong>: Automatic validation of agent responses against ground truth</li>
 <li><strong>Dataset Compatibility</strong>: Works with any math dataset that follows the standard format</li>
 <li><strong>Tool Integration</strong>: Designed to work seamlessly with Python tool for computational assistance</li>
 </ul>
-<h2 id="code">Code</h2>
+<h2 id=code>Code</h2>
 <p>Code generation and evaluation environments that automatically test solutions in sandboxed environments.</p>
 <p>GEM&rsquo;s code environment class automatically evaluates success by running the test cases in a sandbox. This class can be used with any code dataset consisting of the task and test cases.</p>
-<h3 id="available-code-environments">Available Code Environments</h3>
-<table class="gem-table">
-    <thead>
-        <tr>
-            <th>Environment</th>
-            <th>Dataset</th>
-        </tr>
-    </thead>
-    <tbody>
-        <tr>
-            <td><code>code:CodeContest</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/CodeContest">CodeContest</a></td>
-        </tr>
-        <tr>
-            <td><code>code:Taco8k</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/TACO-8k">TACO-8k</a></td>
-        </tr>
-    </tbody>
+<h3 id=available-code-environments>Available Code Environments</h3>
+<table class=gem-table>
+<thead>
+<tr>
+<th>Environment</th>
+<th>Dataset</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td><code>code:CodeContest</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/CodeContest>CodeContest</a></td>
+</tr>
+<tr>
+<td><code>code:Taco8k</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/TACO-8k>TACO-8k</a></td>
+</tr>
+</tbody>
 </table>
-<h3 id="features-1">Features</h3>
+<h3 id=features-1>Features</h3>
 <ul>
 <li><strong>Automatic Code Evaluation</strong>: Runs test cases in a secure sandbox environment</li>
 <li><strong>Test Case Validation</strong>: Compares agent-generated code against provided test cases</li>
 <li><strong>Sandbox Diversity</strong>: Two execution options are available.
 <ul>
-<li>Sandboxed environment using <a href="https://github.com/containers/bubblewrap">bubblewrap</a></li>
+<li>Sandboxed environment using <a href=https://github.com/containers/bubblewrap>bubblewrap</a></li>
 <li>Implementation with Python&rsquo;s <code>subprocess</code> code.</li>
 </ul>
 </li>
 <li><strong>Dataset Diversity</strong>: Compatible with any code dataset that includes problems and test cases</li>
 </ul>
-<h2 id="question-answering">Question-Answering</h2>
+<h2 id=question-answering>Question-Answering</h2>
 <p>QA environments designed for integrated search tool usage to train agents in information retrieval and reasoning.</p>
 <p>GEM&rsquo;s question-answering environments are designed to allow integrated search tool usage to train the agent to use search functionality. Additional question-answering environments can be added by simply registering the dataset.</p>
-<h3 id="available-qa-environments">Available QA Environments</h3>
-<table class="gem-table">
-    <thead>
-        <tr>
-            <th>Environment</th>
-            <th>Dataset</th>
-        </tr>
-    </thead>
-    <tbody>
-        <tr>
-            <td><code>qa:NaturalQuestions</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/NaturalQuestions">NaturalQuestions</a></td>
-        </tr>
-        <tr>
-            <td><code>qa:HotpotQA</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/HotpotQA">HotpotQA</a></td>
-        </tr>
-        <tr>
-            <td><code>logic:RuleTaker-d0</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/RuleTaker-d0-70k">RuleTaker-d0-70k</a></td>
-        </tr>
-        <tr>
-            <td><code>logic:RuleTaker-d1</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/RuleTaker-d1-70k">RuleTaker-d1-70k</a></td>
-        </tr>
-        <tr>
-            <td><code>logic:RuleTaker-d2</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/RuleTaker-d2-70k">RuleTaker-d2-70k</a></td>
-        </tr>
-        <tr>
-            <td><code>logic:RuleTaker-d3</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/RuleTaker-d3-70k">RuleTaker-d3-70k</a></td>
-        </tr>
-        <tr>
-            <td><code>logic:RuleTaker-d5</code></td>
-            <td><a href="https://huggingface.co/datasets/axon-rl/RuleTaker-d5-70k">RuleTaker-d5-70k</a></td>
-        </tr>
-    </tbody>
+<h3 id=available-qa-environments>Available QA Environments</h3>
+<table class=gem-table>
+<thead>
+<tr>
+<th>Environment</th>
+<th>Dataset</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td><code>qa:NaturalQuestions</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/NaturalQuestions>NaturalQuestions</a></td>
+</tr>
+<tr>
+<td><code>qa:HotpotQA</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/HotpotQA>HotpotQA</a></td>
+</tr>
+<tr>
+<td><code>logic:RuleTaker-d0</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/RuleTaker-d0-70k>RuleTaker-d0-70k</a></td>
+</tr>
+<tr>
+<td><code>logic:RuleTaker-d1</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/RuleTaker-d1-70k>RuleTaker-d1-70k</a></td>
+</tr>
+<tr>
+<td><code>logic:RuleTaker-d2</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/RuleTaker-d2-70k>RuleTaker-d2-70k</a></td>
+</tr>
+<tr>
+<td><code>logic:RuleTaker-d3</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/RuleTaker-d3-70k>RuleTaker-d3-70k</a></td>
+</tr>
+<tr>
+<td><code>logic:RuleTaker-d5</code></td>
+<td><a href=https://huggingface.co/datasets/axon-rl/RuleTaker-d5-70k>RuleTaker-d5-70k</a></td>
+</tr>
+</tbody>
 </table>
-<h3 id="environment-types">Environment Types</h3>
+<h3 id=environment-types>Environment Types</h3>
 <ul>
 <li><strong>Natural Questions</strong>: Real-world questions that people ask search engines, requiring factual knowledge and reasoning</li>
 <li><strong>HotpotQA</strong>: Multi-hop reasoning questions that require gathering information from multiple sources</li>
 <li><strong>RuleTaker</strong>: Logical reasoning environments with varying complexity levels (d0 through d5), where agents must apply rules to derive conclusions</li>
 </ul>
-<h2 id="reasoning-gym">Reasoning Gym</h2>
-<p>We include all tasks in <a href="https://github.com/open-thought/reasoning-gym">Reasoning Gym</a> in our package, which could be simply used by calling <code>make(rg:[sub_task_name])</code>.</p>
-
-        </div>
-
-    </main>
+<h2 id=reasoning-gym>Reasoning Gym</h2>
+<p>We include all tasks in <a href=https://github.com/open-thought/reasoning-gym>Reasoning Gym</a> in our package, which could be simply used by calling <code>make(rg:[sub_task_name])</code>.</p>
+</div>
+</main>
+</div>
+<footer class=footer>
+<div class=container>
+<p>&copy; 2025 Axon-RL. All rights reserved.</p>
+<div class=footer-links>
+<a href=https://github.com/axon-rl target=_blank>GitHub</a>
+</div>
 </div>
-
-
-    <footer class="footer">
-        <div class="container">
-            <p>&copy; 2025 Axon-RL. All rights reserved.</p>
-            <div class="footer-links">
-                <a href="https://github.com/axon-rl" target="_blank">GitHub</a>
-            </div>
-        </div>
-    </footer>
-
-    
-    
-    
-    
-    <script src="/js/typing-animation.min.js"></script>
-    <script src="/js/theme-toggle.min.js"></script>
-    <script src="/js/smooth-scroll.min.js"></script>
-    <script src="/js/code-copy.min.js"></script>
+</footer>
+<script src=/js/typing-animation.min.js></script>
+<script src=/js/theme-toggle.min.js></script>
+<script src=/js/smooth-scroll.min.js></script>
+<script src=/js/code-copy.min.js></script>
 </body>
-
-</html>
+</html>
\ No newline at end of file
diff --git a/public/gem/environments/index.xml b/public/gem/environments/index.xml
index a787a62..2d854f7 100644
--- a/public/gem/environments/index.xml
+++ b/public/gem/environments/index.xml
@@ -1,11 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?>
-<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
-  <channel>
-    <title>🌍 Environments on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link>http://localhost:53236/gem/environments/</link>
-    <description>Recent content in 🌍 Environments on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description>
-    <generator>Hugo</generator>
-    <language>en-us</language>
-    <atom:link href="http://localhost:53236/gem/environments/index.xml" rel="self" type="application/rss+xml" />
-  </channel>
-</rss>
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>🌍 Environments on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title><link>https://axon-rl.github.io/gem/environments/</link><description>Recent content in 🌍 Environments on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><atom:link href="https://axon-rl.github.io/gem/environments/index.xml" rel="self" type="application/rss+xml"/></channel></rss>
\ No newline at end of file
diff --git a/public/gem/features/index.html b/public/gem/features/index.html
index f1ea3b2..9dbb16e 100644
--- a/public/gem/features/index.html
+++ b/public/gem/features/index.html
@@ -1,252 +1,235 @@
-<!DOCTYPE html>
-<html lang="en-us">
-
-<head><script src="/livereload.js?mindelay=10&amp;v=2&amp;port=53236&amp;path=livereload" data-no-instant defer></script>
-    <meta charset="UTF-8">
-    <meta name="viewport" content="width=device-width, initial-scale=1.0">
-    <title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link rel="preconnect" href="https://fonts.googleapis.com">
-    <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
-    <link
-        href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap"
-        rel="stylesheet">
-    <link rel="icon" href="/images/axon-vanilla.svg" type="image/svg+xml">
-    
-    <link rel="stylesheet" href="/css/styles.min.css">
+<!doctype html><html lang=en-us>
+<head>
+<meta charset=utf-8>
+<meta name=viewport content="width=device-width,initial-scale=1">
+<title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
+<link rel=preconnect href=https://fonts.googleapis.com>
+<link rel=preconnect href=https://fonts.gstatic.com crossorigin>
+<link href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap" rel=stylesheet>
+<link rel=icon href=/images/axon-vanilla.svg type=image/svg+xml>
+<link rel=stylesheet href=/css/styles.min.css>
 </head>
-
 <body>
-    <header>
-        <nav class="container">
-            <div class="logo">
-                <a href="/"
-                    style="text-decoration: none; color: inherit; display: flex; align-items: center; gap: 0px;">
-                    <img src="/images/axon-vanilla.svg" alt="Axon-RL Logo" height="30" width="30">
-                    <span style="text-decoration: none; color: #174f62;">Axon-RL</span>
-                </a>
-            </div>
-            <ul class="nav-links">
-                <li><a href="/">Home</a></li>
-                <li><a href="/gem/">💎 GEM</a></li>
-            </ul>
-            <button id="theme-toggle" class="theme-toggle" aria-label="Toggle dark mode">
-                <span class="theme-icon">🌙</span>
-            </button>
-        </nav>
-    </header>
-
-    
-<div class="gem-layout">
-    <aside class="gem-sidebar">
-        <h3>🚀 Getting Started</h3>
-        <ul>
-            <li><a href="/gem/#overview">Overview</a></li>
-            <li><a href="/gem/#installation">Installation</a></li>
-            <li><a href="/gem/#quick-start">Quick Start</a></li>
-            <li><a href="/gem/#training-agents">Training Agents</a></li>
-        </ul>
-
-        <h3>🌍 Environments</h3>
-        <ul>
-            <li><a href="/gem/environments/">Environment Overview</a></li>
-            <li><a href="/gem/environments/#games">Games</a></li>
-            <li><a href="/gem/environments/#math">Math</a></li>
-            <li><a href="/gem/environments/#code">Code</a></li>
-            <li><a href="/gem/environments/#question-answering">Question Answering</a></li>
-            <li><a href="/gem/environments/#reasoning-gym">Reasoning Gym</a></li>
-        </ul>
-
-        <h3>🛠️ Tools</h3>
-        <ul>
-            <li><a href="/gem/tools/">Tools Overview</a></li>
-            <li><a href="/gem/tools/#python-tool">Python Tool</a></li>
-            <li><a href="/gem/tools/#search-tool">Search Tool</a></li>
-        </ul>
-
-        <h3>✨ Features</h3>
-        <ul>
-            <li><a href="/gem/features/#wrappers">Wrappers</a></li>
-            <li><a href="/gem/features/#vectorization">Vectorization</a></li>
-        </ul>
-
-        <h3>🧱 Advanced</h3>
-        <ul>
-            <li><a href="/gem/advanced/">Advanced Overview</a></li>
-            <li><a href="/gem/advanced/#custom-environments">Custom Environments</a></li>
-        </ul>
-    </aside>
-
-    <main class="gem-content">
-        <h1>✨ Features</h1>
-
-        
-
-        <div class="gem-article">
-            <h2 id="wrappers">Wrappers</h2>
-<p>Following the Gym interface, GEM provides wrappers to easily add and change functionality. Wrappers are registered in the <a href="https://github.com/axon-rl/gem/blob/main/gem/wrappers/wrapper_factory.py">WRAPPER_FACTORY</a>.</p>
+<header>
+<nav class=container>
+<div class=logo>
+<a href=/ style=text-decoration:none;color:inherit;display:flex;align-items:center;gap:0>
+<img src=/images/axon-vanilla.svg alt="Axon-RL Logo" height=30 width=30>
+<span style=text-decoration:none;color:#174f62>Axon-RL</span>
+</a>
+</div>
+<ul class=nav-links>
+<li><a href=/>Home</a></li>
+<li><a href=/gem/>💎 GEM</a></li>
+</ul>
+<button id=theme-toggle class=theme-toggle aria-label="Toggle dark mode">
+<span class=theme-icon>🌙</span>
+</button>
+</nav>
+</header>
+<div class=gem-layout>
+<aside class=gem-sidebar>
+<h3>🚀 Getting Started</h3>
+<ul>
+<li><a href=/gem/#overview>Overview</a></li>
+<li><a href=/gem/#installation>Installation</a></li>
+<li><a href=/gem/#quick-start>Quick Start</a></li>
+<li><a href=/gem/#training-agents>Training Agents</a></li>
+</ul>
+<h3>🌍 Environments</h3>
+<ul>
+<li><a href=/gem/environments/>Environment Overview</a></li>
+<li><a href=/gem/environments/#games>Games</a></li>
+<li><a href=/gem/environments/#math>Math</a></li>
+<li><a href=/gem/environments/#code>Code</a></li>
+<li><a href=/gem/environments/#question-answering>Question Answering</a></li>
+<li><a href=/gem/environments/#reasoning-gym>Reasoning Gym</a></li>
+</ul>
+<h3>🛠️ Tools</h3>
+<ul>
+<li><a href=/gem/tools/>Tools Overview</a></li>
+<li><a href=/gem/tools/#python-tool>Python Tool</a></li>
+<li><a href=/gem/tools/#search-tool>Search Tool</a></li>
+</ul>
+<h3>✨ Features</h3>
+<ul>
+<li><a href=/gem/features/#wrappers>Wrappers</a></li>
+<li><a href=/gem/features/#vectorization>Vectorization</a></li>
+</ul>
+<h3>🧱 Advanced</h3>
+<ul>
+<li><a href=/gem/advanced/>Overview</a></li>
+<li><a href=/gem/advanced/#custom-environments>Custom Environments</a></li>
+</ul>
+</aside>
+<main class=gem-content>
+<h1>✨ Features</h1>
+<div class=gem-article>
+<h2 id=wrappers>Wrappers</h2>
+<p>Following the Gym interface, GEM provides wrappers to easily add and change functionality. Wrappers are registered in the <a href=https://github.com/axon-rl/gem/blob/main/gem/wrappers/wrapper_factory.py>WRAPPER_FACTORY</a>.</p>
 <p>The main wrapper types are: observation wrappers, tool wrappers, and episode tracking wrappers.</p>
-<div class="gem-callout">
-    <strong>Note:</strong> Order is important! Wrappers should be added in the following order:<br/>
-    tool env wrapper (optional) → observation wrapper (optional) → episode tracking wrapper (optional).
+<div class=gem-callout>
+<strong>Note:</strong> Order is important! Wrappers should be added in the following order:<br>
+tool env wrapper (optional) → observation wrapper (optional) → episode tracking wrapper (optional).
 </div>
-<h3 id="observation-wrappers">Observation Wrappers</h3>
+<h3 id=observation-wrappers>Observation Wrappers</h3>
 <p>Observation wrappers are used to convert the sequence of game states and agent actions into a string which is used as the prompt for the LLM agent at the next step.</p>
-<h4 id="observation-wrapper-examples">Observation Wrapper Examples</h4>
-<table class="gem-table">
-    <thead>
-        <tr>
-            <th>Wrapper name</th>
-            <th>Description</th>
-            <th>Example (Mastermind)</th>
-        </tr>
-    </thead>
-    <tbody>
-        <tr>
-            <td>no wrapper</td>
-            <td>The observation string from the environment.</td>
-            <td><code>"At turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."</code></td>
-        </tr>
-        <tr>
-            <td><code>concat</code></td>
-            <td>The sequence of environment observation strings from all previous steps concatenated together.</td>
-            <td><code>"You are playing Mastermind. [instructions]... Enter your first guess to start the game.\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."</code></td>
-        </tr>
-        <tr>
-            <td><code>concat_with_action</code></td>
-            <td>The sequence of [environment observation string, agent action, environment observation string, etc.] from all previous steps concatenated together.</td>
-            <td><code>"You are playing Mastermind. [instructions]... Enter your first guess to start the game.\nOkay, I will guess a random 3 digit number for now. My first guess will be \\boxed{123}.\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).\nOkay, let's think. One digit is in the correct place. Perhaps this is 3. One digit is completely incorrect. Let's try switching 1 for 4 and moving the 2. My next guess will be \\boxed{243}.\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."</code></td>
-        </tr>
-        <tr>
-            <td><code>concat_chat</code> <em>(default)</em></td>
-            <td>The sequence of [environment observation string, agent action, environment observation string, etc.] from all previous steps concatenated together with the chat template applied to denote "user" (environment) vs "assistant" (agent) turns.</td>
-            <td><code>"&lt;|im_start|&gt;user\nYou are playing Mastermind. [instructions]... Enter your first guess to start the game.&lt;|im_end|&gt;\n&lt;|im_start|&gt;assistant\nOkay, I will guess a random 3 digit number for now. My first guess will be \\boxed{123}&lt;|im_end|&gt; &lt;|im_start|&gt;user\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).&lt;|im_end|&gt;\n&lt;|im_start|&gt;assistant\nOkay, let's think. One digit is in the correct place. Perhaps this is 3. One digit is completely incorrect. Let's try switching 1 for 4 and moving the 2. My next guess will be \\boxed{243}.&lt;|im_end|&gt;\n&lt;|im_start|&gt;user\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s).&lt;|im_end|&gt;\n&lt;|im_start|&gt;assistant"</code></td>
-        </tr>
-        <tr>
-            <td><code>concat_chat_on_reset</code></td>
-            <td>Same as concat_with_action but the chat template tag is applied at the start.</td>
-            <td><code>"&lt;|im_start|&gt;user\nYou are playing Mastermind. [instructions]... Enter your first guess to start the game.\nOkay, I will guess a random 3 digit number for now. My first guess will be \\boxed{123}.\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).\nOkay, let's think. One digit is in the correct place. Perhaps this is 3. One digit is completely incorrect. Let's try switching 1 for 4 and moving the 2. My next guess will be \\boxed{243}.\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."</code></td>
-        </tr>
-    </tbody>
+<h4 id=observation-wrapper-examples>Observation Wrapper Examples</h4>
+<table class=gem-table>
+<thead>
+<tr>
+<th>Wrapper name</th>
+<th>Description</th>
+<th>Example (Mastermind)</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>no wrapper</td>
+<td>The observation string from the environment.</td>
+<td><code>"At turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."</code></td>
+</tr>
+<tr>
+<td><code>concat</code></td>
+<td>The sequence of environment observation strings from all previous steps concatenated together.</td>
+<td><code>"You are playing Mastermind. [instructions]... Enter your first guess to start the game.\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."</code></td>
+</tr>
+<tr>
+<td><code>concat_with_action</code></td>
+<td>The sequence of [environment observation string, agent action, environment observation string, etc.] from all previous steps concatenated together.</td>
+<td><code>"You are playing Mastermind. [instructions]... Enter your first guess to start the game.\nOkay, I will guess a random 3 digit number for now. My first guess will be \\boxed{123}.\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).\nOkay, let's think. One digit is in the correct place. Perhaps this is 3. One digit is completely incorrect. Let's try switching 1 for 4 and moving the 2. My next guess will be \\boxed{243}.\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."</code></td>
+</tr>
+<tr>
+<td><code>concat_chat</code> <em>(default)</em></td>
+<td>The sequence of [environment observation string, agent action, environment observation string, etc.] from all previous steps concatenated together with the chat template applied to denote "user" (environment) vs "assistant" (agent) turns.</td>
+<td><code>"&lt;|im_start|>user\nYou are playing Mastermind. [instructions]... Enter your first guess to start the game.&lt;|im_end|>\n&lt;|im_start|>assistant\nOkay, I will guess a random 3 digit number for now. My first guess will be \\boxed{123}&lt;|im_end|> &lt;|im_start|>user\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).&lt;|im_end|>\n&lt;|im_start|>assistant\nOkay, let's think. One digit is in the correct place. Perhaps this is 3. One digit is completely incorrect. Let's try switching 1 for 4 and moving the 2. My next guess will be \\boxed{243}.&lt;|im_end|>\n&lt;|im_start|>user\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s).&lt;|im_end|>\n&lt;|im_start|>assistant"</code></td>
+</tr>
+<tr>
+<td><code>concat_chat_on_reset</code></td>
+<td>Same as concat_with_action but the chat template tag is applied at the start.</td>
+<td><code>"&lt;|im_start|>user\nYou are playing Mastermind. [instructions]... Enter your first guess to start the game.\nOkay, I will guess a random 3 digit number for now. My first guess will be \\boxed{123}.\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).\nOkay, let's think. One digit is in the correct place. Perhaps this is 3. One digit is completely incorrect. Let's try switching 1 for 4 and moving the 2. My next guess will be \\boxed{243}.\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."</code></td>
+</tr>
+</tbody>
 </table>
-<h3 id="tool-env-wrapper">Tool Env Wrapper</h3>
+<h3 id=tool-env-wrapper>Tool Env Wrapper</h3>
 <p>GEM supports integrating multiple tools to the same agent. Tools are handled by the tool wrapper.</p>
 <p>The input to <code>env.step()</code> is &ldquo;action&rdquo;, a string which is typically the response from the LLM. With the tool env wrapper, when <code>env.step(action)</code> is called, the tool env wrapper iterates through each tool and attempts to parse and execute the action. If any tool is executed successfully, the observation from that tool is returned. If no tool is executed successfully, the action is passed to the wrapped environment.</p>
-<div class="api-box">
-    <div class="api-header">
-        <h4 class="api-class">gem.tools.tool_env_wrapper.ToolEnvWrapper</h4>
-    </div>
-    <div class="api-content">
-        <div class="api-parameters">
-            <h4>Attributes</h4>
-            <ul class="api-param-list">
-                <li class="api-param-item">
-                    <span class="api-param-name">env</span>
-                    <div class="api-param-desc">The wrapped environment.</div>
-                </li>
-                <li class="api-param-item">
-                    <span class="api-param-name">tools</span> <span class="api-param-type">(List[BaseTool])</span>
-                    <div class="api-param-desc">A list of tools.</div>
-                </li>
-                <li class="api-param-item">
-                    <span class="api-param-name">tool_reward</span> <span class="api-param-type">(float = 0.05)</span>
-                    <div class="api-param-desc">Reward if a tool is called.</div>
-                </li>
-                <li class="api-param-item">
-                    <span class="api-param-name">tool_success_reward</span> <span class="api-param-type">(float = 0.05)</span>
-                    <div class="api-param-desc">Additional reward if the tool call is executed without errors.</div>
-                </li>
-                <li class="api-param-item">
-                    <span class="api-param-name">max_tool_uses</span> <span class="api-param-type">(int = 10)</span>
-                    <div class="api-param-desc">Maximum number of tool uses allowed.</div>
-                </li>
-            </ul>
-        </div>
-        <div class="api-method">.reset()</div>
-        <div class="api-returns">
-            <h4>Returns</h4>
-            <ul class="api-return-list">
-                <li class="api-return-item">
-                    <span class="api-return-name">obs</span> <span class="api-return-type">(str)</span>
-                    <div class="api-return-desc">The ToolEnvWrapper.env.reset() output (ie. the environment question), with a list of the available tools and instructions concatenated onto the end.</div>
-                </li>
-                <li class="api-return-item">
-                    <span class="api-return-name">info</span> <span class="api-return-type">(dict)</span>
-                    <div class="api-return-desc">Extra info about the episode state.</div>
-                </li>
-            </ul>
-        </div>
-        <div class="api-method">.step(action: str)</div>
-        <div class="api-parameters">
-            <h4>Parameters</h4>
-            <ul class="api-param-list">
-                <li class="api-param-item">
-                    <span class="api-param-name">action</span> <span class="api-param-type">(str)</span>
-                    <div class="api-param-desc">The response from the LLM agent.</div>
-                </li>
-            </ul>
-        </div>
-        <div class="api-returns">
-            <h4>Returns</h4>
-            <ul class="api-return-list">
-                <li class="api-return-item">
-                    <span class="api-return-name">observation</span> <span class="api-return-type">(str)</span>
-                    <div class="api-return-desc">The output of the tool call if a tool call is found, otherwise the observation from ToolEnvWrapper.env.step().</div>
-                </li>
-                <li class="api-return-item">
-                    <span class="api-return-name">reward</span> <span class="api-return-type">(float)</span>
-                    <div class="api-return-desc">tool_reward if a tool call is found (+ tool_success_reward if the tool call is executed without errors), otherwise the reward from ToolEnvWrapper.env.step()</div>
-                </li>
-                <li class="api-return-item">
-                    <span class="api-return-name">terminated</span> <span class="api-return-type">(bool)</span>
-                    <div class="api-return-desc">Whether the episode is terminated.</div>
-                </li>
-                <li class="api-return-item">
-                    <span class="api-return-name">truncated</span> <span class="api-return-type">(bool)</span>
-                    <div class="api-return-desc">Whether the episode is truncated.</div>
-                </li>
-                <li class="api-return-item">
-                    <span class="api-return-name">info</span> <span class="api-return-type">(dict)</span>
-                    <div class="api-return-desc">Extra info about the episode state.</div>
-                </li>
-            </ul>
-        </div>
-    </div>
+<div class=api-box>
+<div class=api-header>
+<h4 class=api-class>gem.tools.tool_env_wrapper.ToolEnvWrapper</h4>
 </div>
-<h3 id="episode-tracking-wrapper">Episode Tracking Wrapper</h3>
+<div class=api-content>
+<div class=api-parameters>
+<h4>Attributes</h4>
+<ul class=api-param-list>
+<li class=api-param-item>
+<span class=api-param-name>env</span>
+<div class=api-param-desc>The wrapped environment.</div>
+</li>
+<li class=api-param-item>
+<span class=api-param-name>tools</span> <span class=api-param-type>(List[BaseTool])</span>
+<div class=api-param-desc>A list of tools.</div>
+</li>
+<li class=api-param-item>
+<span class=api-param-name>tool_reward</span> <span class=api-param-type>(float = 0.05)</span>
+<div class=api-param-desc>Reward if a tool is called.</div>
+</li>
+<li class=api-param-item>
+<span class=api-param-name>tool_success_reward</span> <span class=api-param-type>(float = 0.05)</span>
+<div class=api-param-desc>Additional reward if the tool call is executed without errors.</div>
+</li>
+<li class=api-param-item>
+<span class=api-param-name>max_tool_uses</span> <span class=api-param-type>(int = 10)</span>
+<div class=api-param-desc>Maximum number of tool uses allowed.</div>
+</li>
+</ul>
+</div>
+<div class=api-method>.reset()</div>
+<div class=api-returns>
+<h4>Returns</h4>
+<ul class=api-return-list>
+<li class=api-return-item>
+<span class=api-return-name>obs</span> <span class=api-return-type>(str)</span>
+<div class=api-return-desc>The ToolEnvWrapper.env.reset() output (ie. the environment question), with a list of the available tools and instructions concatenated onto the end.</div>
+</li>
+<li class=api-return-item>
+<span class=api-return-name>info</span> <span class=api-return-type>(dict)</span>
+<div class=api-return-desc>Extra info about the episode state.</div>
+</li>
+</ul>
+</div>
+<div class=api-method>.step(action: str)</div>
+<div class=api-parameters>
+<h4>Parameters</h4>
+<ul class=api-param-list>
+<li class=api-param-item>
+<span class=api-param-name>action</span> <span class=api-param-type>(str)</span>
+<div class=api-param-desc>The response from the LLM agent.</div>
+</li>
+</ul>
+</div>
+<div class=api-returns>
+<h4>Returns</h4>
+<ul class=api-return-list>
+<li class=api-return-item>
+<span class=api-return-name>observation</span> <span class=api-return-type>(str)</span>
+<div class=api-return-desc>The output of the tool call if a tool call is found, otherwise the observation from ToolEnvWrapper.env.step().</div>
+</li>
+<li class=api-return-item>
+<span class=api-return-name>reward</span> <span class=api-return-type>(float)</span>
+<div class=api-return-desc>tool_reward if a tool call is found (+ tool_success_reward if the tool call is executed without errors), otherwise the reward from ToolEnvWrapper.env.step()</div>
+</li>
+<li class=api-return-item>
+<span class=api-return-name>terminated</span> <span class=api-return-type>(bool)</span>
+<div class=api-return-desc>Whether the episode is terminated.</div>
+</li>
+<li class=api-return-item>
+<span class=api-return-name>truncated</span> <span class=api-return-type>(bool)</span>
+<div class=api-return-desc>Whether the episode is truncated.</div>
+</li>
+<li class=api-return-item>
+<span class=api-return-name>info</span> <span class=api-return-type>(dict)</span>
+<div class=api-return-desc>Extra info about the episode state.</div>
+</li>
+</ul>
+</div>
+</div>
+</div>
+<h3 id=episode-tracking-wrapper>Episode Tracking Wrapper</h3>
 <p>The tracking wrapper logs statistics over the episode, including cumulative_rewards etc. It is not required but can be useful for debugging.</p>
-<h2 id="vectorization">Vectorization</h2>
+<h2 id=vectorization>Vectorization</h2>
 <p>GEM supports collecting multiple episodes in parallel, including asynchronously stepping each of the environments (which may include tool calls etc.). VectorEnv environments automatically reset so that when an episode from one of the parallel environments ends, it is automatically resets and begins the next episode.</p>
-<div class="gem-callout">
-    <strong>Performance tip:</strong> Use vectorization for better throughput when training agents on multiple episodes simultaneously.
+<div class=gem-callout>
+<strong>Performance tip:</strong> Use vectorization for better throughput when training agents on multiple episodes simultaneously.
 </div>
-<h3 id="benefits">Benefits</h3>
+<h3 id=benefits>Benefits</h3>
 <ul>
 <li><strong>Improved Throughput</strong>: Run multiple environments simultaneously for faster data collection</li>
 <li><strong>Automatic Reset</strong>: Environments automatically reset when episodes end, ensuring continuous operation</li>
 <li><strong>Asynchronous Execution</strong>: Each environment can step independently, maximizing efficiency</li>
 <li><strong>Tool Support</strong>: Vectorized environments fully support tool usage across all parallel instances</li>
 </ul>
-<h3 id="usage">Usage</h3>
+<h3 id=usage>Usage</h3>
 <p>Use <code>make_vec()</code> instead of <code>make()</code> when creating environments:</p>
-<div class="highlight"><pre tabindex="0" class="chroma"><code class="language-python" data-lang="python"><span class="line"><span class="cl"><span class="kn">import</span> <span class="nn">gem</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl"><span class="c1"># Create vectorized environment with 8 parallel instances</span>
-</span></span><span class="line"><span class="cl"><span class="n">vec_env</span> <span class="o">=</span> <span class="n">gem</span><span class="o">.</span><span class="n">make_vec</span><span class="p">(</span><span class="s2">&#34;game:GuessTheNumber-v0&#34;</span><span class="p">,</span> <span class="n">num_envs</span><span class="o">=</span><span class="mi">8</span><span class="p">)</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl"><span class="c1"># Reset all environments</span>
-</span></span><span class="line"><span class="cl"><span class="n">observations</span><span class="p">,</span> <span class="n">infos</span> <span class="o">=</span> <span class="n">vec_env</span><span class="o">.</span><span class="n">reset</span><span class="p">()</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl"><span class="c1"># Step all environments</span>
-</span></span><span class="line"><span class="cl"><span class="n">actions</span> <span class="o">=</span> <span class="p">[</span><span class="n">env</span><span class="o">.</span><span class="n">sample_random_action</span><span class="p">()</span> <span class="k">for</span> <span class="n">_</span> <span class="ow">in</span> <span class="nb">range</span><span class="p">(</span><span class="mi">8</span><span class="p">)]</span>
-</span></span><span class="line"><span class="cl"><span class="n">observations</span><span class="p">,</span> <span class="n">rewards</span><span class="p">,</span> <span class="n">terminated</span><span class="p">,</span> <span class="n">truncated</span><span class="p">,</span> <span class="n">infos</span> <span class="o">=</span> <span class="n">vec_env</span><span class="o">.</span><span class="n">step</span><span class="p">(</span><span class="n">actions</span><span class="p">)</span>
-</span></span></code></pre></div><h3 id="key-features">Key Features</h3>
+<div class=highlight><pre tabindex=0 class=chroma><code class=language-python data-lang=python><span class=kn>import</span> <span class=nn>gem</span>
+
+<span class=c1># Create vectorized environment with 8 parallel instances</span>
+<span class=n>vec_env</span> <span class=o>=</span> <span class=n>gem</span><span class=o>.</span><span class=n>make_vec</span><span class=p>(</span><span class=s2>&#34;game:GuessTheNumber-v0&#34;</span><span class=p>,</span> <span class=n>num_envs</span><span class=o>=</span><span class=mi>8</span><span class=p>)</span>
+
+<span class=c1># Reset all environments</span>
+<span class=n>observations</span><span class=p>,</span> <span class=n>infos</span> <span class=o>=</span> <span class=n>vec_env</span><span class=o>.</span><span class=n>reset</span><span class=p>()</span>
+
+<span class=c1># Step all environments</span>
+<span class=n>actions</span> <span class=o>=</span> <span class=p>[</span><span class=n>env</span><span class=o>.</span><span class=n>sample_random_action</span><span class=p>()</span> <span class=k>for</span> <span class=n>_</span> <span class=ow>in</span> <span class=nb>range</span><span class=p>(</span><span class=mi>8</span><span class=p>)]</span>
+<span class=n>observations</span><span class=p>,</span> <span class=n>rewards</span><span class=p>,</span> <span class=n>terminated</span><span class=p>,</span> <span class=n>truncated</span><span class=p>,</span> <span class=n>infos</span> <span class=o>=</span> <span class=n>vec_env</span><span class=o>.</span><span class=n>step</span><span class=p>(</span><span class=n>actions</span><span class=p>)</span>
+</code></pre></div><h3 id=key-features>Key Features</h3>
 <ul>
 <li><strong>Automatic Management</strong>: No need to manually handle environment resets</li>
 <li><strong>Scalable</strong>: Easily adjust the number of parallel environments based on your computational resources</li>
 <li><strong>Compatible</strong>: Works with all GEM environments, tools, and wrappers</li>
 <li><strong>Efficient</strong>: Optimized for minimal overhead in parallel execution</li>
 </ul>
-<h3 id="use-cases">Use Cases</h3>
+<h3 id=use-cases>Use Cases</h3>
 <p>Vectorization is particularly useful for:</p>
 <ul>
 <li>Training reinforcement learning agents</li>
@@ -254,30 +237,20 @@ <h3 id="use-cases">Use Cases</h3>
 <li>Running evaluation experiments across multiple episodes</li>
 <li>Testing agent performance with statistical significance</li>
 </ul>
-
-        </div>
-
-    </main>
 </div>
-
-
-    <footer class="footer">
-        <div class="container">
-            <p>&copy; 2025 Axon-RL. All rights reserved.</p>
-            <div class="footer-links">
-                <a href="https://github.com/axon-rl" target="_blank">GitHub</a>
-            </div>
-        </div>
-    </footer>
-
-    
-    
-    
-    
-    <script src="/js/typing-animation.min.js"></script>
-    <script src="/js/theme-toggle.min.js"></script>
-    <script src="/js/smooth-scroll.min.js"></script>
-    <script src="/js/code-copy.min.js"></script>
+</main>
+</div>
+<footer class=footer>
+<div class=container>
+<p>&copy; 2025 Axon-RL. All rights reserved.</p>
+<div class=footer-links>
+<a href=https://github.com/axon-rl target=_blank>GitHub</a>
+</div>
+</div>
+</footer>
+<script src=/js/typing-animation.min.js></script>
+<script src=/js/theme-toggle.min.js></script>
+<script src=/js/smooth-scroll.min.js></script>
+<script src=/js/code-copy.min.js></script>
 </body>
-
-</html>
+</html>
\ No newline at end of file
diff --git a/public/gem/features/index.xml b/public/gem/features/index.xml
index 8db6229..572aa4f 100644
--- a/public/gem/features/index.xml
+++ b/public/gem/features/index.xml
@@ -1,11 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?>
-<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
-  <channel>
-    <title>✨ Features on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link>http://localhost:53236/gem/features/</link>
-    <description>Recent content in ✨ Features on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description>
-    <generator>Hugo</generator>
-    <language>en-us</language>
-    <atom:link href="http://localhost:53236/gem/features/index.xml" rel="self" type="application/rss+xml" />
-  </channel>
-</rss>
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>✨ Features on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title><link>https://axon-rl.github.io/gem/features/</link><description>Recent content in ✨ Features on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><atom:link href="https://axon-rl.github.io/gem/features/index.xml" rel="self" type="application/rss+xml"/></channel></rss>
\ No newline at end of file
diff --git a/public/gem/index.html b/public/gem/index.html
index 2af3898..e01a1fe 100644
--- a/public/gem/index.html
+++ b/public/gem/index.html
@@ -1,151 +1,124 @@
-<!DOCTYPE html>
-<html lang="en-us">
-
-<head><script src="/livereload.js?mindelay=10&amp;v=2&amp;port=53236&amp;path=livereload" data-no-instant defer></script>
-    <meta charset="UTF-8">
-    <meta name="viewport" content="width=device-width, initial-scale=1.0">
-    <title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link rel="preconnect" href="https://fonts.googleapis.com">
-    <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
-    <link
-        href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap"
-        rel="stylesheet">
-    <link rel="icon" href="/images/axon-vanilla.svg" type="image/svg+xml">
-    
-    <link rel="stylesheet" href="/css/styles.min.css">
+<!doctype html><html lang=en-us>
+<head>
+<meta charset=utf-8>
+<meta name=viewport content="width=device-width,initial-scale=1">
+<title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
+<link rel=preconnect href=https://fonts.googleapis.com>
+<link rel=preconnect href=https://fonts.gstatic.com crossorigin>
+<link href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap" rel=stylesheet>
+<link rel=icon href=/images/axon-vanilla.svg type=image/svg+xml>
+<link rel=stylesheet href=/css/styles.min.css>
 </head>
-
 <body>
-    <header>
-        <nav class="container">
-            <div class="logo">
-                <a href="/"
-                    style="text-decoration: none; color: inherit; display: flex; align-items: center; gap: 0px;">
-                    <img src="/images/axon-vanilla.svg" alt="Axon-RL Logo" height="30" width="30">
-                    <span style="text-decoration: none; color: #174f62;">Axon-RL</span>
-                </a>
-            </div>
-            <ul class="nav-links">
-                <li><a href="/">Home</a></li>
-                <li><a href="/gem/">💎 GEM</a></li>
-            </ul>
-            <button id="theme-toggle" class="theme-toggle" aria-label="Toggle dark mode">
-                <span class="theme-icon">🌙</span>
-            </button>
-        </nav>
-    </header>
-
-    
-<div class="gem-layout">
-    <aside class="gem-sidebar">
-        <h3>🚀 Getting Started</h3>
-        <ul>
-            <li><a href="/gem/#overview">Overview</a></li>
-            <li><a href="/gem/#installation">Installation</a></li>
-            <li><a href="/gem/#quick-start">Quick Start</a></li>
-            <li><a href="/gem/#training-agents">Training Agents</a></li>
-        </ul>
-
-        <h3>🌍 Environments</h3>
-        <ul>
-            <li><a href="/gem/environments/">Environment Overview</a></li>
-            <li><a href="/gem/environments/#games">Games</a></li>
-            <li><a href="/gem/environments/#math">Math</a></li>
-            <li><a href="/gem/environments/#code">Code</a></li>
-            <li><a href="/gem/environments/#question-answering">Question Answering</a></li>
-            <li><a href="/gem/environments/#reasoning-gym">Reasoning Gym</a></li>
-        </ul>
-
-        <h3>🛠️ Tools</h3>
-        <ul>
-            <li><a href="/gem/tools/">Tools Overview</a></li>
-            <li><a href="/gem/tools/#python-tool">Python Tool</a></li>
-            <li><a href="/gem/tools/#search-tool">Search Tool</a></li>
-        </ul>
-
-        <h3>✨ Features</h3>
-        <ul>
-            <li><a href="/gem/features/#wrappers">Wrappers</a></li>
-            <li><a href="/gem/features/#vectorization">Vectorization</a></li>
-        </ul>
-
-        <h3>🧱 Advanced</h3>
-        <ul>
-            <li><a href="/gem/advanced/">Advanced Overview</a></li>
-            <li><a href="/gem/advanced/#custom-environments">Custom Environments</a></li>
-        </ul>
-    </aside>
-
-    <main class="gem-content">
-        <h1>🚀 Getting Started</h1>
-
-        
-
-        <div class="gem-article">
-            <h2 id="overview">Overview</h2>
+<header>
+<nav class=container>
+<div class=logo>
+<a href=/ style=text-decoration:none;color:inherit;display:flex;align-items:center;gap:0>
+<img src=/images/axon-vanilla.svg alt="Axon-RL Logo" height=30 width=30>
+<span style=text-decoration:none;color:#174f62>Axon-RL</span>
+</a>
+</div>
+<ul class=nav-links>
+<li><a href=/>Home</a></li>
+<li><a href=/gem/>💎 GEM</a></li>
+</ul>
+<button id=theme-toggle class=theme-toggle aria-label="Toggle dark mode">
+<span class=theme-icon>🌙</span>
+</button>
+</nav>
+</header>
+<div class=gem-layout>
+<aside class=gem-sidebar>
+<h3>🚀 Getting Started</h3>
+<ul>
+<li><a href=/gem/#overview>Overview</a></li>
+<li><a href=/gem/#installation>Installation</a></li>
+<li><a href=/gem/#quick-start>Quick Start</a></li>
+<li><a href=/gem/#training-agents>Training Agents</a></li>
+</ul>
+<h3>🌍 Environments</h3>
+<ul>
+<li><a href=/gem/environments/>Environment Overview</a></li>
+<li><a href=/gem/environments/#games>Games</a></li>
+<li><a href=/gem/environments/#math>Math</a></li>
+<li><a href=/gem/environments/#code>Code</a></li>
+<li><a href=/gem/environments/#question-answering>Question Answering</a></li>
+<li><a href=/gem/environments/#reasoning-gym>Reasoning Gym</a></li>
+</ul>
+<h3>🛠️ Tools</h3>
+<ul>
+<li><a href=/gem/tools/>Tools Overview</a></li>
+<li><a href=/gem/tools/#python-tool>Python Tool</a></li>
+<li><a href=/gem/tools/#search-tool>Search Tool</a></li>
+</ul>
+<h3>✨ Features</h3>
+<ul>
+<li><a href=/gem/features/#wrappers>Wrappers</a></li>
+<li><a href=/gem/features/#vectorization>Vectorization</a></li>
+</ul>
+<h3>🧱 Advanced</h3>
+<ul>
+<li><a href=/gem/advanced/>Overview</a></li>
+<li><a href=/gem/advanced/#custom-environments>Custom Environments</a></li>
+</ul>
+</aside>
+<main class=gem-content>
+<h1>🚀 Getting Started</h1>
+<div class=gem-article>
+<h2 id=overview>Overview</h2>
 <p>GEM is a diverse collection of environments for training LLM agents in the era of experience. The library includes Math, Code, general reasoning, and question-answering environments, as well as a suite of games (Mastermind, Minesweeper, Hangman, etc). GEM also features fully integrated python and search tool use.</p>
-<div class="gem-callout">
-    <strong>New to GEM?</strong> Start with our Quick Start guide below to get started and running in minutes.
+<div class=gem-callout>
+<strong>New to GEM?</strong> Start with our Quick Start guide below to get started and running in minutes.
 </div>
-<h2 id="installation">Installation</h2>
-<div class="highlight"><pre tabindex="0" class="chroma"><code class="language-bash" data-lang="bash"><span class="line"><span class="cl">pip install gem-llm
-</span></span></code></pre></div><h2 id="quick-start">Quick Start</h2>
-<p>Here&rsquo;s a simple example to get you started. The interface closely follows <a href="https://gymnasium.farama.org/">Gym</a> and other popular RL environment suites.</p>
-<p>Environments can be initialized with <code>make()</code> (or <code>make_vec()</code>  for parallelization) and each environment has <code>Env.reset()</code>, <code>Env.step()</code> and <code>Env.sample_random_action()</code> functions.</p>
-<div class="highlight"><pre tabindex="0" class="chroma"><code class="language-python" data-lang="python"><span class="line"><span class="cl"><span class="kn">import</span> <span class="nn">gem</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl"><span class="c1"># Initialize the environment</span>
-</span></span><span class="line"><span class="cl"><span class="n">env</span> <span class="o">=</span> <span class="n">make</span><span class="p">(</span><span class="s2">&#34;game:GuessTheNumber-v0&#34;</span><span class="p">)</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl"><span class="c1"># Reset the environment to generate the first observation</span>
-</span></span><span class="line"><span class="cl"><span class="n">observation</span><span class="p">,</span> <span class="n">info</span> <span class="o">=</span> <span class="n">env</span><span class="o">.</span><span class="n">reset</span><span class="p">()</span>
-</span></span><span class="line"><span class="cl"><span class="k">for</span> <span class="n">_</span> <span class="ow">in</span> <span class="nb">range</span><span class="p">(</span><span class="mi">30</span><span class="p">):</span>
-</span></span><span class="line"><span class="cl">    <span class="n">action</span> <span class="o">=</span> <span class="n">env</span><span class="o">.</span><span class="n">sample_random_action</span><span class="p">()</span> <span class="c1"># insert policy here</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl">    <span class="c1"># apply action and receive next observation, reward</span>
-</span></span><span class="line"><span class="cl">    <span class="c1"># and whether the episode has ended</span>
-</span></span><span class="line"><span class="cl">    <span class="n">observation</span><span class="p">,</span> <span class="n">reward</span><span class="p">,</span> <span class="n">terminated</span><span class="p">,</span> <span class="n">truncated</span><span class="p">,</span> <span class="n">info</span> <span class="o">=</span> <span class="n">env</span><span class="o">.</span><span class="n">step</span><span class="p">(</span><span class="n">action</span><span class="p">)</span>
-</span></span><span class="line"><span class="cl">
-</span></span><span class="line"><span class="cl">    <span class="c1"># If the episode has ended then reset to start a new episode</span>
-</span></span><span class="line"><span class="cl">    <span class="k">if</span> <span class="n">terminated</span> <span class="ow">or</span> <span class="n">truncated</span><span class="p">:</span>
-</span></span><span class="line"><span class="cl">        <span class="n">observation</span><span class="p">,</span> <span class="n">info</span> <span class="o">=</span> <span class="n">env</span><span class="o">.</span><span class="n">reset</span><span class="p">()</span>
-</span></span></code></pre></div><div class="gem-callout">
-    Please see further documentation for details of <strong>vectorized environments</strong>, <strong>automated resetting</strong>, <strong>different observation/chat templates</strong>, and <strong>integrated tools</strong>.
+<h2 id=installation>Installation</h2>
+<div class=highlight><pre tabindex=0 class=chroma><code class=language-bash data-lang=bash>pip install gem-llm
+</code></pre></div><h2 id=quick-start>Quick Start</h2>
+<p>Here&rsquo;s a simple example to get you started. The interface closely follows <a href=https://gymnasium.farama.org/>Gym</a> and other popular RL environment suites.</p>
+<p>Environments can be initialized with <code>make()</code> (or <code>make_vec()</code> for parallelization) and each environment has <code>Env.reset()</code>, <code>Env.step()</code> and <code>Env.sample_random_action()</code> functions.</p>
+<div class=highlight><pre tabindex=0 class=chroma><code class=language-python data-lang=python><span class=kn>import</span> <span class=nn>gem</span>
+
+<span class=c1># Initialize the environment</span>
+<span class=n>env</span> <span class=o>=</span> <span class=n>make</span><span class=p>(</span><span class=s2>&#34;game:GuessTheNumber-v0&#34;</span><span class=p>)</span>
+
+<span class=c1># Reset the environment to generate the first observation</span>
+<span class=n>observation</span><span class=p>,</span> <span class=n>info</span> <span class=o>=</span> <span class=n>env</span><span class=o>.</span><span class=n>reset</span><span class=p>()</span>
+<span class=k>for</span> <span class=n>_</span> <span class=ow>in</span> <span class=nb>range</span><span class=p>(</span><span class=mi>30</span><span class=p>):</span>
+    <span class=n>action</span> <span class=o>=</span> <span class=n>env</span><span class=o>.</span><span class=n>sample_random_action</span><span class=p>()</span> <span class=c1># insert policy here</span>
+
+    <span class=c1># apply action and receive next observation, reward</span>
+    <span class=c1># and whether the episode has ended</span>
+    <span class=n>observation</span><span class=p>,</span> <span class=n>reward</span><span class=p>,</span> <span class=n>terminated</span><span class=p>,</span> <span class=n>truncated</span><span class=p>,</span> <span class=n>info</span> <span class=o>=</span> <span class=n>env</span><span class=o>.</span><span class=n>step</span><span class=p>(</span><span class=n>action</span><span class=p>)</span>
+
+    <span class=c1># If the episode has ended then reset to start a new episode</span>
+    <span class=k>if</span> <span class=n>terminated</span> <span class=ow>or</span> <span class=n>truncated</span><span class=p>:</span>
+        <span class=n>observation</span><span class=p>,</span> <span class=n>info</span> <span class=o>=</span> <span class=n>env</span><span class=o>.</span><span class=n>reset</span><span class=p>()</span>
+</code></pre></div><div class=gem-callout>
+Please see further documentation for details of <strong>vectorized environments</strong>, <strong>automated resetting</strong>, <strong>different observation/chat templates</strong>, and <strong>integrated tools</strong>.
 </div>
-<h2 id="training-agents">Training Agents</h2>
+<h2 id=training-agents>Training Agents</h2>
 <p>GEM includes <strong>single file</strong> examples for training an LLM agent through <code>oat</code> or <code>verl</code> framework.</p>
 <div class="gem-callout success">
-    <strong><a href="https://github.com/axon-rl/gem/blob/main/examples/train_oat.py">train with OAT</a></strong>
+<strong><a href=https://github.com/axon-rl/gem/blob/main/examples/train_oat>train with OAT</a></strong>
 </div>
-<p>The <a href="https://github.com/sail-sg/oat">OAT</a> framework provides a comprehensive solution for training language model agents in reinforcement learning environments.</p>
-<div class="gem-callout">
-    <strong><a href="https://github.com/axon-rl/gem/tree/main/examples/train_verl">train with verl</a></strong>
+<p>The <a href=https://github.com/sail-sg/oat>OAT</a> framework provides a comprehensive solution for training language model agents in reinforcement learning environments.</p>
+<div class=gem-callout>
+<strong><a href=https://github.com/axon-rl/gem/tree/main/examples/train_verl>train with verl</a></strong>
 </div>
-<p>The <a href="https://github.com/volcengine/verl">VERL</a> framework offers another approach to training agents with different optimization strategies and capabilities.</p>
-
-        </div>
-
-    </main>
+<p>The <a href=https://github.com/volcengine/verl>VERL</a> framework offers another approach to training agents with different optimization strategies and capabilities.</p>
 </div>
-
-
-    <footer class="footer">
-        <div class="container">
-            <p>&copy; 2025 Axon-RL. All rights reserved.</p>
-            <div class="footer-links">
-                <a href="https://github.com/axon-rl" target="_blank">GitHub</a>
-            </div>
-        </div>
-    </footer>
-
-    
-    
-    
-    
-    <script src="/js/typing-animation.min.js"></script>
-    <script src="/js/theme-toggle.min.js"></script>
-    <script src="/js/smooth-scroll.min.js"></script>
-    <script src="/js/code-copy.min.js"></script>
+</main>
+</div>
+<footer class=footer>
+<div class=container>
+<p>&copy; 2025 Axon-RL. All rights reserved.</p>
+<div class=footer-links>
+<a href=https://github.com/axon-rl target=_blank>GitHub</a>
+</div>
+</div>
+</footer>
+<script src=/js/typing-animation.min.js></script>
+<script src=/js/theme-toggle.min.js></script>
+<script src=/js/smooth-scroll.min.js></script>
+<script src=/js/code-copy.min.js></script>
 </body>
-
-</html>
+</html>
\ No newline at end of file
diff --git a/public/gem/index.xml b/public/gem/index.xml
index 8aac96a..5794dfb 100644
--- a/public/gem/index.xml
+++ b/public/gem/index.xml
@@ -1,11 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?>
-<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
-  <channel>
-    <title>🚀 Getting Started on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link>http://localhost:53236/gem/</link>
-    <description>Recent content in 🚀 Getting Started on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description>
-    <generator>Hugo</generator>
-    <language>en-us</language>
-    <atom:link href="http://localhost:53236/gem/index.xml" rel="self" type="application/rss+xml" />
-  </channel>
-</rss>
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>🚀 Getting Started on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title><link>https://axon-rl.github.io/gem/</link><description>Recent content in 🚀 Getting Started on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><atom:link href="https://axon-rl.github.io/gem/index.xml" rel="self" type="application/rss+xml"/></channel></rss>
\ No newline at end of file
diff --git a/public/gem/overview/index.html b/public/gem/overview/index.html
index 5b98e1a..1dcbf4c 100644
--- a/public/gem/overview/index.html
+++ b/public/gem/overview/index.html
@@ -1,10 +1 @@
-<!DOCTYPE html>
-<html lang="en-us">
-  <head>
-    <title>http://localhost:53236/gem/</title>
-    <link rel="canonical" href="http://localhost:53236/gem/">
-    <meta name="robots" content="noindex">
-    <meta charset="utf-8">
-    <meta http-equiv="refresh" content="0; url=http://localhost:53236/gem/">
-  </head>
-</html>
+<!doctype html><html><head><title>https://axon-rl.github.io/gem/</title><link rel=canonical href=https://axon-rl.github.io/gem/><meta name=robots content="noindex"><meta charset=utf-8><meta http-equiv=refresh content="0; url=https://axon-rl.github.io/gem/"></head></html>
\ No newline at end of file
diff --git a/public/gem/tools/index.html b/public/gem/tools/index.html
index 4ee03a2..d620a0b 100644
--- a/public/gem/tools/index.html
+++ b/public/gem/tools/index.html
@@ -1,236 +1,209 @@
-<!DOCTYPE html>
-<html lang="en-us">
-
-<head><script src="/livereload.js?mindelay=10&amp;v=2&amp;port=53236&amp;path=livereload" data-no-instant defer></script>
-    <meta charset="UTF-8">
-    <meta name="viewport" content="width=device-width, initial-scale=1.0">
-    <title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link rel="preconnect" href="https://fonts.googleapis.com">
-    <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
-    <link
-        href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap"
-        rel="stylesheet">
-    <link rel="icon" href="/images/axon-vanilla.svg" type="image/svg+xml">
-    
-    <link rel="stylesheet" href="/css/styles.min.css">
+<!doctype html><html lang=en-us>
+<head>
+<meta charset=utf-8>
+<meta name=viewport content="width=device-width,initial-scale=1">
+<title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
+<link rel=preconnect href=https://fonts.googleapis.com>
+<link rel=preconnect href=https://fonts.gstatic.com crossorigin>
+<link href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap" rel=stylesheet>
+<link rel=icon href=/images/axon-vanilla.svg type=image/svg+xml>
+<link rel=stylesheet href=/css/styles.min.css>
 </head>
-
 <body>
-    <header>
-        <nav class="container">
-            <div class="logo">
-                <a href="/"
-                    style="text-decoration: none; color: inherit; display: flex; align-items: center; gap: 0px;">
-                    <img src="/images/axon-vanilla.svg" alt="Axon-RL Logo" height="30" width="30">
-                    <span style="text-decoration: none; color: #174f62;">Axon-RL</span>
-                </a>
-            </div>
-            <ul class="nav-links">
-                <li><a href="/">Home</a></li>
-                <li><a href="/gem/">💎 GEM</a></li>
-            </ul>
-            <button id="theme-toggle" class="theme-toggle" aria-label="Toggle dark mode">
-                <span class="theme-icon">🌙</span>
-            </button>
-        </nav>
-    </header>
-
-    
-<div class="gem-layout">
-    <aside class="gem-sidebar">
-        <h3>🚀 Getting Started</h3>
-        <ul>
-            <li><a href="/gem/#overview">Overview</a></li>
-            <li><a href="/gem/#installation">Installation</a></li>
-            <li><a href="/gem/#quick-start">Quick Start</a></li>
-            <li><a href="/gem/#training-agents">Training Agents</a></li>
-        </ul>
-
-        <h3>🌍 Environments</h3>
-        <ul>
-            <li><a href="/gem/environments/">Environment Overview</a></li>
-            <li><a href="/gem/environments/#games">Games</a></li>
-            <li><a href="/gem/environments/#math">Math</a></li>
-            <li><a href="/gem/environments/#code">Code</a></li>
-            <li><a href="/gem/environments/#question-answering">Question Answering</a></li>
-            <li><a href="/gem/environments/#reasoning-gym">Reasoning Gym</a></li>
-        </ul>
-
-        <h3>🛠️ Tools</h3>
-        <ul>
-            <li><a href="/gem/tools/">Tools Overview</a></li>
-            <li><a href="/gem/tools/#python-tool">Python Tool</a></li>
-            <li><a href="/gem/tools/#search-tool">Search Tool</a></li>
-        </ul>
-
-        <h3>✨ Features</h3>
-        <ul>
-            <li><a href="/gem/features/#wrappers">Wrappers</a></li>
-            <li><a href="/gem/features/#vectorization">Vectorization</a></li>
-        </ul>
-
-        <h3>🧱 Advanced</h3>
-        <ul>
-            <li><a href="/gem/advanced/">Advanced Overview</a></li>
-            <li><a href="/gem/advanced/#custom-environments">Custom Environments</a></li>
-        </ul>
-    </aside>
-
-    <main class="gem-content">
-        <h1>🛠️ Tools</h1>
-
-        
-
-        <div class="gem-article">
-            <h2 id="overview">Overview</h2>
+<header>
+<nav class=container>
+<div class=logo>
+<a href=/ style=text-decoration:none;color:inherit;display:flex;align-items:center;gap:0>
+<img src=/images/axon-vanilla.svg alt="Axon-RL Logo" height=30 width=30>
+<span style=text-decoration:none;color:#174f62>Axon-RL</span>
+</a>
+</div>
+<ul class=nav-links>
+<li><a href=/>Home</a></li>
+<li><a href=/gem/>💎 GEM</a></li>
+</ul>
+<button id=theme-toggle class=theme-toggle aria-label="Toggle dark mode">
+<span class=theme-icon>🌙</span>
+</button>
+</nav>
+</header>
+<div class=gem-layout>
+<aside class=gem-sidebar>
+<h3>🚀 Getting Started</h3>
+<ul>
+<li><a href=/gem/#overview>Overview</a></li>
+<li><a href=/gem/#installation>Installation</a></li>
+<li><a href=/gem/#quick-start>Quick Start</a></li>
+<li><a href=/gem/#training-agents>Training Agents</a></li>
+</ul>
+<h3>🌍 Environments</h3>
+<ul>
+<li><a href=/gem/environments/>Environment Overview</a></li>
+<li><a href=/gem/environments/#games>Games</a></li>
+<li><a href=/gem/environments/#math>Math</a></li>
+<li><a href=/gem/environments/#code>Code</a></li>
+<li><a href=/gem/environments/#question-answering>Question Answering</a></li>
+<li><a href=/gem/environments/#reasoning-gym>Reasoning Gym</a></li>
+</ul>
+<h3>🛠️ Tools</h3>
+<ul>
+<li><a href=/gem/tools/>Tools Overview</a></li>
+<li><a href=/gem/tools/#python-tool>Python Tool</a></li>
+<li><a href=/gem/tools/#search-tool>Search Tool</a></li>
+</ul>
+<h3>✨ Features</h3>
+<ul>
+<li><a href=/gem/features/#wrappers>Wrappers</a></li>
+<li><a href=/gem/features/#vectorization>Vectorization</a></li>
+</ul>
+<h3>🧱 Advanced</h3>
+<ul>
+<li><a href=/gem/advanced/>Overview</a></li>
+<li><a href=/gem/advanced/#custom-environments>Custom Environments</a></li>
+</ul>
+</aside>
+<main class=gem-content>
+<h1>🛠️ Tools</h1>
+<div class=gem-article>
+<h2 id=overview>Overview</h2>
 <p>GEM provides a comprehensive set of tools to enhance agent capabilities and enable sophisticated problem-solving approaches. GEM currently supports <code>python</code> and <code>search</code> tools to enhance agent capabilities and enable more sophisticated problem-solving approaches.</p>
-<h2 id="python-tool">Python Tool</h2>
+<h2 id=python-tool>Python Tool</h2>
 <p>Allows agents to write and execute Python code, enabling computational problem-solving and data manipulation capabilities.</p>
 <p>GEM&rsquo;s python code tool allows the agent to learn to write code. The python tool parses code blocks, runs them, and returns the result.</p>
-<h3 id="api-reference">API Reference</h3>
-<div class="api-box">
-    <div class="api-header">
-        <h4 class="api-class">gem.tools.python_code_tool.PythonCodeTool</h4>
-    </div>
-    <div class="api-content">
-        <div class="api-method">.execute_action(action: str)</div>
-        <div class="api-description">
-            Parses the action to find the first complete code block. If a valid code block is found the code is run and the output is returned.
-        </div>
-        <div class="api-parameters">
-            <h4>Parameters</h4>
-            <ul class="api-param-list">
-                <li class="api-param-item">
-                    <span class="api-param-name">action</span> <span class="api-param-type">(str)</span>
-                    <div class="api-param-desc">The response from the LLM agent.</div>
-                </li>
-            </ul>
-        </div>
-        <div class="api-returns">
-            <h4>Returns</h4>
-            <ul class="api-return-list">
-                <li class="api-return-item">
-                    <span class="api-return-name">is_valid</span> <span class="api-return-type">(bool)</span>
-                    <div class="api-return-desc">Whether a valid code block is found.</div>
-                </li>
-                <li class="api-return-item">
-                    <span class="api-return-name">has_error</span> <span class="api-return-type">(bool)</span>
-                    <div class="api-return-desc">Whether the code gave an error.</div>
-                </li>
-                <li class="api-return-item">
-                    <span class="api-return-name">observation</span> <span class="api-return-type">(str)</span>
-                    <div class="api-return-desc">The output of running the code if a valid code block is found, otherwise an empty string.</div>
-                </li>
-                <li class="api-return-item">
-                    <span class="api-return-name">parsed_action</span> <span class="api-return-type">(str)</span>
-                    <div class="api-return-desc">The action truncated at the end of the first valid code block. If no code block is found then parsed_action is set to the input action.</div>
-                </li>
-            </ul>
-        </div>
-        <div class="api-method">.instruction_string()</div>
-        <div class="api-description">
-            A string for adding to the prompt to instruct the agent that the python code tool is available.
-        </div>
-        <div class="api-returns">
-            <h4>Returns</h4>
-            <ul class="api-return-list">
-                <li class="api-return-item">
-                    <span class="api-return-name">str</span>
-                    <div class="api-return-desc">Instruction string for the agent</div>
-                </li>
-            </ul>
-        </div>
-    </div>
-</div>
-<h2 id="search-tool">Search Tool</h2>
+<h3 id=api-reference>API Reference</h3>
+<div class=api-box>
+<div class=api-header>
+<h4 class=api-class>gem.tools.python_code_tool.PythonCodeTool</h4>
+</div>
+<div class=api-content>
+<div class=api-method>.execute_action(action: str)</div>
+<div class=api-description>
+Parses the action to find the first complete code block. If a valid code block is found the code is run and the output is returned.
+</div>
+<div class=api-parameters>
+<h4>Parameters</h4>
+<ul class=api-param-list>
+<li class=api-param-item>
+<span class=api-param-name>action</span> <span class=api-param-type>(str)</span>
+<div class=api-param-desc>The response from the LLM agent.</div>
+</li>
+</ul>
+</div>
+<div class=api-returns>
+<h4>Returns</h4>
+<ul class=api-return-list>
+<li class=api-return-item>
+<span class=api-return-name>is_valid</span> <span class=api-return-type>(bool)</span>
+<div class=api-return-desc>Whether a valid code block is found.</div>
+</li>
+<li class=api-return-item>
+<span class=api-return-name>has_error</span> <span class=api-return-type>(bool)</span>
+<div class=api-return-desc>Whether the code gave an error.</div>
+</li>
+<li class=api-return-item>
+<span class=api-return-name>observation</span> <span class=api-return-type>(str)</span>
+<div class=api-return-desc>The output of running the code if a valid code block is found, otherwise an empty string.</div>
+</li>
+<li class=api-return-item>
+<span class=api-return-name>parsed_action</span> <span class=api-return-type>(str)</span>
+<div class=api-return-desc>The action truncated at the end of the first valid code block. If no code block is found then parsed_action is set to the input action.</div>
+</li>
+</ul>
+</div>
+<div class=api-method>.instruction_string()</div>
+<div class=api-description>
+A string for adding to the prompt to instruct the agent that the python code tool is available.
+</div>
+<div class=api-returns>
+<h4>Returns</h4>
+<ul class=api-return-list>
+<li class=api-return-item>
+<span class=api-return-name>str</span>
+<div class=api-return-desc>Instruction string for the agent</div>
+</li>
+</ul>
+</div>
+</div>
+</div>
+<h2 id=search-tool>Search Tool</h2>
 <p>GEM includes a search tool, enabling the agent to learn to call search engines for information retrieval and knowledge enhancement.</p>
-<h3 id="api-reference-1">API Reference</h3>
-<div class="api-box">
-    <div class="api-header">
-        <h4 class="api-class">gem.tools.search_tool.SearchTool</h4>
-    </div>
-    <div class="api-content">
-        <div class="api-method">.execute_action(action: str)</div>
-        <div class="api-description">
-            Parses the action to find the first complete extract the <code>&lt;search&gt;</code> content. Returns the result of the search if a valid search call is found.
-        </div>
-        <div class="api-parameters">
-            <h4>Parameters</h4>
-            <ul class="api-param-list">
-                <li class="api-param-item">
-                    <span class="api-param-name">action</span> <span class="api-param-type">(str)</span>
-                    <div class="api-param-desc">The response from the LLM agent.</div>
-                </li>
-            </ul>
-        </div>
-        <div class="api-returns">
-            <h4>Returns</h4>
-            <ul class="api-return-list">
-                <li class="api-return-item">
-                    <span class="api-return-name">is_valid</span> <span class="api-return-type">(bool)</span>
-                    <div class="api-return-desc">Whether a valid <code>&lt;search&gt;&lt;/search&gt;</code> call is found.</div>
-                </li>
-                <li class="api-return-item">
-                    <span class="api-return-name">has_error</span> <span class="api-return-type">(bool)</span>
-                    <div class="api-return-desc">Whether the search engine gave an error.</div>
-                </li>
-                <li class="api-return-item">
-                    <span class="api-return-name">observation</span> <span class="api-return-type">(str)</span>
-                    <div class="api-return-desc">The output of running the search if a valid search call is found, otherwise an empty string.</div>
-                </li>
-                <li class="api-return-item">
-                    <span class="api-return-name">parsed_action</span> <span class="api-return-type">(str)</span>
-                    <div class="api-return-desc">The action truncated at the end of the first valid search call. If no search call is found then parsed_action is set to the input action.</div>
-                </li>
-            </ul>
-        </div>
-        <div class="api-method">.instruction_string()</div>
-        <div class="api-description">
-            A string for adding to the prompt to instruct the agent that the search tool is available.
-        </div>
-        <div class="api-returns">
-            <h4>Returns</h4>
-            <ul class="api-return-list">
-                <li class="api-return-item">
-                    <span class="api-return-name">str</span>
-                    <div class="api-return-desc">Instruction string for the agent</div>
-                </li>
-            </ul>
-        </div>
-    </div>
-</div>
-<h3 id="usage">Usage</h3>
-<p>Agents can use the search tool by including search queries in their responses using the <code>&lt;search&gt;&lt;/search&gt;</code> tags. The tool will:</p>
+<h3 id=api-reference-1>API Reference</h3>
+<div class=api-box>
+<div class=api-header>
+<h4 class=api-class>gem.tools.search_tool.SearchTool</h4>
+</div>
+<div class=api-content>
+<div class=api-method>.execute_action(action: str)</div>
+<div class=api-description>
+Parses the action to find the first complete extract the <code>&lt;search></code> content. Returns the result of the search if a valid search call is found.
+</div>
+<div class=api-parameters>
+<h4>Parameters</h4>
+<ul class=api-param-list>
+<li class=api-param-item>
+<span class=api-param-name>action</span> <span class=api-param-type>(str)</span>
+<div class=api-param-desc>The response from the LLM agent.</div>
+</li>
+</ul>
+</div>
+<div class=api-returns>
+<h4>Returns</h4>
+<ul class=api-return-list>
+<li class=api-return-item>
+<span class=api-return-name>is_valid</span> <span class=api-return-type>(bool)</span>
+<div class=api-return-desc>Whether a valid <code>&lt;search>&lt;/search></code> call is found.</div>
+</li>
+<li class=api-return-item>
+<span class=api-return-name>has_error</span> <span class=api-return-type>(bool)</span>
+<div class=api-return-desc>Whether the search engine gave an error.</div>
+</li>
+<li class=api-return-item>
+<span class=api-return-name>observation</span> <span class=api-return-type>(str)</span>
+<div class=api-return-desc>The output of running the search if a valid search call is found, otherwise an empty string.</div>
+</li>
+<li class=api-return-item>
+<span class=api-return-name>parsed_action</span> <span class=api-return-type>(str)</span>
+<div class=api-return-desc>The action truncated at the end of the first valid search call. If no search call is found then parsed_action is set to the input action.</div>
+</li>
+</ul>
+</div>
+<div class=api-method>.instruction_string()</div>
+<div class=api-description>
+A string for adding to the prompt to instruct the agent that the search tool is available.
+</div>
+<div class=api-returns>
+<h4>Returns</h4>
+<ul class=api-return-list>
+<li class=api-return-item>
+<span class=api-return-name>str</span>
+<div class=api-return-desc>Instruction string for the agent</div>
+</li>
+</ul>
+</div>
+</div>
+</div>
+<h3 id=usage>Usage</h3>
+<p>Agents can use the search tool by including search queries in their responses using the <code>&lt;search>&lt;/search></code> tags. The tool will:</p>
 <ol>
 <li>Parse the search query from the agent&rsquo;s response</li>
 <li>Execute the search using the configured search engine</li>
 <li>Return the search results to the agent</li>
 <li>Allow the agent to use this information for better responses</li>
 </ol>
-
-        </div>
-
-    </main>
-</div>
-
-
-    <footer class="footer">
-        <div class="container">
-            <p>&copy; 2025 Axon-RL. All rights reserved.</p>
-            <div class="footer-links">
-                <a href="https://github.com/axon-rl" target="_blank">GitHub</a>
-            </div>
-        </div>
-    </footer>
-
-    
-    
-    
-    
-    <script src="/js/typing-animation.min.js"></script>
-    <script src="/js/theme-toggle.min.js"></script>
-    <script src="/js/smooth-scroll.min.js"></script>
-    <script src="/js/code-copy.min.js"></script>
+</div>
+</main>
+</div>
+<footer class=footer>
+<div class=container>
+<p>&copy; 2025 Axon-RL. All rights reserved.</p>
+<div class=footer-links>
+<a href=https://github.com/axon-rl target=_blank>GitHub</a>
+</div>
+</div>
+</footer>
+<script src=/js/typing-animation.min.js></script>
+<script src=/js/theme-toggle.min.js></script>
+<script src=/js/smooth-scroll.min.js></script>
+<script src=/js/code-copy.min.js></script>
 </body>
-
-</html>
+</html>
\ No newline at end of file
diff --git a/public/gem/tools/index.xml b/public/gem/tools/index.xml
index 3de0d41..141ecc0 100644
--- a/public/gem/tools/index.xml
+++ b/public/gem/tools/index.xml
@@ -1,11 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?>
-<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
-  <channel>
-    <title>🛠️ Tools on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link>http://localhost:53236/gem/tools/</link>
-    <description>Recent content in 🛠️ Tools on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description>
-    <generator>Hugo</generator>
-    <language>en-us</language>
-    <atom:link href="http://localhost:53236/gem/tools/index.xml" rel="self" type="application/rss+xml" />
-  </channel>
-</rss>
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>🛠️ Tools on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title><link>https://axon-rl.github.io/gem/tools/</link><description>Recent content in 🛠️ Tools on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><atom:link href="https://axon-rl.github.io/gem/tools/index.xml" rel="self" type="application/rss+xml"/></channel></rss>
\ No newline at end of file
diff --git a/public/index.html b/public/index.html
index a6439eb..916f225 100644
--- a/public/index.html
+++ b/public/index.html
@@ -1,122 +1,97 @@
-<!DOCTYPE html>
-<html lang="en-us">
-
+<!doctype html><html lang=en-us>
 <head>
-	<meta name="generator" content="Hugo 0.133.0"><script src="/livereload.js?mindelay=10&amp;v=2&amp;port=53236&amp;path=livereload" data-no-instant defer></script>
-    <meta charset="UTF-8">
-    <meta name="viewport" content="width=device-width, initial-scale=1.0">
-    <title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link rel="preconnect" href="https://fonts.googleapis.com">
-    <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
-    <link
-        href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap"
-        rel="stylesheet">
-    <link rel="icon" href="/images/axon-vanilla.svg" type="image/svg+xml">
-    
-    <link rel="stylesheet" href="/css/styles.min.css">
+<meta name=generator content="Hugo 0.92.2">
+<meta charset=utf-8>
+<meta name=viewport content="width=device-width,initial-scale=1">
+<title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
+<link rel=preconnect href=https://fonts.googleapis.com>
+<link rel=preconnect href=https://fonts.gstatic.com crossorigin>
+<link href="https://fonts.googleapis.com/css2?family=Crimson+Text:ital,wght@0,400;0,600;0,700;1,400&family=Playfair+Display:wght@400;500;600;700;800;900&family=JetBrains+Mono:wght@400;500;600&display=swap" rel=stylesheet>
+<link rel=icon href=/images/axon-vanilla.svg type=image/svg+xml>
+<link rel=stylesheet href=/css/styles.min.css>
 </head>
-
 <body>
-    <header>
-        <nav class="container">
-            <div class="logo">
-                <a href="/"
-                    style="text-decoration: none; color: inherit; display: flex; align-items: center; gap: 0px;">
-                    <img src="/images/axon-vanilla.svg" alt="Axon-RL Logo" height="30" width="30">
-                    <span style="text-decoration: none; color: #174f62;">Axon-RL</span>
-                </a>
-            </div>
-            <ul class="nav-links">
-                <li><a href="/">Home</a></li>
-                <li><a href="/gem/">💎 GEM</a></li>
-            </ul>
-            <button id="theme-toggle" class="theme-toggle" aria-label="Toggle dark mode">
-                <span class="theme-icon">🌙</span>
-            </button>
-        </nav>
-    </header>
-
-    
-<div class="blog-container">
-    
-    <section class="intro">
-        <img src="/images/axon-all.svg" alt="Axon-RL Logo" class="logo" height="150" width="150">
-        <p>Wiring general intelligence through reinforcement learning</p>
-        <div class="social-links">
-            <a href="https://github.com/axon-rl" target="_blank" class="social-link">
-                <span class="social-icon">
-                    <img src="/images/github-mark.svg" height="20" width="20" alt="GitHub">
-                </span> GitHub
-            </a>
-
-            <a href="https://huggingface.co/axon-rl" target="_blank" class="social-link">
-                <span class="social-icon">🤗</span> Hugging Face
-            </a>
-        </div>
-    </section>
-
-    
-    <section class="section">
-        <h2 class="section-title">Projects</h2>
-        <article class="blog-post">
-            <h3><a href="/gem/">💎 GEM</a></h3>
-            <div class="post-meta">
-                <span class="post-date">Framework</span>
-                <span class="post-category">Reinforcement Learning</span>
-            </div>
-            <p>General Gym - A comprehensive framework for reinforcement learning environments that
-                provides a unified interface for various RL tasks.</p>
-            <a href="/gem/" class="read-more">Learn more</a>
-        </article>
-    </section>
-
-    
-    <section class="section">
-        <h2 class="section-title">Blogs</h2>
-        <article class="blog-post">
-            <h3><a href="https://axon-rl.notion.site/gem">Gem: A Gym for Generalist LLMs</a></h3>
-            <div class="post-meta">
-                <span class="post-date">Aug. 2, 2025</span>
-                <span class="post-category">Blog Post</span>
-            </div>
-            <p>We introduce GEM, our open-source efforts to build a <b>G</b>eneral <b>E</b>xperience <b>M</b>aker</p>
-            <a href="https://axon-rl.notion.site/gem" class="read-more">Read more</a>
-        </article>
-    </section>
-
-    
-    <section class="section">
-        <h2 class="section-title">Research Group</h2>
-        <article class="blog-post">
-            <h3>About Our Team</h3>
-            <div class="post-meta">
-                <span class="post-date">Research Group</span>
-                <span class="post-category">AI & ML</span>
-            </div>
-            <p>We're building a team of passionate researchers and developers dedicated to advancing reinforcement
-                learning. More information about our team members will be available soon.</p>
-        </article>
-    </section>
+<header>
+<nav class=container>
+<div class=logo>
+<a href=/ style=text-decoration:none;color:inherit;display:flex;align-items:center;gap:0>
+<img src=/images/axon-vanilla.svg alt="Axon-RL Logo" height=30 width=30>
+<span style=text-decoration:none;color:#174f62>Axon-RL</span>
+</a>
 </div>
-
-
-    <footer class="footer">
-        <div class="container">
-            <p>&copy; 2025 Axon-RL. All rights reserved.</p>
-            <div class="footer-links">
-                <a href="https://github.com/axon-rl" target="_blank">GitHub</a>
-            </div>
-        </div>
-    </footer>
-
-    
-    
-    
-    
-    <script src="/js/typing-animation.min.js"></script>
-    <script src="/js/theme-toggle.min.js"></script>
-    <script src="/js/smooth-scroll.min.js"></script>
-    <script src="/js/code-copy.min.js"></script>
+<ul class=nav-links>
+<li><a href=/>Home</a></li>
+<li><a href=/gem/>💎 GEM</a></li>
+</ul>
+<button id=theme-toggle class=theme-toggle aria-label="Toggle dark mode">
+<span class=theme-icon>🌙</span>
+</button>
+</nav>
+</header>
+<div class=blog-container>
+<section class=intro>
+<img src=/images/axon-all.svg alt="Axon-RL Logo" class=logo height=150 width=150>
+<p>Wiring general intelligence through reinforcement learning</p>
+<div class=social-links>
+<a href=https://github.com/axon-rl target=_blank class=social-link>
+<span class=social-icon>
+<img src=/images/github-mark.svg height=20 width=20 alt=GitHub>
+</span> GitHub
+</a>
+<a href=https://huggingface.co/axon-rl target=_blank class=social-link>
+<span class=social-icon>🤗</span> Hugging Face
+</a>
+</div>
+</section>
+<section class=section>
+<h2 class=section-title>Projects</h2>
+<article class=blog-post>
+<h3><a href=/gem/>💎 GEM</a></h3>
+<div class=post-meta>
+<span class=post-date>Framework</span>
+<span class=post-category>Reinforcement Learning</span>
+</div>
+<p>General Gym - A comprehensive framework for reinforcement learning environments that
+provides a unified interface for various RL tasks.</p>
+<a href=/gem/ class=read-more>Learn more</a>
+</article>
+</section>
+<section class=section>
+<h2 class=section-title>Blogs</h2>
+<article class=blog-post>
+<h3><a href=https://axon-rl.notion.site/gem>Gem: A Gym for Generalist LLMs</a></h3>
+<div class=post-meta>
+<span class=post-date>Aug. 2, 2025</span>
+<span class=post-category>Blog Post</span>
+</div>
+<p>We introduce GEM, our open-source efforts to build a <b>G</b>eneral <b>E</b>xperience <b>M</b>aker</p>
+<a href=https://axon-rl.notion.site/gem class=read-more>Read more</a>
+</article>
+</section>
+<section class=section>
+<h2 class=section-title>Research Group</h2>
+<article class=blog-post>
+<h3>About Our Team</h3>
+<div class=post-meta>
+<span class=post-date>Research Group</span>
+<span class=post-category>AI & ML</span>
+</div>
+<p>We're building a team of passionate researchers and developers dedicated to advancing reinforcement
+learning. More information about our team members will be available soon.</p>
+</article>
+</section>
+</div>
+<footer class=footer>
+<div class=container>
+<p>&copy; 2025 Axon-RL. All rights reserved.</p>
+<div class=footer-links>
+<a href=https://github.com/axon-rl target=_blank>GitHub</a>
+</div>
+</div>
+</footer>
+<script src=/js/typing-animation.min.js></script>
+<script src=/js/theme-toggle.min.js></script>
+<script src=/js/smooth-scroll.min.js></script>
+<script src=/js/code-copy.min.js></script>
 </body>
-
-</html>
+</html>
\ No newline at end of file
diff --git a/public/index.xml b/public/index.xml
index 4b14142..d9f58a3 100644
--- a/public/index.xml
+++ b/public/index.xml
@@ -1,11 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?>
-<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
-  <channel>
-    <title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link>http://localhost:53236/</link>
-    <description>Recent content on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description>
-    <generator>Hugo</generator>
-    <language>en-us</language>
-    <atom:link href="http://localhost:53236/index.xml" rel="self" type="application/rss+xml" />
-  </channel>
-</rss>
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title><link>https://axon-rl.github.io/</link><description>Recent content on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><atom:link href="https://axon-rl.github.io/index.xml" rel="self" type="application/rss+xml"/></channel></rss>
\ No newline at end of file
diff --git a/public/js/code-copy.min.js b/public/js/code-copy.min.js
index b061f56..2361715 100644
--- a/public/js/code-copy.min.js
+++ b/public/js/code-copy.min.js
@@ -1,4 +1,4 @@
-document.addEventListener("DOMContentLoaded",function(){function e(){const e=document.querySelectorAll("pre:has(code), .highlight");e.forEach(function(e){if(e.querySelector(".copy-button"))return;const t=document.createElement("button");t.className="copy-button",t.innerHTML=`
+document.addEventListener('DOMContentLoaded',function(){function a(){const a=document.querySelectorAll('pre:has(code), .highlight');a.forEach(function(b){if(b.querySelector('.copy-button'))return;const a=document.createElement('button');a.className='copy-button',a.innerHTML=`
                 <svg class="copy-icon" width="16" height="16" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2">
                     <rect x="9" y="9" width="13" height="13" rx="2" ry="2"></rect>
                     <path d="M5 15H4a2 2 0 0 1-2-2V4a2 2 0 0 1 2-2h9a2 2 0 0 1 2 2v1"></path>
@@ -7,4 +7,4 @@ document.addEventListener("DOMContentLoaded",function(){function e(){const e=doc
                     <polyline points="20,6 9,17 4,12"></polyline>
                 </svg>
                 <span class="copy-text">Copy</span>
-            `,t.setAttribute("aria-label","Copy code to clipboard"),t.setAttribute("title","Copy code"),e.style.position="relative",t.addEventListener("click",function(){o(e,t)}),e.appendChild(t)})}function o(e,s){let o=e.querySelector("code");o||(o=e);const i=o.textContent||o.innerText;navigator.clipboard&&window.isSecureContext?navigator.clipboard.writeText(i).then(function(){n(s)}).catch(function(e){console.error("Failed to copy code: ",e),t(i,s)}):t(i,s)}function t(e,t){const o=document.createElement("textarea");o.value=e,o.style.position="fixed",o.style.left="-999999px",o.style.top="-999999px",document.body.appendChild(o),o.focus(),o.select();try{const e=document.execCommand("copy");e?n(t):s(t)}catch(e){console.error("Fallback: Oops, unable to copy",e),s(t)}document.body.removeChild(o)}function n(e){const t=e.querySelector(".copy-icon"),n=e.querySelector(".check-icon"),s=e.querySelector(".copy-text");t.style.display="none",n.style.display="block",s.textContent="Copied!",e.classList.add("copied"),setTimeout(function(){t.style.display="block",n.style.display="none",s.textContent="Copy",e.classList.remove("copied")},2e3)}function s(e){const t=e.querySelector(".copy-text");t.textContent="Failed",e.classList.add("error"),setTimeout(function(){t.textContent="Copy",e.classList.remove("error")},2e3)}e();const i=new MutationObserver(function(t){t.forEach(function(t){t.type==="childList"&&e()})});i.observe(document.body,{childList:!0,subtree:!0})})
\ No newline at end of file
+            `,a.setAttribute('aria-label','Copy code to clipboard'),a.setAttribute('title','Copy code'),b.style.position='relative',a.addEventListener('click',function(){e(b,a)}),b.appendChild(a)})}function e(f,d){let a=f.querySelector('code');a||(a=f);const e=a.textContent||a.innerText;navigator.clipboard&&window.isSecureContext?navigator.clipboard.writeText(e).then(function(){c(d)}).catch(function(a){console.error('Failed to copy code: ',a),b(e,d)}):b(e,d)}function b(e,b){const a=document.createElement('textarea');a.value=e,a.style.position='fixed',a.style.left='-999999px',a.style.top='-999999px',document.body.appendChild(a),a.focus(),a.select();try{const a=document.execCommand('copy');a?c(b):d(b)}catch(a){console.error('Fallback: Oops, unable to copy',a),d(b)}document.body.removeChild(a)}function c(a){const b=a.querySelector('.copy-icon'),c=a.querySelector('.check-icon'),d=a.querySelector('.copy-text');b.style.display='none',c.style.display='block',d.textContent='Copied!',a.classList.add('copied'),setTimeout(function(){b.style.display='block',c.style.display='none',d.textContent='Copy',a.classList.remove('copied')},2e3)}function d(a){const b=a.querySelector('.copy-text');b.textContent='Failed',a.classList.add('error'),setTimeout(function(){b.textContent='Copy',a.classList.remove('error')},2e3)}a();const f=new MutationObserver(function(b){b.forEach(function(b){b.type==='childList'&&a()})});f.observe(document.body,{childList:!0,subtree:!0})})
\ No newline at end of file
diff --git a/public/js/smooth-scroll.min.js b/public/js/smooth-scroll.min.js
index 3e364aa..c1fb4c3 100644
--- a/public/js/smooth-scroll.min.js
+++ b/public/js/smooth-scroll.min.js
@@ -1 +1 @@
-document.addEventListener("DOMContentLoaded",function(){function s(e,t=300){const s=document.querySelector(e);if(!s)return console.warn(`Target element not found: ${e}`),!1;const o=window.pageYOffset,a=s.getBoundingClientRect().top+window.pageYOffset,r=a-120,c=r-o;let n=null;function i(e){n===null&&(n=e);const s=e-n,a=l(s,o,c,t);window.scrollTo(0,a),s<t&&requestAnimationFrame(i)}function l(e,t,n,s){return e/=s/2,e<1?n/2*e*e+t:(e--,-n/2*(e*(e-2)-1)+t)}return requestAnimationFrame(i),!0}function i(e){const t=this.getAttribute("href");if(t&&t.includes("#")){const a=t.indexOf("#"),n=t.substring(a),i=t.substring(0,a);n&&(i===""||i===window.location.pathname)?(e.preventDefault(),window.history&&window.history.pushState&&window.history.pushState(null,null,n),o(this),s(n)):n&&i!==""&&o(this)}}function o(n){t&&clearTimeout(t),t=setTimeout(()=>{t=null},0),document.querySelectorAll(".gem-sidebar a").forEach(e=>{e.classList.remove("active")}),n.classList.add("active"),e(n)}function e(e){const t=document.querySelector(".gem-sidebar");if(!t||!e)return;const n=e.offsetTop,s=e.offsetHeight,o=t.clientHeight,i=n-o/2+s/2;t.scrollTo({top:i,behavior:"smooth"})}document.querySelectorAll('a[href*="#"]').forEach(e=>{e.addEventListener("click",i)}),window.addEventListener("popstate",function(){if(window.location.hash){const o=window.location.hash.substring(1),t=document.querySelector(`.gem-sidebar a[href$="#${o}"]`);t&&(document.querySelectorAll(".gem-sidebar a").forEach(e=>{e.classList.remove("active")}),t.classList.add("active"),n=o,e(t)),s(window.location.hash)}});const a={root:null,rootMargin:"-120px 0px -60% 0px",threshold:[0,.1,.5,1]};let n=null,t=null;const r=new IntersectionObserver(s=>{if(t)return;const o=s.filter(e=>e.isIntersecting).sort((e,t)=>t.intersectionRatio-e.intersectionRatio);if(o.length>0){const s=o[0],t=s.target.getAttribute("id");if(t&&t!==n){n=t;const s=document.querySelector(`.gem-sidebar a[href$="#${t}"]`);s&&(document.querySelectorAll(".gem-sidebar a").forEach(e=>{e.classList.remove("active")}),s.classList.add("active"),e(s),window.history&&window.history.replaceState&&window.history.replaceState(null,null,`#${t}`))}}},a);document.querySelectorAll("h2[id], h3[id], section[id], .expandable-section[id]").forEach(e=>{r.observe(e)});function c(){const o=window.location.pathname,i=window.location.hash;if(i){const o=i.substring(1),t=document.querySelector(`.gem-sidebar a[href$="#${o}"]`);if(t){t.classList.add("active"),n=o,e(t),s(i);return}}let t=null;if(t=document.querySelector(`.gem-sidebar a[href="${o}"]`),!t){const e=o.replace(/\/$/,"");t=document.querySelector(`.gem-sidebar a[href="${e}"]`)}if(!t&&!o.endsWith("/")&&(t=document.querySelector(`.gem-sidebar a[href="${o}/"]`)),t)t.classList.add("active"),e(t);else{const t=document.querySelector(".gem-sidebar a");t&&(t.classList.add("active"),e(t))}}c()})
\ No newline at end of file
+document.addEventListener('DOMContentLoaded',function(){function d(b,e=300){const c=document.querySelector(b);if(!c)return console.warn(`Target element not found: ${b}`),!1;const d=window.pageYOffset,g=c.getBoundingClientRect().top+window.pageYOffset,h=g-120,i=h-d;let a=null;function f(b){a===null&&(a=b);const c=b-a,g=j(c,d,i,e);window.scrollTo(0,g),c<e&&requestAnimationFrame(f)}function j(a,b,c,d){return a/=d/2,a<1?c/2*a*a+b:(a--,-c/2*(a*(a-2)-1)+b)}return requestAnimationFrame(f),!0}function f(b){const a=this.getAttribute('href');if(a&&a.includes('#')){const g=a.indexOf('#'),c=a.substring(g),f=a.substring(0,g);c&&(f===''||f===window.location.pathname)?(b.preventDefault(),window.history&&window.history.pushState&&window.history.pushState(null,null,c),e(this),d(c)):c&&f!==''&&e(this)}}function e(c){b&&clearTimeout(b),b=setTimeout(()=>{b=null},0),document.querySelectorAll('.gem-sidebar a').forEach(a=>{a.classList.remove('active')}),c.classList.add('active'),a(c)}function a(a){const b=document.querySelector('.gem-sidebar');if(!b||!a)return;const c=a.offsetTop,d=a.offsetHeight,e=b.clientHeight,f=c-e/2+d/2;b.scrollTo({top:f,behavior:'smooth'})}document.querySelectorAll('a[href*="#"]').forEach(a=>{a.addEventListener('click',f)}),window.addEventListener('popstate',function(){if(window.location.hash){const e=window.location.hash.substring(1),b=document.querySelector(`.gem-sidebar a[href$="#${e}"]`);b&&(document.querySelectorAll('.gem-sidebar a').forEach(a=>{a.classList.remove('active')}),b.classList.add('active'),c=e,a(b)),d(window.location.hash)}});const g={root:null,rootMargin:'-120px 0px -60% 0px',threshold:[0,.1,.5,1]};let c=null,b=null;const h=new IntersectionObserver(e=>{if(b)return;const d=e.filter(a=>a.isIntersecting).sort((a,b)=>b.intersectionRatio-a.intersectionRatio);if(d.length>0){const e=d[0],b=e.target.getAttribute('id');if(b&&b!==c){c=b;const d=document.querySelector(`.gem-sidebar a[href$="#${b}"]`);d&&(document.querySelectorAll('.gem-sidebar a').forEach(a=>{a.classList.remove('active')}),d.classList.add('active'),a(d),window.history&&window.history.replaceState&&window.history.replaceState(null,null,`#${b}`))}}},g);document.querySelectorAll('h2[id], h3[id], section[id], .expandable-section[id]').forEach(a=>{h.observe(a)});function i(){const e=window.location.pathname,f=window.location.hash;if(f){const e=f.substring(1),b=document.querySelector(`.gem-sidebar a[href$="#${e}"]`);if(b){b.classList.add('active'),c=e,a(b),d(f);return}}let b=null;if(b=document.querySelector(`.gem-sidebar a[href="${e}"]`),!b){const a=e.replace(/\/$/,'');b=document.querySelector(`.gem-sidebar a[href="${a}"]`)}if(!b&&!e.endsWith('/')&&(b=document.querySelector(`.gem-sidebar a[href="${e}/"]`)),b)b.classList.add('active'),a(b);else{const b=document.querySelector('.gem-sidebar a');b&&(b.classList.add('active'),a(b))}}i()})
\ No newline at end of file
diff --git a/public/js/theme-toggle.min.js b/public/js/theme-toggle.min.js
index 9f86a76..c03f377 100644
--- a/public/js/theme-toggle.min.js
+++ b/public/js/theme-toggle.min.js
@@ -1 +1 @@
-class ThemeToggle{constructor(){this.button=document.getElementById("theme-toggle"),this.icon=document.querySelector(".theme-icon"),this.currentTheme=localStorage.getItem("theme")||"light",this.init()}init(){this.setTheme(this.currentTheme),this.button.addEventListener("click",()=>{this.toggleTheme()})}toggleTheme(){this.currentTheme=this.currentTheme==="light"?"dark":"light",this.setTheme(this.currentTheme),localStorage.setItem("theme",this.currentTheme)}setTheme(e){document.body.setAttribute("data-theme",e),this.icon.textContent=e==="light"?"🌙":"☀️"}}document.addEventListener("DOMContentLoaded",()=>{new ThemeToggle})
\ No newline at end of file
+class ThemeToggle{constructor(){this.button=document.getElementById('theme-toggle'),this.icon=document.querySelector('.theme-icon'),this.currentTheme=localStorage.getItem('theme')||'light',this.init()}init(){this.setTheme(this.currentTheme),this.button.addEventListener('click',()=>{this.toggleTheme()})}toggleTheme(){this.currentTheme=this.currentTheme==='light'?'dark':'light',this.setTheme(this.currentTheme),localStorage.setItem('theme',this.currentTheme)}setTheme(a){document.body.setAttribute('data-theme',a),this.icon.textContent=a==='light'?'🌙':'☀️'}}document.addEventListener('DOMContentLoaded',()=>{new ThemeToggle})
\ No newline at end of file
diff --git a/public/js/typing-animation.min.js b/public/js/typing-animation.min.js
index 151cac2..820f137 100644
--- a/public/js/typing-animation.min.js
+++ b/public/js/typing-animation.min.js
@@ -1 +1 @@
-class TypingAnimation{constructor(e,t,n=100){this.element=document.getElementById(e),this.text=t,this.speed=n,this.currentIndex=0,this.isTyping=!1}start(){if(this.isTyping||!this.element)return;this.isTyping=!0,this.element.textContent="",this.element.style.opacity="1",this.element.classList.add("typing-cursor"),this.typeNextCharacter()}typeNextCharacter(){this.currentIndex<this.text.length?(this.element.textContent+=this.text[this.currentIndex],this.currentIndex++,setTimeout(()=>{this.typeNextCharacter()},this.speed)):(this.isTyping=!1,setTimeout(()=>{this.element.classList.remove("typing-cursor")},1e3))}reset(){this.currentIndex=0,this.isTyping=!1,this.element.textContent="",this.element.classList.remove("typing-cursor")}}document.addEventListener("DOMContentLoaded",()=>{const e=new TypingAnimation("typing-text","Wiring general intelligence through reinforcement learning",80);setTimeout(()=>{e.start()},500)})
\ No newline at end of file
+class TypingAnimation{constructor(a,b,c=100){this.element=document.getElementById(a),this.text=b,this.speed=c,this.currentIndex=0,this.isTyping=!1}start(){if(this.isTyping||!this.element)return;this.isTyping=!0,this.element.textContent='',this.element.style.opacity='1',this.element.classList.add('typing-cursor'),this.typeNextCharacter()}typeNextCharacter(){this.currentIndex<this.text.length?(this.element.textContent+=this.text[this.currentIndex],this.currentIndex++,setTimeout(()=>{this.typeNextCharacter()},this.speed)):(this.isTyping=!1,setTimeout(()=>{this.element.classList.remove('typing-cursor')},1e3))}reset(){this.currentIndex=0,this.isTyping=!1,this.element.textContent='',this.element.classList.remove('typing-cursor')}}document.addEventListener('DOMContentLoaded',()=>{const a=new TypingAnimation('typing-text','Wiring general intelligence through reinforcement learning',80);setTimeout(()=>{a.start()},500)})
\ No newline at end of file
diff --git a/public/sitemap.xml b/public/sitemap.xml
index 36d45a7..4574d02 100644
--- a/public/sitemap.xml
+++ b/public/sitemap.xml
@@ -1,21 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?>
-<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"
-  xmlns:xhtml="http://www.w3.org/1999/xhtml">
-  <url>
-    <loc>http://localhost:53236/gem/features/</loc>
-  </url><url>
-    <loc>http://localhost:53236/gem/environments/</loc>
-  </url><url>
-    <loc>http://localhost:53236/gem/</loc>
-  </url><url>
-    <loc>http://localhost:53236/</loc>
-  </url><url>
-    <loc>http://localhost:53236/categories/</loc>
-  </url><url>
-    <loc>http://localhost:53236/tags/</loc>
-  </url><url>
-    <loc>http://localhost:53236/gem/tools/</loc>
-  </url><url>
-    <loc>http://localhost:53236/gem/advanced/</loc>
-  </url>
-</urlset>
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xhtml="http://www.w3.org/1999/xhtml"><url><loc>https://axon-rl.github.io/</loc></url><url><loc>https://axon-rl.github.io/categories/</loc></url><url><loc>https://axon-rl.github.io/tags/</loc></url><url><loc>https://axon-rl.github.io/gem/features/</loc></url><url><loc>https://axon-rl.github.io/gem/environments/</loc></url><url><loc>https://axon-rl.github.io/gem/</loc></url><url><loc>https://axon-rl.github.io/gem/tools/</loc></url><url><loc>https://axon-rl.github.io/gem/advanced/</loc></url></urlset>
\ No newline at end of file
diff --git a/public/tags/index.xml b/public/tags/index.xml
index 9a79580..98692cc 100644
--- a/public/tags/index.xml
+++ b/public/tags/index.xml
@@ -1,11 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?>
-<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
-  <channel>
-    <title>Tags on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title>
-    <link>http://localhost:53236/tags/</link>
-    <description>Recent content in Tags on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description>
-    <generator>Hugo</generator>
-    <language>en-us</language>
-    <atom:link href="http://localhost:53236/tags/index.xml" rel="self" type="application/rss+xml" />
-  </channel>
-</rss>
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Tags on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</title><link>https://axon-rl.github.io/tags/</link><description>Recent content in Tags on Axon-RL - Wiring General Intelligence Through Reinforcement Learning</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><atom:link href="https://axon-rl.github.io/tags/index.xml" rel="self" type="application/rss+xml"/></channel></rss>
\ No newline at end of file

Environment	Description
`game:GuessTheNumber`	The agent has multiple guesses to guess the hidden number. The agent receives whether the hidden number is higher or lower than its guess.
`game:Mastermind`	The agent has multiple guesses to guess the hidden code. The agent receives black and white pegs depending on the number of correct digits in the right and wrong places.
`game:Minesweeper`	The agent must reveal all safe grid squares without revealing a mine. For each revealed square the agent receives the number of adjacent squares that contain mines.
`game:Wordle`	The agent must guess the hidden word. After each turn the agent receives feedback ("G"=correct letter + correct position, "Y"=correct letter + incorrect position, "X"=incorrect letter).
`game:FifteenPuzzle`	Arrange tiles on the board into ascending order using the empty space to slide tiles into different positions.
`game:Hangman`	The objective of the game is to guess the word by providing one letter guesses or the entire word.
`game:Sudoku`	Classic Sudoku Game. `easy` version renders a 4x4 board.
`game:TowerofHanoi`	a classic single-player puzzle game where the objective is to move a stack of disks from one tower to another following specific rules.
Environment	Dataset
`math:ASDIV2k`	ASDIV-2k
`math:GSM8k`	GSM-8k
`math:Math12k`	MATH-12k
`math:ORZ57k`	ORZ-57k
Environment	Dataset
`qa:NaturalQuestions`	NaturalQuestions
`qa:HotpotQA`	HotpotQA
`logic:RuleTaker-d0`	RuleTaker-d0-70k
`logic:RuleTaker-d1`	RuleTaker-d1-70k
`logic:RuleTaker-d2`	RuleTaker-d2-70k
`logic:RuleTaker-d3`	RuleTaker-d3-70k
`logic:RuleTaker-d5`	RuleTaker-d5-70k
Wrapper name	Description	Example (Mastermind)
no wrapper	The observation string from the environment.	`"At turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."`
`concat`	The sequence of environment observation strings from all previous steps concatenated together.	`"You are playing Mastermind. [instructions]... Enter your first guess to start the game.\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."`
`concat_with_action`	The sequence of [environment observation string, agent action, environment observation string, etc.] from all previous steps concatenated together.	"You are playing Mastermind. [instructions]... Enter your first guess to start the game.\nOkay, I will guess a random 3 digit number for now. My first guess will be \\boxed{123}.\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).\nOkay, let's think. One digit is in the correct place. Perhaps this is 3. One digit is completely incorrect. Let's try switching 1 for 4 and moving the 2. My next guess will be \\boxed{243}.\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."
`concat_chat` (default)	The sequence of [environment observation string, agent action, environment observation string, etc.] from all previous steps concatenated together with the chat template applied to denote "user" (environment) vs "assistant" (agent) turns.	"<\|im_start\|>user\nYou are playing Mastermind. [instructions]... Enter your first guess to start the game.<\|im_end\|>\n<\|im_start\|>assistant\nOkay, I will guess a random 3 digit number for now. My first guess will be \\boxed{123}<\|im_end\|> <\|im_start\|>user\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).<\|im_end\|>\n<\|im_start\|>assistant\nOkay, let's think. One digit is in the correct place. Perhaps this is 3. One digit is completely incorrect. Let's try switching 1 for 4 and moving the 2. My next guess will be \\boxed{243}.<\|im_end\|>\n<\|im_start\|>user\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s).<\|im_end\|>\n<\|im_start\|>assistant"
`concat_chat_on_reset`	Same as concat_with_action but the chat template tag is applied at the start.	"<\|im_start\|>user\nYou are playing Mastermind. [instructions]... Enter your first guess to start the game.\nOkay, I will guess a random 3 digit number for now. My first guess will be \\boxed{123}.\nAt turn 1, you guessed 123. This guess receives 1 black peg(s) and 1 white peg(s).\nOkay, let's think. One digit is in the correct place. Perhaps this is 3. One digit is completely incorrect. Let's try switching 1 for 4 and moving the 2. My next guess will be \\boxed{243}.\nAt turn 2, you guessed 243. This guess receives 1 black peg(s) and 2 white peg(s)."