ApartsinProjects
diff --git a/‎appendices/appendix-c-python-for-llm/section-c.1.html‎
Lines changed: 0 additions & 2 deletions b/‎appendices/appendix-c-python-for-llm/section-c.1.html‎
Lines changed: 0 additions & 2 deletions
diff --git a/‎appendices/appendix-c-python-for-llm/section-c.2.html‎
Lines changed: 0 additions & 2 deletions b/‎appendices/appendix-c-python-for-llm/section-c.2.html‎
Lines changed: 0 additions & 2 deletions
diff --git a/‎appendices/appendix-c-python-for-llm/section-c.4.html‎
Lines changed: 0 additions & 6 deletions b/‎appendices/appendix-c-python-for-llm/section-c.4.html‎
Lines changed: 0 additions & 6 deletions
diff --git a/‎appendices/appendix-e-git-collaboration/section-e.3.html‎
Lines changed: 0 additions & 2 deletions b/‎appendices/appendix-e-git-collaboration/section-e.3.html‎
Lines changed: 0 additions & 2 deletions
diff --git a/‎appendices/appendix-i-prompt-templates/section-i.5.html‎
Lines changed: 2 additions & 6 deletions b/‎appendices/appendix-i-prompt-templates/section-i.5.html‎
Lines changed: 2 additions & 6 deletions
diff --git a/‎part-2-understanding-llms/module-06-pretraining-scaling-laws/section-6.3.html‎
Lines changed: 1 addition & 3 deletions b/‎part-2-understanding-llms/module-06-pretraining-scaling-laws/section-6.3.html‎
Lines changed: 1 addition & 3 deletions
diff --git a/‎part-2-understanding-llms/module-07-modern-llm-landscape/section-7.1.html‎
Lines changed: 1 addition & 3 deletions b/‎part-2-understanding-llms/module-07-modern-llm-landscape/section-7.1.html‎
Lines changed: 1 addition & 3 deletions
diff --git a/‎part-3-working-with-llms/module-10-llm-apis/section-10.3.html‎
Lines changed: 0 additions & 3 deletions b/‎part-3-working-with-llms/module-10-llm-apis/section-10.3.html‎
Lines changed: 0 additions & 3 deletions
diff --git a/‎part-3-working-with-llms/module-11-prompt-engineering/section-11.3.html‎
Lines changed: 0 additions & 2 deletions b/‎part-3-working-with-llms/module-11-prompt-engineering/section-11.3.html‎
Lines changed: 0 additions & 2 deletions
diff --git a/‎part-3-working-with-llms/module-12-hybrid-ml-llm/section-12.2.html‎
Lines changed: 1 addition & 4 deletions b/‎part-3-working-with-llms/module-12-hybrid-ml-llm/section-12.2.html‎
Lines changed: 1 addition & 4 deletions
@@ -95,8 +95,6 @@ <h3>NumPy and Pandas</h3>
 dataset = df[["instruction", "response"]].to_dict(orient="records")
 print(f"Training examples: {len(dataset)}")</code></pre>
 <div class="code-caption"><strong>Code Fragment C.1.2:</strong> This snippet demonstrates this approach using PyTorch. Study the implementation to understand how each component contributes to the overall workflow.</div>
-<!-- FIXME: stacked caption, needs manual review -->
-<div class="code-caption"><strong>Code Fragment C.1.3:</strong> This snippet demonstrates this approach. Study the implementation to understand how each component contributes to the overall workflow.</div>
 <h3>Additional Libraries</h3>
 
 <div class="comparison-table">
 
@@ -60,8 +60,6 @@ <h3>Option 2: Conda (Recommended for GPU Work)</h3>
 # Export environment
 conda env export > environment.yml</code></pre>
 <div class="code-caption"><strong>Code Fragment C.2.1:</strong> This snippet demonstrates this approach using <a href="https://pytorch.org/" target="_blank" rel="noopener">PyTorch</a>. Study the implementation to understand how each component contributes to the overall workflow.</div>
-<!-- FIXME: stacked caption, needs manual review -->
-<div class="code-caption"><strong>Code Fragment C.2.2:</strong> This snippet demonstrates this approach using PyTorch. Study the implementation to understand how each component contributes to the overall workflow.</div>
 <div class="callout key-insight">
     <div class="callout-title">Key Insight: Why Conda for GPU Work?</div>
     <p>The main advantage of Conda over venv for LLM work is CUDA management. Installing PyTorch with <code>conda</code> automatically includes the correct CUDA toolkit version, sidestepping the need to install system-level <a href="https://www.nvidia.com/" target="_blank" rel="noopener">NVIDIA</a> drivers and CUDA separately. This is especially valuable on shared machines or when you need different CUDA versions for different projects.</p>
 
@@ -123,12 +123,6 @@ <h3>Pattern 5: Saving and Loading Checkpoints</h3>
 model.push_to_hub("your-username/my-finetuned-model")
 tokenizer.push_to_hub("your-username/my-finetuned-model")</code></pre>
 <div class="code-caption"><strong>Code Fragment C.4.4:</strong> This snippet demonstrates the <code>call_with_retry</code> function using API integration. The function encapsulates reusable logic that can be applied across different inputs.</div>
-<!-- FIXME: stacked caption, needs manual review -->
-<div class="code-caption"><strong>Code Fragment C.4.2:</strong> This snippet demonstrates this approach. Study the implementation to understand how each component contributes to the overall workflow.</div>
-<!-- FIXME: stacked caption, needs manual review -->
-<div class="code-caption"><strong>Code Fragment C.4.1:</strong> This snippet demonstrates this approach using <a href="https://pytorch.org/" target="_blank" rel="noopener">PyTorch</a>. Study the implementation to understand how each component contributes to the overall workflow.</div>
-<!-- FIXME: stacked caption, needs manual review -->
-<div class="code-caption"><strong>Code Fragment C.4.5:</strong> This snippet demonstrates this approach. Study the implementation to understand how each component contributes to the overall workflow.</div>
 <div class="callout fun-note">
       <div class="callout-title">Fun Fact: The Two-Line LLM</div>
     <p>Thanks to the <code>pipeline</code> API, you can run a language model in two lines of Python: one to create the pipeline, one to call it. The entire transformer architecture, tokenization, and decoding are handled behind the scenes. This is both a blessing (rapid prototyping) and a danger (it is easy to treat the model as a black box without understanding its behavior).</p>
 
@@ -84,8 +84,6 @@ <h3>MLflow</h3>
     # Log the model as an artifact
     mlflow.log_artifact("./output/adapter_model.safetensors")</code></pre>
 <div class="code-caption"><strong>Code Fragment E.3.1:</strong> This snippet demonstrates this approach using experiment tracking, loss computation. Notice how experiment parameters and artifacts are logged together for full reproducibility. Reproducible experiments are the foundation of reliable iteration in production ML systems.</div>
-<!-- FIXME: stacked caption, needs manual review -->
-<div class="code-caption"><strong>Code Fragment E.3.2:</strong> This snippet demonstrates this approach using monitoring, experiment tracking. Notice how the metrics are tagged with request metadata so you can slice dashboards by model, user, or endpoint. Proactive monitoring catches regressions before they reach users and simplifies root-cause analysis.</div>
 <div class="comparison-table">
     <div class="comparison-table-title">Feature Comparison</div>
     <table>
 
@@ -47,9 +47,7 @@ <h3>Code Generation with Specification</h3>
 Input: {{input_description}}
 Output: {{output_description}}
 Edge cases to handle: {{edge_cases}}</code></pre>
-<div class="code-caption"><strong>Code Fragment I.5.1:</strong> Instructs the model to generate code from a specification, including type hints, docstrings, and edge case handling for production quality.</div>
-<!-- FIXME: stacked caption, needs manual review -->
-<div class="code-caption"><strong>Code Fragment I.5.2:</strong> Supplies the function signature and requirements, giving the model a clear contract to implement.</div>
+<div class="code-caption"><strong>Code Fragment I.5.1:</strong> Instructs the model to generate code from a specification, including type hints, docstrings, and edge case handling for production quality. The user message supplies the function signature and requirements, giving the model a clear contract to implement.</div>
 
     <div class="callout tip">
     <div class="callout-title">Tip</div>
@@ -82,9 +80,7 @@ <h3>Code Review and Improvement</h3>
 ```{{language}}
 {{code}}
 ```</code></pre>
-<div class="code-caption"><strong>Code Fragment I.5.3:</strong> Configures the model as a code reviewer that identifies bugs, performance issues, and style violations with actionable fix suggestions.</div>
-<!-- FIXME: stacked caption, needs manual review -->
-<div class="code-caption"><strong>Code Fragment I.5.4:</strong> Presents the code to review with context about the programming language and review focus areas.</div>
+<div class="code-caption"><strong>Code Fragment I.5.3:</strong> Configures the model as a code reviewer that identifies bugs, performance issues, and style violations with actionable fix suggestions. The user message presents the code to review with context about the programming language and review focus areas.</div>
 
 </div>
 
 
@@ -389,9 +389,7 @@ <h3>Computing the Chinchilla-Optimal Allocation</h3>
   Optimal model size: 288.7B parameters
   Optimal data:       5774B tokens
     </div>
-<div class="code-caption"><strong>Code Fragment 6.3.1:</strong> Empirical data: (parameters, final_loss) from small training runs.</div>
-<!-- FIXME: stacked caption, needs manual review -->
-<div class="code-caption"><strong>Code Fragment 6.3.2:</strong> Example compute budgets.</div>
+<div class="code-caption"><strong>Code Fragment 6.3.1:</strong> Empirical data from small training runs showing (parameters, final_loss) pairs and example compute budgets with optimal model size and data allocations.</div>
 
     <h2>9. Summary Table: Scaling Regimes</h2>
 
 
@@ -362,9 +362,7 @@ <h3>Pricing Comparison</h3>
 # Similar pattern for Google (Vertex AI) and other providers</code></pre>
     <div class="code-output">GPT-4o: 25 * 37 = 925
 Tokens: 14 in, 8 out</div>
-    <div class="code-caption"><strong>Code Fragment 7.1.1:</strong> Approximate pricing comparison (per million tokens, USD).</div>
-<!-- FIXME: stacked caption, needs manual review -->
-<div class="code-caption"><strong>Code Fragment 7.1.2:</strong> Example: Making an API call to compare providers.</div>
+    <div class="code-caption"><strong>Code Fragment 7.1.2:</strong> Making an API call to compare providers using the OpenAI-compatible chat format across different model providers.</div>
 
     <div class="callout note">
         <div class="callout-title">Note: Where This Leads Next</div>
 
@@ -446,9 +446,6 @@ <h3>5.2 Helicone</h3>
 print("Check Helicone dashboard for detailed analytics.")</code></pre>
     <div class="code-output">Response received. Tokens: 89
 Check Helicone dashboard for detailed analytics.</div>
-<div class="code-caption"><strong>Code Fragment 10.3.6:</strong> Configuring Portkey as an AI gateway with fallback routing and semantic caching. The OpenAI client is pointed at Portkey's gateway URL, which transparently handles provider failover and caching.</div>
-
-<!-- FIXME: stacked caption, needs manual review -->
 <div class="code-caption"><strong>Code Fragment 10.3.7:</strong> Routing API calls through Helicone for observability by changing the base URL and adding custom headers. Every request is automatically logged with latency, token counts, cost, and tagged properties.</div>
 
     <div class="callout key-insight">
 
@@ -371,8 +371,6 @@ <h2>2. Meta-Prompting: Prompts That Generate Prompts <span class="level-badge in
 })
 print(messages)  # ready to pass to any LangChain LLM</code></pre>
 <div class="code-caption"><strong>Code Fragment 11.3.3:</strong> Meta-prompting via <code>generate_expert_prompt()</code>, which uses an LLM to write system prompts for other LLM calls. The meta-prompt template specifies five structural requirements (role definition, output format, quality criteria, edge cases, and two examples) and requests only the prompt text with no commentary.</div>
-<!-- TODO: insert library shortcut code block for this caption -->
-<div class="code-caption"><strong>Code Fragment 11.3.6:</strong> LangChain <code>ChatPromptTemplate</code> shortcut. Declarative templates separate prompt structure from variable content, making prompts versionable, testable, and composable into chains without manual string formatting.</div>
 
     <div class="callout note">
         <div class="callout-title">Note: Meta-Prompting and Iteration</div>
 
@@ -286,10 +286,7 @@ <h2>4. Combining Embeddings with Structured Features <span class="level-badge in
   Embeddings only                          0.534 (+/- 0.035)
   Combined (structured + embeddings)       0.841 (+/- 0.018)
     </div>
-<div class="code-caption"><strong>Code Fragment 12.2.2:</strong> Local embedding with <code>SentenceTransformer('all-MiniLM-L6-v2')</code>, an 80 MB model producing 384-dimensional vectors. The <code>normalize_embeddings=True</code> flag enables direct dot-product similarity. At 5.7 ms per text on CPU with zero API cost, this is orders of magnitude cheaper than cloud embedding APIs.</div>
-
-<!-- FIXME: stacked caption, needs manual review -->
-<div class="code-caption"><strong>Code Fragment 12.2.3:</strong> Feature ablation study comparing structured-only, embeddings-only, and combined feature sets using XGBoost with 5-fold cross-validation. The combined configuration (<code>StandardScaler</code> on structured features concatenated with 384-dim embeddings) outperforms either source alone, demonstrating complementary signal.</div>
+<div class="code-caption"><strong>Code Fragment 12.2.2:</strong> Local embedding with <code>SentenceTransformer('all-MiniLM-L6-v2')</code> producing 384-dimensional vectors, followed by a feature ablation study comparing structured-only, embeddings-only, and combined feature sets using XGBoost with 5-fold cross-validation. The combined configuration outperforms either source alone, demonstrating complementary signal.</div>
 
     <div class="callout key-insight">
         <div class="callout-title">Key Insight</div>