nathanhu0.github.io/index.html at main · nathanhu0/nathanhu0.github.io · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <meta name="description" content="Nathan Hu - Personal Website">
    <title>Nathan Hu</title>
    <link rel="stylesheet" href="styles.css">
</head>
<body>
    <!--
    Hello curious visitor! 👋
    Yes, this website was indeed made with Claude.
    If you're reading this, you probably enjoy looking at source code too.
    Hope you have a great day exploring the web!
    -->
    <div class="container">
        <header>
            <div class="header-content">
                <img src="files/photo.jpg" alt="Nathan Hu" class="profile-photo">
                <div>
                    <h1>Nathan Hu</h1>
                    <div class="email-display">
                        <svg viewBox="0 0 50 25" xmlns="http://www.w3.org/2000/svg">
                            <text x="0" y="20" font-family="-apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif" font-size="20" fill="#8b4513">nathu</text>
                        </svg>
                        <svg viewBox="0 0 20 25" xmlns="http://www.w3.org/2000/svg">
                            <text x="0" y="20" font-family="-apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif" font-size="20" fill="#8b4513">@</text>
                        </svg>
                        <svg viewBox="0 0 140 25" xmlns="http://www.w3.org/2000/svg">
                            <text x="0" y="20" font-family="-apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif" font-size="20" fill="#8b4513">cs.stanford.edu</text>
                        </svg>
                    </div>
                    <nav class="social-links">
                        <a href="https://scholar.google.com/citations?user=4SmMofIAAAAJ&hl=en" target="_blank" title="Google Scholar">
                            <svg class="icon" viewBox="0 0 24 24" xmlns="http://www.w3.org/2000/svg">
                                <path d="M12 24a7 7 0 1 1 0-14 7 7 0 0 1 0 14zm0-24L0 9.5l4.838 3.94A8 8 0 0 1 12 9a8 8 0 0 1 7.162 4.44L24 9.5z"/>
                            </svg>
                        </a>
                        <a href="https://github.com/nathanhu0" target="_blank" title="GitHub">
                            <svg class="icon" viewBox="0 0 24 24" xmlns="http://www.w3.org/2000/svg">
                                <path d="M12 0c-6.626 0-12 5.373-12 12 0 5.302 3.438 9.8 8.207 11.387.599.111.793-.261.793-.577v-2.234c-3.338.726-4.033-1.416-4.033-1.416-.546-1.387-1.333-1.756-1.333-1.756-1.089-.745.083-.729.083-.729 1.205.084 1.839 1.237 1.839 1.237 1.07 1.834 2.807 1.304 3.492.997.107-.775.418-1.305.762-1.604-2.665-.305-5.467-1.334-5.467-5.931 0-1.311.469-2.381 1.236-3.221-.124-.303-.535-1.524.117-3.176 0 0 1.008-.322 3.301 1.23.957-.266 1.983-.399 3.003-.404 1.02.005 2.047.138 3.006.404 2.291-1.552 3.297-1.23 3.297-1.23.653 1.653.242 2.874.118 3.176.77.84 1.235 1.911 1.235 3.221 0 4.609-2.807 5.624-5.479 5.921.43.372.823 1.102.823 2.222v3.293c0 .319.192.694.801.576 4.765-1.589 8.199-6.086 8.199-11.386 0-6.627-5.373-12-12-12z"/>
                            </svg>
                        </a>
                        <a href="https://x.com/nathanhu12" target="_blank" title="X (Twitter)">
                            <svg class="icon" viewBox="0 0 24 24" xmlns="http://www.w3.org/2000/svg">
                                <path d="M23.953 4.57a10 10 0 01-2.825.775 4.958 4.958 0 002.163-2.723c-.951.555-2.005.959-3.127 1.184a4.92 4.92 0 00-8.384 4.482C7.69 8.095 4.067 6.13 1.64 3.162a4.822 4.822 0 00-.666 2.475c0 1.71.87 3.213 2.188 4.096a4.904 4.904 0 01-2.228-.616v.06a4.923 4.923 0 003.946 4.827 4.996 4.996 0 01-2.212.085 4.936 4.936 0 004.604 3.417 9.867 9.867 0 01-6.102 2.105c-.39 0-.779-.023-1.17-.067a13.995 13.995 0 007.557 2.209c9.053 0 13.998-7.496 13.998-13.985 0-.21 0-.42-.015-.63A9.935 9.935 0 0024 4.59z"/>
                            </svg>
                        </a>
                        <a href="https://www.linkedin.com/in/nathan-hu-6598111a9" target="_blank" title="LinkedIn">
                            <svg class="icon" viewBox="0 0 24 24" xmlns="http://www.w3.org/2000/svg">
                                <path d="M20.447 20.452h-3.554v-5.569c0-1.328-.027-3.037-1.852-3.037-1.853 0-2.136 1.445-2.136 2.939v5.667H9.351V9h3.414v1.561h.046c.477-.9 1.637-1.85 3.37-1.85 3.601 0 4.267 2.37 4.267 5.455v6.286zM5.337 7.433c-1.144 0-2.063-.926-2.063-2.065 0-1.138.92-2.063 2.063-2.063 1.14 0 2.064.925 2.064 2.063 0 1.139-.925 2.065-2.064 2.065zm1.782 13.019H3.555V9h3.564v11.452zM22.225 0H1.771C.792 0 0 .774 0 1.729v20.542C0 23.227.792 24 1.771 24h20.451C23.2 24 24 23.227 24 22.271V1.729C24 .774 23.2 0 22.222 0h.003z"/>
                            </svg>
                        </a>
                    </nav>
                </div>
            </div>
        </header>

        <section id="about">
            <p>
                Hi, I'm a first-year PhD student at Stanford studying language model interpretability. I previously did both an undergrad and a master's at Stanford (yes, I've been here a while, although I did take a gap year).
                During that gap year, I was a research resident at Anthropic working on reward hacking with Evan Hubinger, then worked on Stochastic Parameter Decomposition with Lee Sharkey through the MATS program.
                I am grateful to have spent several years during undergrad working in the <a href="https://irislab.stanford.edu" target="_blank">IRIS Lab</a> and learning so much from <a href="https://ericmitchell.ai" target="_blank">Eric Mitchell</a> and <a href="https://ai.stanford.edu/~cbfinn/" target="_blank">Chelsea Finn</a>.
                My full CV can be found <a href="files/cv.pdf" target="_blank">here</a>.
            </p>
            <p>
                I enjoy tennis, cooking, board games, and card games (especially poker and Magic: The Gathering).
                I also love snowboarding and electronic dance music.
            </p>
        </section>

        <section id="publications">
            <h2>Research</h2>
            <div class="publications-grid">
                <div class="publication-entry">
                    <div class="publication-title">Transcoder Adapters for Reasoning-Model Diffing</div>
                    <div class="publication-authors">
                        <strong>Nathan Hu</strong>, Jake Ward, Thomas Icard, Christopher Potts
                    </div>
                    <div class="publication-venue">Preprint, 2026</div>
                    <div class="publication-links">
                        <a href="https://arxiv.org/abs/2602.20904" target="_blank">[paper]</a>
                        <a href="https://transcoder-adapters.github.io/" target="_blank">[website]</a>
                    </div>
                </div>
                <div class="publication-entry">
                    <div class="publication-title">Measuring Sparse Autoencoder Feature Sensitivity</div>
                    <div class="publication-authors">
                        Claire Tian, Katherine Tian, <strong>Nathan Hu</strong>
                    </div>
                    <div class="publication-venue">NeurIPS Workshop on Mechanistic Interpretability (Spotlight), 2025</div>
                    <div class="publication-links">
                        <a href="https://arxiv.org/abs/2509.23717" target="_blank">[paper]</a>
                    </div>
                </div>
                <div class="publication-entry">
                    <div class="publication-title">Training on Documents About Reward Hacking Induces Reward Hacking</div>
                    <div class="publication-authors">
                        <strong>Nathan Hu*</strong>, Benjamin Wright, Carson Denison, Samuel Marks, Johannes Treutlein, Jonathan Uesato, Evan Hubinger
                    </div>
                    <div class="publication-venue">Anthropic Alignment Science Blog, 2025</div>
                    <div class="publication-links">
                        <a href="https://alignment.anthropic.com/2025/reward-hacking-ooc/" target="_blank">[paper]</a>
                    </div>
                </div>
                <div class="publication-entry">
                    <div class="publication-title">Long-form factuality in large language models</div>
                    <div class="publication-authors">
                        Jerry Wei*, Chengrun Yang*, Xinying Song*, Yifeng Lu*, <strong>Nathan Hu</strong>, Jie Huang, Dustin Tran, Daiyi Peng, Ruibo Liu, Da Huang, Cosmo Du, Quoc V. Le
                    </div>
                    <div class="publication-venue">NeurIPS, 2024</div>
                    <div class="publication-links">
                        <a href="https://arxiv.org/abs/2403.18802" target="_blank">[paper]</a>
                    </div>
                </div>
                <div class="publication-entry">
                    <div class="publication-title">High-Fidelity Cellular Network Control-Plane Traffic Generation without Domain Knowledge</div>
                    <div class="publication-authors">
                        Z. Jonny Kong, <strong>Nathan Hu</strong>, Y. Charlie Hu, Jiayi Meng, Yaron Koral
                    </div>
                    <div class="publication-venue">ACM IMC, 2024</div>
                    <div class="publication-links">
                        <a href="https://dl.acm.org/doi/10.1145/3646547.3688422" target="_blank">[paper]</a>
                    </div>
                </div>
                <div class="publication-entry">
                    <div class="publication-title">Meta-Learning Online Adaptation of Language Models</div>
                    <div class="publication-authors">
                        <strong>Nathan Hu*</strong>, Eric Mitchell*, Christopher D. Manning, Chelsea Finn
                    </div>
                    <div class="publication-venue">EMNLP, 2023</div>
                    <div class="publication-links">
                        <a href="https://aclanthology.org/2023.emnlp-main.268/" target="_blank">[paper]</a>
                    </div>
                </div>
                <div class="publication-entry">
                    <div class="publication-title">Sampling Arborescences in Parallel</div>
                    <div class="publication-authors">
                        Nima Anari, <strong>Nathan Hu</strong>, Amin Saberi, Aaron Schild
                    </div>
                    <div class="publication-venue">ITCS, 2021</div>
                    <div class="publication-links">
                        <a href="https://arxiv.org/abs/2012.09502" target="_blank">[paper]</a>
                    </div>
                </div>
            </div>
        </section>

        <footer>
            <p>© 2025 Nathan Hu · Website made with <a href="https://github.com/anthropics/claude-code" target="_blank" style="color: #8b4513;">Claude Code</a></p>
        </footer>
    </div>
</body>
</html>