Add AI tools policy by Turbo87 · Pull Request #13726 · rust-lang/crates.io

Turbo87 · 2026-05-21T10:01:45Z

This PR proposes adopting the CPython devguide's AI tools policy as crates.io's own, as docs/AI-TOOLS.md (with one Python-specific testing sentence dropped). It is linked from CONTRIBUTING.md and AGENTS.md.

The CPython policy puts responsibility for content on the submitter, requires contributors to be able to explain their changes, and doesn't try to ban AI tooling outright. That matches how things are already done in this project and seems a reasonable starting point.

Project-wide AI policy for rust-lang is still being debated in two open RFCs (neither close to consensus), with a separate rust-lang/rust-scoped policy that explicitly excludes crates.io. If a project-wide policy is later adopted, we can align with it.

Open to feedback on whether this is the right policy and whether anything should be adjusted before merging.

So ultimately I don't expect crates.io to have a lot of PRs or issues so I think that ultimately this is just me giving advice that you're free to ignore. But at least from what I found, even projects which strongly support AI tools have been in favour of disclosing just so they know what they're dealing with, and I think that it's okay to have a hard rule with no punishments for forgetting it.

Like, I figure the rate of issues/PRs is so low that you have the time/resources to just individually talk to each person and figure out what they're working with anyway, but it can be helpful to ask for to see what people are using.

Just a few examples of other policies that feel relevant here:

curl explicitly asks for AI disclosure in security reports

ghostty explicitly states "AI is welcome here" but still requests disclosure in basically the first bullet point

View changes since the review

But at least from what I found, even projects which strongly support AI tools have been in favour of disclosing just so they know what they're dealing with, and I think that it's okay to have a hard rule with no punishments for forgetting it.

This would also be my preference.

I litigated this at some length within the Foundation when we were drafting our internal AI policy, so my apologies to @Turbo87 for the repeat, but the short version is that I believe that we should require disclosure for two primary reasons:

Legal concerns. While I suspect our US-focused legal advice that LLM contributions are not subject to copyright will eventually end up being codified into international copyright law, this isn't even fully settled in the US, and we also have to operate in other, slower moving jurisdictions. If this does not end up being the result, we will need to be able to account for the origin of every contribution. Even if that outcome is unlikely, I think the minimal cost of requiring disclosure now is a small price to pay to make it easier to handle that eventuality.

Review practicalities. While I don't think knowing LLMs have been used would meaningfully change how I review PRs from within the crates.io team, we do accept external contributions, and the sorts of things I expect to look for in review would change for a developer who used LLM assistance versus not, as would how I communicate with that contributor.

I don't want this to be some onerous process. I'd be happy with an Assisted-By or Co-Authored-By trailer in either the commit message or the PR description, much like the Linux kernel policy. But I do think it's important that we require this.

It would be interesting to hear from counsel on the first point. I would expect to hear a mixed set of pros and cons for each choice — widespread disclosure seems likely to carry some risks too for the Project. Regarding the second point, the relevant comment in the Python discussions was this one — in short, reviewers need to assume LLM use anyway.

Disclosure has more than minimal cost:

The transparency dilemma finding includes considerable data showing that, contrary to intuition, disclosure of LLM use erodes trust.

Once primed about LLM use, people start to see LLM fingerprints that aren't actually there — we're creatures that see faces in clouds. Disclosure can increase the risk of false accusations of more LLM use than was disclosed.

We're operating in an environment where people are actively shaming others who use LLMs. Highlighting one's use subjects one to the risk of this shaming.

Mandating disclosure has additional costs. It becomes something we need to enforce fairly and consistently. As part of a moderation policy, we'd be talking about banning people from the Project over this. If it turns out the policy is unenforceable and that people choose not to disclose due to these costs, then we'll have created a kind of policy fiction. That's not any good for us.

Similar to Python, we can encourage disclosure without incurring the costs that accompany mandating it.

So, it is worth reiterating on the legal point that generally the result of using LLMs is the removal of copyright protections, not the addition of extra burdens, which does not matter given the extremely permissive license crates.io uses. It does matter for proprietary code and copyleft code, but for open source, permissively licensed work, it seems fine.

Additionally, it seems incredibly likely that if any copyright questions came up, e.g. someone was upset that code verbatim was used without authorisation, there would be ample time to fix the issue (removing the offending code) rather than these being immediate repercussions.

Most of the projects that I've seen be concerned with copyright issues are copyleft and thus specifically rely on copyright to enforce licencing, which just isn't the case here. So, while I do think there are other points to discuss, I don't think the legal point is particularly compelling here, which also matches the existing advice given by Foundation counsel on the matter.

I think I do most of the PR reviews on crates.io at the moment (obviously excluding my own PRs…), and knowing whether AI was involved wouldn't change much how I review. The "explain your changes" requirement is already what I actually care about. For first-time contributors with zero context I agree disclosure can help, but that's a pretty different situation from someone opening a dozens of PRs each month.

On the legal point: the PSF landed on this exact wording without requiring disclosure. I don't know what their counsel actually said about it, so I'm only guessing, but I'd be a little surprised if a project that size shipped an AI policy without running it past their lawyers at some point. If that's roughly right, it makes me want to understand what's specifically different about our situation before treating the legal angle as a strong reason to go further than they did.

And I think Travis is right that disclosure has real downsides in the current climate. Adding a trailer to every PR also adds quite a bit of friction when LLM tooling is opening them, and getting that tooling to reliably add trailers is a whole separate problem.

That said, here are a few things I'd be happy to put in the policy, if it helps:

Bump "appreciated, while not required" to "encouraged, especially from new contributors"

Add something like "reviewers may ask about AI usage when it would help, and contributors should answer honestly"

Two other options to avoid the additional friction for regular contributors:

We can flip the framing: say explicitly that reviewers should assume AI may have been used, and make disclosure of non-use the optional opt-in. At this point the vast majority of recent crates.io PRs had LLM involvement and IMHO it's better to optimize for the 90% case.

One-time disclosure for regulars, recorded privately: contributors who use AI tooling regularly disclose it once (e.g. via a comment on a designated issue in the team's private ops guide repo) instead of repeating it on every PR. The record exists, so reviewers on the team have access and the legal/review-practicalities angle is covered, but the disclosure isn't sitting in a public list that anyone can scrape and turn into a target. For someone like me who opens a lot of PRs that drops the per-PR friction effectively to zero while still leaving the information available where it matters. First-time or external contributors would still get asked by a PR template or maybe by the reviewer.

I mean, ultimately, I do think that the small number of contributions is more than sufficient to prefer an ad hoc method of explaining/disclosing since, like I said, you probably have the bandwidth to just talk to everyone individually about their changes. I think that encouraging more transparency is good, but if you think the downsides outweigh the upsides here, that's fair.

FYI for those that were not there: we discussed this policy proposal and the disclosure thread in particular in the team meeting last friday.

@rust-lang/crates-io I'm still open to feedback to my last comment in this thread. if I don't hear anything by the end of the week I'll assume that there are no objections to merging this as-is.

In the opening comment here I read:

Project-wide AI policy for rust-lang is still being debated [...] ]If a project-wide policy is later adopted, we can align with it.

The policy discussed here differs in many parts from rust-forge#1040 (for example the disclosure policy) and I appreciate you want something out sooner.

But I want to raise a more general question: this policy is more lenient than the other one, so I think that if you want to adopt rust-forge#1040, you might find yourself deciding to backpedal on some items that are now allowed (or not mentioned), like the disclosure clause which we feel strongly about and many others, like bots reviews, typos fixes, etc.

How does the t-crates.io team feels when (or if) comes the question about adopting the rust-lang/rust policy?

clarfonthey · 2026-05-22T01:49:57Z

This is something I totally didn't even consider until now, but are there any other public repos managed by the crates.io team or is everything besides this just like, private infra accounts for controlling the actual deployed resources? I mostly ask since, this makes the most sense to just say "this is the policy of the crates.io team," but I don't actually know if there's anything else that would be controlled by you folks.

LawnGnome

Approved is probably a strong word here, so I won't use it.

I have significant ethical concerns with the current state-of-the-art with regards to LLM assistants, and will not use them myself in any generative context. However, that is my own boundary, and I accept that everyone gets to make their own choice.

As discussed below, I think we should require disclosure. With that change, I would accept this policy.

View changes since this review

LawnGnome · 2026-05-22T10:41:42Z

+Disclosure of the use of AI tools in the PR description is appreciated,
+while not required. Be prepared to explain how the tool was used and what
+changes it made.


But at least from what I found, even projects which strongly support AI tools have been in favour of disclosing just so they know what they're dealing with, and I think that it's okay to have a hard rule with no punishments for forgetting it.

This would also be my preference.

I litigated this at some length within the Foundation when we were drafting our internal AI policy, so my apologies to @Turbo87 for the repeat, but the short version is that I believe that we should require disclosure for two primary reasons:

Legal concerns. While I suspect our US-focused legal advice that LLM contributions are not subject to copyright will eventually end up being codified into international copyright law, this isn't even fully settled in the US, and we also have to operate in other, slower moving jurisdictions. If this does not end up being the result, we will need to be able to account for the origin of every contribution. Even if that outcome is unlikely, I think the minimal cost of requiring disclosure now is a small price to pay to make it easier to handle that eventuality.

Review practicalities. While I don't think knowing LLMs have been used would meaningfully change how I review PRs from within the crates.io team, we do accept external contributions, and the sorts of things I expect to look for in review would change for a developer who used LLM assistance versus not, as would how I communicate with that contributor.

I don't want this to be some onerous process. I'd be happy with an Assisted-By or Co-Authored-By trailer in either the commit message or the PR description, much like the Linux kernel policy. But I do think it's important that we require this.

LawnGnome · 2026-05-22T10:46:52Z

This is something I totally didn't even consider until now, but are there any other public repos managed by the crates.io team or is everything besides this just like, private infra accounts for controlling the actual deployed resources? I mostly ask since, this makes the most sense to just say "this is the policy of the crates.io team," but I don't actually know if there's anything else that would be controlled by you folks.

That's a good call out, but in this case, this repo is the only public repo owned by the crates.io team.

I expect this policy — if approved — would also apply to our one private repo (our ops guide), but I'm not sure it's necessary to change this to explicitly include that. 🤷

clarfonthey · 2026-05-22T17:41:42Z

Yeah, I figure that this kind of policy doesn't need to be clarified for private repos considering how it's mostly aimed at contributors who aren't on the team and thus don't have expectations set yet. So, with this being the only public crate for the repo and the repo receiving such a small amount of traffic externally as-is this policy is more than sufficient.

I don't know if there are any crates.io team members who aren't Foundation staff but I feel that regardless, just deferring to Foundation policy for internal stuff would be sufficient.

Turbo87 · 2026-05-23T09:04:28Z

That's a good call out, but in this case, this repo is the only public repo owned by the crates.io team.

Quick correction on the scope claim: there are actually a few other public repos in the crates.io team's orbit. The authoritative list is in the team-api (https://team-api.infra.rust-lang.org/v1/repos.json), but off the top of my head:

rust-lang/crates_io_og_image, owned by the crates.io team, used for OG image rendering
rust-lang/crates-io-auth-action, shared with infra I think
rust-lang/crates-io-heroku-metrics, ownership a bit unclear, possibly infra
rust-lang/crates.io-index and the related index repos, but the AI policy probably isn't very relevant there

For the repos where the policy is actually relevant (crates_io_og_image in particular), my preference would be to have the policy explicitly cover them rather than copying it into each repo. Maybe a sentence at the end of the policy like:

This policy applies to all repositories owned by the crates.io team.

That way we don't have to manage copies, and as the team's repo list changes the policy follows automatically.

clarfonthey · 2026-05-24T16:32:20Z

Specifically for crates.io-index and co., I know that GitHub allows disabling both issues and PRs now so I would just disable both for the repo and any relevant related ones. It's not really a project to contribute to, just a dataset.

Might poke around later to see if the current infra we have allows automatically doing this via the team repo, since it is a new feature.

Edit: It appears that this is done manually: https://github.com/rust-lang/team/blob/main/repos/rust-lang/crates.io-index.toml

So, I'll open a thread on Zulip about it.

This imports the AI tools guidelines from the CPython devguide (python/devguide#1778) verbatim, as a starting point for our own policy. The text is licensed under CC0-1.0. Source: https://github.com/python/devguide/blob/main/getting-started/ai-tools.rst

crates.io has no published "testing principles" document equivalent to the one this sentence references, so the guidance does not translate.

rustbot · 2026-05-25T12:54:34Z

This PR was rebased onto a different main commit. Here's a range-diff highlighting what actually changed.

Rebasing is a normal part of keeping PRs up to date, so no action is needed—this note is just to help reviewers.

Turbo87 added the C-documentation 📜 Category: Documentation of crates.io itself label May 21, 2026

Turbo87 requested a review from a team May 21, 2026 10:02

carols10cents approved these changes May 21, 2026

View reviewed changes

clarfonthey reviewed May 22, 2026

View reviewed changes

Comment thread docs/AI-TOOLS.md

clarfonthey reviewed May 22, 2026

View reviewed changes

LawnGnome reviewed May 22, 2026

View reviewed changes

Turbo87 added 4 commits May 25, 2026 14:54

docs/AI-TOOLS: Drop Python-specific testing principles sentence

4590196

crates.io has no published "testing principles" document equivalent to the one this sentence references, so the guidance does not translate.

AGENTS: Link to AI-TOOLS.md from reference material list

ec1c6b2

docs/CONTRIBUTING: Add Using AI tools section

7ecdf65

Turbo87 force-pushed the ai-tools-policy branch from 7a2230e to 7ecdf65 Compare May 25, 2026 12:54

Turbo87 requested a review from LawnGnome May 28, 2026 18:22

Turbo87 added this to crates.io team meetings May 28, 2026

Turbo87 moved this to For next meeting in crates.io team meetings May 28, 2026

clarfonthey mentioned this pull request May 31, 2026

Project-wide LLM policy rust-lang/rfcs#3959

Open

Conversation

Turbo87 commented May 21, 2026

Related

Uh oh!

Uh oh!

clarfonthey May 22, 2026 • edited by rustbot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LawnGnome May 22, 2026

Choose a reason for hiding this comment

Uh oh!

traviscross May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

clarfonthey May 22, 2026

Choose a reason for hiding this comment

Uh oh!

Turbo87 May 23, 2026

Choose a reason for hiding this comment

Uh oh!

clarfonthey May 23, 2026

Choose a reason for hiding this comment

Uh oh!

Turbo87 Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

apiraino Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

clarfonthey commented May 22, 2026

Uh oh!

LawnGnome left a comment • edited by rustbot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LawnGnome May 22, 2026

Choose a reason for hiding this comment

Uh oh!

LawnGnome commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

clarfonthey commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Turbo87 commented May 23, 2026

Uh oh!

clarfonthey commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rustbot commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

clarfonthey May 22, 2026 •

edited by rustbot

Loading

traviscross May 22, 2026 •

edited

Loading

LawnGnome left a comment •

edited by rustbot

Loading

LawnGnome commented May 22, 2026 •

edited

Loading

clarfonthey commented May 22, 2026 •

edited

Loading

clarfonthey commented May 24, 2026 •

edited

Loading