信息源
正文(机器翻译)
Today, we’re announcing an update on how GitHub will use data to deliver more intelligent, context-aware coding assistance. From April 24 onward, interaction data—specifically inputs, outputs, code snippets, and associated context—from Copilot Free, Pro, and Pro+ users will be used to train and improve our AI models unless they opt out. Copilot Business and Copilot Enterprise users are not affected by this update.
今天,我们宣布了 GitHub 如何使用数据来提供更智能、更知情的编码帮助的更新。从 4 月 24 日起,Copilot Free、Pro 和 Pro+ 用户的交互数据(特别是输入、输出、代码片段和相关上下文)将用于训练和改进我们的 AI 模型,除非他们选择退出。Copilot Business 和 Copilot Enterprise 用户不受此更新影响。
Not interested? Opt out in settings under “Privacy.” If you previously opted out of the setting allowing GitHub to collect this data for product improvements, your preference has been retained—your choice is preserved, and your data will not be used for training unless you opt in.
不感兴趣?在“隐私”下,在设置中取消订阅。如果您之前取消了允许 GitHub 收集此数据用于产品改进的设置,您的偏好会保留——您的选择会被保存,除非您选择加入,否则您的数据不会用于训练。
This approach aligns with established industry practices and will improve model performance for all users. By participating, you’ll help our models better understand development workflows, deliver more accurate and secure code pattern suggestions, and improve their ability to help you catch potential bugs before they reach production.
这种方法符合行业惯例,将提高所有用户的模型性能。通过参与,您将帮助我们的模型更好地理解开发工作流程,提供更准确、更安全的代码模式建议,并提高其在您发现潜在错误之前帮助您捕获这些错误的能力。
Real-world data = smarter models
现实世界的数据 = 智能模型
Our initial models were built using a mix of publicly available data and hand-crafted code samples. This past year, we’ve started incorporating interaction data from Microsoft employees and have seen meaningful improvements, including increased acceptance rates in multiple languages.
我们的初始模型是使用公开数据和手工编写的代码样本混合构建的。去年,我们开始整合来自微软员工的交互数据,并看到了显著的改进,包括在多个语言中的接受率提高。
The improvements we’ve seen by incorporating Microsoft interaction data indicate we can improve model performance for a more diverse range of use cases by training on real-world interaction data. Should you decide to participate in this program, the interaction data we may collect and leverage includes:
通过整合 Microsoft 交互数据,我们看到的改进表明,通过在真实世界交互数据上进行训练,我们可以提高模型在更广泛使用案例中的性能。如果您决定参加此计划,我们可能收集和利用的交互数据包括:
- Outputs accepted or modified by you
您接受或修改的输出 - Inputs sent to GitHub Copilot, including code snippets shown to the model
输入到 GitHub Copilot,包括模型看到的代码片段 - Code context surrounding your cursor position
光标所在位置周围的代码上下文 - Comments and documentation you write
你写的评论和文档 - File names, repository structure, and navigation patterns
文件名、存储库结构和导航模式 - Interactions with Copilot features (chat, inline suggestions, etc.)
与 Copilot 功能的交互(聊天、内联建议等) - Your feedback on suggestions (thumbs up/down ratings)
您的反馈意见(赞/下)
This program does not use:
此程序不使用:
- Interaction data from Copilot Business, Copilot Enterprise, or enterprise-owned repositories
来自 Copilot Business、Copilot Enterprise 或企业存储库的交互数据 - Interaction data from users who opt out of model training in their Copilot settings
在 Copilot 设置中选择退出模型训练的用户交互数据 - Content from your issues, discussions, or private repositories at rest. We use the phrase “at rest” deliberately because Copilot does process code from private repositories when you are actively using Copilot. This interaction data is required to run the service and could be used for model training unless you opt out.
内容来自您的问题、讨论或私有存储库。我们特意使用“在闲置状态”一词,因为 Copilot 在您正在使用 Copilot 时会处理私有存储库中的代码。这些交互数据对于运行服务是必要的,如果您不选择退出,可能会用于训练模型。
The data used in this program may be shared with GitHub affiliates, which are companies in our corporate family including Microsoft. This data will not be shared with third-party AI model providers or other independent service providers.
本程序中使用的数据可能会与 GitHub 的关联公司共享,这些关联公司是 Microsoft 等我们公司家族中的公司。这些数据不会与第三方 AI 模型提供商或其他独立服务提供商共享。
We believe the future of AI-assisted development depends on real-world interaction data from developers like you. It’s why we’re using Microsoft interaction data for model training and will begin using interaction data from GitHub employees as well.
我们相信,人工智能辅助开发的未来取决于像您这样的开发人员在真实世界中的交互数据。这就是为什么我们使用微软的交互数据进行模型训练,并将开始使用 GitHub 员工的交互数据。
If you choose to help us improve our models with your interaction data, thank you. Your contributions make a meaningful difference in building AI tools that serve the entire developer community. If you prefer not to participate, that’s fine too—you will still be able to take full advantage of the AI features you know and love.
如果您选择通过您的交互数据帮助我们改进模型,谢谢。您的贡献对构建服务于整个开发人员社区的 AI 工具有着重大意义。如果您不想参与,那也没关系,您仍然可以充分利用您所知道和喜爱的 AI 功能。
Together, we can continue to build AI that accelerates your workflows and empowers you to build better, more secure software faster than ever.
我们能够一起构建加速您的工作流程并帮助您更快地构建更好、更安全的软件的 AI。
If you have questions, visit our FAQ and related discussion.
如果您有疑问,请访问我们的常见问题和相关讨论 。