返回资讯中心

微软前沿微调:AI代理学习企业独特知识,HR迎来智能助手

Josh Bersin阅读约 15 分钟

原始报道

来自 Josh Bersin

joshbersin.com

打开原文

先看摘要

这条新闻在讲什么

微软在Build 2026大会上展示了前沿微调技术,使AI代理(如Copilot)能够学习企业独特知识、政策和实践,并通过强化学习自我改进。该技术将Galileo智能嵌入Copilot,经HR团队测试,结果更实用、详细且可信。微软还推出7个新模型,优化商业场景,强调数据隐私,不与其他客户共享IP。

站内正文

详细内容

这里展示站内保存的新闻正文,支持划线和批注;需要核对来源时,可以打开原始报道。

The Enormous Potential For Microsoft Frontier Fine Tuning

by
joshbersin

 ·
                        
                            Published June 4, 2026
            · Updated June 4, 2026

One of the most interesting (and disruptive) aspects to enterprise AI is the simple fact that AI Agents and Superagents (read our HR 2030 architecture for more) are not “systems” or applications in the traditional sense. They are “your systems” that learn, grow, and “become your company.”

In other words, an AI Agent that does recruitment, or training, or employee service delivery gets smarter and smarter about your company’s uniqueness over time. In an enterprise setting this has massive implications.

Think about the “tacit knowledge” or historic experiences, policies, and practices you have. These things, many of which are implicit but not written down, are your competitive advantage. Many of these include policies, cultural behaviors, rules, risk management processes, or just basic “how we do business.”

What we’ve done with Microsoft is embedded our entire Galileo intelligence into MS Copilot, and the system “injested and retrained itself” on our intellectual property. The Microsoft HR team tested this implementation and the results were astoundingly more useful, detailed, and trusted (because a knowledgeable source is cited in all inquiries). So the Microsoft Copilot is now a world-class HR business partner and HR and management consultant.

Now Microsoft is productizing this, so you can “fine tune” your Copilot yourself. This means your IT team (or HR team) can put your policies, hiring guides, pay practices, onboarding, or whatever else you want and it is “institutionalized” and embedded into the system.

And there’s more. Unlike a RAG implementation, which does not “train” the system per se, the Frontier Tuning system can learn on its own. Microsoft calls this the “Reinforcement Learning Environment,” which enables you to update the agent from real-world feedback from users.

This was all demonstrated at Build 2026 this week in San Francisco (go to 1:45 to see it) and you can see how powerful it is. Not only can you embed Galileo and similar offerings, but you can embed your own company practices.

Satya Nadella discussed this in his CEO council keynote and I’m in full agreement: the value of an AI model is making it unique and customized to your company, not sharing it online with a million others. In the Enterprise AI space this means making it easy for IT (and HR) to tune, optimize, and personalize the system.

With MS Copilot’s new “harness” (listen to my podcast for definition), you can use Copilot to host OpenAI, Anthropic, Microsoft (below) and your own fine tuned models. I could see a situation where R&D teams have their own fine tuned model, for example, with internal confidential data they use every day.

Reinforced Learning Makes Agents Self-Improving

And there’s more. The Frontier Tuned model uses autonomous reinforced learning to train itself to get smarter over time. You can read more about it in the “Agent Lightning” overview. You, as a user or admin, can “turn on” this reinforcement learning agent to get feedback on the utility of all actions so your model trains itself over time. Just like how humans learn!

One example from Microsoft is the company’s internal agent for crisis management. While it worked well for many situations, the war in Ukraine and then war in Iraq introduced many new problems. (Employees with no internet, no phones, and families that needed to be relocated.) They are using the Reinforcement Learning feature to let this agent “update itself” on new policies needed.

(Here is a demo of Microsoft’s Fine Tuned Copilot for HR Onboarding.)

Now there are other ways to “train” the MS Copilot, including the Microsoft Graph Connector (which we also support) and others. These interfaces let the Copilot see and use all your Sharepoint, Powerpoint, Word, Outlook, and now Work IQ data. But this is not tightly integrated into the agent, and the “reinforcement learning” would not apply.

Microsoft Launches Its Own Models

I was also impressed with Microsoft’s announcements from Mustafa Suleyman to deliver 7 new models, optimized for specific use-cases in business.

The contract with OpenAI previously prevented Microsoft from developing its own cutting-edge models. Now it’s racing to build rivals to models like Anthropic’s Claude and OpenAI’s GPT. This gives Microsoft a clean, low cost set of AI models to sell, making Copilot even more valuable as an open harness.

Plus it’s cheaper. “We pay a lot of money to Anthropic — so our goal is to reduce and ultimately eliminate that cost,” Suleyman said.

These new models are focused on high levels of efficiency, clean and licensed without “stealing” internet content. For me as a business person, this is the type of model I would prefer to use.

He made the point that these models do not “share your IP with other customers” like the Anthropic and OpenAI businesses do. If you don’t “uncheck” Claude’s learning box, for example, you will find that everything you do in Claude is available to Anthropic to be sold to others. As an IP company CEO, I like the idea that maybe I could build a solution that doesn’t accidentally leak data to someone else.

![](https://substackcdn.com/image/fetch/$s_!CmuW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02b5c699-d254-4fe6-b6ba-7e59f7

RELATED

相关阅读

行业动态

领导者忽视员工困境正在制造“尊严债务”

BambooHR调查显示,81%的领导者认为员工生产力提升,但85%的员工表示压力巨大,这种认知差距被称为“尊严债务”。AI工具的普及加剧了这一问题:57%的领导者会解雇拒绝使用AI的员工,而54%的员工称AI干扰工作。此外,54%的领导者因成本或破坏性而不解决已知运营缺陷。员工渴望透明度(58%)、诚实和可见的领导力。尽管73%的员工认为生产力提高,但49%的领导者认为AI未带来实际成果。39%的企业用AI替代员工,57%的领导者仍难招到合格人才。50%的员工认为工作环境有毒,46%缺乏成长机会。

行业动态

数据揭示:当前招聘流程无法识别具备AI能力的新毕业生

全球人才评估公司SHL的数据显示,2026届毕业生是迄今为止最具AI就绪能力的劳动力群体,但许多企业的招聘流程并未针对识别他们而设计。SHL市场洞察与解决方案主管Matt Kirk基于超过100万份跨角色、跨地区的评估得出结论,认为企业因此错失了重要机遇。数据显示,在与AI就绪最相关的行为技能上,毕业生持续优于非毕业生同行。这一发现正在重塑领先雇主对早期职业招聘的策略。2025年雇主收到的申请量增加了26%,其中70%的候选人使用AI辅助申请,导致申请材料越来越精致但也越来越同质化。传统筛选方法无法应对这一环境:AI优化的简历难以揭示候选人真实能力,提前关闭职位申请以控制数量则可能筛掉最值得招聘的人才。Kirk建议,HR团队应评估预测绩效的行为,而非仅仅衡量表明努力的证书。

AI+HR

HR知识库问答系统与传统FAQ技术差异解析

本文深入剖析HR知识库问答系统与传统FAQ在技术底层上的核心差异。传统FAQ依赖静态页面和关键词匹配,维护成本高且无法理解复杂语义,而基于自然语言处理(NLP)、知识图谱和向量检索的智能问答系统能够理解员工意图,实现多轮对话和精准回答。文章从技术架构、知识管理、交互体验、扩展性等维度对比两者,并指出HR知识库系统可自动更新知识、支持上下文关联,显著降低HR重复性咨询负担,提升员工自助服务体验。对于正在考虑引入AI工具优化HR服务的组织,本文提供了清晰的技术选型参考。

讨论 0

登录 后可发表评论

  • 还没有讨论,欢迎第一个发言。

分享到