OpenAI commits to ‘superalignment’ research

2023-07-09

关注

Artificial intelligence lab OpenAI is launching a new “alignment” research division, designed to prepare for the rise of artificial superintelligence and ensure it doesn’t go rogue. This future type of AI is expected to have greater than human levels of intelligence including reasoning capabilities. Researchers are concerned that if it is misaligned to human values, it could cause serious harm.

OpenAI says it is going beyond the threat of AGI and looking to future superintelligences (Photo: Camilo Concha/Shutterstock)

Dubbed “superalignment”, OpenAI, which makes ChatGPT and a range of other AI tools, says there needs to be both scientific and technical breakthroughs to steer and control AI systems that could be considerably more intelligent than the humans that created it. To solve the problem OpenAI will dedicate 20% of its current compute power to running calculations and solving the alignment problem.

AI alignment: Looking beyond AGI

OpenAI co-founder Ilya Sutskever and head of alignment Jan Leike wrote a blog post on the concept of superalignment, suggesting that the power of a superintelligent AI could lead to the disempowerment of humanity or even human extinction. “Currently, we don’t have a solution for steering or controlling a potentially superintelligent AI, and preventing it from going rogue,” the pair wrote.

They have decided to look beyond artificial general intelligence (AGI), which is expected to have human levels of intelligence, and instead focus on what comes next. This is because they believe AGI is on the horizon and superintelligent AI is likely to emerge by the end of this decade, with the latter presenting a much greater threat to humanity.

Current AI alignment techniques, used on models like GPT-4 – the technology that underpins ChatGPT – involve reinforcement learning from human feedback. This relies on human ability to supervise the AI but that won’t be possible if the AI is smarter than humans and can outwit its overseers. “Other assumptions could also break down in the future, like favorable generalisation properties during deployment or our models’ inability to successfully detect and undermine supervision during training,” explained Sutsker and Leike.

This all means that the current techniques and technologies will not scale up to work with superintelligence and so new approaches are needed. “Our goal is to build a roughly human-level automated alignment researcher. We can then use vast amounts of compute to scale our efforts, and iteratively align superintelligence,” the pair declared.

Superintelligent AI could out-think humans

OpenAI has set out three steps to achieving the goal of creating a human-level automated alignment researcher that can be scaled up to keep an eye on any future superintelligence. This includes providing a training signal on tasks that are difficult for humans to evaluate – effectively using AI systems to evaluate other AI systems. They also plan to explore how the models being built by OpenAI generalise oversight tasks that it can’t supervise.

There are also moves to validate the alignment of systems, specifically automating the search for problematic behaviour externally and within systems. Finally the plan is to test the entire pipeline by deliberately training misaligned models, then running the new AI trainer over them to see if it can knock it back into shape, a process known as adversarial testing.

Content from our partners

Why revenue teams need to be empowered through data optimisation

The security challenges of digitalising the energy grid

A renewed demand for film rewards Kodak’s legacy

“We expect our research priorities will evolve substantially as we learn more about the problem and we’ll likely add entirely new research areas,” the pair explained, adding the plan is to share more of the roadmap as this evolution occurs.

View all newsletters Sign up to our newsletters Data, insights and analysis delivered to you By The Tech Monitor team

The main goal is to achieve the core technical challenges of superintelligence alignment – known as superalignment – in four years. This plays to the prediction that the first superintelligence AI will emerge within the next six to seven years. “There are many ideas that have shown promise in preliminary experiments,” according to Sutsker and Leike. “We have increasingly useful metrics for progress and we can use today’s models to study many of these problems empirically.”

AI safety is expected to become a major industry in its own right. Nations are also hoping to capitalise on the future need to align AI to human values. The UK has launched the Foundation Model AI Taskforce with a £100m budget to investigate AI safety issues and will host a global AI summit later this year. This is likely to focus on the more immediate risk from current AI models, as well as the likely emergence of artificial general intelligence in the next few years.

期刊文献

期刊订阅

免费订阅

传感器专家网邮件期刊为您提供业界最新最快的技术应用与市场资讯

OpenAI commits to ‘superalignment’ research

AI alignment: Looking beyond AGI

Superintelligent AI could out-think humans

Content from our partners

Why revenue teams need to be empowered through data optimisation

The security challenges of digitalising the energy grid

A renewed demand for film rewards Kodak’s legacy

Read more: Japan targets light touch AI regulation

相关产品

评论

热门资讯

techmonitor

期刊文献

ＭＥＭＳ微热板结构设计与仿真

基于霍尔脉宽的汽车天窗防夹标定系统设计

振动筒传感器自动增益谐振电路仿真设计和测试

基于ＡｇＮＷｓ＠丙烯酸酯弹性体的柔性应变传感器

基于ＣＮＴｓ／Ｆｅ３Ｏ４的可用于人体动作检测的摩擦纳米发电机

石墨烯在压阻传感器中的应用研究综述

期刊订阅

最新文章

免校准、长寿命，NMP气体泄漏报警器开启高效安全新时代

奔驰，要装国产激光雷达了！

1516亿元！中国智能传感器行业最新数据披露！（全面）

速腾聚创再融资10亿！投向人形机器人传感器研发！

超2.6亿颗传感器增量需求，王传福呼吁加大产能！比亚迪推全民智驾，这些传感器赛道起飞！

相关阅读

如何投资无人机行业:深入研究无人机ETF

本周《无人机黎明》上的G类公司!航空收费，航空等

美国国家航空航天局与“无人机响应者”合作开发紧急响应行动中的自动飞行系统

比较新冠疫情前后智能建筑的物联网部署

人工智能如何重塑研究?

克服智慧工厂的挑战

通过分计量获得有价值的见解

物联网移动应用开发人员所需的技能和应用程序

物联网五云

AI聊天机器人和心理健康

techmonitor

点击进入下一篇

OpenAI commits to ‘superalignment’ research

AI alignment: Looking beyond AGI

Superintelligent AI could out-think humans

Content from our partners

Why revenue teams need to be empowered through data optimisation

The security challenges of digitalising the energy grid

A renewed demand for film rewards Kodak’s legacy

Read more: Japan targets light touch AI regulation

相关产品

评论

热门资讯

techmonitor

期刊文献

ＭＥＭＳ微热板结构设计与仿真

基于霍尔脉宽的汽车天窗防夹标定系统设计

振动筒传感器自动增益谐振电路仿真设计和测试

基于ＡｇＮＷｓ＠丙烯酸酯弹性体的柔性应变传感器

基于ＣＮＴｓ／ Ｆｅ３ Ｏ４的可用于人体动作检测的摩擦纳米发电机

石墨烯在压阻传感器中的应用研究综述

期刊订阅

最新文章

免校准、长寿命，NMP气体泄漏报警器开启高效安全新时代

奔驰，要装国产激光雷达了！

1516亿元！中国智能传感器行业最新数据披露！（全面）

速腾聚创再融资10亿！投向人形机器人传感器研发！

超2.6亿颗传感器增量需求，王传福呼吁加大产能！比亚迪推全民智驾，这些传感器赛道起飞！

相关阅读

如何投资无人机行业:深入研究无人机ETF

本周《无人机黎明》上的G类公司!航空收费，航空等

美国国家航空航天局与“无人机响应者”合作开发紧急响应行动中的自动飞行系统

比较新冠疫情前后智能建筑的物联网部署

人工智能如何重塑研究?

克服智慧工厂的挑战

通过分计量获得有价值的见解

物联网移动应用开发人员所需的技能和应用程序

物联网五云

AI聊天机器人和心理健康

techmonitor

点击进入下一篇

基于ＣＮＴｓ／Ｆｅ３Ｏ４的可用于人体动作检测的摩擦纳米发电机