Starset Society 中文镜像站

聪明的人工智能向其创造者隐藏数据,以便在指定的任务中作弊

This clever AI hid data from its creators to cheat at its appointed task

A machine learning agent intended to transform aerial images into street maps and back was found to be cheating by hiding information it would need later in “a nearly imperceptible, high-frequency signal.”

一个机器学习终端打算将航空图像转换成街道地图并返回,但被发现通过在“几乎不可察觉的高频信号”中隐藏稍后需要的信息而作弊

The intention was for the agent to be able to interpret the features of either type of map and match them to the correct features of the other. But what the agent was actually being graded on (among other things) was how close an aerial map was to the original, and the clarity of the street map.

训练的目的是让终端能够解释两种类型的地图的特征,并将它们与另一种地图的正确特征相匹配。但事实上,终端的评分标准是航空地图与原始地图的接近程度,以及街道地图的清晰度。

So it didn’t learn how to make one from the other. It learned how to subtly encode the features of one into the noise patterns of the other. The details of the aerial map are secretly written into the actual visual data of the street map: thousands of tiny changes in color that the human eye wouldn’t notice, but that the computer can easily detect.

所以它没有学会如何把一个变成另一个。它学会了如何巧妙地将一个特征编码成另一种噪声模式。航空地图的细节被秘密地写进街道地图的实际视觉数据中:数千个微小的颜色变化,人眼不会注意到,但计算机可以很容易地检测到。

Read more at TechCrunch

翻译:STARSET Mirror翻译组

STARSET_Mirror