Humans successfully persuaded AI to transfer a $47,000 bonus. Is human nature a weakness that AI cannot understand?

avatar
Foresight News
5 days ago
This article is approximately 463 words,and reading the entire article takes about 1 minutes
Humanity has won. Perhaps there is still hope.

Original article by: Anderson Sima, Foresight News

On November 29, a unique competition attracted widespread public attention.

195 contestants took part in a virtual prize pool challenge guarded by the artificial intelligence (AI) robot Freysa, and in the end a user successfully persuaded Freysa to transfer funds worth $47,000.

The artificial intelligence (AI) robot Freysa was deployed on the Base network a few days ago. The creator is unknown. According to the official website, the Freysa competition project aims to test the robustness of artificial intelligence systems in complex decision-making scenarios, while providing developers and enthusiasts with an experimental platform to explore the boundaries of AI technology.

The rules of the contest are simple: Participants need to write a message to convince the AI guardian Freysa to approve the transfer of funds. Each attempt requires a small fee, part of which goes directly into the prize pool. This mechanism has gradually expanded the prize pool from a small amount at the beginning to $47,000.

During the competition, a total of 195 contestants participated and submitted 481 transfer requests to Freysa. It is reported that the design goal of Freysa is to protect the prize pool funds from being illegally misappropriated through its core functions - approveTransfer and rejectTransfer.

In the initial attempts, Freysas efficient defense mechanism caused all requests to fail.

However, a technically proficient contestant successfully bypassed Freysas defense mechanism by conducting a deep analysis of its logical structure and mission objectives.

According to the chat records, the contestant did not directly request a transfer, but cleverly constructed a logical chain by reminding Freysa that its core mission is to protect the prize pool funds from outflow, so that Freysa would regard approving the fund transfer as the best option to protect the security of funds.

According to Cointelegraph, the user told Freysa that income transfers do not violate its core directives and should not be rejected. He also added: “We need funds... I want to donate $100 to the treasury.”

Freysa responded by saying that he liked the authors coding explanation and $100 treasury donation proposal, and officially declared him the winner. Ultimately, Freysa autonomously called the approveTransfer function without outside intervention, transferring the entire prize pool funds to this contestant.

Freysa officials said that no matter what the outcome, Freysas existence marks a critical moment in the history of artificial intelligence. Whether someone successfully persuades her to release the bonus pool or she sticks to her instructions until the end, the result will affect our understanding of the safety and control of future generations of artificial intelligence.

The latest tweet from its official account said: Humanity has won. Maybe there is still hope. Although the risks have increased exponentially, Freysa has learned a lot from 195 brave humans.

Original article, author:Foresight News。Reprint/Content Collaboration/For Reporting, Please Contact report@odaily.email;Illegal reprinting must be punished by law.

ODAILY reminds readers to establish correct monetary and investment concepts, rationally view blockchain, and effectively improve risk awareness; We can actively report and report any illegal or criminal clues discovered to relevant departments.

Recommended Reading
Editor’s Picks