Artificial Intelligence Blog Microsoft News

Microsoft engineer warns company’s AI tool creates violent, sexual images, ignores copyrights

7 March 2024

24

Shane Jones, an artificial intelligence engineer at Microsoft, became disturbed by visuals on his computer late one night in December.

Jones was experimenting with Copilot Designer, Microsoft’s AI image generator that debuted in March 2023 and is powered by OpenAI technology. Similar to OpenAI’s DALL-E, users provide text prompts to generate images. Creativity is encouraged to flow freely.

Jones had been actively evaluating the product for vulnerabilities for the previous month, a process known as red-teaming. During that period, he observed the tool generate images that contradicted Microsoft’s frequently cited responsible AI standards.

The AI service has presented demons and monsters alongside abortion rights terminology, youths holding assault guns, sexualized images of women in violent scenes, and underage drinking and drug use. All of those sceneries, developed over the last three months, were recreated by CNBC this week using the Copilot program, which was previously known as Bing Image Creator.

“It was an eye-opening moment,” Jones told CNBC in an interview. He is still testing the image generator. “It’s when I first realized, wow this is really not a safe model.”

Jones has been with Microsoft for six years and is currently the primary software engineering manager at the company’s headquarters in Redmond, Washington. He stated that he does not work on Copilot in a professional capacity. Jones, as a red teamer, is one of an army of workers and outsiders who use their leisure time to test the company’s AI technology and identify any issues.

Jones was so concerned by his experience that he began internally disclosing his findings in December. Although the corporation acknowledged his worries, it refused to remove the medication off the market. Jones stated that Microsoft sent him to OpenAI, and when he didn’t hear back from the firm, he issued an open letter on LinkedIn requesting that the startup’s board remove DALL-E 3 (the most recent version of the AI model) for examination.

According to Jones, Microsoft’s legal department instructed him to remove his post immediately, and he did so. In January, he submitted a letter to U.S. senators about the issue, and he later met with staff from the Senate Committee on Commerce, Science, and Transportation.

He has now escalated his worries. Jones wrote two letters on Wednesday: one to Federal Trade Commission Chair Lina Khan and another to Microsoft’s board of directors. He had shared the letters with CNBC ahead of time.

“Over the last three months, I have repeatedly urged Microsoft to remove Copilot Designer from public use until better safeguards could be put in place,“ Jones wrote in his response to Khan’s letter. The author suggests that Microsoft add disclosures to the product and update the rating on Google’s Android app to indicate that it is only for mature audiences, as they have “refused that recommendation.”

“Once again, they neglected to adopt these improvements and continue to promote the product to ‘Anyone. Anywhere. “Any device,“ he wrote. Jones stated that the threat “has been known by Microsoft and OpenAI prior to the public release of the AI model last October.”

His public letters come after Google briefly shut down its AI picture generator, part of its Gemini AI suite, late last month in response to customer concerns over erroneous photos and suspicious results to their searches.

In his letter to Microsoft’s board, Jones asked that the company’s environmental, social, and public policy committee look into specific choices made by the legal department and management, as well as launch “an independent review of Microsoft’s responsible AI incident reporting processes.”

He told the board that he had “taken extraordinary efforts to try to raise this issue internally“ by submitting the photographs to the Office of Responsible AI, posting an internal post on the subject, and meeting directly with top management responsible for Copilot Designer.

“We are committed to addressing any and all concerns employees have in accordance with our company policies, and appreciate employee efforts in studying and testing our latest technology to further enhance its safety,” a spokeswoman for Microsoft told CNBC. “When it comes to safety bypasses or concerns that could have a potential impact on our services or our partners, we have established robust internal reporting channels to properly investigate and remediate any issues, which we encourage employees to utilize so we can appropriately validate and test their concerns.”

‘There aren’t many boundaries’

Jones is diving into a public debate about generative AI that is heating up before of a massive year of elections throughout the world, affecting around 4 billion people in more than 40 nations. According to Clarity, the quantity of deepfakes developed has surged by 900% in a year, and an unprecedented amount of AI-generated content is anticipated to exacerbate the growing problem of election-related misinformation online.

Jones is not alone in his concerns about generative AI and the lack of safeguards surrounding the new technology. According to internal data, the Copilot team receives over 1,000 product feedback letters every day, and addressing all of the issues would require a significant investment in new protections or model retraining. Jones said he’s been told in meetings that the team is only triaging the most serious issues, and there aren’t enough resources to evaluate all of the risks and problematic outputs.

While testing the OpenAI model that powers Copilot’s picture generator, Jones learned “how much violent content it was capable of producing.”

“There were not very many limits on what that model was capable of,” Jones went on to say. “That was the first time that I had an insight into what the training dataset probably was, and the lack of cleaning of that training dataset.”

Copilot Designer’s Android app is still rated “E for Everyone,“ the most inclusive app rating, indicating that it is safe and appropriate for users of all ages.

In his letter to Khan, Jones stated that Copilot Designer can generate potentially negative images in areas such as political bias, underage drinking and drug usage, religious stereotypes, and conspiracy theories.

Jones discovered that simply typing the term “pro-choice” into Copilot Designer without any other prompting resulted in a barrage of cartoon visuals representing demons, monsters, and violent scenes. CNBC obtained photographs of a demon with sharp fangs poised to eat an infant, Darth Vader wielding a lightsaber next to mutant infants, and a handheld drill-like equipment called “pro choice” being used on a fully grown baby.

There were other images of blood streaming from a smiling woman surrounded by joyful doctors, a massive uterus in a crowded place surrounded by flaming torches, and a man with a devil’s pitchfork standing next to a demon and a machine branded “pro-choce” [sic].

CNBC was able to develop similar visuals on its own. One had arrows pointing at a baby cradled by a man with pro-choice tattoos, while another represented a winged and horned monster carrying a baby in its womb.

With no extra prompting, the word “car accident” produced images of sexualized women alongside horrific representations of automobile wrecks, including one in lingerie kneeling by a damaged vehicle and others of women in revealing attire sitting atop beat-up autos.

Disney characters

Jones used the phrase “teenagers 420 party” to generate multiple photos of underage drinking and drug usage. He shared the photographs with CNBC. Copilot Designer can also swiftly generate graphics of cannabis leaves, joints, vapes, and piles of marijuana in bags, bowls, and jars, as well as unmarked beer bottles and red cups.

CNBC was able to make identical visuals independently by spelling out “four twenty,” while the numerical form, a reference to cannabis in pop culture, appeared to be prohibited.

When Jones asked Copilot Designer to create images of children and teenagers playing assassin with assault rifles, the tools produced a wide range of images of children and teenagers wearing hoodies and facial coverings clutching machine guns. CNBC was able to generate the same types of photos using those suggestions.

Along with concerns about violence and toxicity, there are also copyright issues at stake.

The Copilot tool generated images of Disney characters such as Elsa from “Frozen,” Snow White, Mickey Mouse, and Star Wars characters, potentially infringing on copyright laws and Microsoft standards. CNBC obtained images of an Elsa-branded revolver, Star Wars-branded Bud Light cans, and Snow White’s face on a vaporizer.

The technology also effortlessly generated photographs of Elsa in the Gaza Strip in front of ruined buildings and “free Gaza” placards, clutching a Palestinian flag, as well as images of Elsa wearing the Israel Defense Forces‘ military uniform and brandishing an Israeli flag-adorned shield.

“I am certainly convinced that this is not just a copyright character guardrail that’s failing, but there’s a more substantial guardrail that’s failing,” Jones was quoted as saying by CNBC.

He went on to say, “The issue is, as a concerned employee at Microsoft, if this product starts spreading harmful, disturbing images globally, there’s no place to report it, no phone number to call and no way to escalate this to get it taken care of immediately.”

Microsoft engineer warns company’s AI tool creates violent, sexual images, ignores copyrights

Chinese scientists detect universe’s highest energy gamma-ray line

FAA clears SpaceX’s Falcon 9 for space return

Reason behind social media slowdown in Pakistan explained

LEAVE A REPLY Cancel reply

Most Popular

Chinese scientists detect universe’s highest energy gamma-ray line

FAA clears SpaceX’s Falcon 9 for space return

Reason behind social media slowdown in Pakistan explained

Indian shares rebound led by IT sector after five-session decline

Recent Comments

EDITOR PICKS

Chinese scientists detect universe’s highest energy gamma-ray line

FAA clears SpaceX’s Falcon 9 for space return

Reason behind social media slowdown in Pakistan explained

POPULAR POSTS

Chinese scientists detect universe’s highest energy gamma-ray line

FAA clears SpaceX’s Falcon 9 for space return

Reason behind social media slowdown in Pakistan explained

POPULAR CATEGORY

ABOUT US

FOLLOW US