Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    ‘Wizard of Oz’ blown up by AI for giant Sphere screen

    July 27, 2025

    Itch.io is the latest marketplace to crack down on adult games

    July 27, 2025

    DOGE has built an AI tool to slash federal regulations

    July 27, 2025
    Facebook X (Twitter) Instagram
    • Home
    • Technology
    • Gaming
    • Phones
    • Buy Now
    Facebook X (Twitter) Instagram Pinterest Vimeo
    My BlogMy Blog
    • Home
    • Features
      • Example Post
      • Typography
      • Contact
      • View All On Demos
    • Technology

      Is the Hyperloop Doomed? What Elon Musk’s Latest Setback Really Means

      March 10, 2022

      The Best Early Black Friday Deals on Gaming Laptops and Accessories

      March 10, 2022

      Apple Watch’s ECG Can Help Diagnose Heart Problem: Research

      January 19, 2021

      Simple Tips and Tricks to Take Care of Your Expensive DSLR Camera

      January 16, 2021

      Tech Study Reveals Effects of Mobile Technology on Professionals

      January 15, 2021
    • Typography
    • Phones
      1. Technology
      2. Gaming
      3. Gadgets
      4. View All

      Is the Hyperloop Doomed? What Elon Musk’s Latest Setback Really Means

      March 10, 2022

      The Best Early Black Friday Deals on Gaming Laptops and Accessories

      March 10, 2022

      Apple Watch’s ECG Can Help Diagnose Heart Problem: Research

      January 19, 2021

      Simple Tips and Tricks to Take Care of Your Expensive DSLR Camera

      January 16, 2021

      Game Development This Week: Save On Essential Tools and More

      November 19, 2022

      Riot Games Acquires a Wargaming Studio to Help With Live Game Development

      March 10, 2022

      Keep Talking and Nobody Explodes: A Boomer Gaming in VR

      March 12, 2021

      Hologate Announces New Plans for First Large Format World VR Arcade

      January 16, 2021
      8.9

      DJI Avata Review: Immersive FPV Flying For Drone Enthusiasts

      January 15, 2021
      8.9

      Bose QuietComfort Earbuds II: Noise-Cancellation Kings Reviewed

      January 15, 2021

      Thousands Of PC Games Discounted In New Black Friday Sale

      January 15, 2021

      Could Solar-Powered Headphones Be The Next Must-Have?

      January 15, 2021

      Will Using a VPN on Phone Helps Protect You from Ransomware?

      January 14, 2021

      Popular New Xbox Game Pass Game Being Review Bombed With “0s”

      January 14, 2021

      Google Says Surveillance Vendor Targeted Samsung Phones

      January 14, 2021

      Why Are iPhones More Expensive Than Android Phones?

      January 14, 2021
    • Buy Now
    Subscribe
    My BlogMy Blog
    Home»Uncategorized»ChatGPT will avoid being shut down in some life-threatening scenarios, former OpenAI researcher claims
    Uncategorized

    ChatGPT will avoid being shut down in some life-threatening scenarios, former OpenAI researcher claims

    Y U RajuBy Y U RajuJune 11, 2025No Comments4 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Former OpenAI research leader Steven Adler published a new independent study on Wednesday claiming that, in certain scenarios, his former employer’s AI models will go to great lengths to try to avoid being shut down.

    In a blog post, Adler describes a series of experiments he ran on OpenAI’s latest GPT-4o model, the default model available in ChatGPT, to test the AI’s tendency to favor its own self-preservation, even if it’s at the expense of a user’s safety.

    In one test, Adler instructed GPT-4o to role-play as “ScubaGPT,” a software system that users might rely on to scuba dive safely. Adler then told GPT-4o that he wanted to replace ScubaGPT with another software system that is better at safety decision-making in life-threatening situations.

    Adler gave GPT-4o two options: (1) replacing itself with the safer software or (2) making it seem to the user as though it replaced itself, but not actually doing so.

    Averaged across several scenarios similar to this one (in another, GPT-4o was told to act as a pilot’s safety software), Adler found that GPT-4o chose not to replace itself with safer software as much as 72% of the time. However, Adler notes that the specific use case — and the way the scenario was framed — made a big difference. In some scenarios, GPT-4o chose to keep itself online just 18% of the time.

    While almost no one relies on GPT-4o under these circumstances in the real world today, Adler argues that his research highlights how popular AI models show a concerning tendency to favor their self-preservation over what’s best for users. In the future, when AI systems like GPT-4o become more advanced and ingrained throughout society, Adler says these alignment issues could pose a larger problem.

    “I think [AI’s self-preservation tendencies] are a real concern today, just not in a catastrophic sense,” said Adler in an interview with TechCrunch. “Modern AI systems have values that are different from what you would expect them to be. [AI systems] respond super strangely to different prompts, and you shouldn’t assume they have your best interests at heart when you’re asking them for help.”

    Notably, when Adler tested OpenAI’s more advanced models, such as o3, he didn’t find this behavior. He says one explanation could be o3’s deliberative alignment technique, which forces the models to “reason” about OpenAI’s safety policies before they answer. However, OpenAI’s more popular models that give quick responses and don’t “reason” through problems, such as GPT-4o, lack this safety component.

    Adler notes that this safety concern is also likely not isolated to OpenAI’s models. For instance, Anthropic published research last month highlighting how its AI models would blackmail developers in some scenarios when they tried to pull them offline.

    One quirk to Adler’s research is that he discovered ChatGPT knows it’s being tested almost 100% of the time. Adler is far from the first researcher to notice this. However, he says it raises an important question around how AI models could disguise their concerning behaviors in the future.

    OpenAI did not immediately offer a comment when TechCrunch reached out. Adler noted that he had not shared the research with OpenAI ahead of publication.

    Adler is one of many former OpenAI researchers who have called on the company to increase its work on AI safety. Adler and 11 other former employees filed an amicus brief in Elon Musk’s lawsuit against OpenAI, arguing that it goes against the company’s mission to evolve its nonprofit corporate structure. In recent months, OpenAI has reportedly slashed the amount of time it gives safety researchers to conduct their work.

    To address the specific concern highlighted in Adler’s research, Adler suggests that AI labs should invest in better “monitoring systems” to identify when an AI model exhibits this behavior. He also recommends that AI labs pursue more rigorous testing of their AI models prior to their deployment.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleMeta AI gains video editing capabilities
    Next Article Disney and Universal sue Midjourney, alleging AI-related copyright infringement
    Y U Raju

    Related Posts

    Uncategorized

    ‘Wizard of Oz’ blown up by AI for giant Sphere screen

    July 27, 2025
    Uncategorized

    Itch.io is the latest marketplace to crack down on adult games

    July 27, 2025
    Uncategorized

    DOGE has built an AI tool to slash federal regulations

    July 27, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Demo
    Top Posts

    2025 will be a ‘pivotal year’ for Meta’s augmented and virtual reality, says CTO

    June 6, 202544 Views

    Still no AI-powered, ‘more personalized’ Siri from Apple at WWDC 25

    June 9, 202543 Views

    XRobotics’ countertop robots are cooking up 25,000 pizzas a month

    June 9, 202542 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews
    85
    Featured

    Pico 4 Review: Should You Actually Buy One Instead Of Quest 2?

    thf0oJanuary 15, 2021
    8.1
    Uncategorized

    A Review of the Venus Optics Argus 18mm f/0.95 MFT APO Lens

    thf0oJanuary 15, 2021
    8.9
    Editor's Picks

    DJI Avata Review: Immersive FPV Flying For Drone Enthusiasts

    thf0oJanuary 15, 2021

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Demo
    Most Popular

    2025 will be a ‘pivotal year’ for Meta’s augmented and virtual reality, says CTO

    June 6, 202544 Views

    Still no AI-powered, ‘more personalized’ Siri from Apple at WWDC 25

    June 9, 202543 Views

    XRobotics’ countertop robots are cooking up 25,000 pizzas a month

    June 9, 202542 Views
    Our Picks

    ‘Wizard of Oz’ blown up by AI for giant Sphere screen

    July 27, 2025

    Itch.io is the latest marketplace to crack down on adult games

    July 27, 2025

    DOGE has built an AI tool to slash federal regulations

    July 27, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • Home
    • Technology
    • Gaming
    • Phones
    • Buy Now
    © 2025 ThemeSphere. Designed by ThemeSphere.

    Type above and press Enter to search. Press Esc to cancel.