Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Chinese authorities are using a new tool to hack seized phones and extract data

    July 16, 2025

    Of course, Grok’s AI companions want to have sex and burn down schools

    July 15, 2025

    The Chainsmokers’ Mantis Ventures closes $100M third fund

    July 15, 2025
    Facebook X (Twitter) Instagram
    • Home
    • Technology
    • Gaming
    • Phones
    • Buy Now
    Facebook X (Twitter) Instagram Pinterest Vimeo
    My BlogMy Blog
    • Home
    • Features
      • Example Post
      • Typography
      • Contact
      • View All On Demos
    • Technology

      Is the Hyperloop Doomed? What Elon Musk’s Latest Setback Really Means

      March 10, 2022

      The Best Early Black Friday Deals on Gaming Laptops and Accessories

      March 10, 2022

      Apple Watch’s ECG Can Help Diagnose Heart Problem: Research

      January 19, 2021

      Simple Tips and Tricks to Take Care of Your Expensive DSLR Camera

      January 16, 2021

      Tech Study Reveals Effects of Mobile Technology on Professionals

      January 15, 2021
    • Typography
    • Phones
      1. Technology
      2. Gaming
      3. Gadgets
      4. View All

      Is the Hyperloop Doomed? What Elon Musk’s Latest Setback Really Means

      March 10, 2022

      The Best Early Black Friday Deals on Gaming Laptops and Accessories

      March 10, 2022

      Apple Watch’s ECG Can Help Diagnose Heart Problem: Research

      January 19, 2021

      Simple Tips and Tricks to Take Care of Your Expensive DSLR Camera

      January 16, 2021

      Game Development This Week: Save On Essential Tools and More

      November 19, 2022

      Riot Games Acquires a Wargaming Studio to Help With Live Game Development

      March 10, 2022

      Keep Talking and Nobody Explodes: A Boomer Gaming in VR

      March 12, 2021

      Hologate Announces New Plans for First Large Format World VR Arcade

      January 16, 2021
      8.9

      DJI Avata Review: Immersive FPV Flying For Drone Enthusiasts

      January 15, 2021
      8.9

      Bose QuietComfort Earbuds II: Noise-Cancellation Kings Reviewed

      January 15, 2021

      Thousands Of PC Games Discounted In New Black Friday Sale

      January 15, 2021

      Could Solar-Powered Headphones Be The Next Must-Have?

      January 15, 2021

      Will Using a VPN on Phone Helps Protect You from Ransomware?

      January 14, 2021

      Popular New Xbox Game Pass Game Being Review Bombed With “0s”

      January 14, 2021

      Google Says Surveillance Vendor Targeted Samsung Phones

      January 14, 2021

      Why Are iPhones More Expensive Than Android Phones?

      January 14, 2021
    • Buy Now
    Subscribe
    My BlogMy Blog
    Home»Uncategorized»AI coding tools are shifting to a surprising place: the terminal
    Uncategorized

    AI coding tools are shifting to a surprising place: the terminal

    Y U RajuBy Y U RajuJuly 15, 2025No Comments5 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    For years, code-editing tools like Cursor, Windsurf, and GitHub’s Copilot have been the standard for AI-powered software development. But as agentic AI grows more powerful and vibe-coding takes off, a subtle shift has changed how AI systems are interacting with software. Instead of working on code, they’re increasingly interacting directly with the shell of whatever system they’re installed in. It’s a significant change in how AI-powered software development happens – and despite the low profile, it could have significant implications for where the field goes from here.

    The terminal is best known as the black-and-white screen you remember from 90s hacker movies – a very old-school way of running programs and manipulating data. It’s not as visually impressive as contemporary code editors, but it’s an extremely powerful interface if you know how to use it. And while code-based agents can write and debug code, terminal tools are often needed to get software from written code to something that can actually be used.

    The clearest sign of the shift to the terminal has come from major labs. Since February, Anthropic, DeepMind and OpenAI have all released command-line coding tools (Claude Code, Gemini CLI, and CLI Codex respectively), and they’re already among the companies’ most popular products. That shift has been easy to miss, since they’re largely operating under the same branding as previous coding tools. But under the hood, there have been real changes in how agents interact with other computers, both online and offline. Some believe those changes are just getting started.

    “Our big bet is that there’s a future in which 95% of LLM-computer interaction is through a terminal-like interface,” says Alex Shaw, co-creator of the leading terminal-focused benchmark TerminalBench. 

    Terminal-based tools are also coming into their own just as prominent code-based tools are starting to look shaky. The AI code editor Windsurf has been torn apart by dueling acquisitions, with senior executives hired away by Google and the remaining company acquired by Cognition – leaving the consumer product’s long-term future uncertain.

    At the same time, new research suggests programmers may be overestimating productivity gains from conventional tools. A METR study testing out Cursor Pro, Windsurf’s main competitor, found that while developers estimated they could complete tasks 20-30 percent faster, the observed process was nearly 20 percent slower. In short, the code assistant was actually costing programmers time.

    That has left an opening for companies like Warp, which currently holds the top spot on TerminalBench. Warp bills itself as an “agentic development environment,” a middle ground between IDE programs and command-line tools like Claude Code. But Warp founder Zach Lloyd is still bullish on the terminal, seeing it as a way to tackle problems that would be out of scope for a code editor like Cursor. 

    “The terminal occupies a very low level in the developer stack, so it’s the most versatile place to be running agents,” Lloyd says.

    To understand how the new approach is different, it can be helpful to look at the benchmarks used to measure them. The code-based generation of tools was focused on solving GitHub issues, the basis of the SWE-Bench test. Each problem on SWE-Bench is an open issue from GitHub — essentially, a piece of code that doesn’t work. Models iterate on the code until they find something that works, solving the problem. Integrated products like Cursor have built more sophisticated approaches to the problem, but the GitHub/SWE-Bench model is still the core of how these tools approach the problem: starting with broken code and turning it into code that works.

    Terminal-based tools take a wider view, looking beyond the code to the whole environment a program is running in. That includes coding but also more DevOps-oriented tasks like configuring a Git server or troubleshooting why a script won’t run. In one TerminalBench problem, the instructions give a decompression program and a target text file, challenging the agent to reverse-engineer a matching compression algorithm. Another asks the agent to build the Linux kernel from source, failing to mention that the agent will have to download the source code itself. Solving the issues requires the kind of bull-headed problem-solving ability that programmers need.

    “What makes TerminalBench hard is not just the questions that we’re giving the agents,” says Shaw, “it’s the environments that we’re placing them in.”

    Crucially, this new approach means tackling a problem step-by-step – the same skill that makes agentic AI so powerful. But even state-of-the-art agentic models can’t handle all of those environments. Warp earned its high score on TerminalBench by solving just over half of the problems – a mark of how challenging the benchmark is, but also how much work still needs to be done to unlock the terminal’s full potential. 

    Still, Lloyd believes we’re already at a point where terminal-based tools can reliably handle much of a developer’s non-coding work – a value proposition that’s hard to ignore.

    “If you think of the daily work of setting up a new project, figuring out the dependencies and getting it runnable, Warp can pretty much do that autonomously,” says Lloyd. “And if it can’t do it, it will tell you why.”



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleLucid Motors will roll out hands-free highway driving this month
    Next Article The rights to Fyre Festival sold on eBay for 245K
    Y U Raju

    Related Posts

    Uncategorized

    Chinese authorities are using a new tool to hack seized phones and extract data

    July 16, 2025
    Uncategorized

    Of course, Grok’s AI companions want to have sex and burn down schools

    July 15, 2025
    Uncategorized

    The Chainsmokers’ Mantis Ventures closes $100M third fund

    July 15, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Demo
    Top Posts

    2025 will be a ‘pivotal year’ for Meta’s augmented and virtual reality, says CTO

    June 6, 202544 Views

    Still no AI-powered, ‘more personalized’ Siri from Apple at WWDC 25

    June 9, 202543 Views

    XRobotics’ countertop robots are cooking up 25,000 pizzas a month

    June 9, 202542 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews
    85
    Featured

    Pico 4 Review: Should You Actually Buy One Instead Of Quest 2?

    thf0oJanuary 15, 2021
    8.1
    Uncategorized

    A Review of the Venus Optics Argus 18mm f/0.95 MFT APO Lens

    thf0oJanuary 15, 2021
    8.9
    Editor's Picks

    DJI Avata Review: Immersive FPV Flying For Drone Enthusiasts

    thf0oJanuary 15, 2021

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Demo
    Most Popular

    2025 will be a ‘pivotal year’ for Meta’s augmented and virtual reality, says CTO

    June 6, 202544 Views

    Still no AI-powered, ‘more personalized’ Siri from Apple at WWDC 25

    June 9, 202543 Views

    XRobotics’ countertop robots are cooking up 25,000 pizzas a month

    June 9, 202542 Views
    Our Picks

    Chinese authorities are using a new tool to hack seized phones and extract data

    July 16, 2025

    Of course, Grok’s AI companions want to have sex and burn down schools

    July 15, 2025

    The Chainsmokers’ Mantis Ventures closes $100M third fund

    July 15, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • Home
    • Technology
    • Gaming
    • Phones
    • Buy Now
    © 2025 ThemeSphere. Designed by ThemeSphere.

    Type above and press Enter to search. Press Esc to cancel.