Software

AI + ML

Amazon congratulates itself for AI code that mostly works

Web services souk celebrates 'leader' designation for Q Developer


Amazon Web Services on Tuesday took a moment to pat itself on the back for being thought of inside the box, specifically, the upper right-hand square that's part of Gartner's trademarked Magic Quadrant.

This particular set of boxes maps the IT consultancy's view of AI code assistants. AWS is understandably chuffed to land a spot in the "leaders" quadrangle for its Q Developer service, alongside GitHub Copilot, GitLab Duo, and Google Cloud's Gemini Code Assist.

The other three boxes are reserved for "visionaries" (less ability to execute compared to leaders), "challengers" (executing just fine, but vision impaired), and "niche players" (lagging in execution and vision).

As with the security industry, where companies have filed legal complaints after their software is labeled spyware or malware, IT firms placed in less desirable boxes have challenged the designation in court.

NetScout Systems, for example, filed a lawsuit against Gartner in 2014 over its placement in the "challenger" box. The case was dismissed in 2017, with Gartner emphasizing that its rankings are not for sale. Another firm, ZL Technologies, sued Gartner in 2009 over its "niche player" designation, only to lose in court and on appeal.

So it's perhaps understandable that AWS would crow about recognition from Gartner, as it has been doing on a regular basis for years.

"We believe this Leader placement reflects our rapid pace of innovation, which makes the whole software development lifecycle easier and increases developer productivity with enterprise-grade access controls and security," said Channy Yun, principal developer advocate for AWS.

However, "leader" doesn't necessarily refer to customer count for Q Developer. An Amazon spokesperson said the company does not disclose specific adoption numbers, which generally means they're less than the competition. Microsoft's GitHub, on the other hand, happily brags about Copilot usage. As of February, there were "over 1.3 million paid GitHub Copilot subscribers."

Amazon CEO Andy Jassy last month offered a different way to assess the success of Q Developer. He said Amazon had used the generative AI coding service to modernize its production Java systems by updating the code to Java 17.

"The benefits go beyond how much effort we've saved developers," Jassy said. "The upgrades have enhanced security and reduced infrastructure costs, providing an estimated $260 million in annualized efficiency gains."

Amazon claims that Q Developer's agents for code transformation helped Amazon migrate 30,000 production applications from Java 8 to Java 17, saving over 4,500 years of development work, in addition to the $260 million in performance improvements cited by Jassy.

Jassy did not mention how often AI code suggestions need correction or the costs associated with these fixes. Last year, researchers evaluated ChatGPT, GitHub Copilot, and Amazon CodeWhisperer (now Q Developer), and found that the AI helpers generated correct code 65.2 percent, 46.3 percent, and 31.1 percent of the time, respectively.

Asked about this, an Amazon spokesperson said: "Amazon Q Developer has the highest reported code acceptance rates in the industry for assistants that perform multi-line code suggestions – with BT Group reporting they accepted 37 percent of Q's code suggestions and National Australia Bank reporting a 50 percent acceptance rate."

A test suite for small AI models called CanAiCode doesn't mention Q Developer and ranks Anthropic's Claude 3 Opus at the top of the models evaluated.

Gartner, in its report on AI code assistants, projects that by 2028, 90 percent of enterprise software engineers will use the technology, up from less than 14 percent as of early 2024.

Nonetheless, academics argue that AI will not render software engineering obsolete. In a recent preprint paper, Carnegie Mellon computer science professors Eunsuk Kang and Mary Shaw conclude: "Generative AI is now eagerly inflating our aspirations, but its capability is not yet trustworthy and robust enough to be part of the stable core of [software engineering] methods. AI is already demonstrably useful under careful supervision, and we can expect its utility for routine programming tasks to improve quickly." ®

Send us news
8 Comments

AI-pushing Adobe says AI-shy office workers will love AI if it saves them time

knowledge workers, overwhelmed by knowledge tasks? We know what you need

Canadian artist wants Anthropic AI lawsuit corrected

Tim Boucher objects to the mischaracterization of his work in authors' copyright claim

If every PC is going to be an AI PC, they better be as good at all the things trad PCs can do

Microsoft's Copilot+ machines suck at one of computing's oldest use cases

AI firms propose 'personhood credentials' … to fight AI

It's going to take more than CAPTCHA to prove you're real

'Error' causes Alexa to endorse Kamala Harris, refuse to discuss Trump

Bot shouldn't have political opinions, says Amazon

Buying a PC for local AI? These are the specs that actually matter

If you guessed TOPS and FLOPS, that's only half right

The future of AI/ML depends on the reality of today – and it's not pretty

The return of Windows Recall is more than a bad flashback

China AI devs use cloud services to game US chip sanctions

Orgs are accessing restricted tech, raising concerns about more potential loopholes

Dell's all-in bet on AI pays off in latest earnings

The term was mentioned over 140 times during the earnings call

GPT apps fail to disclose data collection, study finds

Researchers say that implementing Actions omit privacy details and expose info

AGI is on clients' radar but far from reality, says Gartner

Controversial concept may not even be a useful goal in computing

Top companies ground Microsoft Copilot over data governance concerns

Securiti's Jack Berkowitz polled 20-plus CDOs, and half have hit pause