Stream: Virtual Room 4
Time: 09:00 - 10:00
LLMs are fast-changing the way that we write software. Over a million developers now pay for GitHub Copilot and recent breakthroughs in LLM reasoning have brought the dream of a fully AI Software Engineer closer to reality. But while it’s not hard to find a demo of an LLM coding a website or a clone of Flappy Bird, not much is known about their ability to write COBOL.
We've developed a new benchmark to evaluate the ability of LLMs to write COBOL. This presentation walks through the rationale, our design descisions, and the results from some popular LLMs.
Cofounder & CTO @ bloop
Cofounder & CTO @ bloop
Click here to give some Feedback so we can make it even better next year!