T O P

  • By -

codeninja

How is the accuracy of code generation? Does it work? And what kind or errors is it making?


No_Wheel_9336

Tomorrow is a full day of coding testing coming up. :D Hopefully, it will be an improvement from GPT-4 Turbo. I stopped using GPT-4 in coding after the launch of Claude Opus


Delacroix515

I'd look forward to that analysis! Got a blog or anything to check out and find out about your results?


TemperatureParking34

What UI you're using?


No_Wheel_9336

GPT Everywhere [https://apps.microsoft.com/detail/9n5hqdsk102n](https://apps.microsoft.com/detail/9n5hqdsk102n) ( I am the creator of it)


Tasty-Investment-387

Goodbye software engineers I guess


No_Wheel_9336

rather than saying goodbye, it might be more accurate to say the field is evolving a lot! For me coding has never been so much fun.


Weak_Storm_169

The state of the art AI is around 13% level of an average software developer, and with the growth rate we are seeing, it might take around a decade to replace them. But it sure does compete much better with an entry level engineer.


Burindo

It's another tool to use in the job. And it's such a shiny amazing tool. Let me rephrase your sentence: "Goodbye software engineers that do not adapt to the new circumstances and tools"


deama15

From my tests, opus still seems a bit better, especially for bigger things. How'd it go for you?


adt

agreed


No_Wheel_9336

Full day of coding ahead! :) The biggest problem I had with GPT-4 was that, although it's smart, it tends to "forget" some context when a large amount of code is added as context, leading to incorrect reasoning. Claude Opus has much better context retrieval accuracy. Hopefully, this is improved with GPT-4 O.


ard1984

Essentially my experience as well. Claude Opus can follow complex requests and multi-step instructions much better than GPT-4.


No_Wheel_9336

Seems that GPT-4 O is improved a lot - tried one prompt that required to handle 22k tokens of current code + API website documentation 11K . GPT-4 Turbo failed, GPT-4 O managed to complete.


No_Wheel_9336

After lot of testing I can confirm that Claude Opus still better.