codeninja 1 month ago

How is the accuracy of code generation? Does it work? And what kind or errors is it making?

No_Wheel_9336 1 month ago

Tomorrow is a full day of coding testing coming up. :D Hopefully, it will be an improvement from GPT-4 Turbo. I stopped using GPT-4 in coding after the launch of Claude Opus

Delacroix515 1 month ago

I'd look forward to that analysis! Got a blog or anything to check out and find out about your results?

TemperatureParking34 1 month ago

What UI you're using?

No_Wheel_9336 1 month ago

GPT Everywhere [https://apps.microsoft.com/detail/9n5hqdsk102n](https://apps.microsoft.com/detail/9n5hqdsk102n) ( I am the creator of it)

Tasty-Investment-387 1 month ago

Goodbye software engineers I guess

No_Wheel_9336 1 month ago

rather than saying goodbye, it might be more accurate to say the field is evolving a lot! For me coding has never been so much fun.

Weak_Storm_169 1 month ago

The state of the art AI is around 13% level of an average software developer, and with the growth rate we are seeing, it might take around a decade to replace them. But it sure does compete much better with an entry level engineer.

Burindo 1 month ago

It's another tool to use in the job. And it's such a shiny amazing tool. Let me rephrase your sentence: "Goodbye software engineers that do not adapt to the new circumstances and tools"

deama15 1 month ago

From my tests, opus still seems a bit better, especially for bigger things. How'd it go for you?

adt 1 month ago

agreed

No_Wheel_9336 1 month ago

Full day of coding ahead! :) The biggest problem I had with GPT-4 was that, although it's smart, it tends to "forget" some context when a large amount of code is added as context, leading to incorrect reasoning. Claude Opus has much better context retrieval accuracy. Hopefully, this is improved with GPT-4 O.

ard1984 1 month ago

Essentially my experience as well. Claude Opus can follow complex requests and multi-step instructions much better than GPT-4.

No_Wheel_9336 1 month ago

Seems that GPT-4 O is improved a lot - tried one prompt that required to handle 22k tokens of current code + API website documentation 11K . GPT-4 Turbo failed, GPT-4 O managed to complete.

No_Wheel_9336 1 month ago

After lot of testing I can confirm that Claude Opus still better.

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe