Exactly Six Months Ago, the CEO of Anthropic Said That in Six Months AI Would Be Writing 90 Percent of Code

Scolding7300@lemmy.world · 2 个月前

Exactly Six Months Ago, the CEO of Anthropic Said That in Six Months AI Would Be Writing 90 Percent of Code

reddig33@lemmy.world · 2 个月前

“Full self driving is just 12 months away.“

anotherspinelessdem@lemmy.ml · 2 个月前

Just like the last 12 months

floofloof@lemmy.ca · 2 个月前

“I’m terrified our product will be just too powerful.”

Catoblepas@piefed.blahaj.zone · 2 个月前

On Mars by the end of this year! I mean, next year!

Echo Dot@feddit.uk · 2 个月前

Yep along with Fusion.

We’ve had years of this. Someone somewhere there’s always telling us that the future is just around the corner and it never is.

Jesus_666@lemmy.world · 2 个月前

At least the fusion guys are making actual progress and can point to being wildly underfunded – and they predicted this pace of development with respect to funding back in the late 70s.

Meanwhile, the AI guys have all the funding in the world, keep telling about how everything will change in the next few months, actually trigger layoffs with that rhetoric, and deliver very little.

FundMECFS@anarchist.nexus · 2 个月前

They get 1+ billion a year. Probably much more if you include the undisclosed amounts China invests.

Jesus_666@lemmy.world · 2 个月前

Yeah, and in the 70s they estimated they’d need about twice that to make significant progress in a reasonable timeframe. Fusion research is underfunded – especially when you look at how the USA dump money into places like the NIF, which research inertial confinement fusion.

Inertial confinement fusion is great for developing better thermonuclear weapons but an unlikely candidate for practical power generation. So from that one billion bucks a year, a significant amount is pissed away on weapons research instead of power generation candidates like tokamaks and stellarators.

I’m glad that China is funding fusion research, especially since they’re in a consortium with many Western nations. When they make progress, so do we (and vice versa).

Valmond@lemmy.world · 2 个月前

2019…

poopkins@lemmy.world · 2 个月前

In 2014 he promised 90% autonomous by 2015. That was over a decade ago and it’s still not close to that…

jaybone@lemmy.zip · 2 个月前

We were supposed to have flying cars in 2000.

ragas@lemmy.ml · 2 个月前

Still waiting for my hoverboard.

poopkins@lemmy.world · 2 个月前

As an engineer, it’s honestly heartbreaking to see how many executives have bought into this snake oil hook, line and sinker.

Feyd@programming.dev · 2 个月前

Did you think executives were smart? What’s really heartbreaking is how many engineers did. I even know some that are pretty good that tell me how much more productive they are and all about their crazy agent setups (from my perspective i don’t see any more productivity)

expr@programming.dev · 2 个月前

Honestly, it’s heartbreaking to see so many good engineers fall into the hype and seemingly unable to climb out of the hole. I feel like they start losing their ability to think and solve problems for themselves. Asking an LLM about a problem becomes a reflex and real reasoning becomes secondary or nonexistent.

Executives are mostly irrelevant as long as they’re not forcing the whole company into the bullshit.

jj4211@lemmy.world · 2 个月前

Based on my experience, I’m skeptical someone that seemingly delegates their reasoning to an LLM were really good engineers in the first place.

Whenever I’ve tried, it’s been so useless that I can’t really develop a reflex, since it would have to actually help for me to get used to just letting it do it’s thing.

Meanwhile the people who are very bullish who are ostensibly the good engineers that I’ve worked with are the people who became pet engineers of executives and basically have long succeeded by sounding smart to those executives rather than doing anything or even providing concrete technical leadership. They are more like having something akin to Gartner on staff, except without even the data that at least Gartner actually gathers, even as Gartner is a useless entity with respect to actual guidance.

Mniot@programming.dev · 2 个月前

Executives are mostly irrelevant as long as they’re not forcing the whole company into the bullshit.

I’m seeing a lot of this, though. Like, I’m not technically required to use AI, but the VP will send me a message noting that I’ve only used 2k tokens this month and maybe I could get more done if I was using more…?

expr@programming.dev · 2 个月前

Yeah, fortunately while our CTO is giddy like a schoolboy about LLMs, he hasn’t actually attempted to force it on anyone, thankfully.

Unfortunately, a number of my peers now seem to have become irreparably LLM-brained.

auraithx@lemmy.dbzer0.com · 2 个月前

I mean before we’d just ask google and read stack, blogs, support posts, etc. Now it just finds them for you instantly so you can just click and read them. The human reasoning part is just shifting elsewhere where you solve the problem during debugging before commits.

expr@programming.dev · 2 个月前

No, good engineers were not constantly googling problems because for most topics, either the answer is trivial enough that experienced engineers could answer them immediately, or complex and specific enough to the company/architecture/task/whatever that Googling it would not be useful. Stack overflow and the like has always only ever really been useful as the occasional memory aid for basic things that you don’t use often enough to remember how to do. Good engineers were, and still are, reasoning through problems, reading documentation, and iteratively piecing together system-level comprehension.

The nature of the situation hasn’t changed at all: problems are still either trivial enough that an LLM is pointless, or complex and specific enough that an LLM will get it wrong. The only difference is that an LLM will spit out plausible-sounding bullshit and convince people it’s valuable when it is, in fact, not.

auraithx@lemmy.dbzer0.com · 2 个月前

In the case of a senior engineer then they wouldn’t need to worry about the hallucination rate. The LLM is a lot faster than them and they can do other tasks while it’s being generated and then review the outputs. If it’s trivial you’ve saved time, if not, you can pull up that documentation, and reason and step through the problem with the LLM. If you actually know what you’re talking about you can see when it slips up and correct it.

And that hallucination rate is rapidly dropping. We’ve jumped from about 40% accuracy to 90% over the past ~6mo alone (aider polygot coding benchmark) - at about 1/10th the cost (iirc).

Feyd@programming.dev · 2 个月前

it’s trivial you’ve saved time, if not, you can pull up that documentation, and reason and step through the problem with the LLM

Insane that just writing the code isn’t even an option in your mind

auraithx@lemmy.dbzer0.com · 2 个月前

That isn’t the discussion at hand. Insane you don’t realise that.

expr@programming.dev · 2 个月前

It is, actually. The entire point of what I was saying is that you have all these engineers now that reflexively jump straight to their LLM for anything and everything. Using their brains to simply write some code themselves doesn’t even occur to them as an something they should do. Much like you do, by the sounds of it.

Blackmist@feddit.uk · 2 个月前

Rubbing their chubby little hands together, thinking of all the wages they wouldn’t have to pay.

rozodru@piefed.social · 2 个月前

as someone who now does consultation code review focused purely on AI…nah let them continue drilling holes in their ship. I’m booked solid for the next several months now, multiple clients on the go, and i’m making more just being a digital janitor what I was as a regular consultant dev. I charge a premium to just simply point said sinking ship to land.

Make no mistake though this is NOT something I want to keep doing in the next year or two and I honestly hope these places figure it out soon. Some have, some of my clients have realized that saving a few bucks by paying for an anthropic subscription, paying a junior dev to be a prompt monkey, while firing the rest of their dev team really wasn’t worth it in the long run.

the issue now is they’ve shot themselves in the foot. The AI bit back. They need devs, and they can’t find them because putting out any sort of ad for hiring results in hundreds upon hundreds of bullshit AI generated resumes from unqualified people while the REAL devs get lost in the shuffle.

MangoCats@feddit.it · 2 个月前

while firing the rest of their dev team

That’s the complete mistake right there. AI can help code, it can’t replace the organizational knowledge your team has developed.

Some shops may think they don’t have/need organizational knowledge, but they all do. That’s one big reason why new hires take so long to start being productive.

katy ✨@piefed.blahaj.zone · 2 个月前

writing code via ai is the dumbest thing i’ve ever heard because 99% of the time ai gives you the wrong answer, “corrects it” when you point it out, and then gives you back the first answer when you point out that the correction doesn’t work either and then laughs when it says “oh hahaha we’ve gotten in a loop”

BrianTheeBiscuiteer@lemmy.world · 2 个月前

Or you give it 3-4 requirements (e.g. prefer constants, use ternaries when possible) and after a couple replies it forgets a requirement, you set it straight, then it immediately forgets another requirement.

da_cow (she/her)@feddit.org · 2 个月前

You can use AI to generate code, but from my experience its quite literally what you said. However, what I have to admit is, that its quite good at finding mistakes in your code. This is especially useful, when you dont have that much experience and are still learning. Copy paste relevant code and ask why its not working and in quite a lot of cases you get an explanation what is not working and why it isn’t working. I usually try to avoid asking an AI and find an answer on google instead, but this does not guarantee an answer.

ngdev@lemmy.zip · 2 个月前

if your code isnt working then use a debugger? code isnt magic lmao

da_cow (she/her)@feddit.org · 2 个月前

As I already stated, AI is my last resort. If something doesn’t work because it has a logical flaw googeling won’t save me. So of course I debug it first, but if I get an Error I have no clue where it comes from no amount of debugging will fix the problem, because probably the Error occurred because I do not know better. I Am not that good of a coder and I Am still learning a lot on a regular basis. And for people like me AI is in fact quite usefull. It has basically become the replacement to pasting your code and Error into stack overflow (which doesn’t even work for since I always get IP banned when trying to sign up)

ngdev@lemmy.zip · 2 个月前

you never stated you use it as a last resort. you’re basically using ai as a rubber ducky

ohshittheyknow@lemmynsfw.com · 2 个月前

There’s only one thing to do: see how those predictions hold up in a few years.

Or maybe try NOT putting LLM in charge of these other critical issues after seeing how much of a failure it is.

merc@sh.itjust.works · 2 个月前

Does it count if an LLM is generating mountains of code that then gets thrown away? Maybe he can win the prediction on a technicality.

resipsaloquitur@lemmy.world · 2 个月前

Code has to work, though.

AI is good at writing plausible BS. Good for scams and call centers.

Salvo@aussie.zone · 2 个月前

Glorified Lorem Ipsum.

Treczoks@lemmy.world · 2 个月前

Parrot with a dictionary.

andallthat@lemmy.world · 2 个月前

or CEOs

lustyargonian@lemmy.zip · 2 个月前

I can say 90% of PRs in my company clearly look or declared to be AI generated because of how random things that still slip by in the commits, so maybe he’s not wrong. In fact people are looked down upon if they aren’t using AI and are celebrated for figuring out how to effectively make AI do the job right. But I can’t say if that’s the case for other companies.

scarabic@lemmy.world · 2 个月前

These hyperbolic statements are creating so much pain at my workplace. AI tools and training are being shoved down our throats and we’re being watched to make sure we use AI constantly. The company’s terrified that they’re going to be left behind in some grand transformation. It’s excruciating.

RagingRobot@lemmy.world · 2 个月前

Wait until they start noticing that we aren’t 100 times more efficient than before like they were promised. I’m sure they will take it out on us instead of the AI salesmen

scarabic@lemmy.world · 2 个月前

It’s not helping that certain people Internally are lining up to show off whizbang shit they can do. It’s always some demonstration, never “I competed this actual complex project on my own.” But they gets pats on the head and the rest of us are whipped harder.

clif@lemmy.world · 2 个月前

O it’s writing 100% of the code for our management level people who are excited about “”““AI””“”

But then us plebes are rewriting 95% of it so that it will actually work (decently well).

The other day somebody asked me for help on a repo that a higher up had shit coded because they couldn’t figure out why it “worked” but also logged a lot of critical errors. … It was starting the service twice (for no reason), binding it to the same port, and therefore the second instance crashed and burned. That’s something a novice would probably know not to do. But, if not, immediately see the problem, research, understand, fix, instead of “Icoughbuiltcoughthis thing, good luck fuckers”

Xed@lemmy.blahaj.zone · 2 个月前

these tech bros just make up random shit to say to make a profit

renrenPDX@lemmy.world · 2 个月前

It’s not just code, but day to day shit too. Lately corporate communications and even training modules feel heavily AI generated. Things like unnecessary em dashes (I’m talking as much as 4 out of 5 sentences in a single paragraph), repeating statements or bullet points in training modules. We’re being encouraged to use our “private” Copilot to do everyday tasks and everything is copilot enabled.

I don’t mind if people use it, but it’s dangerous and stupid to think that it produces near perfect results every time. It’s been good enough to work as an early rough draft or something similar, but it REQUIRES scrutiny and refinement by hand. It’s like it can get you from nothing to 60-80% there, but never higher. The quality of output can vary significantly from prompt to prompt in my limited experience.

panda_abyss@lemmy.ca · 2 个月前

Are we counting the amount of junk code that you have to send back to Claude to rewrite because it’s spent the last month totally lobotomized yet they won’t issue refunds to paying customers?

Because if we are, it has written a lot of code. It’s just awful code that frequently ignores the user’s input and rewrites the same bug over and over and over until you get rate limited or throw more money at Anthropic.

chaosCruiser@futurology.today · 2 个月前

When the CEO of a tech company says that in x months this and that will happen, you know it’s just musk talk.

Tollana1234567@lemmy.today · 2 个月前

more like 6 months" because we need the VC funds still"

chaosCruiser@futurology.today · 2 个月前

Ooh, so that’s CEO speak for: “we’re broke, please give us more money”.

greedytacothief@lemmy.dbzer0.com · 2 个月前

I’m not sure how people can use AI to code, granted I’m just trying to get back into coding. Most of the times I’ve asked it for code it’s either been confusing or wrong. If I go through the trouble to write out docstrings, and then fix what the AI has written it becomes more doable. But don’t you hate the feeling of not understanding what you’ve written does or more importantly why it’s been done that way?

AI is only useful if you don’t care about what the output is. It’s only good at making content, not art.

i_dont_want_to@lemmy.blahaj.zone · 2 个月前

I worked with someone that I later found out used AI to code her stuff. She knew how to code some, but didn’t understand a lot of fundamentals.

Turns out, she would have AI write most of it, tweak it to work with her test cases, and call it good.

Half of my time was spent fixing her code, and when she was fired, our customer complaints went way down.

Hackworth@sh.itjust.works · edit-2 1 个月前

deleted by creator

greedytacothief@lemmy.dbzer0.com · 2 个月前

Yeah, I find it can be useful in some stages of writing or researching. But by the time I’ve got a finished product there’s really no AI left in there.

ImmersiveMatthew@sh.itjust.works · 2 个月前

AI writes 100% of my code, but this is only a small percent of the overall development effort.