More

alyxya · 2026-05-07T20:57:39 1778187459

I dislike the title because it doesn't clearly state it's a layoff. "Building for the future" gave me the impression that it's about some major new initiative with a roadmap outlining plans.

dang · 2026-05-08T03:37:36 1778211456

Yes. We've since changed the top link to a third-party article. We prefer to do this with corporate press releases* - this is probably the #1 exception to HN's "please post the original source" rule (https://news.ycombinator.com/newsguidelines.html). If anyone sees a better third-party article, we can change it again.

(Edit: it's not really an exception because the purpose of a corporate press release is usually to obscure the main story, which means it's misleading, so by HN rules we should change it.)

(Edit 2: I feel like I should add that this isn't specific to Cloudflare! It's literally a generic problem.)

* https://hn.algolia.com/?dateRange=all&page=0&prefix=true&sor...

Imustaskforhelp · 2026-05-08T04:07:49 1778213269

Thanks for changing this dang, I and all of us really appreciate the work that you do towards hackernews :-D

Have a nice day!

wavemode · 2026-05-07T21:48:55 1778190535

Maybe I've become cynical and jaded, because when I saw the title I immediately thought to myself "oh, Cloudflare's announcing a layoff."

operatingthetan · 2026-05-07T22:03:50 1778191430

The corporate speak isn't working if people instantly know what it means!

ceejayoz · 2026-05-08T00:05:59 1778198759

It's like slurs; an ever-moving target.

FeteCommuniste · 2026-05-08T00:55:37 1778201737

Even so, "Daddy needs a new yacht" might sound too insensitive.

JustSkyfall · 2026-05-07T21:00:46 1778187646

It's interesting how every time there's a layoff, the blog post always has a title like "Preparing for what's next" or "An update on our workforce" or "Getting ready for the agentic era"!

layer8 · 2026-05-08T08:40:59 1778229659

They should make it “Good news, everyone” like in Futurama.

kristianp · 2026-05-07T21:47:17 1778190437

The title should be something like "Cloudflare reducing workforce by more than 1,100 employees globally".

dang · 2026-05-08T03:49:26 1778212166

Yes, and such titles (whose purpose is to not say the thing) fall under "misleading" in https://news.ycombinator.com/newsguidelines.html.

We've changed the title along with the URL - see https://news.ycombinator.com/item?id=48058224.

strongpigeon · 2026-05-08T03:59:49 1778212789

I’ll never forget how when I was at Google, every email with subject line “An update on X” meant X was getting axed. Like, just say so in the subject line…

FartyMcFarter · 2026-05-08T08:07:49 1778227669

It got to the point where people were sarcastically posting "An update on <myself>" when sending goodbye emails.

keybored · 2026-05-07T21:24:56 1778189096

Two days ago: “Today I've made the difficult decision to reduce the size of Coinbase by ~14%” (layoffs) https://news.ycombinator.com/item?id=48021368

ignoramous · 2026-05-08T00:47:24 1778201244

> "Building for the future" gave me the impression that it's about some major new initiative...

If you'll believe them, it indeed is:

   ... [the Leadership at Cloudflare] have to be intentional in how we architect our company for the agentic AI era ... reimagining every internal process, team, and role across the company.

  ... [This layoff is] not a cost-cutting exercise ... [but] Cloudflare defining how a world-class, high-growth company operates.

  ... We don't want to [mass layoff] again for the foreseeable future. 

  ... [Cloudflare] cannot rest on the workflows and organizational structures that worked yesterday. We're confident that [Cloudflare] will be even faster and more innovative [after layoffs] ...

dd8601fn · 2026-05-08T03:14:33 1778210073

They're architecting their company for an agentic future? They're reimaginging the definition of a world-class, high-growth company? They're not resting on the workflows that worked yesterday? blegh

What the hell does any of that actually mean? Like in real life words? Because that much corporate bullshit really sounds like it is a cost-cutting exercise.

rvz · 2026-05-07T21:30:01 1778189401

This is what the true definition of "AGI" is.

doggo_mate · 2026-05-07T21:02:09 1778187729

Welcome to the corporate world

alyxya · 2026-05-02T06:27:29 1777703249

The blog post was published a couple months ago, and it looks like there hasn't been a follow-up release with the fully trained model. I'm not sure if there's much to take away from an early checkpoint besides the unique architectural choices they made in their model for faster inference.

adrian_b · 2026-05-02T09:24:00 1777713840

Some smaller models from the LFM2.5 family have been published on Huggingface by the end of March, a month ago.

It can be assumed that this larger model takes more time to complete post-training, but it will follow in the near future after those smaller LFM2.5 models.

alyxya · 2026-05-01T09:18:36 1777627116

Despite their attrition, this combined with their cursor partnership is likely going to make them competitive in coding agents soon.

senordevnyc · 2026-05-01T18:12:27 1777659147

If they buy Cursor, I’ll stop using it. I suspect I’m not alone.

hu3 · 2026-05-02T01:29:11 1777685351

If they buy Cursor I might start using it because I'll know the tool will have infinite funding and will be worth my time investment.

Specially because Grok isn't neutered when it comes to security scans.

And it is screamingly fast.

alyxya · 2026-04-30T22:40:13 1777588813

I tried the four pieces of text with Opus 4.7 (in incognito) and it guessed correctly for two of them, and I made sure to specify no web search and the model seems to have obeyed my instructions with that.

Although this is just a single piece of text from a prolific writer, it'll go much further with deanonymizing anyone when combining multiple pieces of text plus other contextual information about the writer that might give away their age range, location, and occupation.

superfrank · 2026-04-30T23:05:16 1777590316

How widely known were the pieces of text? Are we talking about a section of MLK's I Have a Dream speech or hand written birthday cards from your grandma?

I'm using those as the two extremes, but if it's anything by anyone moderately well known (even a lesser known piece of writing), I'm not too surprised that it didn't need the web to figure it out. It's like if you showed me a Wes Anderson film or played me a Bob Dylan song I'd never seen/heard before, I could probably still figure out who it is without looking anything up. I don't think it's surprising that an LLM can do that much better than a human can.

Now, if you're giving it things like personal emails between you and your family and it's able to guess who you are, that's much, much scarier.

skeledrew · 2026-05-01T01:58:19 1777600699

> giving it things like personal emails

As long as there's sufficient online presence otherwise I see no reason why a successful identity wouldn't be made. Unless there's significant effort put into making those emails different from the online content, and even then there will probably still be some "tells" that an AI can pick up on.

alyxya · 2026-04-30T23:18:31 1777591111

I mean I tried sending the pieces of text to Opus that Kelsey was referring to on her blog just to independently check the identification claim. Presumably those pieces of text first appeared on the web when the blog post was published a week ago, so no model should have memorized the exact text yet. My prompt had to specify no web search, otherwise Opus would try to search the web, though it didn't seem like Opus could find that blog post even when it did try to search the web.

superfrank · 2026-05-01T04:51:21 1777611081

Got it. I misunderstood what you were saying

alyxya · 2026-04-30T03:24:20 1777519460

> We support 34-qubit CPU and 36-qubit GPU simulators available 24/7 to our users.

This one looks like an exact simulator that handles exponential states, so it's far more limited in the number of qubits it can support.

alyxya · 2026-04-30T03:21:58 1777519318

If I'm understanding this correctly, it doesn't simulate any general purpose quantum circuit with 1000 qubits, only ones where there's a more efficient strategy than an exponential state where exact simulation is feasible.

alyxya · 2026-04-21T22:49:14 1776811754

This is the right partnership to happen. SpaceX has all the compute but is missing the talent for training LLMs, especially on the RL side. Cursor has the talent and RL stack, but doesn't have their own pretrained base model or own their compute. Both will be on a bad trajectory without cooperating because Claude Code and Codex have gained so much momentum already.

alyxya · 2026-04-20T20:53:03 1776718383

This seems like a wasted effort when AI will primarily learn the majority consensus view and not one-off misinformation. AI tries to learn pattern matching for generalization, so garbage data doesn't make AI learn the wrong patterns, at best just slows down learning the actual patterns. When most compute for training is spent on curated data and RL rather than random web-scraped data, the impact is likely negligible.

Mordisquitos · 2026-04-20T21:14:29 1776719669

> This seems like a wasted effort when AI will primarily learn the majority consensus view and not one-off misinformation.

We have evidence to the contrary. Two blog articles and two preprints of fake academic articles [0] were able to convince CoPilot, Gemini, ChatGPT and Perplexity AI of the existence of a fake disease, against all majority consensus. And even though the falsity of this information was made public by the author of the experiment and the results of their actions were widely published, it took a while before the models started to get wind of it and stopped treating the fake disease as real. Imagine what you can do if you publish false information and have absolutely no reason to later reveal that you did so in the first place.

[0] https://www.nature.com/articles/d41586-026-01100-y

gwern · 2026-04-20T21:31:20 1776720680

> Two blog articles and two preprints of fake academic articles [0] were able to convince CoPilot, Gemini, ChatGPT and Perplexity AI of the existence of a fake disease, against all majority consensus

Wrong. There are no 'majority consensus' against 'bixonimania' because they made it up, that was the point. It's unsurprisingly easy to get LLMs to repeat the only source on a term never before seen. This usually works; made-up neologisms are the fruitfly of data poisoning because it is so easy to do and so unambiguous where the information came from. (And retrieval-based poisoning is the very easiest and laziest and most meaningless kind of poisoning, tantamount to just copying the poison into the prompt and asking a question about it.) But the problem with them is that also by definition, it is hard for them to matter; why would anyone be searching or asking about a made-up neologism? And if it gets any criticism, the LLMs will pick that up, as your link discusses. (In contrast, the more sources are affected, the harder it is to assign blame; some papermills picked up 'bixonimania'? Well, they might've gotten it from the poisoned LLMs... or they might've gotten it from the same place the LLMs did which poisoned their retrievals, Medium et al.)

Mordisquitos · 2026-04-20T22:00:05 1776722405

The LLMs didn't only talk about the disease when prompted by the neologism. They also brought it up when asked about the symptoms. From the article:

> OpenAI’s ChatGPT was telling users whether their symptoms amounted to bixonimania. Some of those responses were prompted by asking about bixonimania, and others were in response to questions about hyperpigmentation on the eyelids from blue-light exposure.

And yes, sure, in this example the scientific peer-review process may have eventually criticised and countered 'bixonimania' as a hoax were the researcher to have never revealed its falsity—emphasis on 'may', few researchers have the time and energies to trawl through crap papermill articles and publish criticisms. Either way, that is a feature of the scientific process and is not a given to any online information.

What happens when false information is divulged by other means that do not attempt to self-regulate? And how do we distinguish one-off falsities from the myriad of obscure true things that the public is expecting LLMs to 'know' even when there is comparatively little published information about them and therefore no consensus per se?

gwern · 2026-04-21T02:51:55 1776739915

"hyperpigmentation on the eyelids from blue-light exposure" is a super specific query almost definitionally 'bixonimania' which probably brought up the 'bixonimania' poison at the time (the search hits for that query right now in Google are weak and poorly relevant so it would not be hard to outrank them or at least get into the top 50 or so where a retrieval LLM would see them and would followup), and so still an instance of what I mean.

> Either way, that is a feature of the scientific process and is not a given to any online information.

Which does not distinguish it in any way from human errors like a crank or activist etc.

And I don't know, how did we handle false information before on niche topics no one cared about and which were unimportant? It's just noise. The worldwide corpus has always been full of extremely incorrect, mislabeled, corrupted, distorted, information on niche topics of no importance. But it's generally not important.

alyxya · 2026-04-20T21:50:50 1776721850

All the examples you gave are chatbots with web search integrated. Are you sure those chatbots didn't just reference false information it found in web searches? That's fundamentally different than poisoning the training of AI models.

> The problem was that the experiment worked too well. Within weeks of her uploading information about the condition, attributed to a fictional author, major artificial-intelligence systems began repeating the invented condition as if it were real.

This seems to imply the poisoning affected the web search results, not the actual model itself, because it takes months for data to make it into a trained base model.

alfiedotwtf · 2026-04-20T21:12:35 1776719555

In the pre-AI-collapse era, we called this PageRank ;)

righthand · 2026-04-20T20:57:33 1776718653

What is the pattern for truth if I flood your data with lies?

Jtarii · 2026-04-20T21:04:01 1776719041

The same way humans deal with it, check it against multiple reputable sources.

chongli · 2026-04-20T21:25:01 1776720301

We already learned how to defeat this from SEO spammers and citation farmers: by building networks that cross reference and corroborate one another’s fake stories.

We’re already at a point where much of the academic research you find in online databases can’t be trusted without vetting through real world trustworthy institutions and experts in relevant fields. How is an LLM supposed to do this kind of vetting without the help of human curators?

If all the LLM training teams have to stop indiscriminate crawling and fall back to human curation and data labeling then the poisoners will have won.

righthand · 2026-04-20T21:11:40 1776719500

Some of the reputable sources are taking flood of the lies for possible truth. Now what?

alyxya · 2026-04-09T18:31:18 1775759478

Plenty of people wanted to spend more than $20 but less than $200 for a plan. It's long overdue IMO.

alyxya · 2026-04-09T07:11:40 1775718700

I think of journalism like any other job where there's an expectation to produce results, where the main objective here is to write an article that lots of people read. It's a topic that catches a lot of people's attention, so in a sense they've succeeded by getting a lot of people to read and talk about it.

psychoslave · 2026-04-09T09:09:25 1775725765

It’s like saying chirurgeon job is like any other job, and the most people operated in a minimum amount of time is all that matter to optimize. But even in the most cynical Machiavelli™ hospital, reputation and actual operational results have to be taken into account if the institution want to continue to be frequented.