Netzsphaere

Conversation

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

LLMs could be a bad blow to anonymously publishing software.
If you want to keep a project separate from your day job / real identity, you need a second subscription. $20/mo is not that big a deal, but $200 (Claude Max) sure is. If the hater are to be believed, we might see a hike in pricing as well.

1

0

1

lain

Reply to @WandererUber@poa.st

@WandererUber the haters are %100 wrong on this one. deepseek/glm/kimi inference is already profitable at low prices

1

0

1

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

Reply to @lain@lain.com

as they often are.

>profitable
for whom?

1

0

0

lain

Reply to @WandererUber@poa.st

@WandererUber for people who have inference hardware

2

0

1

lain

Reply to @lain@lain.com

@WandererUber these are chinese so not jews, please relax

2

0

0

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

Reply to @lain@lain.com

that's not the issue. If the model is not profitable for the lab making it, they must raise prices (and stop/delay weight releases)

1

0

0

lain

Reply to @WandererUber@poa.st

@WandererUber these are all open models, anyone can run them. if a third party can run them at profit right now, they'll be able to do that in the future. glm and deepseek and kimi are worse than claude and codex, but not so much worse that people wouldn't swithc to them if claude would cost $1000 a month

2

0

1

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

Reply to @lain@lain.com

I'm not even agreeing. This was just an additional thing that could put the kibosh on it

0

0

1

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

Reply to @lain@lain.com

>if a third party can run them at profit right now, they'll be able to do that in the future.
I don't like that I have to spell everything out WHILE playing devil's advocate. This does not follow at all. They have training costs to recoup and if they don't keep pace they will fall behind. If they fall behind, Anthrobbicc would be able raise prices to e.g. $40 for pro, maybe $100

barrier to entry for writing code and not publishing it under your own name was virtually zero (compared to having a different identity anyway of course) and now is not.

1

0

0

lain

Reply to @WandererUber@poa.st

@WandererUber but the open models are trained already, there's no more cost. i'm not talking about anthropic and openai and who ever trains. the open models are profitable right now. if the trainers go bankrupt, well, that's sad, but inference still stays profitable.

1

0

0

Reply to @lain@lain.com

@lain @WandererUber token providers are straight up printing money with margins above 10x too, can't wait for capacity to improve enough to where it's more or less 1:1 to electricity cost like bitcoin

0

0

1

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

Reply to @lain@lain.com

We had this discussion before and you are making an argument that I
-already agree with
-is only tangentially related to what I am saying here

>People can just stay on GLM5.1
they can also hand-code. If SOTA devving costs money and you have to pay again if you want it under a second identity, that's a detriment.
If GLM 5.1 is not profitable for the model trainer, there will not be GLM N.1 in future. Open Weights models *could* lag behind which would make SOTA be OAI / DarioCorp -dependent. I believe quite a few people would be deterred and/or chance it under their real account and catch a C&D from Nintendo for it

1

0

0

lain

Reply to @WandererUber@poa.st

@WandererUber yeah, that's true. if current level training turns out to be not profitable, we won't get better models soon. but the current ones still are profitable. so i'm not sure what we are disagreeing on

2

0

0

meeper

meeper

Reply to @lain@lain.com

@lain @WandererUber Diminishing returns that way would honestly be for the best, a lot of hate is due to aggressive hype fear fomo mongering and having it turn into a normal industry would make folk relax. and treat it as a normal technology

1

0

1

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

Reply to @lain@lain.com

idk I just don't get why you are making the argument that GLM 5.1 inference is profitable for providers at all here.
The issue is something else. If you are seriously implying that GLM 5.1 is good enough that people would use it over $100 2028 Claude, then I guess we disagree about that.
It's as if I said "training models could be an issue to do anonymously if Nvidia puts phone # verification in their GPUs and you have to buy a second one" and you replied with "a 2023 chromebook will be cheap forever"
It's besides the point.

0

0

1

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

Reply to @meeper

Edited 2 months ago

I genuinely think it has too high an "intelligence-to-get-it floor", for lack of a better term.
Sometimes I'm not even sure if the LABS get it (Anthropic publishing a post about a "mind reader" for Claude). A lot of previously smart-enough people are falling for the convincingness of the output and are incapable of checking for *correctness*

all that to say, it could be a long while until it's "treated as a normal technology"

1

0

1

meeper

meeper

Reply to @WandererUber@poa.st

@WandererUber @lain

probably coming off as a bit of the anti llm for the sake of anti llm side but-

idk how that happens tbh ik web ui models are kinda shit especially without harnesses (agentic stuff inmean) and really haven't ever vibe coded.

but I've used it as a reseaech assistant (with all the deep research bells and whistles google afforded me) and while certainly helpful (still doesn't replace normal searching for me, probably as I prefer searching first) l. Its obviously not a smart thing and makes stupid as hell errors or honestly fails to get the point (which is kinda obvious since they are fundamentally word generators)

One example is how It starts phrasing everything with basis on a fact which should have been treated irrelevant.

I tried to use it to write a slop paper and report (uni obligation) it was so bad that I basically didnit manually. Ig it is me actually having standards but it felr quite bad and focised on entirelynthe wrongnthings and had such terrible flow that babying itnto get something worthwhile was harder than actually writing it myself

code (surprisingly due to the NLP roots of the architecture) is probably a best case scenario in hindsight which is why they actually end up being surprisingly useful for thay

2

0

1

ⰎⰅ ⰀⰍⰨⰒⰎⰫ

genmaicha@stereophonic.space

Reply to @lain@lain.com

@lain @WandererUber but are they overseas chinese?

0

0

1

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

Reply to @meeper

>Ig it is me actually having standards
yes I think so. I have largely the same experience as you do, although I did like research and building basic knowledge with Grok (not so much post-nerf)

LLM Arena found out people largely rate the incorrect emoji-laden,bullet-pointed slop torrent higher than the correct concise answer.
This is what I was talking about. A lot of decision makers simply do not have the intelligence to understand what is going on. It's like when people can't tell a pic is AI, but for text and they can't tell it's incorrect.

1

0

5

meeper

meeper

Reply to @WandererUber@poa.st

@WandererUber @lain ig we forget the fact that actual human 'professionals' are the ones writing linkedin posts and people like these are ones providing ratings and scores

0

0

0

lain

Reply to @meeper

@meeper @WandererUber i know you're engaging with it, but somehow you're not getting the results. i don't know why, but it can do a lot more than you found out.

2

0

1

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

Reply to @lain@lain.com

I am once again asking for the video of you using it

0

0

2

meeper

meeper

Reply to @lain@lain.com

@lain @WandererUber probably the parricukar tooic for the slop research paper.

for example it wqs surprisingly good at hindi poetry (which is actually quite complex to get a good meter in but soinds really good) and analyzed stuff based on the rules I didnt even know at the time.

and recent models (late 2025) suddently became useful for my etymology interest which would otherwise require me to dive into obscure books (which I do want to I digress)

1

0

1

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

Reply to @meeper

>based on the rules I didnt even know at the time.
it always seems better if you have no clue
Gell-Mann amnesia but for AI

2

0

1

lain

Reply to @WandererUber@poa.st

@WandererUber @meeper learning something is not the same as gell-mann amnesia

1

0

1

meeper

meeper

Reply to @WandererUber@poa.st

@WandererUber @lain Nah I mean like I had it evaluate something on a whim as I wanted to see what the statistical text machine would say about my work which I painstakingly put into a meter (I'm fammiliar with how it should sound since as a sikh I've been raised on that kind of poetey)

Its through it thay Iearned that it actually had a conplex set of rules (I did verify it according to wikipedia) and it was mostly correct, some word choice changes tn suggested felt off but the actual analysis of the meter was quite correct

1

0

1

Wanderer atop the sea of clouds or whatever

WandererUber@poa.st

Reply to @lain@lain.com

Edited 2 months ago

that's an LLM-type answer because you gave a sequence of words that are likely to appear in similar conversations but are not actually a response to what I actually mean.

It's similar in the sense that one's incompetence in a field makes the AI seem more correct than it is. If one looks at the answers after spending a few days actually learning, suddenly they don't seem so intelligent anymore (they still were helpful because they gave you keywords to research the actual rules. The intricacies of its' explanations are probably incorrect though)
You immediately notice this with topics you know about, thus it is similar to Gell-Mann amnesia.

0

0

1

meeper

meeper

Reply to @meeper

@WandererUber @lain and tbh there wasn't that much to verify, llms (as shown by their capability of being useful in the routine theorem proving kind of math) are not too bad at that kind analysis stuff when its clear and defined, which due to the particulars in this case is highly formulaic

0

0

0

About Netzsphaere

Terms of Service

DA RULEZ:

Don't cause us any legal trouble
Try not to be too annoying
No loli or beast
Rule #9 still applies

If there's any questions or you want an invite link, feel free to ask snacks.

动态网自由门天安門天安门法輪功李洪志 Free Tibet 六四天安門事件 The Tiananmen Square protests of 1989 天安門大屠殺 The Tiananmen Square Massacre 反右派鬥爭 The Anti-Rightist Struggle 大躍進政策 The Great Leap Forward 文化大革命 The Great Proletarian Cultural Revolution 人權 Human Rights 民運 Democratization 自由 Freedom 獨立 Independence 多黨制 Multi-party system 台灣臺灣 Taiwan Formosa 中華民國 Republic of China 西藏土伯特唐古特 Tibet 達賴喇嘛 Dalai Lama 法輪功 Falun Dafa 新疆維吾爾自治區 The Xinjiang Uyghur Autonomous Region 諾貝爾和平獎 Nobel Peace Prize 劉暁波 Liu Xiaobo 民主言論思想反共反革命抗議運動騷亂暴亂騷擾擾亂抗暴平反維權示威游行李洪志法輪大法大法弟子強制斷種強制堕胎民族淨化人體實驗肅清胡耀邦趙紫陽魏京生王丹還政於民和平演變激流中國北京之春大紀元時報九評論共産黨獨裁專制壓制統一監視鎮壓迫害侵略掠奪破壞拷問屠殺活摘器官誘拐買賣人口遊進走私毒品賣淫春畫賭博六合彩天安門天安门法輪功李洪志 Free Tibet 劉曉波动态网自由门