Netzsphaere

Conversation

meso

meso@new.asbestos.cafe

@snacks is there an AI model you can feed large documents to and ask it questions about them. like i wanna be able to ask questions about the exact nature of the traffic laws here because it's very hard to read through that shit

Snacks

snacks

1 month ago

Reply to @meso@new.asbestos.cafe

@meso you'll prob need some kind of rag setup of you want to query over your entire traffic law tbh. Maybe there's some rag in a box thing but i'm not aware of any

meso

meso@new.asbestos.cafe

1 month ago

Reply to @snacks

@snacks rag?

meso

meso@new.asbestos.cafe

1 month ago

Reply to @meso@new.asbestos.cafe

@snacks why is AI such useless niggerware this is the only thing it's useful for, querying large amounts of data. AI slop haters vindicated

Snacks

snacks

1 month ago

Reply to @meso@new.asbestos.cafe

@meso retrieval augmented generation. You use vectorization to categorize parts of your text and can then draw up the closest matches and feed just those to an llm

Snacks

snacks

1 month ago

Reply to @snacks

@meso vectorization is performed by a seperate specialised ai model

vii@dsmc.space

1 month ago

Reply to @snacks

@snacks @meso it might be more than you need but https://github.com/HKUDS/RAG-Anything a place to start digging

Snacks

snacks

1 month ago

Reply to @snacks

@meso if i coupd i'd give you the rag tool i made for my finals

meso

meso@new.asbestos.cafe

1 month ago

Reply to @snacks

@snacks you couldnt?

Snacks

snacks

1 month ago

Reply to @meso@new.asbestos.cafe

@meso it's production code at my company lmao

Snacks

snacks

1 month ago

Reply to @snacks

Edited 1 month ago

@meso it's not that hard to implement yourself tbh, most of my time was spent wrestling file formats and shitty microsoft webservers.
Just figure out a way to cut your documents into small enough chunks with as much meaning as possibke in tact, run it through an embedding model and save the result into a database that can handle querying vectors with like 1000 dimensions

πρωτος

nigger@detroitriotcity.com

1 month ago

Reply to @snacks

@snacks @meso the lion doesn't respect IP or NDAs

Snacks

snacks

1 month ago

Reply to @snacks

@meso then you run your query through the same embedding model and get the closest matches in your db, combining bith embeddings and text search usually gives the best results, i think pgvector even has a good example how to combine them

Sally (evil)

sally@freesoftwareextremist.com

1 month ago

Reply to @meso@new.asbestos.cafe

@meso @snacks

> like i wanna be able to ask questions about the exact nature of the traffic laws here because it's very hard to read through that shit

Just learn to read.

meso

meso@new.asbestos.cafe

1 month ago

Reply to @snacks

@snacks fuck

meso

meso@new.asbestos.cafe

1 month ago

Reply to @sally@freesoftwareextremist.com

@sally @snacks bro it's written by bulgarian bureaucrats even they don't know what the laws they wrote mean

Sally (evil)

sally@freesoftwareextremist.com

1 month ago

Reply to @meso@new.asbestos.cafe

@meso @snacks

Then why does it matter?

meso

meso@new.asbestos.cafe

1 month ago

Reply to @sally@freesoftwareextremist.com

@sally @snacks i wanna know what kind of shit can i do to my car so when a cop stops me i can be like Kill yourself nigger

Sally (evil)

sally@freesoftwareextremist.com

1 month ago

Reply to @meso@new.asbestos.cafe

@meso @snacks

Cops don't really care about technical law and they can make you disappear if they want to anyway.

About Netzsphaere

Terms of Service

DA RULEZ:

Don't cause us any legal trouble
Try not to be too annoying
No loli or beast
Rule #9 still applies

If there's any questions or you want an invite link, feel free to ask snacks.

动态网自由门天安門天安门法輪功李洪志 Free Tibet 六四天安門事件 The Tiananmen Square protests of 1989 天安門大屠殺 The Tiananmen Square Massacre 反右派鬥爭 The Anti-Rightist Struggle 大躍進政策 The Great Leap Forward 文化大革命 The Great Proletarian Cultural Revolution 人權 Human Rights 民運 Democratization 自由 Freedom 獨立 Independence 多黨制 Multi-party system 台灣臺灣 Taiwan Formosa 中華民國 Republic of China 西藏土伯特唐古特 Tibet 達賴喇嘛 Dalai Lama 法輪功 Falun Dafa 新疆維吾爾自治區 The Xinjiang Uyghur Autonomous Region 諾貝爾和平獎 Nobel Peace Prize 劉暁波 Liu Xiaobo 民主言論思想反共反革命抗議運動騷亂暴亂騷擾擾亂抗暴平反維權示威游行李洪志法輪大法大法弟子強制斷種強制堕胎民族淨化人體實驗肅清胡耀邦趙紫陽魏京生王丹還政於民和平演變激流中國北京之春大紀元時報九評論共産黨獨裁專制壓制統一監視鎮壓迫害侵略掠奪破壞拷問屠殺活摘器官誘拐買賣人口遊進走私毒品賣淫春畫賭博六合彩天安門天安门法輪功李洪志 Free Tibet 劉曉波动态网自由门