Conversation

Lucy [hiatus era] 𒌋𒁯

it's impressive how mp3 technically DOES have a header but the vast majority of mp3 files I've encountered just don't have a header.. brainrot made in germany
3
1
2

@lucy and iirc even that header is fairly ambiguous and could easily occur in any random byte stream…

0
0
3
@meso HE'S STILL ALIVE GO GET HIM erogun
0
0
0
@lucy i blame m$ personally, i'd assume files without headers are their fault since they really wanna treat name extensions as headers
1
0
2

@snacks @lucy i blame the very concept of file systems because both name extensions and headers/magic numbers are a bad form of in-band signalling

1
0
2
@mia @lucy magic umbers don't really seem to be a filesystem thing? They work the exact same way if you just had a blob of data somewhere.
They're also at least in-band with the data they try to represent instead of some pretty unrelated stuff that tends to change a lot or get lost entirely
1
0
0

@snacks @lucy the idea is to, ideally, never store or transmit random blobs of data without identifying metadata

in general, metadata is not handled well on any popular system. we have dozens of audio tag formats, xmp + sidecar files, .DS_Store/.directory and others, extended file attributes etc. etc.
it’s terrible. there’s got to be a better way, and maybe files and folders are just a bad concept to design computer storage after

0
0
2

@desea @lucy @snacks i’m not a systems engineer cirno_shrug

i think data storage should have more than one possible abstraction. how the records are ultimately stored should not be that important, but i would want to have different views of the data depending on the use case and my own habits and preferences

1
0
4
Edited 5 months ago
@desea @lucy @mia ideas of filesystems with database capabilities have been floating around for ages. You could try to come up with some container format that can store the associated information with the file for exchange ig
1
0
2

@lucy @snacks @desea and as a programmer i just HATE file systems because they’re so fucking painful to use. where do i store shit? how do i serialize my data structures? how do i ensure ACID? what do i do if a transfer is interrupted? what’s the optimal allocation/access strategy for this filesystem and my particular workload? and so on and so forth

1
0
4

@lucy @snacks @desea like, look at how e.g. mastodon stores media files, or how nginx organizes cache directories. they do some weird ass directory trees to avoid taking too much of a performance hit on ext4 or something

1
0
4

@lucy @snacks @desea ceph sort of has the right idea, it’s just way too enterprise-oriented

0
0
2
Edited 5 months ago
@snacks @desea @lucy @mia attribute-based filesystems have been proposed for a long time, Richard Strandh had it as one of his goals for CLOSOS

hierarchical data organization in general has also been criticized too in favor of freeform graphs since at least Shirky's "Ontology is Overrated"
0
0
5