@GamingChairModel

GamingChairModel@lemmy.world · 5 days ago

if we highly restrict the parameters of what information we’re looking at, we then get a possible 10 bits per second.

Not exactly. More the other way around: that human behaviors in response to inputs are only observed to process about 10 bits per second, so it is fair to conclude that brains are highly restricting the parameters of the information that actually gets used and processed.

When you require the brain to process more information and discard less, it forces the brain to slow down, and the observed rate of speed is on the scale of 5-40 bits per second, depending on the task.

GamingChairModel@lemmy.world · 5 days ago

You can still brute force it, which is more or less how back propagation works.

Intractable problems of that scale can’t be brute forced because the brute force solution can’t be run within the time scale of the universe, using the resources of the universe. If we’re talking about maintaining all the computing power of humanity towards a solution and hoping to solve it before the sun expands to cover the earth in about 7.5 billion years, then it’s not a real solution.

GamingChairModel@lemmy.world · 5 days ago

I think the fundamental issue is that you’re assuming that information theory refers to entropy as uncompressed data but it’s actually referring to the amount of data assuming ideal/perfect compression.

Um, so each character is just 0 or 1 meaning there are only two characters in the English language? You can’t reduce it like that.

There are only 26 letters in the English alphabet, so fitting in a meaningful character space can be done in less than 5 bits (2^5 = 32). Morse code, for example, encodes letters in less than 4 bits per letter (the most common letters use fewer bits, and the longest use 4 bits). A typical sentence will reduce down to an average of 2-3 bits per letter, plus the pause between letters.

And because the distribution of letters in any given English text is nonuniform, there’s less meaning per letter than it takes to strictly encode things by individual letter. You can assign values to whole words and get really efficient that way, especially using variable encoding for the more common ideas or combinations.

If you scour the world of English text, the 15-character string of “Abraham Lincoln” will be far more common than even the 3-letter string of “xqj,” so lots of those multiple character expressions only convey a much smaller number of bits of entropy. So it might be that it takes someone longer to memorize a random 10 character string that is truly random, including case sensitivity and symbols and numbers, than it would to memorize a 100-character sentence that actually carries meaning.

Finally, once you actually get to reading and understanding, you’re not meticulously remembering literally every character. Your brain is preprocessing some stuff and discarding details without actually consciously incorporating them into the reading. Sometimes we glide past typos. Or we make assumptions (whether correct or not). Sometimes when tasked with counting basketball passes we totally miss that there was a gorilla in the video. The actual conscious thinking discards quite a bit of the information as it is received.

You can tell when you’re reading something that is within your own existing knowledge, and how much faster it is to read than something that is entirely new, on a totally novel subject that you have no background in. Your sense of recall is going to be less accurate with that stuff, or you’re going to significantly slow down how you read it.

I can read a whole sentence with more than ten words, much less characters, in a second while also retaining what music I was listening to, what color the page was, how hot it was in the room, how itchy my clothes were, and how thirsty I was during that second if I pay attention to all of those things.

If you’re preparing to be tested on the recall of each and every one of those things, you’re going to find yourself reading a lot slower. You can read the entire reading passage but be totally unprepared for questions like “how many times did the word ‘the’ appear in the passage?” And that’s because the way you actually read and understand is going to involve discarding many, many bits of information that don’t make it past the filter your brain puts up for that task.

For some people, memorizing the sentence “Linus Torvalds wrote the first version of the Linux kernel in 1991 while he was a student at the University of Helsinki” is trivial and can be done in a second or two. For many others, who might not have the background to know what the sentence means, they might struggle with being able to parrot back that idea without studying it for at least 10-15 seconds. And the results might be flipped for different people on another sentence, like “Brooks Nader repurposes engagement ring from ex, buys 9-carat ‘divorce ring’ amid Gleb Savchenko romance.”

The fact is, most of what we read is already familiar in some way. That means we’re actually processing less information than we’re actually taking in, and discarding a huge chunk of what we perceive towards what we actually think. And when we encounter things that didn’t necessarily expect, we slow down or we misremember things.

So I can see how the 10-bit number comes into play. It cited various studies showing the image/object recognition tends to operate in the high 30’s in bits per second, and many memorization or video game playing tasks involve processing in the 5-10 bit range. Our brains are just highly optimized for image processing and language processing, so I’d expect those tasks to be higher performance than other domains.

GamingChairModel@lemmy.world · edit-2 7 days ago

But if you read the article, then you saw that the author specifically concludes that the answer to the question in the headline is “yes.”

This is a dead end and the only way forward is to abandon the current track.

GamingChairModel@lemmy.world · 7 days ago

I hope someone will finally mathematically prove that it’s impossible with current algorithms, so we can finally be done with this bullshiting.

They did! Here’s a paper that proves basically that:

van Rooij, I., Guest, O., Adolfi, F. et al. Reclaiming AI as a Theoretical Tool for Cognitive Science. Comput Brain Behav 7, 616–636 (2024). https://doi.org/10.1007/s42113-024-00217-5

Basically it formalizes the proof that any black box algorithm that is trained on a finite universe of human outputs to prompts, and capable of taking in any finite input and puts out an output that seems plausibly human-like, is an NP-hard problem. And NP-hard problems of that scale are intractable, and can’t be solved using the resources available in the universe, even with perfect/idealized algorithms that haven’t yet been invented.

This isn’t a proof that AI is impossible, just that the method to develop an AI will need more than just inferential learning from training data.

GamingChairModel@lemmy.world · edit-2 8 days ago

The paper gives specific numbers for specific contexts, too. It’s a helpful illustration for these concepts:

A 3x3 Rubik’s cube has 2^65 possible permutations, so the configuration of a Rubik’s cube is about 65 bits of information. The world record for blind solving, where the solver examines the cube, puts on a blindfold, and solves it blindfolded, had someone examining the cube for 5.5 seconds, so the 65 bits were acquired at a rate of 11.8 bits/s.

Another memory contest has people memorizing strings of binary digits for 5 minutes and trying to recall them. The world record is 1467 digits, exactly 1467 bits, and dividing by 5 minutes or 300 seconds, for a rate of 4.9 bits/s.

The paper doesn’t talk about how the human brain is more optimized for some tasks over others, and I definitely believe that the human brain’s capacity for visual processing, probably assisted through the preprocessing that happens subconsciously, or the direct perception of visual information, is much more efficient and capable than plain memorization. So I’m still skeptical of the blanket 10-bit rate for all types of thinking, but I can see how they got the number.

GamingChairModel@lemmy.world · 8 days ago

I mean: look at an image for a second. Can you only remember 10 things about it?

The paper actually talks about the winners of memory championships (memorizing random strings of numbers or the precise order of a random arrangement of a 52-card deck). The winners tend to have to study the information for an amount of time roughly equivalent to 10 bits per second.

It even talks about the guy who was given a 45 minute helicopter ride over Rome and asked to draw the buildings from memory. He made certain mistakes, showing that he essentially memorized the positions and architectural styles of 1000 buildings chosen out of 1000 possibilities, for an effective bit rate of 4 bits/s.

That experience suggests that we may compress our knowledge by taking shortcuts, some of which are inaccurate. It’s much easier to memorize details in a picture where everything looks normal, than it is to memorize details about a random assortment of shapes and colors.

So even if I can name 10 things about a picture, it might be that those 10 things aren’t sufficiently independent from one another to represent 10 bits of entropy.

GamingChairModel@lemmy.world · 8 days ago

The problem here is that the bits of information needs to be clearly defined, otherwise we are not talking about actually quantifiable information

here they are talking about very different types of bits

I think everyone agrees on the definition of a bit (a binary two-value variable), but the active area of debate is which pieces of information actually matter. If information can be losslessly compressed into smaller representations of that same information, then the smaller compressed size represents the informational complexity in bits.

The paper itself describes the information that can be recorded but ultimately discarded as not relevant: for typing, the forcefulness of each key press or duration of each key press don’t matter (but that exact same data might matter for analyzing someone playing the piano). So in terms of complexity theory, they’ve settled on 5 bits per English word and just refer to other prior papers that have attempted to quantify the information complexity of English.

GamingChairModel@lemmy.world · 9 days ago

The Caltech release says they derived it from “a vast amount of scientific literature” including studies of how people read and write. I think the key is going to be how they derived that number from existing studies.

GamingChairModel@lemmy.world · edit-2 8 days ago

Speaking which is conveying thought, also far exceed 10 bits per second.

There was a study in 2019 that analyzed 17 different spoken languages to analyze how languages with lower complexity rate (bits of information per syllable) tend to be spoken faster in a way that information rate is roughly the same across spoken languages, at roughly 39 bits per second.

Of course, it could be that the actual ideas and information in that speech is inefficiently encoded so that the actual bits of entropy are being communicated slower than 39 per second. I’m curious to know what the underlying Caltech paper linked says about language processing, since the press release describes deriving the 10 bits from studies analyzing how people read and write (as well as studies of people playing video games or solving Rubik’s cubes). Are they including the additional overhead of processing that information into new knowledge or insights? Are they defining the entropy of human language with a higher implied compression ratio?

EDIT: I read the preprint, available here. It purports to measure externally measurable output of human behavior. That’s an important limitation in that it’s not trying to measure internal richness in unobserved thought.

So it analyzes people performing external tasks, including typing and speech with an assumed entropy of about 5 bits per English word. A 120 wpm typing speed therefore translates to 600 bits per minute, or 10 bits per second. A 160 wpm speaking speed translates to 13 bits/s.

The calculated bits of information are especially interesting for the other tasks (blindfolded Rubik’s cube solving, memory contests).

It also explicitly cited the 39 bits/s study that I linked as being within the general range, because the actual meat of the paper is analyzing how the human brain brings 10^9 bits of sensory perception down 9 orders of magnitude. If it turns out to be 8.5 orders of magnitude, that doesn’t really change the result.

There’s also a whole section addressing criticisms of the 10 bit/s number. It argues that claims of photographic memory tend to actually break down into longer periods of study (e.g., 45 minute flyover of Rome to recognize and recreate 1000 buildings of 1000 architectural styles translates into 4 bits/s of memorization). And it argues that the human brain tends to trick itself into perceiving a much higher complexity that it is actually processing (known as “subjective inflation”), implicitly arguing that a lot of that is actually lossy compression that fills in fake details from what it assumes is consistent with the portions actually perceived, and that the observed bitrate from other experiments might not properly categorize the bits of entropy involved in less accurate shortcuts taken by the brain.

I still think visual processing seems to be faster than 10, but I’m now persuaded that it’s within an order of magnitude.

GamingChairModel@lemmy.world · 10 days ago

Seriously. I’m not at all an art guy so I feel qualified to observe that The Scream is probably one of the top 5 (and definitely top 10) most well known paintings, somewhere shortly after Da Vinci’s Mona Lisa and Van Gogh’s Starry Night.

GamingChairModel@lemmy.world · 16 days ago

Yeah, there have been cases of people dealing with the bureaucratic nightmare that followed when they got vanity license plates that said “NULL” and a bunch of bad program logic combined with incomplete data in the databases to send them a bunch of tickets.

Making it so that people can take advantage of even more complex computer errors could ruin things for other people.

GamingChairModel@lemmy.world · edit-2 17 days ago

Might not need to even have much new mining.

Gallium is primarily extracted from bauxite, which is already mined worldwide for aluminum processing. So with gallium being a very small byproduct of aluminum processing from mined bauxite, the bottleneck probably isn’t in the mines, because mining and processing bauxite is already something many countries do. It’s just not always economically profitable to further process the gallium at the same time, but if the need is there, that can be ramped up at existing aluminum plants.

It’s not an overnight process but with many elements, the limiting factor isn’t actual rarity, but the high energy/equipment needs of the process to extract and purify the element, and the high amounts of waste produced.

GamingChairModel@lemmy.world · 17 days ago

I haven’t seen any reporting on just how much gallium, germanium, and antimony is used in electronics. I can see that it’s basically present in all electronics, and the price per kilogram has gone up a lot under the restrictions, and that China accounts for 94% of the world’s production.

Ok, so is this like gold, then, where very small quantities are used in most electronics so that it matters, but doesn’t actually account for a significant percentage of the cost of the finished product?

How much gallium is used to manufacture a typical cell phone, a laptop, a car? How much does that $90/kg price hike translate to actual devices? Because if 1 gram of gallium goes into a particular device, that’s an increase of 9 cents in raw materials, basically a rounding error.

GamingChairModel@lemmy.world · 17 days ago

Of course. But even for people who don’t read the article, it’s still best practice to just copy the headline from the article so that it’s the top of the Lemmy thread.

GamingChairModel@lemmy.world · 17 days ago

Yeah, the actual headline to the article is “Even Netflix struggles to identify and understand the cost of its AWS estate,” which OP has very unhelpfully shortened to post “Netflix struggles to understand its cloud costs.”

The word “Even” is doing a lot of work, and leaving it out changes the meaning of the headline.

GamingChairModel@lemmy.world · 17 days ago

Exactly. To extend the junk food analogy, this is like making donuts from scratch in your own kitchen: customized to your preferences, maybe tastes better, but ultimately you’re still making a mess in your kitchen and eating unhealthy.

GamingChairModel@lemmy.world · 17 days ago

If you’re writing 100 MB/s, it’ll still take 300,000 seconds to write 30TB. 300,000 seconds is 5,000 minutes, or 83.3 hours, or about 3.5 days. In some contexts, that can be considered a long time to be exposed to risk of some other hardware failure.

GamingChairModel@lemmy.world · 20 days ago

My theory is that there is quite a few servers that are chosing to defederate. The number of total servers continues to drop according to fedidb.

Or admins are just finding it not worth bothering with administering their own server and turning them off.

GamingChairModel@lemmy.world · 21 days ago

It sounds like you want a way to collect articles, including full text offline, and organize them in a searchable way. Why do you need RSS for this? Just use a blogging platform where you can organize each post, list/sort/filter by date or topic or original source, and use the search functionality in the actual blog platform.