No background, but it's plausible to me that they actively prefer imperfect alignment because companies that care about alignment will tend to be woke, moralizing, or opposed to authoritarianism.
In the specific hedonist vs Christian case, aren't there two obvious compromises?
There are three cost sources: materials, design, and manufacturing. I claim that because phones are small and expensive to design it's just not worth it.
I don't really think progress being fast is quite sufficient to explain things. If every tripling in price could get you a 20% better phone, many people would pay for an 81x more expensive, 107% better phone just like they do for private jets. If it became obsolete in a year, the billionaire would have their phone guy replace it every six months. I think it's because a phone, or gmail, starlink, etc. has super high fixed costs, and the smaller market for luxury goods generally means the design will be worse. For food or furniture better quality ingredients/materials can more than make up for it, but consumer electronics can only get slightly better components because most already designed things are within the cost/kg budget of a $1k phone.
Everything more expensive per kg than phones has an interesting way of funding their design costs without scale:
So my prediction would be that the luxury smartphone business only starts up when lots of rich people have different problems from the average consumer that need a custom device (security maybe?) or subscribe to a status game that results in the phone equivalent of luxury watches.
Seems unlikely to me this would be in their interest.
If they really believed in RSI they would do diplomacy to get more compute and just invest in their AI industry.
[1] Claude thinks the likely outcome is this:
Sure, then just add that to the disclaimer. "I may omit claims that are risky, unpopular, may be easily misinterpreted, require lots of words to justify, etc, but aim to not outright lie for such reasons"
It seems super costly for many public intellectuals to say all of their beliefs, and for reasons that other commenters have pointed out, giving an epistemic status might not help. What's wrong with a blanket disclaimer like "Assume that all of my claims without an epistemic status are optimized to improve discourse on the margin, rather than to convey a complete picture of my all-things-considered beliefs"?
I think IBKR might let Americans trade HKEX index options out to 2030 but it would probably be a hassle. Otherwise there are options on FXI, a Chinese large-cap ETF, which look less liquid than SPX options and only go out 2 years with an IV of 32% or so. I don't think FXI is worth it because China isn't in a position to get AGI in the next 2 years if the US doesn't.
(in the usual units, this means that the plot of log(2025-FLOP per FLOP) vs log(researcher-hours) is a straight line with slope .) A plot that curves downward or "hits a wall" seems like evidence against this model's applicability to the data.
Note there are no log-log plots in the data. They're performance vs LoC and log(performance) vs LoC, and same for stars. I don't think we're at an absolute ceiling since two more improvements came out in the past week, they've just gotten smaller and taken more code to implement.
I need to think about this algorithmic progress being 10x/year thing. It feels like some assumptions are violated with how much the data seem to give inconsistent answers, maybe there's a prospective vs retrospective difference. Or do you think progress has just sped up in the past couple of years?
Reasons time horizon is overrated and misinterpreted:
In the 9 months since the METR time horizon paper (during which AI time horizons have increased by ~6x), it’s generated lots of attention as well as various criticism on LW and elsewhere. As one of the main authors, I think much of the criticism is a valid response to misinterpretations, and want to list my beliefs about limitations of our methodology and time horizon more broadly. This is not a complete list, but rather whatever I thought of in a few hours.
Figure 1: METR staying one step ahead of frontier models
Despite these limitations, what conclusions do I still stand by?
[1] see eg DeepSeek R1 paper: https://arxiv.org/abs/2501.12948