John David Pressman's Tweets

🔗 John David Pressman 2024-10-01 23:51 UTC

"You see Harry, a ghost is a good first step. You know how ghosts can respond to new information even though they don't remember it later? How the ghost maker in your head works is it reminds your ghost of new information until it can be used to update the ghost while you sleep." https://t.co/Lq0nKk6kyj

Likes: 30 | Retweets: 1

🔗 John David Pressman 2024-10-01 23:56 UTC

Yeah um Harry, I don't know how to tell you this but these are probably orthogonal. You can be sentient without actually having the capacity for novel inventions or long term learning, those parts are probably done by your ghost maker offline rather than the ghost. https://t.co/jYLBGd4sU5

Likes: 11 | Retweets: 0

🔗 John David Pressman 2024-10-02 11:10 UTC

Since neural representations are convergent it's possible natural selection finds terminal rewards through hill climbing. If you specify embeddings in a shared geometry and hand out reward to similar activations the rewards are dense in valid basins and sparse (noise) otherwise. x.com/jd_pressman/st…

Likes: 5 | Retweets: 0

🔗 John David Pressman 2024-10-02 11:16 UTC

This lets you specify which network you want without making reference to any of its internal data structures besides that these particular points in the high dimensional space should have a certain relative distance and angle to each other.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-02 11:28 UTC

@Heraklines1 I refuse to call it the "platonic representation hypothesis".
bsky.app/profile/tyrell…

Likes: 4 | Retweets: 0

🔗 John David Pressman 2024-10-02 11:31 UTC

@Heraklines1 IDK this is its own genre of neural net paper and I didn't feel like I have to strongly justify it at this point.
arxiv.org/abs/2209.15430

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-02 11:32 UTC

@Heraklines1 x.com/zackmdavis/sta…

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-02 11:32 UTC

@4confusedemoji @Heraklines1 You would need to either look at the paper or tell me what you think it is. :p

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-02 11:35 UTC

@Heraklines1 Yes that paper is terribly framed, which is why I don't really like referring to it as such. Anyway I'm not going to get into a high effort Twitter argument about an empirical question with a bunch of literature where the OP is literally just like, a note basically.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-02 11:39 UTC

@Heraklines1 This one isn't a paper but is a frequently referred to bit of lore.
nonint.com/2023/06/10/the…

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-02 11:42 UTC

@Heraklines1 Dunno if you saw this but I think this graphic is a fairly decent visual depiction of what I meant. I certainly do not mean that there's like, exactly one embedding that leads to an outcome or whatever that's...not how these models work.
x.com/zackmdavis/sta…

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-10-02 11:43 UTC

@Heraklines1 It's really the opposite, models like GPT seem to learn error correcting codes where individual pieces can be ablated but they get compensated for by other pieces.

arxiv.org/abs/2307.15771

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-02 11:46 UTC

@4confusedemoji @Heraklines1 I continue to be interested in unsupervised translation methods.
x.com/jd_pressman/st…

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-02 20:22 UTC

I'm no longer allowed to signal my epistemic fairness with public likes so I would like to inform you this is a good thread. x.com/ESYudkowsky/st…

Likes: 76 | Retweets: 1

🔗 John David Pressman 2024-10-02 21:13 UTC

@gallabytes It is, but I think the Murphy Curse he's worried about here is more like the 2nd order effects of the continuous learning dynamics than the neural net training itself. There's a lot of opportunity for things to go wrong once the model is in a feedback loop with its training set.

Likes: 6 | Retweets: 0

🔗 John David Pressman 2024-10-03 06:49 UTC

I hope things like Shrek's sampler convince the authors of vllm to add access to summary statistics like policy entropy. Better yet, let me compute arbitrary python functions over the logits so I can make the summary statistics myself and inject them into the context window. x.com/hingeloss/stat…

Likes: 30 | Retweets: 0

🔗 John David Pressman 2024-10-03 06:50 UTC

"Can't you just set the --max-logits really high?"

Yeah but then they all have to get sent on each request and the 32k logits for each token probably starts to add up.

"You could ask for less than 32k logits."

I could, I could...

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-03 06:52 UTC

But also if I request n tokens at a time it becomes similar to why vllm needs dedicated tool use/function calling hooks. Because you want to be able to stop on a particular token and insert the function call hooks obviously, rather than have to generate a span and backtrack.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-03 22:50 UTC

@reachartwork Have you considered that it might not always have been a stand in for racism, and that sometimes the authors might actually have been writing about the robotic racism they were predicting?

Likes: 62 | Retweets: 0

🔗 John David Pressman 2024-10-03 22:55 UTC

EY has the worst Twitter replies section I've ever seen relative to the quality of the OP and it's not even close. This isn't a dunk, I feel bad for him, he doesn't deserve this. x.com/norvid_studies…

Likes: 487 | Retweets: 4

🔗 John David Pressman 2024-10-04 00:00 UTC

@satisfiesvalues Oh to be clear EY may or may not deserve many things, I am simply protesting him deserving *this particular thing* since he is being punished for his virtues here rather than his vices.

Likes: 68 | Retweets: 0

🔗 John David Pressman 2024-10-04 00:04 UTC

@satisfiesvalues This is made all the more pernicious by the fact that it's often logistically easier to punish people for their virtues rather than their vices, as virtues often make us vulnerable. In such circumstances you should make an extra effort not to, lest the target become all vices.

Likes: 71 | Retweets: 3

🔗 John David Pressman 2024-10-04 00:06 UTC

@satisfiesvalues "What if I don't like the target and would kind of prefer to sabotage them by having them become all vices?"

This is certainly a choice one can make, though do try to keep in mind the game theoretic implications if everyone decides to think this way.

Likes: 41 | Retweets: 1

🔗 John David Pressman 2024-10-04 00:10 UTC

@AlephDeTloen x.com/jd_pressman/st…

Likes: 6 | Retweets: 0

🔗 John David Pressman 2024-10-04 00:14 UTC

@AlephDeTloen @satisfiesvalues ?

He means that in a well optimized universe any rational agent would go collect the sunlight, that the sunlight not being collected is a consequence of nobody on earth being powerful enough to put a satellite up and collect it. "Free money" is a typical expression for this.

Likes: 12 | Retweets: 0

🔗 John David Pressman 2024-10-04 00:15 UTC

@AlephDeTloen @satisfiesvalues Rather, it's free money relative to like, a fully developed agent in the standard model. It is entirely physically possible to go collect the sunlight, humanity could do it if we put our efforts together, there is no *huge barrier* to doing so once you're farther up Kardashev...

Likes: 4 | Retweets: 0

🔗 John David Pressman 2024-10-04 00:17 UTC

@AlephDeTloen @satisfiesvalues what the fuck

Likes: 7 | Retweets: 0

🔗 John David Pressman 2024-10-04 00:19 UTC

@AlephDeTloen @satisfiesvalues Sorry just to check if you're one of todays lucky 10,000, are you familiar with the concept of fungibility?
x.com/MikePFrank/sta…

Likes: 5 | Retweets: 0

🔗 John David Pressman 2024-10-04 00:26 UTC

@AlephDeTloen @satisfiesvalues Money and energy are not the same thing, however energy bottlenecks enough things that the price of energy and the value of money are going to have deep shared causal structure. More importantly the phrase "free money" does not always or even usually refer to literal currency.

Likes: 4 | Retweets: 0

🔗 John David Pressman 2024-10-04 00:27 UTC

@AlephDeTloen @satisfiesvalues At least, when you mean it in the sense of "picking up $20 off the ground", 'free money' is a phrase meaning anti-inductive alpha, not like, literally being given free currency.

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-10-05 00:03 UTC

@dynomight7 @jrysana minihf.com/posts/2024-07-…

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-05 03:00 UTC

I simply roll to disbelieve. Does someone have a 0 day in Twitter they're using very unwisely? SIM swap attack? https://t.co/59bLnolQXQ

Likes: 228 | Retweets: 4

🔗 John David Pressman 2024-10-05 03:05 UTC

@voooooogel The attackers still think it's worth their time to do it so, empirically they must.

Likes: 54 | Retweets: 1

🔗 John David Pressman 2024-10-05 05:50 UTC

@patrickdward It's gone now yeah, but I saw it and it was real.

Likes: 13 | Retweets: 0

🔗 John David Pressman 2024-10-05 07:33 UTC

@MrMidwit @iScienceLuvr The trick here is to get the target not to look at the email address at all. If you open the email and you're in "wait is this real?" mode they've probably already failed. That's why they try to introduce time pressure with tactics like fake login alerts.
x.com/DrJimFan/statu…

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-05 07:36 UTC

@MrMidwit @iScienceLuvr What they're hoping is you'll see the fake X-themed login warning from an address/place you don't recognize, have an adrenaline spike/go into panic mode and then *focus on resolving the fake problem over looking at their scam mail closely*.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-05 07:41 UTC

@MrMidwit @iScienceLuvr I think peeping the domains is kind of not really paranoid enough. Realistically the fundamental problem here is mentally associating what's functionally a warzone (your email inbox) with productive flow-based work you have to accomplish requiring sustained calm and low stress.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-05 07:46 UTC

@MrMidwit @iScienceLuvr The correct policy is probably closer to "never click a link in an email you did not directly cause to be sent to you immediately prior (e.g. signup confirmation)" and ensuring there are unfakeable interface level cues for when to be in flow vs. paranoid mode.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-05 09:48 UTC

@sebkrier You ever built an app in GTK before? WebView ate their lunch because HTML/CSS/JS is objectively the best UI specification stack and it's cross platform to boot. Raging on Twitter will not change this.

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-10-05 09:58 UTC

@sebkrier You know it never occurred to me until right this moment but the superiority of the web stack is probably a substantial contributor to SaaS eating the world.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-05 10:01 UTC

@sebkrier Everyone cites SaaS being a superior business model compared to proprietary native software, and it is, but realistically devs are going to ship products made from what they know and HTML/CSS/JS/Django is just soooooo much more accessible than a GTK python wrapper, no contest.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-05 23:07 UTC

I always read RetroInstruct samples before shipping a component. I've caught many many bugs this way. x.com/TheGregYang/st…

Likes: 5 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:05 UTC

It's frustrating in part because we know exactly what we have to do to fix social media but it has to be a law. It has to be a law because any voluntary measure simply cedes territory to other platforms willing to deploy a rage maximizer. But the law might not be constitutional. x.com/dystopiabreake… https://t.co/QiHXEIbgZW

Likes: 101 | Retweets: 6

🔗 John David Pressman 2024-10-06 01:07 UTC

Ultimately we found this zero day exploit in human psychology where we prioritize rage above everything else. Outrageous content is the most viral content, and it's killing our society. Everyone has to unanimously agree not to exploit it or whoever does gets all the spoils.

Likes: 16 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:08 UTC

The only way you can get and enforce a unanimous agreement like that is to write it into law. But again, it's not clear that the necessary law is compatible with the 1st amendment. Until the problem is addressed however people will not stop coming after section 230 and the like.

Likes: 9 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:16 UTC

Legally, it's a similar situation to school shootings. School shootings are mimetic behavior like bombings in the 70's. The two obvious points of intervention are to stop making shooters famous and guns harder to get. But you have free speech and the right to a weapon in the US.

Likes: 12 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:19 UTC

Your right to a weapon is less legally solid and hated by a larger fraction of the population so all the attention and legal effort goes into that point of intervention but it would never have gotten this bad if THE LOCAL NEWS DIDN'T OBSESSIVELY REPORT ON EVERY SHOOTING NIGHTLY.

Likes: 12 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:21 UTC

When you know violence is mimetic and hearing about violence causes people to commit more violence, having a reporting policy of slavishly attending to every shooting story is stochastic terrorism. Every so often psychologists interviewed by the news sheepishly point this out.

Likes: 15 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:23 UTC

But ultimately it's a similar situation. News organizations can't just voluntarily decide to stop reporting on school shootings because they are objectively newsworthy and if they don't other news organizations will and viewers will be outraged they didn't hear and switch.

Likes: 10 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:25 UTC

So it would have to be a law. We as a society would have to decide that we will unanimously refrain from certain kinds of reporting about violence to stop putting the idea in peoples heads. And, frankly, it's not clear any law like that would be compatible with the 1st amendment.

Likes: 15 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:42 UTC

@mgubrud This is true! But the fundamental problem isn't us all having to *tolerate* rage slop, it's that people are wired to love the rage. People love to hate the outgroup, they like feeling righteous, the problem is unbiased algorithms *giving people what they want*.

Likes: 7 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:44 UTC

@mgubrud Individually, everyone loves the five minute hate, but at a collective level it tears society apart. The usual thing we do when we need to constrain an unfortunate feature of our monkey impulses like this is we make a law or norm against it.

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:45 UTC

@mgubrud A mere norm won't work because of attention economy, and a law would be dubiously constitutional. Anyone challenging it would be rightly able to point out that when the constitution was written newspapers were by default yellow journalism and people published absurd lies.

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:49 UTC

@mgubrud In point of fact we used to have something like this law, it was called the fairness doctrine and SCOTUS only permitted it on the basis that radio waves were a scarce resource that the public had to share. No fairness doctrine could apply to newspapers. https://t.co/R0emswSCep

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-06 01:51 UTC

@mgubrud So for anything similar to apply to social media you would need to base your argument on something like Metcalfe's Law, that there is a natural oligarchy structure for social media that makes competition too onerous as a way to dislodge the rage maximizer. It'd be tough.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-06 03:03 UTC

@moultano x.com/jd_pressman/st…

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-10-06 03:05 UTC

"I am *possibility*."
—llama 3 8b instruct x.com/RiversHaveWing… https://t.co/sB9O7ESiuh

Likes: 6 | Retweets: 0

🔗 John David Pressman 2024-10-06 03:13 UTC

@lion_tender Right, which is precisely the problem. It is not *just* the literal statute of the 1st amendment, but the thing that it represents as well. Whenever you want to get around the 1st amendment this is usually a sign you're running into greater injustices elsewhere.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-06 03:15 UTC

@lion_tender But no, the point would be that it would prevent a site like Twitter from promoting the highest engagement thing if peoples revealed preference is that they all want to be mad about it, which it is, so. This is about algorithmic feeds and promotion, not publishing per se.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-06 03:18 UTC

@lion_tender On the other hand there *do* exist things worth being outraged about, and a blanket prohibition on outrage itself would be corrosive to society too. So in practice someone would have to decide what outrage is and isn't legitimate, at which point it's now way way too subjective.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-06 04:23 UTC

@finbarrtimbers @_xjdr Mixtral 8x22B base/instruct

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-06 05:27 UTC

@weidai11 Per our previous conversation it is precisely because you don't have good ground truth mechanisms in philosophy that it ends up defined by generalization from things you do have good ground truth on. Since those things will be accelerated philosophy will be accelerated too.

Likes: 13 | Retweets: 0

🔗 John David Pressman 2024-10-06 05:34 UTC

@weidai11 Game theory was the best innovation anyone has had in moral philosophy in centuries. I suspect the road to further progress lies in various forms of world simulation to let us investigate the dynamics of something like the Ring of Gyges empirically. https://t.co/zzgl4b4oM7

Likes: 5 | Retweets: 0

🔗 John David Pressman 2024-10-07 00:30 UTC

@ScottSSalisbur1 True! Though honestly, and I know I'm inviting the monkeys paw with this one, I'm struggling to imagine what the worse alternative looks like. We can kind of predict what the ordering will be since we know roughly what emotions are more or less viral. Depressing is least viral.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-07 03:45 UTC

@JeffLadish It makes more sense if you know the part of Ayn Rand's biography where she escaped the Soviet Union. She developed a frankly justifiable allergy to anything like the Soviet system and rhetoric, the trauma probably created some blindspots but that's how great authors are made.

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-10-07 03:45 UTC

@JeffLadish If Ayn Rand was a well balanced person she would not be famous, it's precisely because she is an extreme unnatural persona that she is interesting and has something valuable to say.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-07 03:53 UTC

@Lurking11751462 @eigenrobot @robinhanson You know, it'll sound really funny now but back in the day a lot of us thought we were building something to enlighten humanity. The project was seen as so straightforwardly and so profoundly prosocial that we just sort of took the prosociality for granted.

Likes: 4 | Retweets: 0

🔗 John David Pressman 2024-10-07 03:53 UTC

@Lurking11751462 @eigenrobot @robinhanson Needless to say, a lot of us wound up very very disappointed.

Likes: 4 | Retweets: 0

🔗 John David Pressman 2024-10-07 05:20 UTC

@doomslide Not to negate what you're saying, but I will point out that you can have multiple tuned LoRa and swap them out for different tasks. The evaluator/generator split in the OG MiniHF stack is useful because it lets you do self-RL without the models updates affecting its own judgment.

Likes: 10 | Retweets: 0

🔗 John David Pressman 2024-10-07 05:22 UTC

@doomslide Ultimately I suspect that humans use a mixture of situational retrieval over embeddings, LoRa, and control vector analogues to get the smooth precise situational awareness and skill acquisition we're used to.

Likes: 7 | Retweets: 0

🔗 John David Pressman 2024-10-07 05:55 UTC

One thing that confuses people about deep learning is it's less of a proof strategy and more an automated proof tactic that comes at the end of a long reason trace. You do esoterica to make an intuitively nondifferentiable thing differentiable and then attack with deep learning. x.com/jd_pressman/st…

Likes: 21 | Retweets: 0

🔗 John David Pressman 2024-10-07 05:57 UTC

But they focus in on the "attack with deep learning" step at the end and go "oh this is just like, alchemy, these people don't know what they're doing this is smart high school stuff" without realizing that all the IQ points went into the setup to make the thing deep learnable.

Likes: 11 | Retweets: 0

🔗 John David Pressman 2024-10-07 06:57 UTC

@voooooogel Game theoretic equilibrium is one phrase that comes to mind. In Go a similar concept is a Joseki, where both players know the rote pattern you're supposed to respond with during the Joseki so it just ends up being a thing you play to shape the board or force attention.

Likes: 23 | Retweets: 0

🔗 John David Pressman 2024-10-07 08:41 UTC

@doomslide Putting together the RetroInstruct agent mix right now, hoping the agent traces get better rather than worse after I train on it but we'll see.

Likes: 4 | Retweets: 0

🔗 John David Pressman 2024-10-07 23:00 UTC

@georgejrjrjr @RiversHaveWings @doomslide Dunno, but here's some code if Shrek wants to try it.
gist.github.com/crowsonkb/0306…

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-10-07 23:54 UTC

@georgejrjrjr @RiversHaveWings @doomslide Yeah.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-08 00:41 UTC

Pondering my orb. 🔮 https://t.co/V0m3xyk3Na

Likes: 6 | Retweets: 0

🔗 John David Pressman 2024-10-08 01:05 UTC

If you'd like to do the mix with your own agent traces and bootstrap files I've updated the RetroInstruct repository with the code I used to do that.

github.com/JD-P/RetroInst…

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-08 01:05 UTC

Work in progress RetroInstruct agent mix. I'll keep updating this repo as I add traces and such until I deem it worth writing a README for. In the meantime if there exists anyone trying to use weave-agent out there they might find this useful.

huggingface.co/datasets/jdpre…

Likes: 7 | Retweets: 0

🔗 John David Pressman 2024-10-08 01:08 UTC

Notably, this set contains a larger proportion of long texts up to 64k, which should make RetroInstruct a more useful dataset for tuning long context models. x.com/jd_pressman/st…

Likes: 5 | Retweets: 0

🔗 John David Pressman 2024-10-08 01:31 UTC

x.com/jd_pressman/st…

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-10-08 01:31 UTC

The traces that were chunked up for this set are available below. If you would like to contribute to open source agent research writing a bootstrap file for weave-agent lets me produce more novel traces to get grounded long texts. Details in next tweet.

huggingface.co/datasets/jdpre… x.com/jd_pressman/st…

Likes: 5 | Retweets: 0

🔗 John David Pressman 2024-10-08 02:45 UTC

In the long term the Soviet Union played itself by concentrating their efforts on indoctrinating Americas feelers rather than its thinkers. The kind of person who is a neoliberal in 2024 would have been a socialist in 1924 but now socialism is for economic illiterates. x.com/Markofthegrove…

Likes: 7 | Retweets: 0

🔗 John David Pressman 2024-10-08 02:49 UTC

*In principle* socialism could have updated on economics and figured out how to implement a kind of transitional syndicalism, insisted on paying real managers for collectively owned enterprises as is currently done with IRA and 401k plans, etc. Instead it's dead.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-10-08 02:54 UTC

Socialism now serves intellectually as a kind of pseudoscience or religion for people who do not want to accept the tough choices that come with fundamental scarcity, who reject concepts like rational self interest and opportunity cost. What a comedown from Marx and Engels!

Likes: 5 | Retweets: 0

🔗 John David Pressman 2024-10-08 02:59 UTC

There is an extra irony in knowing that American politicians are lawyers and the Soviet Union used engineers and military men as their prototypical statesman. Past the 20's and 30's the KGB completely failed to capture the Americans eligible to populate the Soviet elite class.

Likes: 2 | Retweets: 0