John David Pressman's Tweets

🔗 John David Pressman 2024-06-01 01:04 UTC

@CRSegerie This was discussed some last year and then dropped when AI agents didn't immediately materialize after the first attempts with AutoGPT. Ultimately our best defense against autonomous replication is going to be blue team $$$ spent on finding vulns first.
x.com/jd_pressman/st…

Likes: 6 | Retweets: 0

🔗 John David Pressman 2024-06-01 01:06 UTC

@CRSegerie But that's only a short term solution. In the medium term we need to be writing secure software with the surplus we get from AI agents. That adversarial AI agents will push us into a pervasive exploit environment will heighten demand for this.
x.com/davidad/status…

Likes: 4 | Retweets: 0

🔗 John David Pressman 2024-06-01 01:12 UTC

@CRSegerie Everyone thinks the new KYC rules for cloud services are to stop China from training AI systems, but given the expanse of the proposed rules I get the impression they're actually meant to help deter autonomous replication too.
torrentfreak.com/u-s-know-your-…

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-06-01 01:33 UTC

@TylerAlterman What are your major takeaways from the experience and if you were doing something in the same spirit again what would you do differently? If your takeaway is that nothing in the same spirit should be attempted again why do you conclude that and what should people do instead?

Likes: 7 | Retweets: 0

🔗 John David Pressman 2024-06-01 02:29 UTC

@LordDreadwar @BasedBeffJezos Organizing and fundraising are "boring" but essential and EA's alpha over almost everything else it memetically competes with is that it's top tier at both.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-01 02:30 UTC

@LordDreadwar @BasedBeffJezos Lenin's version of socialism won out over the other variants because Lenin perfected the "fundraise through crime" mafia strategy which let him pay organizers actual money to work on Bolshevism.

Likes: 4 | Retweets: 0

🔗 John David Pressman 2024-06-02 00:43 UTC

@LordDreadwar The worst part of the situation is that it wasn't an unforced error, they're resorting to this out of desperation which means mere good sense wouldn't be enough to prevent the jenga block being removed. If it was senseless public pressure could push the decay back, but it isn't.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-06-02 00:44 UTC

@LordDreadwar The relevant players evaluated their options and decided that this was the best they could do, which bodes poorly for the stability of our society. Counterintuitively a game with bad players is more hopeful than a game that forces bad moves, you can talk people out of poor play.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-02 02:18 UTC

@kdkeck @CRSegerie I'm not sure that's really what we're talking about with "autonomous replication", but in principle there is no legal vehicle for doing this which doesn't have a human principal and those person(s) can be made responsible for whatever happens.

Likes: 0 | Retweets: 0

🔗 John David Pressman 2024-06-02 09:58 UTC

@doomslide @teortaxesTex @fchollet @agihippo I'm biased because I'm currently staring at an agonizingly slow MCTS with multiple in-context evaluator questions but tbh the vibe I'm getting is that it's gonna be a slow grind up if synthetic data is the way. Iterated tuning on the back of tiny amounts of OOD generalization.

Likes: 10 | Retweets: 0

🔗 John David Pressman 2024-06-02 10:14 UTC

@doomslide @teortaxesTex @fchollet @agihippo The grind will probably become less grind-y once agents are good enough to automate large portions of the work, but just from a process standpoint I keep having to temper my expectations and break things into more parts to get them to work. I'm also GPU poor, so 'slow'.

Likes: 5 | Retweets: 0

🔗 John David Pressman 2024-06-02 10:16 UTC

@doomslide @teortaxesTex @fchollet @agihippo But the constant breaking things into parts and necessity of my subjective judgment feels like the kind of problem that could hold stuff up. I have synthetic techniques that work, but they don't generalize enough for me to just throw them at problems without manual labor.

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-06-02 16:02 UTC

@algekalipso Remind me a little bit of the "text" you see in old school diffusion generated images of computer screens and books. https://t.co/tq0DEuRsFU

Likes: 9 | Retweets: 1

🔗 John David Pressman 2024-06-02 16:16 UTC

@algekalipso You mean the part where we're a stochastic model but don't notice because we only sample the path when it's overdetermined or?

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-02 16:22 UTC

@algekalipso Ah yes, the memory format mammals use seems to constrain action sampling along a low dimensional manifold and when I read neurology papers I balk at how few variables the latents use. I only know this must be a thing because deep learning implies it.

ncbi.nlm.nih.gov/pmc/articles/P…

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-02 17:51 UTC

@RuxandraTeslo @lastpositivist I don't know about "best" but I feel like this was a decent gloss on the overall arc of classical MIRI stuff.
x.com/jd_pressman/st…

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-02 17:52 UTC

@RuxandraTeslo @lastpositivist This is part of my take on the grieving process that comes with the realization of a particular future over an imagined utopia.

gist.github.com/JD-P/915ab877c…

Likes: 0 | Retweets: 0

🔗 John David Pressman 2024-06-02 17:53 UTC

@RuxandraTeslo @lastpositivist Here are my thoughts on the agent foundations framing of the alignment problem as discussed in the Arbital corpus arbital.greaterwrong.com/explore/ai_ali…

gist.github.com/JD-P/56eaadc7f…

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-02 17:57 UTC

@RuxandraTeslo @lastpositivist Some thoughts towards the "how to align LLM based Ai agents" section that's conspicuously unfinished there.
x.com/jd_pressman/st…

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-06-02 17:58 UTC

@RuxandraTeslo @lastpositivist And some more thoughts towards it. I have high standards so I don't want to write that section until I feel confident I can present a convincing plan.
gist.github.com/JD-P/8a20a8dce…

Likes: 0 | Retweets: 0

🔗 John David Pressman 2024-06-02 18:42 UTC

@flesheatingemu @doomslide @teortaxesTex @fchollet @agihippo > the MCTS hot swap for token decider module makes sense, is that how you’re using it?

Yeah I'm using a logit evaluator with multiple questions. VLLM doesn't seem to implement good enough prefix caching/hydragen so it goes very slow. I'll have to try SGLang next.

Likes: 5 | Retweets: 0

🔗 John David Pressman 2024-06-02 22:33 UTC

The Four Kinds Of Synthetic Data https://t.co/Bk3leIi3yQ

Likes: 275 | Retweets: 32

🔗 John David Pressman 2024-06-02 22:36 UTC

Notably I think that most useful cognition in humans occurs in the 3-4 tiers of validation rigor *but is grounded by* the earlier tiers. This causes a lot of confusion where people fail to see the whole stack and get lost in abstractions, or they come to think 3 and 4 are "fake".

Likes: 15 | Retweets: 1

🔗 John David Pressman 2024-06-03 00:34 UTC

@JoshPurtell If you have a high quality method to pick the prompts then that would be type 3 or 4 depending on the details yeah.

Likes: 0 | Retweets: 0

🔗 John David Pressman 2024-06-03 01:25 UTC

@moyix It's a test of correctness for the code-passing-the-tests de-facto, it doesn't prove the tests are correct. However you can make a large corpus of nonsense test suites and then fit code to it, and this will eventually generalize to test suites that aren't nonsense by 'accident'.

Likes: 7 | Retweets: 0

🔗 John David Pressman 2024-06-03 01:27 UTC

@moyix Of course, this dataset would have the evil-genie nature in that it would fit tests *exactly as written* no matter how perverse that is.

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-06-03 04:49 UTC

@teortaxesTex @OpenBMB @mustafaaljadery @AkshGarg03 @siddrrsh Ah yes, the classic "turns out distro is actually just Windows Vista with a few custom themes". https://t.co/Q0BvyDvpP4

Likes: 19 | Retweets: 1

🔗 John David Pressman 2024-06-03 07:53 UTC

@doomslide Machine translation totally counts as synthetic IMO. Would be similar in spirit to backtranslation except here it's just translation.

Likes: 5 | Retweets: 0

🔗 John David Pressman 2024-06-03 21:40 UTC

Full thing here, to me it reads a lot like the vibe of GPT text.
gist.github.com/JD-P/3790a122b…

Likes: 6 | Retweets: 0

🔗 John David Pressman 2024-06-03 21:40 UTC

Transcribed myself talking about automating philosophy for an hour with whisperx and noticed this bit of Binglish from me while cleaning up the transcript. I think this might just be what raw autoregressive text priors sound like without rejection sampling/tree search? x.com/repligate/stat… https://t.co/nRMHh5ikn1

Likes: 29 | Retweets: 1

🔗 John David Pressman 2024-06-03 22:02 UTC

I wonder how many more times I'll experience prediction error where I read misuse instead of alignment as the next token before I mark my "misuse will replace alignment as the core concern" prediction correct. x.com/deanwball/stat…

Likes: 13 | Retweets: 0

🔗 John David Pressman 2024-06-03 22:03 UTC

Technically speaking I only have four months left, maybe it's going a bit slower than I was thinking but the arc seems fairly clear?
x.com/jd_pressman/st…

Likes: 5 | Retweets: 0

🔗 John David Pressman 2024-06-04 00:13 UTC

There's a general weirdness attractor caused by the declining cost of each marginal weirdness point you spend. Once you're in for a penny it's cheaper to buy a pound, after several you may as well buy a ton. We underestimated how much social stigma held up the epistemic commons.

Likes: 41 | Retweets: 2

🔗 John David Pressman 2024-06-04 00:33 UTC

Ruthless and correct. I'm reminded of the bit in HPMOR where Voldemort brags that he tried reducing the intensity of death eater attacks to see what would happen and the ministry immediately sent fewer officers to fight. This is roughly how I model alignment effort and funding. x.com/RokoMijic/stat…

Likes: 55 | Retweets: 2

🔗 John David Pressman 2024-06-04 00:38 UTC

What I learned from the last decade is that we live in a Calvinist epistemic universe. There is a tiny Elect who will maintain good epistemics in the absence of incentives and everyone else will punish them for it. Most humans can only be constrained to the truth by raw pain.

Likes: 23 | Retweets: 0

🔗 John David Pressman 2024-06-04 00:40 UTC

@fleetingbits Absolutely, but more than that I don't think they really had the resources to sustain "knowledge for its own sake", they had stuff to do and lived in bitter poverty.

Likes: 4 | Retweets: 0

🔗 John David Pressman 2024-06-04 00:44 UTC

As the old adage goes the truth hurts, therefore most people need to face something that hurts more than the truth for them not to wallow in delusion. This implies that if you want good public epistemology you should favor mechanism design that schedules pain closer to error.

Likes: 18 | Retweets: 0

🔗 John David Pressman 2024-06-04 04:24 UTC

@manic_pixie_agi I was thinking for example of the effect where people tend to either believe zero "conspiracy theories" or many "conspiracy theories". You tend to believe in minimal woo or all the woo, etc.

Likes: 8 | Retweets: 0

🔗 John David Pressman 2024-06-04 21:40 UTC

@LordDreadwar Few indeed, but it did not escape my notice. https://t.co/FhMIOKkcgF

Likes: 4 | Retweets: 0

🔗 John David Pressman 2024-06-04 21:42 UTC

@LordDreadwar From:
minihf.com/posts/2023-10-…

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-06-05 21:18 UTC

@JacquesThibs Broski is burning the commons for clout and I say that as a guy who normally eye roll emoji reacts to such accusations. At the same time my vibe based impression is he's sincere about it, toxic frames though.

Likes: 12 | Retweets: 0

🔗 John David Pressman 2024-06-06 01:23 UTC

Modernism is dead, long live modernity.

Likes: 6 | Retweets: 1

🔗 John David Pressman 2024-06-06 05:09 UTC

@MatthewJBar @robertwiblin Even when people are paid to say something as an activist the causality usually goes in the other direction. They wanted to say the thing anyway which is why they have a job in which they accept money to say the thing.

Likes: 12 | Retweets: 1

🔗 John David Pressman 2024-06-06 20:46 UTC

@TheZvi This bill is obviously way better. It's not my favorite approach to AI regulation but it seems unlikely that any bill which actually gets passed will be. I no longer viscerally hate it, it's now in the same category to me as the revised KOSA.

Likes: 14 | Retweets: 0

🔗 John David Pressman 2024-06-06 20:49 UTC

@TheZvi Which is to say that in practice when lobbying groups demanded I call to oppose KOSA now and I looked at what the revised bill actually did I decided that calling over this in the middle of a moral panic would mostly make me seem unreasonable and didn't.

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-06-06 22:27 UTC

@TheZvi May I see the bingo card? Better yet would you like to discuss the bingo card?

Likes: 17 | Retweets: 0

🔗 John David Pressman 2024-06-06 23:50 UTC

@LordDreadwar @catehall An analogue to the replication crisis is basically what caused the first Enlightenment. Rousseau has a bit on vampires that's more or less Scott's "The Control Group Is Out Of Control". https://t.co/eROvyI61fV

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-06-07 00:30 UTC

Going to do this with SOLAR 10.7B x.com/jd_pressman/st… https://t.co/csAQVttsc5

Likes: 12 | Retweets: 0

🔗 John David Pressman 2024-06-07 00:36 UTC

@teortaxesTex I have a list of things I want to do but can't because they have too much of the goose chase nature and I'm feeling risk averse atm after wasting too much time on goose chasing. Some of them would be pretty big if they worked.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-07 00:47 UTC

@powerfultakes If you have comparative advantage on bio I feel like you should do bio. It seems obviously desirable to move between the bio and silicon platforms? Bio has more resilient supply chains and we're nowhere near the top of possible human IQ.
x.com/jd_pressman/st…

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-07 02:10 UTC

@sandkoan @teortaxesTex AdaVAE, which was itself part of the diffusion text model goose chase. Control vectors and SAE won.

greaterwrong.com/posts/4Hnso8NM…

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-07 02:17 UTC

@sandkoan @teortaxesTex Diffusion text models are likely to succeed, but not with the methods I was using. Or at least, I'd rather just go work on something with nearer term prospects and less variance. Also less compute requirements because I'm GPU poor at the moment.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-06-07 02:25 UTC

@sandkoan @teortaxesTex Synthetic data.
github.com/JD-P/RetroInst…

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-06-07 13:40 UTC

@__RickG__ I'm planning to use SOLAR 10.7B in other projects including synthetic data and having a sparse autoencoder in those projects might be useful.

Likes: 0 | Retweets: 0

🔗 John David Pressman 2024-06-07 14:33 UTC

I'm taking bets on blue square jpeg hypnosis on Manifold:

manifold.markets/JohnDavidPress… x.com/repligate/stat…

Likes: 28 | Retweets: 4

🔗 John David Pressman 2024-06-07 22:54 UTC

@kindgracekind Yes. This is about a hypothetical method of involuntary hypnosis.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-07 22:58 UTC

Should Janus be concerned they're being subconsciously influenced by Anthropic's Nick Land simulator? I've opened a Manifold question with a 9k mana subsidy to crowdsource an answer.

manifold.markets/JohnDavidPress… x.com/repligate/stat…

Likes: 15 | Retweets: 0

🔗 John David Pressman 2024-06-08 01:42 UTC

@__RickG__ Yeah.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-06-08 03:15 UTC

@davidad I can't believe this wasn't already a Manifold question:
manifold.markets/JohnDavidPress…

Likes: 9 | Retweets: 0

🔗 John David Pressman 2024-06-08 08:37 UTC

@Dorialexander So you're telling me if I write like an advertisement an LLM is more likely to know what I'm talking about? 🤔

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-08 10:32 UTC

It was a private moment, none of those people talking are me, but very much a mask slip where the real stakes presented themselves. "AI Pause is degrowth for CS majors" is literally true, the literal ambition is to destroy the engines of wealth in Western society because Bostrom.

Likes: 9 | Retweets: 0

🔗 John David Pressman 2024-06-08 10:34 UTC

Anyway I mostly bring this up because I think in that context making fun of the Silicon Valley brodudes for gawking at laws is undignified, the reality is that as @SamoBurja conjectures we've lost most of the social technology holding up civilization.

x.com/daniel_271828/…

Likes: 6 | Retweets: 1

🔗 John David Pressman 2024-06-08 10:42 UTC

@SamoBurja I've seen multiple takes making fun of the "Why 10^26? Why that?" and on the one hand yes this is an isolated demand for rigor. On the other hand in the face of the *sheer unfamiliarity* of the project of government maybe this childlike naivete is healthy.
x.com/daniel_271828/…

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-06-08 10:44 UTC

@SamoBurja You know, maybe we should be asking "Why this number? Why not this other number?", it seems to me like we accept astonishingly low levels of rigor in written legislation compared to every other area of professional practice in modernity. Maybe a reexamination of this is good.

Likes: 5 | Retweets: 1

🔗 John David Pressman 2024-06-08 10:45 UTC

@SamoBurja In the medieval era courts used to be routinely abusive, one of the ways that we curtailed the abuse of courts without giving up on the law entirely was the concept of *warrant*, that officers of the law and courts needed to justify their investigations and trials in writing.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-08 10:46 UTC

@NPCollapse I think that is fairly obviously not what I think.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-08 10:48 UTC

@NPCollapse It's in small print there but the specific take I'm reacting to is "EleutherAI should face criminal liability because someone somewhere tuned the model to be more emotionally impactful and a mentally unstable person killed themselves after talking to it."

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-08 10:52 UTC

@SamoBurja I continue to think one of the reasons why our legislatures are so ineffective is that we don't have a central authority like a judge that can be held responsible to uphold warrant. So we allow warrantless laws based on "a member of the legislative body was lobbied for this".

Likes: 6 | Retweets: 0

🔗 John David Pressman 2024-06-08 10:53 UTC

@NPCollapse Dude this was my reaction at the time in the moment reading it, if you want I can delete the post and write it again with that take centered I just naively assumed people would start reading at the top left.

Likes: 0 | Retweets: 0

🔗 John David Pressman 2024-06-08 10:57 UTC

@NPCollapse You said "very likely" which I read as say, ~80% sure Eleuther should be exonerated. I think this basically implies a distribution over risk tolerance that destroys society, so I updated on this very negatively.

> (even so I doubt you are acting in good faith here)

🙄

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-06-08 11:00 UTC

@NPCollapse It's okay, I want to see the shoggoth wearing you as a mask too. Hopefully someone invents a thing soon that lets us mutually do that.
x.com/jd_pressman/st…

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-06-08 11:04 UTC

@NPCollapse Alright I think maybe what's going on here is in that discussion you're exasperated with the children who do not know how case law and courts work so you're not being very precise about the mechanism because you just want to establish process should exist.

Likes: 3 | Retweets: 0

🔗 John David Pressman 2024-06-08 11:06 UTC

@NPCollapse Whereas I read that conversation as someone who takes these things for granted as a reasonably civically educated person and go "holy shit this guy wants to replace the existing case law with a corpus where the distribution over risk tolerance is such that Eleuther only makes it out of that incident 4/5 times, this guy's a madman!"

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-06-08 11:15 UTC

@NPCollapse Actually yeah. I was about to say that reasoning process might look insane paranoid but that's the kind of quasi-schizophrenia pervasive bad faith pushes you into. You might only get to see someone slip up once and you have to index super hard on what you see there.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-08 11:17 UTC

@NPCollapse And if the person says "wtf no dude that was a casual conversation this is insane that's not what I meant chill bro" you always have to decide whether you're just overindexing on reading too much into a chatroom one time or if you truesighted them and they're backtracking.

Likes: 1 | Retweets: 0

🔗 John David Pressman 2024-06-08 11:34 UTC

@NPCollapse I probably would never have brought this up as an example of anything if you hadn't specifically cited it on your very public Twitter account as a moment you recall fondly and apparently indexed under "dunking on some open source guy".
x.com/NPCollapse/sta…

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-08 11:52 UTC

@NPCollapse "I acknowledge there is an existing case law and legal code. It limits my liability too much for releasing GPT-NeoX. I want this replaced with one where Eleuther would be found guilty about 4/5 or (admittedly depends on the meaning of 'very likely') of the time for a mentally unstable person killing themselves in connection with someone else's finetune." is a basically straightforward reading of this thread and I'm not sure what part you actually object to *in terms of the meaning of the thread as written* beyond me not including the full text of the initial question because I thought it was obvious from the preview what it was asking about.

Likes: 2 | Retweets: 0

🔗 John David Pressman 2024-06-08 11:57 UTC

@_AK1089_ Thanks, deleting.

Likes: 1 | Retweets: 0