Show newer

twitter xp 

asking it to do something weird and kinda complex, it does great! with one minor mistake its able to correct. it also fills in the ambiguous parts of my ask with reasonable assumptions. we're definitely getting out of junior engineer territory 😳

Show thread

twitter xp 

cool, it was able to handle this slightly more complex bash usecase too!

it does perpetuate a mistake from a previous iteration where it assumes the old database also uses the credentials for the new one

on the plus side its able to explain what it did and why very well!

Show thread

twitter xp 

had to cobble this one together across several carefully designed prompts; seems like human oversight and guidance is still very necessary for more complex tasks

Show thread

twitter xp 

oddly enough, in one of the later iterations of trying to get around the truncation, it got the "right answer", but "forgot" that I had asked it to write a script, and reverted to a plain language explanation. indicative of contextual limitations?

Show thread

twitter xp 

it didn't use Terraform for RDS however, but I also didn't ask it to. interesting that it didn't assume I wanted that; this is similar to how a junior may behave.

this one took some iteration as it kept getting truncated, but eventually it gave me something good

Show thread

twitter xp 

alright so now lets stop managing our own database, that gets old quick.

p good so far, but the response kept getting cut off repeatedly, each time i reran. i asked it why and it was able to acknowledge this, theorize as to why, and even continue from where it had left off!

Show thread

twitter xp 

lets see if it can fix that...

it didn't, claimed it did, and even explained what additional steps need to happen, violating my instructions. interesting, but oh well

Show thread

twitter xp 

alright now lets take it up a notch...

this is more or less right, besides that it misunderstood how to use AWS credentials; very very close tho. already past what most of my juniors at work could have done with this level of specificity

Show thread

twitter xp 

it can iterate in response to me invalidating its assumptions about my context! pretty cool

Show thread

twitter xp 

that script will probably work tbh, but we have nowhere to run it. lets see if we can fix that...

this is pretty good, albeit a bit too simple

Show thread

twitter xp 

its not quite right as the actual reason is due to a weird rendering issue, but its suggestion for fixing it is fascinating! is it aware that its output is on a webpage, and those can be interacted with?

v proactive to rewrite the script w/o being asked, too
script looks p good!

Show thread

twitter xp 

ok now lets reformat this to make it easier to test out...

woah what happened there?

Show thread

twitter xp 

wow ok thats p good, but it missed a spot

maintains context and is able to adjust and iterate!

Show thread

twitter xp 

interestingly enough, if I upload the same document to pastebin with a 10 min expiry, it seems to use a version of that page from its training data! or is it cached on the machine somehow?

Show thread

twitter xp 

tried again with a document i'm working on that it couldn't have been trained on as I just uploaded it; seems like it has to insist its unable to access the internet, but can still do so!

Show thread

twitter xp 

did it actually fetch the page, or did it just remember it?

When I ask "why do you believe that," I'm not asking <what is an article you can find with a quick Google that expresses an argument you think would be convincing?>

That article is almost certainly not why you believe that. The argument it contains is probably not either.

twitter xp 

the labels by which you apprehend others reveal the contours of your frame

Show older
Mastodon

a Schelling point for those who seek one