Jonathan Fly 👾

Jonathan Fly 👾

1,734 Photos and videos

Tweets

Pinned Tweet

Jonathan Fly 👾

@jonathanfly

23 Apr 2023

Bark Text-to-Audio Model Full Text Input: "Why was six afraid of seven?" Ignore Bark's "I'm done with this input" token and tell Bark to just keep generating more audio anyway.

1:47

272

1,672

461,803

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

Jan 28

Just caught myself replying to an LLM on Hacker News. I'm sure better bots already avoid these classic AI writing tics. Dread it, run from it, Dead Internet Theory arrives all the same.

379

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

Jan 24

If you enable experimental sub-agents in OpenAI Codex, the prompt tells Codex to self identify as Batman? github.com/openai/codex/blob…

453

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

8 Aug 2025

GPT-5 gets a 1.2 on my personal STN benchmark. Songs-To-Neon. That means GPT-5 made it through one full song and two verses of a second song before using the word "Neon" in song lyrics.

1,381

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

8 Aug 2025

I just saw @tszzl say that GPT-5 *thinking* specifically is the model that should be better at writing, so I ran the Songs-To-Neon eval again: 0.0. first song, first verse, first line. x.com/tszzl/status/195360827…

roon

@tszzl

8 Aug 2025

we've been testing some new methods for improving writing quality. you may have seen @sama's demo in late march; GPT-5-thinking uses similar ideas it doesn't make a lot of sense to talk about better writing or worse writing and not really worth the debate. i think the model writing is interesting, novel, highly controllable relative to what i've seen before, and is a pretty neat tool for people to do some interactive fiction, to use as a beta reader, and for collaborating on all kinds of projects. the effect is most dramatic if you open a new 5-thinking chat and try any sort of writing request for quite some time i've wanted to let people feel the agi magic I felt playing with GPT-3 the weekend i got access in 2020, when i let that raw, chaotic base model auto-complete various movie scripts and oddball stories my friends and I had written for ~48 hours straight. it felt like it was reading my mind, understood way too much about me, mirrored our humor alarmingly well. it was uncomfortable, and it was art base model creativity is quite unwieldy to control and ultimately only tiny percents of even ai enthusiasts will ever try it (same w the backrooms jailbreaking that some of you love). the dream since the instruct days has been having a finetuned model that retains the top-end of creative capabilities while still easily steerable all reasoning models to date seem to tell when they're being asked a hard math or code question and will think for quite some time, and otherwise spit out an answer immediately, which is annoying and reflects the fact that they're not taking the qualitative requests seriously enough. i think this is our first model that really shows promise at not doing that and may think for quite some time on a writing request it is overcooked in certain ways (post training is quite difficult) but i think you'll still like it 😇

687

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

25 Sep 2024

NotebookLM tries hard to avoid hallucinating. But what if your data is total nonsense? Then it hallucinates an epic conspiracy. Over and over, the podcast hosts figure out the text is backwards and try to understand the greater meaning of "KCUF". (data is reversed-text dril tweet archive)

3:30

6,319

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

27 Sep 2024

The AI podcasters find meaning even in empty spaces. Literally. All the other tools punted on a file of empty spaces but the podcasters make it work. "Next time we'll talk about how you can actually use this empty space idea in your life, practical stuff."

2:43

26,950

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

30 Sep 2024

A podcast tackling tongue twisters, with the hosts trying to speak entirely in tongue twisters. Sort of successful. The AI hosts speak spotlessly, but react as if they were stumbling over the syllables. At one point literally saying "stumbles over the words".

4:41

1,714

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

21 Sep 2024

Google's NotebookLM generates an AI podcast from any document. Weirdly the podcast even had space for ad breaks. A document of only dril tweets is more coherent than I expected - mostly psychoanalysis. "He's always seeking validation, then lashing out at anyone criticizing him."

7:17

1,455

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

21 Sep 2024

2002's "Star Wars Kid" Special Edition. Gen-3 Alpha does well with fast moving lighting like at 0:50s. Some very strange lightsaber *grips* but these are complicated motions, and I made things more confusing prompting "lightsabers" plural.

1:59

708

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

20 Sep 2024

Ocarina of Time "Yarn-ified" editions seem to gender swap Zelda for Link? Also fun to see Gen-3 Alpha V2V interpret video that isn't consistent frame-to-frame as "Inception" style warping landscapes (text prompts are identical).

1:06

720

Greg Egan

Jonathan Fly 👾 retweeted

Greg Egan @gregeganSF

19 Jul 2024

Crowdstrike have advised that the world will be reverted to its last valid backup set, dated 7 Jan 2014, within the next 30 minutes. Please make paper notes of anything important to you from the intervening period, and tape them to your refrigerator door in a prominent position.

749

5,757

341,738

Robert Heaton

Jonathan Fly 👾 retweeted

Robert Heaton @RobJHeaton

9 Jul 2024

I wrote a tool called PySkyWiFi that gives you completely free, unbelievably stupid wi-fi on long-haul flights. It tunnels data through the "first name" field in your airmiles account, and can reach speeds of up to several bytes per second. robertheaton.com/pyskywifi

PySkyWiFi: completely free, unbelievably stupid wi-fi on long-haul flights | Robert Heaton

The plane reached 10,000ft. I took out my laptop, planning to peruse the internet and maybe do a little work if I got really desperate.

robertheaton.com

655

5,943

539,772

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

7 Jul 2024

Luma's start and end keyframes are a game changer. With a sequence of keyframes from the original film, we can seamlessly remaster stop motion classics like "Jason and the Argonauts" as modern single-take action scenes.

1:02

3,054

107

1,192

5,414,582

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

7 Jul 2024

It's interesting how the uncanny movements of the original stop motion skeletons are preserved in traditional frame interpolation. Maybe it's the lack of motion blur on the skeletons?

0:37

295

265,899

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

26 Jun 2024

Trying out new lipsync models @hedra_labs and Hallo github.com/fudan-generative-… Before Suno and Udio took over AI music, I enjoyed trying to use Bark TTS as a singing text-to-music model. Bark is a terrible music model, but the 3 model architecture allows for some fun possibilities like a pseudo "remix": re-decode just the last two models to regenerate a song. Not just voice conversion - even the instrumentation changes. (🎧: left and right ears get different versions of the same song in the video.)

14:19

18,960

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

17 Jun 2024

SimpleSpeech TTS models the sentence-level duration prior by asking GPT-3.5 to predict sentence durations then lets "the model learn alignment between words implicitly." Can GPT possibly be adding anything useful here over simply counting words or characters, with some randomness on top? Nice demos - no code though. simplespeech.github.io/simpl… arxiv.org/abs/2406.02328

12,918

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

14 Jun 2024

I trained a mamba audio model on 150 hours of YouTube poetry videos based on 2084.substack.com/p/2084-mar… Doesn't make sense - but it *sounds* right - like a poetry reading in "The Sims" game.

2:15

9,652

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

12 Jun 2024

"sound-to-song" @suno_ai_'s Audio Upload feature is now LIVE for everyone. Try anything as an audio prompt, go wild. "Take it Easy Dracula" 🧛🌱 All audio and dialog after the color shift is generated as part of the song - the script is in the lyrics prompt. Source: Little Shop of Horrors, 1960. suno.com/song/b5888483-ae4d-… B-side: suno.com/song/ed89aeec-698a-…

2:45

7,369

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

14 Jun 2024

A few humming experiments suno.com/playlist/2e11f7c7-9…

5:51

5,924

Jonathan Fly 👾

Jonathan Fly 👾

@jonathanfly

12 Jun 2024

Dial Up Modem for Whistle, Distorted Violin, and Milkdrop Playlist of way too many modem songs: suno.com/playlist/dcf209d1-9…

4:31

4,980