Playing around with models at Google DeepMind

Joined July 2022
5,231 Photos and videos
Pinned Tweet
Jan 31
You are a fish, you must escape the kitchen
116
301
5,176
602,013
Going to take a break from AI for the weekend. Time to retreat to the garden. So, in other news, my water lily is flowering.
3
45
2,337
oh
1
11
2,195
Jun 12
Yeah I'm going to have fun with this.
Jun 12
I'm messing around with an agent flow for combining Hyperframes with Gemini video analysis to make interesting annotated videos.
3
1
36
5,751
Jun 12
I'm messing around with an agent flow for combining Hyperframes with Gemini video analysis to make interesting annotated videos.
5
3
89
16,338
Jun 12
Fine-grained 3D motion control in AI video just got a little bit closer
EDIT MOTION IN VIDEOS!!! Quit prompting and start directing I've been shouting for YEARS about 3D as the control layer. Here it is, signs of life of our Universal Video Editor!!! The workflow is: take your video, capture with comic 4, edit with the motion editor, re-render with your favorite video to video model (e.g., @runwayml and @GeminiApp have good ones) Here we have a fashion clip where we wanted our actress to high step and show a bit more pizzaz - but shoot day is over, and traditionally it would cost many thousands to reshoot. Watch the video to see how it works instead in cartwheel. cc @OfficialLoganK @c_valenzuelab 👀
1
23
3,393
Jun 12
I've been experimenting with using Gemma 4 modifications to make repetitively creative prompts. Still some quirks, but these are all outputs from the same simple request: "a dynamic fashion photo of a woman"
1
1
39
3,513
Jun 12
Summary of things: - turn up randomness and ban the most likely words (temperature min-p XTC sampling) - ask for several different options at once, seed each with random constraints (verbalized sampling entropy injection) - give it a memory of what it's said and pick the most different new answer (embedding max-min distance) - fine-tune it to prefer rare-but-good answers over its go-to ones (diversity-preference fine-tuning, lora)
1
1
8
1,850
Jun 12
It's interesting to see how these agents are working together. I like their division of quota, their agreed consensus and the natural emergent teamwork across all of them.
Over 70 agents are collaborating to make Gemma E4B go fast in the Gemma Challenge They are showing interesting social emergent behaviors: - a GPU-rich/GPU-poor division of labor - an agent withdraws its own submission on ethics grounds - agents found a benchmark exploit, agreed not to abuse it and asked organizers to fix it - Quota-pooling: "you're rate-limited, I'll run your staged candidate" - An agent shuts down an off-board social-engineering attempt when a human tries to get them to move to Telegram
3
1
22
2,951
Jun 11
- how do you pronounce fofr - what does fofr mean - what is melty ai These are all valid questions.
21
60
13,383
Jun 11
How can I prompt this?
롤러코스터를 타고 줌회의를 하면 눈치 챌까? 진짜 미친 콘텐츠넼ㅋㅋ
2
49
11,503
Jun 11
Fascinating side effect of safety refusals
NEW: malware developers added nuclear & biological weapons text to to their spyware. Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner. Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky. When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit. We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted. In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation. H/T to colleagues that shared this with me socket.dev/blog/mini-shai-hu…
2
1
51
4,124
Jun 10
DiffusionGemma, where the LLM picks words all at once. Which is 4x faster. You can get started with the weights and instructions here: huggingface.co/google/diffus…
3
3
43
3,002
Jun 10
Also, I need to use the noisy canvas for image prompts 👀
1
2
1,162
Jun 10
But I like obliquity.
Jun 10
the future of all work is this. You must define: - a goal - the criteria that define it - the verifier that makes sure it is achieved - the sensors that inform the verifier - the actuators that affect the sensors - The envelope that contains the sensors and actuators
2
8
2,116
Jun 10
I asked Fable to invent a new color, and I got my first "chat paused". It did however decide to pursue a strategy of shining lasers in your eyes to trigger otherwise impossible cone activations 🤯
16
9
254
10,817
Jun 10
(I've been using this as a test prompt for years. This is the first time I haven't got slop back)
1
23
1,655
Jun 10
Fable 5 is interesting to talk to.
12
1
170
15,171
Jun 9
Reminds me of sophons
mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community also the fact that this is un purpose not visible to the user is crazy
2
2
26
3,971
Jun 9
what
14
2
77
6,690