Cory Gabrielsen

Cory Gabrielsen

Users
Tweets

Cory Gabrielsen

@corygabrielsen

10h

The institutions are so deftly corrupt and incompetent they banned 10s videogen and bugfix AI.

118

Lucas Beyer (bl16)

Lucas Beyer (bl16)

@giffmana

18h

Replying to @soniajoseph_

I agree with you but here (at least for me) it's not about "which loss has it better" but it's an existence proof that "pure videogen pixels models got this too", directly disproving claims of the contrary. For this, size doesn't matter.

428

Ray Sorkin

Ray Sorkin

@ray_sorkin

Jun 13

Replying to @theRayW @DrEliDavid

Is there a way to use Higgsfield MCP connector with other videogen models?

Quentin Garrido

Felix H retweeted

Quentin Garrido @garridoq_

Jun 12

A crucial aspect is that the VAEs used to train the videogen models perform no-better than chance, but the diffusion models then figure it out themselves. This is in contrast with VideoMAE and V-JEPA, which learn some representations of physics through their encoders !

Lucas Beyer (bl16)

@giffmana

Jun 10

You may have recently heard claims that video generation models are "dumb" about physics, and only "world models" (V-JEPA, specifically) have a valid internal model of physics. This turns out to be false. In a recent paper, researchers show that a LINEAR probe of diffusion videogen models predict various "physics" very well, significantly better than V-JEPA or VideoMAE (and plain VAE just sucks). This is noteworthy, because a *linear* probe being this accurate shows that the model has a pretty explicit internal representation of the physics!

3,331

Lucas Beyer (bl16)

Blessing Agyei Kyem retweeted

Lucas Beyer (bl16)

@giffmana

Jun 10

To be clear, this is not a V-JEPA or VideoMAE diss, just resurrecting the fact that "pure videogen" models may indeed learn an explicit model of the world/physics as a byproduct. Also cc @mapo1 we chatted about this and you also intuitively pushed back against such claim.

5,553

Leon Lin

Leon Lin

@LexnLin

Jun 11

Replying to @OfficialLoganK

cant video edit in my region unfortunately. will do for simple video gen tho! few weeks ago it wasn't really better than seedance 2 in videogen

788

Singh

Singh @SamratthSinghJi

Jun 11

Replying to @giffmana @mapo1

If videogen models implicitly learn object dynamics, could multi-agent interaction dynamics (coordination, communication, pursuit/evasion) also emerge without being explicitly modeled?

ttafs

ttafs @whentheferg

Jun 11

Replying to @TBC_on_X @Clive_99

Because that's what you prompted the videogen model for???

Krzysztof Gonia

Krzysztof Gonia

@kgonia7

Jun 10

Replying to @giffmana @PINTO03091

Isn't that what he expected from VideoGen models? We know that ImageGen models learn, among other things, depth and segmentation. They have to have some kind of world representation.

612

(kenr)

(kenr)@kenr

Jun 10

@sama well look at that: You abort poor Sora, harr Codex pivot (fafo cannibals eh, @Disney) to *catch up* to @DarioAmodei (his IPO gonna tag yrs taintstain) now (unexpectedly) meine CEO just approved scaling up ai videogen (since 2023 here, scroll. back) which justifies. flops..

ALT Cray That Shit Cray GIF

Lucas Beyer (bl16)

Lucas Beyer (bl16)

@giffmana

Jun 10

At the beginning of the year, there was another paper who did this check only for V-MAE and V-JEPA, and showed both have some understanding of physics. This new paper essentially extends the study to diffusion-based "pure videogen" models and shows they understand physics very well.

9,064

Lucas Beyer (bl16)

Lucas Beyer (bl16)

@giffmana

Jun 10

107

1,067

99,875

ueaj

ueaj

@_ueaj

Jun 10

Replying to @willccbb @tautologer

I'm not really being pedantic no, yes you do imply you think that it is good because of course you do, so do I, that's why I said it was a self interest argument. What was missing was an appeal to something else, which would make it not a self interest argument. Open source does not just mean a race to the bottom on price, it means a race to the bottom on everything, like alignment or refusals. Do you really think the world is better off with open source image and videogen? easy access to revenge porn and misinformation generation? etc.? They do collaborate, on alignment especially, which is what matters most. Sharing capabilities would do nothing w.r.t. pro-socialness. They do these things because they also, rightly, believe a multipolar future would be dystopian.

1,326

Qiuyang Mang

Qiuyang Mang

@MangQiuyang

Jun 10

Replying to @HaochengXiUCB

Nice work! I'm just curious about what's the difference between sparse videogen team and quant video team🧐😋

488