implemented q-chunking on top of it
offline only for now
already converges significantly faster: 84% at 50k steps vs 56% for vanilla fql
online fine-tuning harder envs coming next
implemented flow q-learning (FQL) from scratch in PyTorch, tested on OGBench cube manipulation
smol 200k step pilot on my mac
some more bigger scale experiments coming soon