Filter
Exclude
Time range
-
Near
17/25 ๐—™๐—น๐—ฒ๐˜…๐—ถ๐—ฏ๐—น๐—ฒ ๐—™๐—น๐—ผ๐˜„๐˜€ ๐—ณ๐—ผ๐—ฟ ๐—•๐—ถ๐—ผ๐—น๐—ผ๐—ด๐—ถ๐—ฐ๐—ฎ๐—น ๐—ฆ๐—ฒ๐—พ๐˜‚๐—ฒ๐—ป๐—ฐ๐—ฒ ๐——๐—ฒ๐˜€๐—ถ๐—ด๐—ป This paper addresses limitations in Discrete Flow Matching (DFM) for biological sequence design by proposing a structured coupling for domain-specific preferences and a latent edit-based rate parameterization for variable-length generation. It introduces a latent classifier-free guidance mechanism and Dirichlet-prior temperature scaling for test-time control, achieving state-of-the-art performance across diverse tasks including density estimation, unconditional/conditional DNA, and peptide sequence generation. #DiscreteFlowMatching #BiologicalSequenceDesign #GenerativeModels #Bioinformatics #MachineLearning #DNAgeneration #PeptideGeneration Paper Link: arxiv.org/abs/2606.10543
1
4
Replying to @AndyHazelton
Cumulus parameterization problem?
1
386
That exponential in there reminds me of the Schwinger parameterization.
1
34
Alongside the ECWMF solutions, the UKMET is even more aggressive w/ evolution of a potent warm-core low over the Gulf coast states next week. These models have surface/boundary layer flux parameterization enabling unusual sustenance of lows in significantly moist environments.
1
4
24
1,238
@CNPYNetwork, application architects deploy fully programmable sovereign account constructs, natively enforcing decentralized multi-signature execution, permissioned access routing, and automated security parameterization.
1
1
11
The recognition of a parameterization grounded in the physical nature of the economy is possible provided we acknowledge that it is a system subject to forms of universality that reproduce recurrent probabilistic and physical properties.
1
11
Jiaxuan Zou retweeted
How do you know if a parameterization (e.g., ยตP) or a fitted Hyperparameter (HP) scaling law actually gives reliable transfer? @MBarkeshli and I propose a three-metric framework to quantify the quality of transfer and use it to show that ยตPโ€™s advantage over SP in Transformers trained with AdamW comes from training the embedding layer fast enough. Below: speeding up the embedding LR in SP (SP Embd) recovers ยตP-like transfer, and slowing it down in ยตP (ยตP-Embd) wrecks training with severe instabilities. A thread ๐Ÿงต 1/n
1
16
55
4,593
this is probably a harder example to replicate, where we generate some kind of parameterization around the edge loop, that goes from 0 to 1. And then we can make varying fillets along the edge
1
1
21
Replying to @Akintola_steve
Attacker inserts malicious SQL codes into input fields. Parameterization ORMs
33
๐Ÿ“ข June 15 (Mon): Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation ๐Ÿค” Discrete diffusion models are often trained through clean-data prediction, but the prediction can be used in different ways to define the reverse dynamics. In Masked Diffusion Models (MDM) these choices largely coincide, whereas in Uniform Diffusion Models (UDM) they do not. ๐Ÿ’ก The authors show that the standard plug-in bridge parameterization for UDM is not optimized by the denoising posterior, but by a leave-one-out posterior that predicts each clean token without using its own noisy observation. This identifies a mismatch between the plug-in ELBO and the usual cross-entropy denoising objective. ๐Ÿ”ง The authors characterize the leave-one-out target and derive exact conversions between the denoiser, the leave-one-out posterior, and the score. These conversions allow them to disentangle parameterization and the training objective. ๐Ÿ“ˆ Their results also lead to inference improvements without any additional training through an informed predictor-corrector sampler and improved temperature sampling based on the leave-one-out predictor. ๐Ÿ”ง The authors further introduce an absorbing-state reformulation of uniform diffusion that preserves the UDM joint law while decomposing it into masked-diffusion-like sampling operations, with simpler denoising posteriors, carry-over unmasking, and a natural remasking mechanism. ๐Ÿ“ˆ On language modeling, leave-one-out parameterizations consistently improve UDM generation, while the absorbing construction matches or surpasses masked diffusion. These results suggest that the empirical gap between masked and uniform diffusion is driven less by the choice of marginals themselves than by parameterization and sampling design. This Monday, Samson Gourevitch (@samsongvch, samsongourevitch.github.io/), Yazid Janati (@yjelid, yazidjanati.github.io/), and Dario Shariatian (@dario_sha, darioshar.github.io/) will present their paper "Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation".
1
6
24
4,110
Full Aiken integration with the IDE - from scaffolding to runners that use the IDE UI. Project creation, testing, building, parameterization, artifact generation, smart context-aware autocomplete/suggestions that can infer types and provide hints based on context, automatic imports -it has EVERYTHING. And what about local project-aware toolchains? You can work on different projects with different versions of Aiken or stdlib. No more regression fear! With customizable runners, you no longer need to write commands in the terminal. But if thatโ€™s what youโ€™re used to, the terminal launched inside the IDE will use the exact Aiken version configured for your project. Want to use runners but still prefer working with a console? You can do that too. All runners have two modes: Integrated UI and TTY. And do you know what else is great about runners? You can chain them together. One click launches the whole pipeline! Aiken development without compromises. Configure your pipeline, create your comfort zone, and build.
Aiken Plugin 2.0 is here! It was a long road - much longer than we expected. But it was worth it. #aiken #jetBrains #intelliJ #plugin
2
4
252
Whatโ€™s it called? Schwinger parameterization?
76
6/ Bigger picture: maybe PEFT doesnโ€™t need more tricks, modules, or complexity. Maybe it just needs a better parameterization. Thatโ€™s the idea behind GPart: simple, geometry-preserving, and competitive across domains. #PEFT #LoRA #LLM #MachineLearning #FineTuning
1
104
the LLM mantra has been this deep double descent, More parameterization never hurts, somehow lets LLMs escape fundamental laws of modelingโ€ฆ
1
62