Rafael Pardinas

Rafael Pardinas

37 Photos and videos

Tweets

Pinned Tweet

Rafael Pardinas

@muchomuchacho

Mar 30

Really cool to see PipelineRL's in-flight weight updates being picked up! We're spreading it across our research teams to train models to reason and to make reasoning more efficient.

Sasha Rush

@srush_nlp

Mar 29

We agree. arxiv.org/abs/2603.24477

458

Alex Gurung

Rafael Pardinas retweeted

Alex Gurung @AlexAag1234

Jun 1

Excited to share my recent work @ServiceNowRSRCH ! We introduce a new privacy-centric deep research dataset and show models frequently leak enterprise information. However, training with dense _situational_ rewards efficiently learns to jointly optimize performance and privacy

Rafael Pardinas

@muchomuchacho

Jun 1

MosaicLeaks is now on arXiv. The Mosaic Effect captures a simple idea: small fragments can look harmless alone, but become revealing in aggregate. Deep research agents can leak enterprise information in exactly this way. 1/9

532

Rafael Pardinas

Rafael Pardinas

@muchomuchacho

Jun 1

1,201

more replies

Rafael Pardinas

Rafael Pardinas

@muchomuchacho

Jun 1

The core idea: Enterprise agent privacy failures will not only come from copying private text. They can also come from the external actions agents take while trying to be useful. Privacy shouldn't come at the cost of utility, we can optimise for both. 8/9

Rafael Pardinas

Rafael Pardinas

@muchomuchacho

Jun 1

Led by @AlexAag1234 at ServiceNow AI Research, with @gspandana , @alexandredrouin , @ILaradji , @PerouzT and me. Paper: arxiv.org/abs/2605.30727

MosaicLeaks:Privacy Risks in Querying-in-the-Open for Deep Research Agents

Deep research agents increasingly combine private local documents with external tools like web retrieval, creating a privacy risk: an agent's external queries may leak sensitive information from...

arxiv.org

190

Rafael Pardinas

Rafael Pardinas

@muchomuchacho

May 25

this is too good

テコまる @tecomalupepepe

May 25

長椅子振動主への反射システム

1:08

Rafael Pardinas

Rafael Pardinas

@muchomuchacho

May 21

This is becoming really powerful. More to come for high latency agentic pipelines

Rafael Pardinas

@muchomuchacho

Apr 9

Better reasoning does not have to mean longer reasoning. Apriel OpenReasoner: fully reproducible multi-domain RL post-training using public datasets. 30-50% shorter traces, no quality trade-off. @ServiceNowRSRCH @ehsk0 @dvazquezcv @alexandredrouin

Rafael Pardinas

Rafael Pardinas

@muchomuchacho

May 16

London tech you say?

Zain Mobarik

@Zainmbrk

May 15

I spent the last few weeks crowdsourcing the ultimate guide to London’s startup ecosystem. Here's why. Finding your people is a lifelong mission- the people that push you, open doors for you, celebrate your wins, advise you sincerely and say yes to your crazy ideas. It’s one of the reasons people love San Francisco. Everyone is rooting for you and believes in you. There is a sense of wild ambition. But is this something only unique to SF? What is/was London missing? I think it really came down to a few things: - Optimism - A mindset of waiting for permission - Lack of a catalyst Those in the startup world would have felt a shift over the past couple of months that has instilled a renewed sense of optimism for Britain, a mentality of not waiting for anyone’s permission and the catalyst of the AI boom empowering a new generation of builders. And surprisingly, this isn’t new for Britain. We made the jet engine, steam trains, discovered the structure of DNA, discovered gravity and so much more. There was no concept of permission. The UK that exists today has: - Anthropic, OpenAI and DeepMind all opening offices in Kings Cross - Startups raising absurd rounds building generational companies (just 2 days ago Fractile raised a $220m Series B) - Unmatched talent being pulled in from Oxford, Cambridge, Imperial, UCL, Warwick, Kings and even European universities like ETH So how can someone get involved and how can we level the playing field for those outside the startup ecosystem? The guide friends and I created below is our small role in helping democratise some of the obscure information on the inner workings of London’s startup scene. Read it, add to it, check it regularly and most importantly, do something with it. I hope this guide helps people for years to come. Can’t wait to see what we do on top of all the infrastructure built by those before us. We’re truly standing on the shoulders of giants. 🔥 Link in comments.

0:36

108

Alexandre

Rafael Pardinas retweeted

Alexandre

@alexpiche_

May 11

PipelineRL finally supports vLLM v1!

Rafael Pardinas

@muchomuchacho

May 8

Our first vLLM V0→V1 run on PipelineRL looked broken. @ehsk0 and I almost reached for an objective-side correction. That would have been the wrong fix. The real problem: four mismatches in the rollout backend. 🧵

697

Rafael Pardinas

Rafael Pardinas retweeted

Rafael Pardinas

@muchomuchacho

May 8

2,489

Rafael Pardinas

Rafael Pardinas

@muchomuchacho

May 8

2,489

more replies

Rafael Pardinas

Rafael Pardinas

@muchomuchacho

May 8

With those fixed, V1 converged to the V0 trajectory. No objective change. Backend correctness before objective corrections — otherwise your objective fix silently compensates for a broken inference path, and the curves stop telling you anything.

Rafael Pardinas

Rafael Pardinas

@muchomuchacho

May 8

Fixes in main: github.com/ServiceNow/Pipeli… Postmortem: huggingface.co/blog/ServiceN…

GitHub - ServiceNow/PipelineRL: A scalable asynchronous reinforcement learning implementation with...

A scalable asynchronous reinforcement learning implementation with in-flight weight updates. - ServiceNow/PipelineRL

github.com

Rafael Pardinas

Rafael Pardinas

@muchomuchacho

May 3

It’s been over a year since we released this work. Since then, PipelineRL has gone places. huggingface.co/blog/ServiceN…

PipelineRL

A Blog post by ServiceNow on Hugging Face

huggingface.co