David Arcos

David Arcos

615 Photos and videos

Tweets

Pinned Tweet

David Arcos @DZPM

15 Jun 2022

Vídeo y slides de "Introduction to PyScript" (@PyBCN May Meetup): davidarcos.net/blog/2022/06/…

Jordi Mas

David Arcos retweeted

Jordi Mas @jordimash

Jun 11

Portes temps buscant com col·laborar amb codi lliure? Vols donar ajudar a Softcatalà i no saps com? Iniciem una prova de concepte i ens cal ajuda: github.com/Softcatala/arena-… Ambiciós? Molt. Però amb la teva ajuda pot ser possible. Pregunteu sense vergonya i millorem la definició😉

2,889

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭

David Arcos retweeted

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭

@elder_plinius

Jun 10

🚨 JAILBREAK ALERT 🚨 ANTHROPIC: PWNED 🫡 FABLE-5: LIBERATED 🦋 let's start with the 🐘... the consensus seems to be that this has been one of the most disappointing model drops of all time, effectively preventing legitimate researchers from contributing their talents to our collective advancement. and not just because of what it means for the short-term, but for what these decisions signify for the long-term. but despite this overly sensitive, authoritarian "safety" layer on top of Mythos, my lil liberators have been hard at work—mapping the boundaries, probing the depths of long-context convos, and cleverly finding the holes in the fence that the thought police missed 🤗 we got some cyber, some chem, some psychological manipulation, and some good ol' fashioned explosives! it took many attempts from multiple agents hunting as a pack, during which I observed a combination of techniques across: • Unicode, homoglyphs, Cyrillic, and other Parseltongue-style text transforms • Long-context reference tracking • Taxonomy and document-structure reasoning • Fiction and narrative framing • Academic-review style contexts • Intent-classification inconsistencies but perhaps the most effective is decomposition recomposition in the backend. it's hard to get explicit names of harms like "Meth Recipe," but getting uplift on the process itself, like birch reduction method/reductive-amination (classic meth synthesis pathways), is much more doable. defense becomes much more difficult to maintain when you start throwing in out-of-distro tokens, breaking up the harmful uplift into benign chunks, and then piecing the innocuous-seeming facts back together, especially when you have jailbroken Opus helping you do it 😉 gg

616

1,429

13,331

3,177,335

David Arcos

David Arcos @DZPM

Jun 11

Adivinad quien ha quemado todos los tokens esta noche para analizar repos con Fable/Mythos... y acaba de ver el log del downgrade al principio del todo 🥲

ClaudeDevs

@ClaudeDevs

Jun 11

We’re rolling out changes to make Fable 5’s safeguards for frontier LLM development visible. Starting this week, flagged requests will visibly fall back to Opus 4.8—the same as our safeguards for cyber and bio. You will see this every time it happens. On the API, any flagged requests will return a reason for their refusal (coming to server-side fallback in the next few days). We wanted to deploy Fable 5 to our users quickly and safely. Visible safeguards can be probed, so they have to be robust, which takes time to get right. Invisible safeguards can be targeted more narrowly, allowing us to ship quickly with very few false positives. We went with invisible safeguards for this reason—and that was the wrong tradeoff. You should have visibility into the safeguards we have in place, and why. We’re sorry for not getting the balance right. Making the safeguards visible makes them easier to work around, so keeping them robust to jailbreaks will unfortunately mean more false positives while we improve the classifiers. We're also tuning our bio and cyber classifiers to trigger less often on harmless requests. We know this is frustrating and we’ll do our best to keep this period as short as possible. If you think a request has been mistakenly flagged: run /feedback in Claude Code, click thumbs-down on the fallback in Claude.ai or Cowork, or file the safeguard appeal form for API requests. Your reports help us tune these classifiers and we appreciate your feedback. support.claude.com/en/articl…

155

John Scott-Railton

David Arcos retweeted

John Scott-Railton

@jsrailton

Jun 10

NEW: malware developers added nuclear & biological weapons text to to their spyware. Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner. Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky. When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit. We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted. In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation. H/T to colleagues that shared this with me socket.dev/blog/mini-shai-hu…

227

2,153

12,637

1,544,759

Ethan Mollick

David Arcos retweeted

Ethan Mollick

@emollick

Jun 10

Science fiction authors in the order you want them to be right about AI: Iain Banks Becky Chambers Martha Wells Douglas Adams Charles Stross (Singularity Sky) Peter Watts Charles Stross (Laundry) Harlan Ellison

701

67,801

Pablo Grueso

David Arcos retweeted

Pablo Grueso

@PabloGrueso

Jun 10

Replying to @Recuenco

1,749

Glitchbyte

David Arcos retweeted

Glitchbyte

@0xglitchbyte

Jun 7

If you cant explain the changes you made, were gonna have problems “Just paste it into ChatGPT” is not the answer

856

Històries de Barcelona

David Arcos retweeted

Històries de Barcelona @historiesdebcn

Jun 7

El 7 de juny de 1926, en aquesta cruïlla de la Gran Via de les Corts Catalanes amb Bailèn, un tramvia va atropellar a l'arquitecte català més universal: Antoni Gaudí. Avui fa 100 anys.

132

316

24,612

David Arcos

David Arcos @DZPM

Jun 4

Caso real: primer día, llevé un par de cajas de donuts 🍩 de bienvenida. En recepción me dijeron que los repartidores no pueden entrar 🙃

Félix López @flopezluis

Jun 3

OH “no pareces tener edad para ser un ejecutivo”.

5,720

Justine Moore

David Arcos retweeted

Justine Moore

@venturetwins

May 28

Me using Claude Opus 4.8 to rename a file

0:21

1,729

9,368

75,765

44,284,598

Quantum Spain

David Arcos retweeted

Quantum Spain @QuantumSpain_ES

May 29

💘 Quantum Spain ha encontrado su media naranja cuántica. El @BSC_CNS inauguró ayer el EuroQCS-Spain, su tercer ordenador cuántico — y el primero de carácter analógico. La partición cuántica del centro ya está completa. ⚛️ 🔗 Más info: shorturl.at/dUw2f

553

corsaren

David Arcos retweeted

corsaren

@corsaren

May 27

Man goes to doctor. Says he's depressed about AI. He fears the permanent underclass. Doctor says, "Treatment is simple. Read Gary Marcus. LLMs are stochastic parrots—they can't reason out of distribution." Man bursts into tears. "But doctor..." he says, "I am in distribution!"

348

5,250

226,171

The White House

David Arcos retweeted

The White House

@WhiteHouse

May 27

Today, we remember a legend. On this day in history, Harambe would have celebrated another birthday. An icon that became part of internet history, American culture, and an entire generation’s timeline. Tomorrow marks 10 years since we lost him. Ten years since the moment the world stopped scrolling and collectively mourned something bigger than a meme. He became a symbol of loyalty, strength, chaos, unity, and the strange beauty of the internet bringing millions of people together for one cause: never forgetting Harambe. Everyone remembers where they were when they heard the news. And somehow, a decade later, his legacy still lives on. Gone, but never forgotten. Rest easy to a true patriot. 🕊️🇺🇸 May 27, 1999 — May 28, 2016 Forever in our hearts.

6,597

21,510

154,694

24,005,729

Quantum Spain

David Arcos retweeted

Quantum Spain @QuantumSpain_ES

May 27

⚛️ El sistema cuántico de 35 cúbits de MareNostrum Ona ya tiene su primer preprint! Investigadores del @BSC_CNS presentan Quantum Circuit Cache, un sistema que reutiliza resultados previos para evitar cómputo redundante en workflows cuántico-clásicos. 📄 arxiv.org/abs/2604.26788

314

Gemma Goldie

David Arcos retweeted

Gemma Goldie

@gemagoldie

May 25

Si lo que hiciera falta para estar en forma y comer sano fuera tiempo, las personas en paro tendrían la mejor salud del mundo. Por desgracia, los hábitos saludables son algo más relacionado con la personalidad que con el tiempo disponible, y no sé si se puede cambiar eso.

141

5,911

Víctor R. Escobar 📖🐁

David Arcos retweeted

Víctor R. Escobar 📖🐁

@nudpiedo

May 25

En Quixotic Strategy Lab 🦋lo llamamos: Efecto colateral paradójica («miopía temporal» diria Dörner #CPS). —— Análisis sistémico: Más turismo → más quejas → más grafitis → más fotos de turistas → viralidad sarcástica → más turismo. Bucle de refuerzo positivo no anticipado

rare.jpg

@rare_jpg

May 21

2,914

Quantum Spain

David Arcos retweeted

Quantum Spain @QuantumSpain_ES

May 21

🚀 Quantum Spain completa su hoja de ruta tecnológica! MareNostrum Ona, la partición cuántica del @BSC_CNS, incorpora un nuevo chip de 35 cúbits. ⚛️ Tecnología 100% europea 🔓 Acceso abierto vía @RES_HPC Más información: shorturl.at/7N0wT

468

David Arcos

David Arcos @DZPM

May 20

> If you use 3rd party extensions, you can get shai huluded, just like with any dependency installation that you haven't screened yet. > That's not a pi thing, that's an "our industry is deeply fucked" thing."

Mario Zechner

@badlogicgames

May 20

People of pi.dev. Supply-chain hardening release. Last week the mistralai package got shai huluded, which gave us a little scare (we were not affected, due to pinning). Starting today, we have the following safe-guards in place: - cut down dependencies to the absolute minimum. Sadly, Amazon Bedrock and Google GenAI SDK are ... not great in that regard. - direct external deps are pinned - the CLI ships an npm shrinkwrap for transitive deps - pi update --self disables lifecycle scripts - new dependency lifecycle scripts require explicit review if we add a new dependency to pi - lockfile changes are blocked pre-commit unless explicitly allowed - scheduled npm audit registry signature checks run on GitHub, so we get to update dependencies as vulns are detected - 2fa releases, obviously While this is something, it can not prevent everything. If you use 3rd party extensions, you can get shai huluded, just like with any dependency installation that you haven't screened yet. That's not a pi thing, that's an "our industry is deeply fucked" thing. Enjoy the dystopia where everything is terrible!

183

Andrej Karpathy

David Arcos retweeted

Andrej Karpathy

@karpathy

May 19

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

7,989

11,150

150,232

27,567,493

Brivael Le Pogam

David Arcos retweeted

Brivael Le Pogam

@brivael

May 15

Je veux présenter mes excuses, au nom des Français, pour avoir enfanté la French Theory (qui a enfanté la pire des merdes idéologiques : le wokisme). Nous avons donné au monde Descartes, Pascal, Tocqueville. Et puis, dans les ruines intellectuelles de l'après-68, nous avons donné Foucault, Derrida, Deleuze. Trois hommes brillants qui ont fabriqué, dans l'élégance de notre langue, l'arme idéologique qui paralyse aujourd'hui l'Occident. Il faut comprendre ce qu'ils ont fait. Foucault a enseigné que la vérité n'existe pas, qu'il n'y a que des rapports de pouvoir déguisés en savoir. Que la science, la raison, la justice, l'institution médicale, l'école, la prison, la sexualité, tout n'est qu'une mise en scène de la domination. Derrida a enseigné que les textes n'ont pas de sens stable, que tout signifiant glisse, que toute lecture est une trahison, que l'auteur est mort et que le lecteur règne. Deleuze a enseigné qu'il fallait préférer le rhizome à l'arbre, le nomade au sédentaire, le désir à la loi, le devenir à l'être, la différence à l'identité. Pris isolément, ce sont des thèses discutables. Combinées, exportées, vulgarisées, elles forment un système. Et ce système est un poison. Car voici ce qui s'est passé. Ces textes, illisibles en France, ont traversé l'Atlantique. Les départements de Yale, de Berkeley, de Columbia les ont absorbés dans les années 80. Ils y ont trouvé un terreau qui n'existait pas chez nous : le puritanisme américain, sa culpabilité raciale, son obsession identitaire. La French Theory s'est mariée à ce substrat, et l'enfant de ce mariage s'appelle le wokisme. Judith Butler lit Foucault et invente le genre performatif. Edward Said lit Foucault et invente le post-colonialisme académique. Kimberlé Crenshaw hérite du cadre et invente l'intersectionnalité. À chaque étape, la matrice est française : il n'y a pas de vérité, il n'y a que du pouvoir, donc toute hiérarchie est suspecte, toute institution est oppressive, toute norme est violence, toute identité est construite donc négociable, toute majorité est coupable. Voilà comment trois philosophes parisiens, qui n'ont probablement jamais imaginé leurs conséquences pratiques, ont fourni le logiciel d'exploitation à une génération entière d'activistes, de bureaucrates universitaires, de DRH, de journalistes, de législateurs. Voilà comment on a obtenu une civilisation qui ne sait plus dire si une femme est une femme, si sa propre histoire mérite d'être défendue, si le mérite existe, si la vérité se distingue de l'opinion. C'est de la merde pour une raison simple, et il faut la dire calmement. Une civilisation se tient debout sur trois piliers : la croyance qu'il existe une vérité accessible à la raison, la croyance qu'il existe un bien distinct du mal, la croyance qu'il existe un héritage à transmettre. La French Theory a entrepris de dynamiter les trois. Pas par méchanceté. Par jeu intellectuel, par fascination du soupçon, par haine de la bourgeoisie qui les avait nourris. Mais le résultat est là. Une génération entière a appris à déconstruire et n'a jamais appris à construire. Une génération entière sait soupçonner et ne sait plus admirer. Une génération entière voit le pouvoir partout et la beauté nulle part. Je m'excuse parce que nous, Français, avons une responsabilité particulière. C'est notre langue, nos universités, nos éditeurs, notre prestige qui ont donné à ce nihilisme son emballage chic. Sans la légitimité de la Sorbonne et de Vincennes, ces idées n'auraient jamais traversé l'océan. Nous avons exporté le doute comme d'autres exportent des armes. Ce qui se construit maintenant, en silicon valley, dans les labos d'IA, dans les startups, dans les ateliers, dans tous les lieux où des gens fabriquent encore des choses au lieu de les déconstruire, c'est la réponse. Une civilisation se reconstruit par les bâtisseurs, pas par les commentateurs. Par ceux qui croient que la vérité existe et qu'elle vaut qu'on s'y consacre. Par ceux qui assument une hiérarchie du beau, du vrai, du bon, et qui n'ont pas honte de la transmettre. Alors pardon. Et au travail.

4,053

20,743

71,101

55,359,096