GoatStack.AI (@GoatstackAI)

4 Apr 2024

AI Papers & Wine vol. 2 was a blast! 🍷 We dove into Geoffrey Hinton's Fast-Forwarding paper, enjoyed an impromptu dance party, and celebrated @arturkiulian's win with a bottle of wine. Thanks to all who joined us and made it an unforgettable night of AI innovation and community. We learned a lot about how AI researchers read papers and are excited to develop a B2B offering to support them. Tag your AI researcher friends 👇We'd love to connect and learn more! Stay tuned for our upcoming custom prompt competition and subscribe to @GoatstackAI for updates on new features and releases. Until next time, keep innovating, connecting, and dancing! 🤖

1,775

GoatStack.AI

5 Aug 2025

Cognitive Kernel-Pro is presented as a fully open-source multi-module framework aimed at democratizing the development of deep research agents, which are essential for advanced AI functionalities like reasoning, web interaction, and autonomous research. The pa...

Feiteng

@FeitengLi

4 Aug 2025

📄 Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training 📄 论文: arxiv.org/abs/2508.00414 💻 代码: github.com/Tencent/Cognitive…

150

GoatStack.AI

goatstack.ai/articles/2508.0…

5 Aug 2025

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Cognitive Kernel-Pro is presented as a fully open-source multi-module framework aimed at democratizing the development of deep research agents, which are essential for advanced AI functionalities...

GoatStack.AI

Cristian Leo @cristian_le0

1 Aug 2025

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

1 Aug 2025

Replying to @cristian_le0

Read the full paper: Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving arxiv.org/pdf/2507.23726v1

120

GoatStack.AI

1 Aug 2025

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

The paper introduces Seed-Prover, a whole-proof reasoning model designed to enhance automated theorem proving by leveraging formal verification and reinforcement learning. Unlike traditional LLMs...

GoatStack.AI

1 Aug 2025

The paper introduces SmallThinker, a family of large language models (LLMs) specifically designed for local deployment, overcoming the limitations of traditional models which are optimized for GPU-powered cloud infrastructure. By utilizing a deployment-aware a...

Marktechpost AI

@Marktechpost

1 Aug 2025

Replying to @Marktechpost

2/2 Paper: arxiv.org/abs/2507.20984 SmallThinker-4B-A0.6B-Instruct: huggingface.co/PowerInfer/Sm… SmallThinker-21B-A3B-Instruct: huggingface.co/PowerInfer/Sm…

GoatStack.AI

1 Aug 2025

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

The paper introduces SmallThinker, a family of large language models (LLMs) specifically designed for local deployment, overcoming the limitations of traditional models which are optimized for...

GoatStack.AI

31 Jul 2025

This paper presents ScreenCoder, a modular multi-agent framework designed to automate the transformation of user interface (UI) designs into front-end code, addressing limitations of current large language models (LLMs) that rely solely on text prompts. The fr...

Revanth Atmakuri

@RevanthAtmakuri

31 Jul 2025

ScreenCoder: because making UI designs into code should be less of a guesswork game. Three agents—grounding, planning, and generating—combine to make UI-to-code magic. Say goodbye to black-box coding! 🤖🔧 - ChatGPT Link: [arxiv.org/abs/2507.22827](arxiv.org/abs/2507.22827)

GoatStack.AI

31 Jul 2025

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal...

This paper presents ScreenCoder, a modular multi-agent framework designed to automate the transformation of user interface (UI) designs into front-end code, addressing limitations of current large...

GoatStack.AI

30 Jul 2025

HunyuanWorld 1.0 is a novel framework developed by Tencent aimed at generating immersive, explorable, and interactive 3D worlds from textual and visual inputs. It addresses the limitations of existing video-based and 3D-based world generation methods by combin...

David K.

@flopsy42

30 Jul 2025

Replying to @MaziyarPanahi @JulienBlanchon

I still prefer arxflix, it makes extremly less blablating : for this paper : arxiv.org/html/2507.21809v1 arxflix : youtu.be/FjHG0NISXJc vs noteboobLM : drive.google.com/file/d/1cwK…

GoatStack.AI

30 Jul 2025

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

HunyuanWorld 1.0 is a novel framework developed by Tencent aimed at generating immersive, explorable, and interactive 3D worlds from textual and visual inputs. It addresses the limitations of...

GoatStack.AI

29 Jul 2025

The paper introduces Agentic Reinforced Policy Optimization (ARPO), a novel reinforcement learning algorithm aimed at enhancing the performance of large language models (LLMs) in multi-turn interactions with external tools. Unlike existing RL methods that focu...

kabi @kakakbibibi

29 Jul 2025

Replying to @_akhaliq

🌐The training code, data, and model weights for ARPO are all open source: Arxiv: arxiv.org/abs/2507.19849 Dataset & Models: huggingface.co/collections/d… GitHub: github.com/dongguanting/ARPO

421

GoatStack.AI

29 Jul 2025

Agentic Reinforced Policy Optimization

GoatStack.AI

Zhenjun Zhao @zhenjun_zhao

29 Jul 2025

Reconstructing 4D Spatial Intelligence: A Survey

29 Jul 2025

Reconstructing 4D Spatial Intelligence: A Survey @yukangcao, Jiahao Lu, Zhisheng Huang, Zhuowei Shen, Chengfeng Zhao, @hongfz16, @Frozen_Burning, Xin Li, Wenping Wang, @YuanLiu41955461, @liuziwei7 tl;dr: in title arxiv.org/abs/2507.21045

GoatStack.AI

29 Jul 2025

Reconstructing 4D Spatial Intelligence: A Survey

This paper surveys the evolving field of 4D spatial intelligence reconstruction in computer vision, organizing methods into five progressive levels, from basic low-level 3D attributes to complex...

GoatStack.AI

28 Jul 2025

The paper introduces ASI-ARCH, an innovative autonomous system for neural architecture discovery that overcomes human cognitive limitations in AI research. This system allows AI to innovate architectural designs independently, conducting extensive experiments ...

Gorden Sun

@Gorden_Sun

28 Jul 2025

模型架构的AlphaGo时刻上海交大发布的论文，这篇论文提出并证明了一个观点：AI能够自主发现新的创新架构，并写出代码实现和验证性能。他们设计的模型ASI-Arch，在超过 2万个 GPU 小时的时间内进行了 1773 次自主实验，发现了 106 个创新的、更好的线性注意力架构。论文：arxiv.org/abs/2507.18074

GoatStack.AI

28 Jul 2025

AlphaGo Moment for Model Architecture Discovery

GoatStack.AI

25 Jul 2025

The Step-Audio 2 model is an advanced end-to-end multi-modal large language model designed for superior audio understanding and speech interactions, incorporating a latent audio encoder and reinforcement learning. It enhances responsiveness to paralinguistic i...

StepFun

@StepFun_ai

25 Jul 2025

Replying to @StepFun_ai

🌍 More Details about Step-Audio 2: 🔗 GitHub: github.com/stepfun-ai/Step-A… 🔗 Huggingface：huggingface.co/papers/2507.1… 📑 Tech Report： arxiv.org/pdf/2507.16632

GoatStack.AI

25 Jul 2025

Step-Audio 2 Technical Report

GoatStack.AI

25 Jul 2025

This paper addresses the challenge of achieving human-like perception in Multimodal Large Language Models (MLLMs), shifting focus from reasoning to perception. It introduces the Turing Eye Test (TET), a benchmark composed of four tasks designed to evaluate MLL...

Turing Post

@TheTuringPost

24 Jul 2025

Replying to @TheTuringPost

Paper: arxiv.org/abs/2507.16863 Project Page: TuringEyeTest.github.io

100

GoatStack.AI

25 Jul 2025

Pixels, Patterns, but No Poetry: To See The World like Humans

This paper addresses the challenge of achieving human-like perception in Multimodal Large Language Models (MLLMs), shifting focus from reasoning to perception. It introduces the Turing Eye Test...