rlm

Name: rlm
Author: alexzhang13

라이브러리

alexzhang13/rlm

재귀 언어 모델(RLM)의 플러그 앤 플레이 추론 라이브러리로, 다양한 샌드박스 환경을 지원합니다.

저장소 방문 홈페이지

개요

RLM은 재귀적 프로그래밍 방식으로 표준 LLM 호출을 대체하여 거의 무한한 컨텍스트를 처리할 수 있습니다. 이 라이브러리는 확장 가능한 추론 엔진 및 학습 환경을 제공하며, API 및 로컬 LLM을 지원하고, 격리된 실행을 위해 Docker, Modal 등 다양한 샌드박스 환경을 통합합니다.

README 미리보기

\n---\n\n\nRecursive Language Models (RLMs)\n\n\n\n  Full Paper •\n  Blogpost •\n  Documentation •\n  RLM Minimal\n\n\n\n  \n    \n  \n  \n    \n  \n\n\n\n  \n    \n  \n\n\n## Overview\nRecursive Language Models (RLMs) are a task-agnostic inference paradigm for language models (LMs) to handle near-infinite length contexts by enabling the LM to *programmatically* examine, decompose, and recursively call itself over its input. RLMs replace the canonical `llm.completion(prompt, model)` call with a `rlm.completion(prompt, model)` call, acting as a "language model". RLMs offload the context as a variable in a REPL environment that the LM can interact with and launch sub-LM calls inside of.\n\nRLMs are a bet on future "language model" design choices. We argue for a [CodeAct](https://arxiv.org/abs/2402.01030)-style harness (i.e. all language models should have access to a code environment) with sub-(R)LM calls as functions in code, and context / prompts as objects in code. RLMs explicitly defer code execution with sub-calls as functions to the language model itself, which is incredibly flexible and lends itself well to scale if trained correctly. We want to move away from the JSON tool-calling standard for both sub-agents and generic tool calls. The naming comes from the fact that such a system is itself a "language model" (a probabilistic mapping from text to text) that builds around and relies on recursive sub-LLM calls.\n\nThis repository provides both an extensible inference engine and training environment for using RLMs around standard API-based and local LLMs. The initial experiments and idea were proposed in a [blogpost](https://alexzhang13.github.io/blog/2025/rlm/) in 2025, with expanded results in an [arXiv preprint](https://arxiv.org/abs/2512.24601).\n\nWe now also include a [verifiers](https://github.com/PrimeIntellect-ai/verifiers) training environment based on Prime Intellect's [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) in the `training/` folder

rlm

개요

README 미리보기

同类型项目

puppeteer

PaddleOCR

crawl4ai

supervision