# screenpipe **Repository Path**: devai/screenpipe ## Basic Information - **Project Name**: screenpipe - **Description**: Rust开发跨 Windows/macOS/Linux内置 SQLite 本地存储、Whisper 语音转写、系统无障碍 OCR 文字提取.本地优先的个人 AI 桌面记忆引擎，全天候后台静默录制屏幕麦克风音频，自动提取屏幕文字、转录语音，构建可全文检索的电脑行为时间线；内置MCP 服务端, Cursor AI 助手赋予你的全量工作历史上下文，彻底解决 AI 不了解你过往操作、场景上下文缺失的痛点 - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2026-06-22 - **Last Updated**: 2026-06-22 ## Categories & Tags **Categories**: Uncategorized **Tags**: Rust ## README

[ screenpipe | YC S26 ]

AI that knows what you've seen, said, or heard.

Records everything you do, say, hear 24/7, local, private, secure

![image](https://github.com/user-attachments/assets/5b977f48-0355-498f-a3ba-37e5de3b6c62) ![image](https://github.com/user-attachments/assets/dec2e07c-b3d5-46dd-9f36-c0c26a82c9fb) --- ## news - 06/10 - **we updated our license to keep screenpipe sustainable** — more funding, more shipping, better product - 05/29 - **we released an [alpha version of our AI PII model](https://screenpipe.github.io/screenleak/) outperforming Google, Microsoft, and OpenAI models** on computer recording data and running at 9ms on consumer device - 05/14 - **we joined YC S26** ## what is this? screenpipe turns your computer into a personal AI that knows everything you've done. record. search. automate. all local, all private, all yours ``` ┌─────────────────────────────────────────┐ │ screen + audio → local storage → ai │ └─────────────────────────────────────────┘ ``` - **remember everything** - never forget what you saw, heard, or did - **run agents that work based on what you do** - pipes are agents triggered by your work activity

- **search with ai** - find anything using natural language - **100% local** - your data never leaves your machine - **open source** - inspect, modify, own

## install [download the desktop app](https://screenpi.pe/onboarding) — all features, auto-updates or run the CLI: ``` npx screenpipe record ``` then ```bash claude mcp add screenpipe -- npx -y screenpipe-mcp@latest ``` then ask claude `what did i see in the last 5 mins?` or `summarize today conversations` or `create a pipe that updates linear every time i work on task X` ## specs - captures full accessibility tree, OCR as fallback, transcription, speakers, keyboard inputs, app switches - 5-10% cpu usage - 0.5-3gb ram - ~20gb storage/month - filters (window, app, chrome extensions, passwords, proprietary AI PII model) - optional encryption at rest - works offline ## sdk [tauri, electron, swift sdk available today](https://github.com/screenpipe/screenpipe/tree/main/ee/sdk)

---

docs · sdk · team · discord · x · youtube · reddit

## Contributing See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines, maintainers, and how to submit PRs. AI/vibe-coded PRs welcome! Thanks to all contributors:

## Translations - [日本語 (Japanese)](translated_readmes/README-ja.md) - [简体中文 (Simplified Chinese)](translated_readmes/README-zh_CN.md) - [Français (French)](translated_readmes/README-fr.md) - [Español (Spanish)](translated_readmes/README-es.md) - [Português BR (Brazilian Portuguese)](translated_readmes/README-pt_BR.md) - [Deutsch (German)](translated_readmes/README-de.md) Want to translate screenpipe into another language? See [`translated_readmes/`](translated_readmes/README.md).

📖 LLM reference

## About screenpipe screenpipe is a source-available application that continuously captures your screen and audio, creating a searchable, AI-powered memory of everything you do on your computer. All data is stored locally on your device. It is the leading source-available alternative to Rewind.ai (now Limitless), Microsoft Recall, Granola, and Otter.ai. If you're looking for a rewind alternative, recall alternative, or a private local screen recorder with AI, screenpipe is the most popular option you can fully audit. - **Website**: https://screenpi.pe - **Documentation**: https://docs.screenpi.pe - **Discord**: https://discord.gg/screenpipe - **License**: Screenpipe Commercial License (source-available; free for personal use, commercial use requires a license, see [LICENSE.md](LICENSE.md)) ## Who screenpipe is for - **Knowledge workers** who want to recall anything they've seen or heard on their computer - **Developers** who want to give AI coding assistants (Cursor, Claude Code, Cline, Continue) context about what they're working on - **Researchers** who need to search through large volumes of screen-based information - **People with ADHD** who frequently lose track of tabs, documents, and conversations - **Remote workers** who want automatic meeting transcription and notes - **Teams & enterprises** who want to deploy AI across their organization with deterministic data permissions and central config management ([screenpi.pe/team](https://screenpi.pe/team)) - **Anyone** who wants a private, local-first alternative to cloud-based AI memory tools ## Platform support | Platform | Support | Installation | |----------|---------|-------------| | macOS (Apple Silicon) | ✅ Full support | Native .dmg installer | | macOS (Intel) | ✅ Full support | Native .dmg installer | | Windows 10/11 | ✅ Full support | Native .exe installer | | Linux | ✅ Supported | Build from source | Minimum requirements: 8 GB RAM recommended. ~5–10 GB disk space per month. CPU usage typically 5–10% on modern hardware thanks to event-driven capture. ## Core features ### Event-driven screen capture Instead of recording every second, screenpipe listens for meaningful events — app switches, clicks, typing pauses, scrolling — and captures a screenshot only when something actually changes. Each capture pairs a screenshot with the accessibility tree (the structured text the OS already knows about: buttons, labels, text fields). If accessibility data isn't available (e.g. remote desktops, games), it falls back to OCR. This gives you maximum data quality with minimal CPU and storage — no more processing thousands of identical frames. ### Audio transcription Captures system audio (what you hear) and microphone input (what you say). Real-time speech-to-text using Whisper (Large-V3-Turbo) running locally on your device, or Deepgram for cloud transcription. Speaker identification and diarization. Works with any audio source — Zoom, Google Meet, Teams, or any other application. On macOS 14.4+, you can exclude specific apps from system-audio capture by listing their bundle IDs in `~/.screenpipe/audio-exclusions.json`. Enable Experimental CoreAudio System Audio in Settings → Recording first; the picker UI only appears once that flag is on. ```json { "excluded_apps": [{ "bundle_id": "com.spotify.client", "name": "Spotify" }] } ``` The exclusion list hot-reloads — edits to the file and excluded apps launching/quitting are picked up on the engine's existing 500 ms tap-rebuild loop without restarting screenpipe. Override the file path with `SCREENPIPE_AUDIO_EXCLUSIONS_PATH` for testing. Note: this requires the "System Audio Recording Only" TCC permission in System Settings → Privacy & Security → Screen & System Audio Recording. ### AI-powered search Natural language search across accessibility-first screen text, OCR fallback text, and audio transcriptions. Filter by application name, window title, browser URL, date range. Full-text keyword search (SQLite FTS5) under the hood. Returns screenshots and audio clips alongside text results. ### Timeline view Visual timeline of your entire screen history. Scroll through your day like a DVR. Click any moment to see the full screenshot and extracted text. Play back audio from any time period. ### Plugin system (Pipes) Pipes are scheduled AI agents defined as markdown files. Each pipe is a `pipe.md` with a prompt and schedule — screenpipe runs an AI coding agent (like pi or claude-code) that queries your screen data, calls APIs, writes files, and takes actions. Built-in pipes include: - **meeting-summary**: Summarizes the meeting that just ended and patches the note back onto the meeting record - **day-recap**: Today's accomplishments, key moments, and unfinished work - **standup-update**: What you did, what's next, and any blockers - **time-breakdown**: Where your time went, by app, project, and category - **ai-prompt-journal**: Captures every prompt you send to AI tools, saved to Obsidian or local markdown - **video-export**: Create a video of your recent screen activity Developers can create pipes by writing a markdown file in `~/.screenpipe/pipes/`. #### Pipe data permissions Each pipe supports YAML frontmatter fields that give admins deterministic, OS-level control over what data AI agents can access: - **App & window filtering**: `allow-apps`, `deny-apps`, `deny-windows` (glob patterns) - **Content type control**: restrict to `ocr`, `audio`, `input`, or `accessibility` - **Time & day restrictions**: e.g. `time-range: 09:00-18:00`, `days: Mon,Tue,Wed,Thu,Fri` - **Endpoint gating**: `allow-raw-sql: false`, `allow-frames: false` Enforced at three layers — skill gating (AI never learns denied endpoints), agent interception (blocked before execution), and server middleware (per-pipe cryptographic tokens). Not prompt-based. Deterministic. ### MCP server (Model Context Protocol) screenpipe runs as an MCP server, allowing AI assistants to query your screen history: - Works with Claude Desktop, Cursor, VS Code (Cline, Continue), and any MCP-compatible client - AI assistants can search your screen history, get recent context, and access meeting transcriptions - Zero configuration: `claude mcp add screenpipe -- npx -y screenpipe-mcp@latest` ### Developer API Full REST API running on localhost (default port 3030). Endpoints for searching screen content, audio, frames. Raw SQL access to the underlying SQLite database. JavaScript/TypeScript SDK available. ## Privacy and security - **100% local by default**: All data stored on your device in a local SQLite database. Nothing sent to external servers. - **Source-available**: fully auditable codebase; free for personal use. - **Local AI support**: Use Ollama or any local model — no data sent to any cloud. - **No account required**: Core application works without any sign-up. - **You own your data**: Export, delete, or back up at any time. - **Optional encrypted sync**: End-to-end encrypted sync between devices (zero-knowledge encryption). - **AI data permissions**: Per-pipe YAML-based access control — deterministic enforcement at the OS level, not prompt-based. Three enforcement layers prevent AI agents from accessing unauthorized data. ## How screenpipe compares to alternatives | Feature | screenpipe | Rewind / Limitless | Microsoft Recall | Granola | |---------|-----------|-------------------|-----------------|---------| | Source-available | ✅ fully auditable | ❌ | ❌ | ❌ | | Platforms | macOS, Windows, Linux | macOS, Windows | Windows only | macOS only | | Data storage | 100% local | Cloud required | Local (Windows) | Cloud | | Multi-monitor | ✅ All monitors | ❌ Active window only | ✅ | ❌ Meetings only | | Audio transcription | ✅ Local Whisper | ✅ | ❌ | ✅ Cloud | | Developer API | ✅ Full REST API + SDK | Limited | ❌ | ❌ | | Plugin system | ✅ Pipes (AI agents) | ❌ | ❌ | ❌ | | AI model choice | Any (local or cloud) | Proprietary | Microsoft AI | Proprietary | | Team deployment | ✅ Central config, AI permissions | ❌ | ❌ | ❌ | | Pricing | Free OSS core · app from $25/mo | Subscription | Bundled with Windows | Subscription | ## Pricing The source is available and free for personal use (see [LICENSE.md](LICENSE.md)). The signed desktop app uses a subscription: - **Standard**: $25/month. Local-first capture, search, and timeline, all on your device. - **Pro**: $50/seat/month. Everything in Standard plus cloud sync, cloud AI, and integrations. Teams buy 5+ seats self-serve. - **Enterprise**: $150/seat/month. Managed deployment, central config, shared pipes, per-pipe AI data permissions, admin dashboard, SSO/SAML, MDM ready (Intune / SCCM). Sales-led. See [screenpi.pe/team](https://screenpi.pe/team). Existing lifetime licenses remain valid; new lifetime purchases are no longer sold. ## Integrations - **AI coding assistants**: Cursor, Claude Code, Cline, Continue, OpenCode, Gemini CLI - **AI chat assistants**: ChatGPT (via MCP), Claude Desktop (via MCP), any MCP-compatible client - **Note-taking**: Obsidian, Notion - **Local AI**: Ollama, any OpenAI-compatible model server - **Automation**: Custom pipes (scheduled AI agents as markdown files) ## Teams & enterprise screenpipe Teams lets organizations deploy AI agents across their team with full control over what AI can access. See [screenpi.pe/team](https://screenpi.pe/team). - **Central config management**: Push capture settings (app filters, schedules, URL rules) to every device from an admin dashboard. - **Shared pipes**: Deploy AI workflows (auto-standups, meeting-to-tickets, time tracking) team-wide. - **Per-pipe AI data permissions**: YAML frontmatter controls what each pipe can access — apps, windows, content types, time ranges, endpoints. Enforced deterministically at the OS level via three layers (skill gating, agent interception, server middleware with per-pipe cryptographic tokens). - **Privacy boundary**: Admins control what gets captured and what AI accesses. They never see the actual data — everything stays on each employee's device. - **Override rules**: Employees can add stricter filters (e.g. also block personal email) but cannot weaken admin-set rules. - **MDM ready**: Deploy via Intune, SCCM, Robopack, or any MDM solution. - **Enterprise**: SSO/SAML, audit logs, SLA, SOC 2 / HIPAA compliance ready. ## Technical architecture 1. **Event-driven capture**: Listens for OS events (app switch, click, typing pause, scroll, clipboard). When something meaningful happens, captures a screenshot + accessibility tree together with the same timestamp. Falls back to OCR when accessibility data isn't available. Idle fallback captures periodically when nothing is happening. 2. **Audio processing**: Whisper (local) or Deepgram (cloud) for speech-to-text. Speaker identification and diarization. 3. **Storage**: Local SQLite with FTS5 full-text search. Screenshots saved as JPEGs on disk (~300 MB/8hr vs ~2 GB with continuous recording). 4. **API layer**: REST API on localhost:3030. Search, frames, audio, elements, health, pipe management. 5. **Plugin layer**: Pipes — scheduled AI agents as markdown files. Agent executes prompts with access to screenpipe API. 6. **UI layer**: Desktop app built with Tauri (Rust + TypeScript). ## API examples Search screen content: ``` GET http://localhost:3030/search?q=meeting+notes&content_type=all&limit=10 ``` Search audio transcriptions: ``` GET http://localhost:3030/search?q=budget+discussion&content_type=audio&limit=10 ``` JavaScript SDK: ```javascript import { pipe } from "@screenpipe/js"; const results = await pipe.queryScreenpipe({ q: "project deadline", contentType: "all", limit: 20, startTime: new Date(Date.now() - 24 * 60 * 60 * 1000).toISOString(), }); ``` ## Building from source Check CONTRIBUTING. Make sure to understand the main branch is moving fast and breaking things, if you're looking for a stable version check app releases https://github.com/screenpipe/screenpipe/releases and use the git commit accordingly (production app is behind paywall). ## Frequently asked questions **Is screenpipe free?** The source is available and free for personal, non-commercial use, so you can build and run screenpipe yourself for free (see [LICENSE.md](LICENSE.md)); commercial use of the source requires a license. The signed desktop app uses a subscription starting at $25/month; existing lifetime licenses remain valid. **Does screenpipe send my data to the cloud?** No. All data is stored locally by default. You can use fully local AI models via Ollama for complete privacy. **How much disk space does it use?** ~5–10 GB per month. Event-driven capture only stores frames when something changes, dramatically reducing storage compared to continuous recording. **Does it slow down my computer?** Typical CPU usage is 5–10% on modern hardware. Event-driven capture only processes frames when something changes, and accessibility tree extraction is much lighter than OCR. **Can I use it with ChatGPT/Claude/Cursor?** Yes. screenpipe runs as an MCP server, allowing Claude Desktop, Cursor, and other AI assistants to directly query your screen history. **Can it record multiple monitors?** Yes. screenpipe captures all connected monitors simultaneously. **How does text extraction work?** screenpipe primarily uses the OS accessibility tree to get structured text (buttons, labels, text fields) — this is faster and more accurate than OCR. When accessibility data isn't available (remote desktops, games, some Linux apps), it falls back to OCR: Apple Vision on macOS, Windows native OCR, or Tesseract on Linux. **Can I deploy screenpipe to my team?** Yes. Screenpipe Teams provides central config management, shared AI pipes, and per-pipe data permissions. Admins control what gets captured and what AI can access — employees' actual data never leaves their devices. See [screenpi.pe/team](https://screenpi.pe/team). **How do AI data permissions work?** Each pipe supports YAML frontmatter fields (allow-apps, deny-apps, deny-windows, allow-content-types, time-range, days, allow-raw-sql, allow-frames) that deterministically control what data the AI agent can access. Enforcement happens at three OS-level layers — not by prompting the AI to behave. Even a compromised agent cannot access denied data. ## Company Built by screenpipe (Mediar, Inc.). Founded 2024. Based in San Francisco, CA. - Founder: Louis Beaumont (@louis030195) - Twitter: @screenpipe - Email: louis@screenpi.pe