The context window of a voice OS is the prompt assembled before every conversation turn. It includes the system prompt, the persona, retrieved memories, current calendar and inbox state, the recent conversation, and the user's latest utterance. The art of voice OS engineering is keeping this prompt small and relevant. A bloated context window slows down the LLM, dilutes attention, and increases cost. A starved context window makes the AI forget what you just said. The right size is usually 3,000 to 6,000 tokens.
WHAT TO LOOK FOR
System prompt
The base instructions that define what Lucy is, how it should behave, and what guardrails apply. The system prompt is roughly 800 tokens and is identical across all turns and users. It establishes the persona, the response style, and the rules of engagement.
Calendar injection
The next 8 to 12 calendar events from Google Calendar, formatted as a compact list. This is what lets Lucy answer 'what is on my plate today' without making any tool call. Calendar injection costs about 400 tokens per turn.
Inbox injection
The most recent 10 inbox subjects from Gmail, with sender and a one-line snippet. This is what lets Lucy answer 'did Sarah email me back yet' instantly. Inbox injection costs about 600 tokens per turn.
TLDR:Lucy OS1 builds the context window dynamically per turn. A scheduler runs a few milliseconds before each LLM call: it pulls your next 8 calendar events, the top 10 unread inbox subjects, your most recent 10 conversation turns, and 8 to 15 retrieved memories ranked by semantic relevance to what you just said. The total context typically lands at 4,000 to 5,000 tokens, which keeps time-to-first-token under 200 milliseconds while still giving Lucy real situational awareness for the answer.
The base instructions that define what Lucy is, how it should behave, and what guardrails apply. The system prompt is roughly 800 tokens and is identical across all turns and users. It establishes the persona, the response style, and the rules of engagement.
The next 8 to 12 calendar events from Google Calendar, formatted as a compact list. This is what lets Lucy answer 'what is on my plate today' without making any tool call. Calendar injection costs about 400 tokens per turn.
The most recent 10 inbox subjects from Gmail, with sender and a one-line snippet. This is what lets Lucy answer 'did Sarah email me back yet' instantly. Inbox injection costs about 600 tokens per turn.
8 to 15 retrieved memories selected for semantic relevance to the current conversation. Memories are injected as a structured block: subject, fact, when stored. This is the heaviest dynamic component and typically costs 1,000 to 1,800 tokens.
The last 10 to 20 turns of the current conversation, included so the LLM can reference what was just discussed. Older turns get summarized into a single line to compress them while preserving the thread.
Function signatures for tools the LLM can call: send_email, create_calendar_event, web_search, set_reminder. These cost about 600 tokens per turn but are required for the LLM to know what actions are available.
QUICK COMPARISON
| Capability | Lucy OS1 | Most AI tools |
|---|---|---|
| Memory across sessions | ✓ Permanent, never resets | ✗ Resets after every session |
| Voice quality | ✓ Lucy OS1 Natural Voice (best-in-class) | ✗ Basic STT, struggles with noise |
| Calendar awareness | ✓ Reads Google Calendar in real time | ✗ No calendar access |
| Available 24/7 | Always on, any device | Available but stateless each time |
| Gets personal over time | ✓ Builds your context continuously | ✗ Starts from zero every session |
Voice-first AI with memory and calendar integration. Free to try.
Start TalkingFree tier available. No credit card required.
GET STARTED
Create your free account
No credit card required. Sign in with your Google account and you're inside in under a minute.
Connect your Google Calendar
Lucy reads your upcoming events before every conversation, so it already knows your day before you say a word.
Start talking about voice os context window
Speak naturally. Lucy listens, responds by voice, and begins building context from your very first exchange. The more you use it, the better it gets.
Welcome