Models & Authentication
Nexus-Nancy supports multiple ways to connect to LLMs, from high-performance local servers to cost-effective ChatGPT Plus subscriptions.
1. Local LLM (Gemma 4 / Llama 3)
Ideal for private, high-speed execution.
Start the Llama Server
llama-server -m /path/to/model.gguf --port 8089 --jinja --reasoning-format deepseek --think
Configure Nancy
# .agents/nnancy.yaml
model: my-local-model
base_url: http://localhost:8089/v1
auth_type: api_key
api_key_file: none
2. ChatGPT Plus Subscription ($20/mo)
Nexus-Nancy can bridge directly to your ChatGPT Plus subscription using OpenAI Codex OAuth. This avoids per-token API billing and allows you to use your flat-rate subscription.
Login
Run the following command on your local machine:
nnancy auth login
This will open your browser to OpenAI’s authorization page. Once you log in, Nancy will capture the session tokens and store them securely in .agents/secrets/codex.json.
Configure Nancy
# .agents/nnancy.yaml
auth_type: codex
model: gpt-5.4 # Or latest available via Codex
Nancy will automatically handle token usage and refreshing. Note that this method is subject to your subscription’s message caps (e.g., 80 messages / 3 hours).
3. Standard OpenAI API
Standard usage-based billing.
# .agents/nnancy.yaml
auth_type: api_key
api_key_env: OPENAI_API_KEY
base_url: https://api.openai.com/v1
Setting Up Your API Key
To securely add your API key:
nnancy secrets
This opens .agents/secrets/openai.key in your editor. Write your key to this file and save. The key is read on startup and validated before any API calls.
Alternatively, set the OPENAI_API_KEY environment variable.
Model Switching Strategy
Nancy follows a Minimalist Configuration philosophy. We do not provide commands to switch models (e.g., nnancy use gpt-4). Instead, we encourage users to edit the .agents/nnancy.yaml file directly.
This ensures:
Source of Truth: The config file always reflects the current state.
Explicitness: You always know which model is active and what it costs.
Admin-Free: No complex hidden state to manage across different cluster nodes.