KAPEX Beta
getkapex.ai GitHub

Configuration Presets

KAPEX allows you to tune memory behavior on a per-tenant basis using named presets. Each preset bundles parameters for decay, retrieval, token budgets, and signal weighting -- optimized for a specific domain.

GET /profile

Retrieve the active configuration for your tenant.

curl https://api.getkapex.ai/api/v1/profile \
  -H "X-API-Key: your_api_key_here"

PUT /profile

Apply a preset or set custom overrides.

Apply a Preset

curl -X PUT https://api.getkapex.ai/api/v1/profile \
  -H "X-API-Key: your_api_key_here" \
  -H "Content-Type: application/json" \
  -d '{
    "preset": "reflective"
  }'

Custom Overrides

curl -X PUT https://api.getkapex.ai/api/v1/profile \
  -H "X-API-Key: your_api_key_here" \
  -H "Content-Type: application/json" \
  -d '{
    "updates": {
      "decay_profile": {
        "base_rate": 0.01,
        "processing_boost": 0.5
      },
      "injection_template": {
        "max_tokens": 8000
      }
    }
  }'

List Available Presets

curl https://api.getkapex.ai/api/v1/profile/presets \
  -H "X-API-Key: your_api_key_here"

Available Presets

KAPEX provides 16 domain-specific presets. Each tunes decay rates, token budgets, signal weights, and retrieval behavior for its target use case.

Core Presets

`reflective`

Optimized for journaling, therapy companions, and personal reflection apps. Slow decay preserves important disclosures. Larger token budget gives the LLM more context to work with.

Best for: Mental health platforms, therapy companions, journaling apps, personal reflection tools.

`customer_service`

High recency bias with faster decay. Recent interactions stay salient while older conversations fade quickly. Focused token budget keeps responses concise and case-relevant.

Best for: Help desks, customer support bots, ticket-based interactions.

`education`

Tuned for learning platforms. Active study sessions (processing events) create clear differentiation between practiced and unpracticed material.

Best for: Tutoring systems, learning platforms, spaced-repetition applications.

`healthcare`

Conservative configuration with higher residual floors. Medical information persists longer and decays more slowly. Safety checks run at maximum intensity.

Best for: Clinical documentation, patient context, care coordination, medical assistants.

`sales`

Assertive framing with emphasis on factual recall. Optimized for remembering deal context, customer preferences, and relationship history.

Best for: CRM assistants, sales engagement, account management tools.

`legal`

Precision-oriented with minimal emotional weighting. Facts and dates persist; subjective impressions decay faster. Low injection threshold ensures comprehensive context retrieval.

Best for: Legal research assistants, case management, compliance tools.

All Presets

Preset Description
reflective Journaling, therapy, personal reflection
gaming Game companions, persistent NPC memory
photo_curation Photo organization, memory tagging
nightlife Social event tracking, venue recommendations
customer_service Help desks, support bots
education Tutoring, learning platforms
fitness Workout tracking, health coaching
ecommerce Product recommendations, shopping assistants
journaling Daily journaling, life logging
sales CRM, sales engagement
legal Legal research, case management
healthcare Clinical context, care coordination
creative_writing Story assistants, character memory
real_estate Property context, client preferences
recruiting Candidate tracking, interview context
travel Trip planning, preference tracking

Custom Configuration

You can override individual parameters without applying a preset. The updates object supports nested configuration sections:

Decay Profile

Controls how quickly memories lose salience over time.

{
  "updates": {
    "decay_profile": {
      "base_rate": 0.02,
      "processing_boost": 0.3,
      "residual_floor": 0.05
    }
  }
}
Parameter Type Default Description
base_rate float 0.02 Rate at which memories lose salience over time
processing_boost float 0.3 How much each processing event accelerates decay
residual_floor float 0.05 Minimum salience fraction (prevents memories from fully disappearing)

Injection Template

Controls how memories are assembled into the LLM context block.

{
  "updates": {
    "injection_template": {
      "max_tokens": 6000,
      "injection_threshold": 0.25
    }
  }
}
Parameter Type Default Description
max_tokens integer 6000 Maximum token budget for the context block
injection_threshold float 0.25 Minimum salience score for injection into LLM context

Governance Floors

KAPEX enforces minimum values on safety-critical parameters. You cannot configure the system to forget things too aggressively or suppress memories below safe thresholds.

If a requested value falls below the governance floor, the system clamps it to the floor value.

Parameter Minimum Rationale
base_rate 0.005 Prevents memories from persisting indefinitely
processing_boost 0.1 Ensures processing events affect decay
residual_floor 0.02 Prevents memories from decaying to zero
injection_threshold 0.10 Prevents low-salience noise from entering LLM context

Python Example

import requests

API_KEY = "your_api_key_here"
BASE = "https://api.getkapex.ai/api/v1"
HEADERS = {"X-API-Key": API_KEY, "Content-Type": "application/json"}

# Apply a domain preset
resp = requests.put(
    f"{BASE}/profile",
    headers=HEADERS,
    json={"preset": "healthcare"}
)
print(f"Profile updated: {resp.json()}")

# Or use custom overrides
resp = requests.put(
    f"{BASE}/profile",
    headers=HEADERS,
    json={
        "updates": {
            "decay_profile": {"base_rate": 0.008, "processing_boost": 0.6},
            "injection_template": {"max_tokens": 8000, "injection_threshold": 0.15}
        }
    }
)
print(f"Custom config applied: {resp.json()}")