IMBox
IMBox AI

Sovereign AI for defense, security and public administration.

IMBox AI brings models, orchestration and applications inside the customer's perimeter. No cloud dependency, no data leaving the organization and a native experience inside the secure IMBox platform.

400k+

active deployments

200k+

public employees using AI

300+

public bodies

8 wk

from zero to production

Sovereignty by design

Your AI runs where your data lives.

Inference hardware, models, orchestration and applications are deployed inside your data center. Sensitive information never leaves the perimeter.

100% on premise

BrainBox is installed in the customer's data center and runs without third-party endpoints.

Zero exfiltration

No telemetry, no anonymized samples and no external training on customer information.

Air-gapped environment

Editors, viewers, players and sandboxing live inside IMBox to remove external dependencies.

Institutional scale

Built for public administration, security and defense, with traceability and operational control from day one.

BrainBox

BrainBox brings AI into your own data center.

BrainBox is the inference node for IMBox AI: an autonomous compute unit for running quantized open source models inside your infrastructure, with full control over data, performance, and scaling.

BrainBox cluster

Linear capacity, standard rack deployment

4x

NVIDIA RTX 5090

128 GB

VRAM agregada

1.500

TFLOPS FP16

32C

AMD Genoa CPU

192 GB

RAM de sistema

N+1

escalado modular

One endpoint. Every model. OpenAI-compatible.

GPUBouncer turns the BrainBox fleet into a managed AI service, with category routing, auditing, quotas and compatibility with existing clients.

GPUBouncer Admin

Active models

12/12

T2TL, T2TF, I2T, A2T

BrainBox fleet

10/10

All nodes enrolled

Throughput

2.4k

tokens per second

KV cache reuse

68%

session-aware routing

ModelTypeBrainBoxLatencyStatus
llama-3.3-70b-q4T2TLBB-01, BB-02241 mshealthy
mistral-large-q8T2TFBB-0389 mshealthy
internvl-26bI2TBB-05312 mshealthy
whisper-large-v3A2TBB-071.2x rthealthy
florence-2-ocrOCRBB-08147 msrebalancing

Compatible API

Change the URL and migrate existing applications in minutes.

Self-healing

Isolates faulty nodes and redeploys models automatically.

Smart KV cache

Reuses context by session to reduce latency and energy use.

Category routing

T2TL, T2TF, I2T, A2T, EMB and OCR by purpose.

Plug-and-play scaling

Add a BrainBox to the rack and it joins the fleet.

Model Foundry

Open-source models, quantized and tuned in house.

The IMBox engineering team maintains a curated catalogue of model families, evaluated against institutional workloads and deployed through a controlled operational cycle.

CodePurposeGPUsVRAM
T2TLText to text, large reasoning4128 GB
T2TFText to text, fast132 GB
I2TLImage to text, large264 GB
I2TFImage to text, fast132 GB
A2TAudio to text132 GB
EMBEmbeddings132 GB
OCROptical character recognition132 GB

Multiple AI capabilities. One platform your teams already use.

Users interact with AI in natural language inside IMBox: no tool switching, no data leaving the perimeter, no unnecessary complexity.

On-premise agentic workflows

Agents with 30+ specialized tools to chain actions, cross data sources and complete complex tasks.

tools and traceability

Governed assistants

Create generalist or specialist assistants, feed them internal documents and publish them by user, group or organization.

RAG and evaluation

Native recorder

Record meetings, interviews, statements or debriefs and send them into the transcription and analysis flow.

custody and context

Transcription with diarization

Turn audio and video into text, identify speakers and generate summaries, tasks and review-ready minutes.

local speech to text

Translation and rewriting

Translate documents, summarize at different lengths and rewrite with configurable tone and style.

languages and style

Document generation

Create and edit Word, Excel and PowerPoint in conversational mode using IMBox Drive context.

secure office work

Document OCR

Read scanned files, images and annexes to turn them into searchable and reusable content.

scanned documents

Knowledge graph

Connect fragmented data and internal relationships so teams can query operational knowledge naturally.

contextual search

Database queries

Allow AI to query corporate data under permissions, auditing and limits defined by the organization.

controlled data

Sovereign API

Expose AI capabilities through a compatible interface for integrations and custom clients.

fast integration

Assistant memory

Assistants remember preferences, context and relevant patterns within security boundaries.

operational continuity

Administration panel

Central control over models, quotas, users, audit, BrainBox fleet and service health.

unified operations

Deployment

From zero to production AI in eight weeks.

A predictable process to install BrainBox, activate GPUBouncer, validate models and publish capabilities to teams.

1

Week 1

Architecture, security and integration requirements.

2

Week 2

Rack, networking and initial BrainBox commissioning.

3

Week 4

Models, assistants, auditing and real-workload tests.

4

Week 8

Production launch, training and operational follow-up.

IMBox AI

Bring sovereign AI inside your perimeter.

A private intelligence layer for organizations that cannot afford to lose control over their data.

Talk to IMBox