OPEN SOURCE — MIT

VAREK
————

AI PIPELINE PROGRAMMING LANGUAGE

One language for every stage of the AI pipeline. Faster than Python. Safer than C. More expressive than anything in between.

READ THE v1.5 RELEASE NOTES → SEE THE SYNTAX
10–40×
FASTER THAN PYTHON
ZERO
MEMORY UNSAFETY
ONE
UNIFIED SYNTAX
MIT
OPEN SOURCE
// SYNTAX

Replace four tools
with one language.

Today's ML pipelines stitch together Python, YAML, JSON, and shell scripts. VAREK replaces all of it.

● VAREK
-- Schema, pipeline, and logic in one file
schema ImageInput { 
  path:   str, 
  label:  str?, 
  width:  int, 
  height: int 
} 

pipeline classify_images { 
  source: ImageInput[] 
  steps:  [preprocess -> embed -> infer] 
  output: ClassResult[] 
} 

fn preprocess(img: ImageInput) -> Tensor { 
  load_image(img.path) 
    |> resize(224, 224) 
    |> normalize(mean=[0.485, 0.456]) 
} 

async fn infer(t: Tensor) -> ClassResult { 
  let model = load_model("resnet50.synmodel") 
  model.forward(t) 
}
● TODAY'S EQUIVALENT
# schema.json + config.yaml + pipeline.py + run.sh

# --- schema.json ---
{ 
  "type": "object", 
  "properties": { 
    "path": {"type": "string"}, 
    "label": {"type": "string"} 
  }
} 

# --- pipeline.py ---
from torchvision import transforms 
from PIL import Image 
import torch, yaml, json, subprocess 

def preprocess(img_path): 
  t = transforms.Compose([ 
    transforms.Resize((224, 224)), 
    transforms.ToTensor(), 
    transforms.Normalize([0.485, 0.456]) 
  ]) 
  return t(Image.open(img_path)) 

async def infer(tensor): 
  model = torch.load("resnet50.pt") 
  model.eval() 
  with torch.no_grad(): 
    return model(tensor.unsqueeze(0))
// FEATURES

Built for the AI era.
Not retrofitted for it.

Native Speed
Compiles to native machine code via LLVM. Interpreted mode for prototyping, compiled mode for production. 10–40× faster than equivalent Python.
LLVM BACKEND
🛡
Memory Safety
Static typing with full inference. Null safety enforced at compile time. No buffer overflows. No silent failures. Errors are values, not exceptions.
COMPILE-TIME SAFETY
🧠
AI-Native Primitives
Built-in tensor and matrix types. Native async/await for model inference. Pipeline operators for composing model chains without boilerplate.
TENSOR<T, DIMS>
🔗
Unified Syntax
The same file defines schemas, logic, pipelines, and config. No format translation. No context switching between YAML, JSON, Python, and shell.
ONE FORMAT
🔄
Python Interop
Import any Python, C, or Rust library natively. Migrate incrementally. You don't have to abandon your existing ecosystem to adopt VAREK.
FFI SUPPORT
🌐
LLM-Legible Grammar
Deterministic, unambiguous grammar designed for human and AI collaboration. The first language built knowing that LLMs will read and write it.
AI-FIRST DESIGN
// ROADMAP

From spec to stable.

v0.1 - v1.1.1
Core Language & Warden Layer
Compiler backend, LLVM compilation, type inference, standard library, and Warden Orchestration isolation layer.
COMPLETE ✓
v1.2
Policy & Admissibility Control (RFC)
Evaluator engine, fail-closed semantics, simulated syscall interception, and unified YAML policy format.
COMPLETE ✓
v1.2.1
Seccomp-Unotify Integration
Binding the v1.2 policy evaluator directly to the Linux kernel syscall interception layer for hard enforcement.
COMPLETE ✓
v1.3
Contributor Infrastructure & Architecture Scaffolding
RFC process, MIT license unification, contributor agreements, and scaffolding for the production Warden.
COMPLETE ✓
v1.4
Production Warden in C
Production-grade supervisor process with full kernel-level enforcement pipeline: cross-process memory extraction, in-kernel verdict injection, and end-to-end latency measurement.
COMPLETE ✓
v1.5
Sub-Microsecond Policy Decisions
Hybrid fast-path matcher for the common case, with a reserved decision-procedure slow path for richer policy classes.
COMPLETE ✓
v1.6
Semantic Derivation & Whole-Graph Verification
Mapping raw syscall streams to higher-level meaningful actions, with whole-graph admissibility analysis before execution.
IN PROGRESS 🚧