OngoingMaking 3B model beat 20B in Python. read
Research

Applied Research

May 2026

Embedding Models

applied

Geometry inspection, retrieval evaluation, fine-tuning monitoring, and embedding diff for any sentence-transformers compatible encoder. BGE, E5, GTE, Nomic, Jina, Instructor, MiniLM, and SBERT all get anisotropy scoring, UMAP exploration, layer-wise similarity, OOD detection, and hard-negative gap analysis.

May 2026

Transformers & LLMs

applied

How Aquin supports dense transformer LLMs, Mixture-of-Experts models, and hybrid architectures, from Llama and Mistral to Mixtral, DeepSeek, and Grok. Covers architecture-aware inspection, attribution, training monitoring, and evaluation across the full transformer family.

Apr 2026

Security

applied

Adversarial risk detection across the full ML pipeline: prompt injection and poisoned samples in training data, red teaming and jailbreak taxonomy in model inspection, model robustness scoring, weight trojan detection, and attack surface comparison across model versions in the training monitor.

Apr 2026

Training

applied

Live signal detection across five failure modes, gradient and loss monitoring per step, SAE feature diffs and behavioral model diffs post-training, and an agentic chat that reads from live training state at send time.

Apr 2026

Attribution

applied

Causal mediation analysis, SAE feature extraction, circuit attribution graph, logit lens, feature steering, UMAP exploration, fact verification, bias detection, and censor auditing, all in one pipeline on Llama 3.2 1B Instruct.

Apr 2026

Evals

applied

Consistency, suppression detection, and knowledge boundary probing. Behavioral evals that surface failure modes without requiring a trained SAE, and works on any TransformerLens-compatible model.

Apr 2026

Benchmarks

applied

InterpScore, FeaturePurityScore, and MUI for SAE feature evaluation, plus a conversational Benchmark Builder that works across all supported architectures, dense LLMs, MoE, hybrid, and embedding models. Describe what to measure, get a scored inline card exportable as CSV, JSON, image, or PDF.

Work with us

Interpretability tooling, custom SAE databases, mechanistic audits, circuit reports, and hands-on research, experiments, and studies for teams of all sizes. Reach us at aquin@aquin.app

Book a call

Not sure if Aquin is right for you?

SubstackMedium
© 2026 Aquin. All rights reserved.

Aquin