Reexpress Model Context Protocol Server

What is Reexpress Model Context Protocol Server about?

The server adds a statistically robust verification layer to LLM interactions, especially for Claude Opus 4 and other supported models. It uses a pre‑trained Similarity‑Distance‑Magnitude (SDM) estimator that ensembles multiple LLM explanations and compares them against a calibration set from the OpenVerification1 dataset, producing a confidence bin (≥ 90 %, < 90 % caution, out‑of‑distribution).

How to use Reexpress Model Context Protocol Server?

Install the server following the steps in INSTALL.md (requires Linux/macOS and the ability to run ibm-granite/granite-3.3-8b-instruct locally via HuggingFace Transformers).
Configure the server via code/reexpress/mcp_settings.json to set API keys for Anthropic, Azure/OpenAI, or Google, and to define file‑access tools (ReexpressDirectorySet, ReexpressFileSet).
Invoke the Reexpress tool at the end of a chat prompt. The LLM returns a verification payload wrapped in XML tags, e.g., <successfully_verified> True </successfully_verified> and <confidence> >= 90% </confidence>.
Update the verification model on the fly with ReexpressAddTrue, ReexpressAddFalse, or reexpress_add_ood to incorporate new true/false or out‑of‑distribution examples.
Optionally generate a static HTML page that visualizes the nearest training examples for each prediction (OUTPUT_HTML.md).

Key features of Reexpress Model Context Protocol Server

Ensemble verification using GPT‑4.1‑2025‑04‑14, o4‑mini‑2025‑04‑16‑high, Gemini‑2.5‑pro, and locally‑run Granite‑3.3‑8b‑instruct.
Dynamic model adaptation via tools that add true, false, or OOD instances without full retraining.
Confidence bins (≥ 90 %, < 90 % caution, OOD) for quick decision making.
Static HTML introspection showing nearest training matches for each prediction.
Training scripts and evaluation reports included for custom retraining on domain‑specific data.
Simple file‑access control allowing explicit specification of directories and files to be sent to LLM APIs.
Reduced threshold (α' = 0.9) yielding ~62 % in‑distribution admission with ~92 % marginal accuracy.

Use cases of Reexpress Model Context Protocol Server

Software development assistants that need reliable code suggestions with confidence estimates.
Data‑science notebooks where LLM‑generated analysis must be validated before downstream usage.
Enterprise AI pipelines requiring a second‑opinion verification step before automated actions.
Research prototyping where developers experiment with new prompts and need immediate feedback on reliability.
Tool‑enhanced agents that can self‑refine answers based on verification results or request additional resources.

FAQ from the Reexpress Model Context Protocol Server

Do I need an internet connection? Yes, for API calls to Anthropic, Azure/OpenAI, or Google; all SDM estimation runs locally.
Can I run the server on Windows? Currently only Linux and macOS are supported.
What models are required locally? ibm-granite/granite-3.3-8b-instruct via HuggingFace Transformers.
How is the confidence calculated? The SDM estimator ensembles explanations, measures similarity to calibrated examples, and outputs a binary probability that is bucketed into three confidence levels.
Can I change the confidence threshold? Yes, by editing mcp_settings.json and retraining if a different trade‑off is needed.
Is the verification data private? Only the original prompt/response is sent to the LLM APIs; the calibration dataset stays on your machine.
How do I add new training data? Use the provided training script in documentation/model_details/... and supply your own calibration set.

Reexpress Model-Context-Protocol (MCP) Server

For tool-calling LLMs (e.g., Claude Opus 4.1 or Sonnet 4.5) and MCP clients running on macOS (Sequoia 15 or later on Apple silicon) or Linux

Video overview¹: Here

Screenshot image of the rendered HTML output from the Reexpress tool.

Reexpress MCP Server is a drop-in solution to add state-of-the-art statistical verification to your complex LLM pipelines, as well as your everyday use of LLMs for search and QA for software development and data science settings. It's the first reliable, statistically robust AI second opinion for your AI workflows.

Simply install the MCP server and then add the Reexpress prompt to the end of your chat text. The tool-calling LLM (e.g., Anthropic's LLM model Claude Opus 4.1) will then check its response with the provided pre-trained Reexpress Similarity-Distance-Magnitude (SDM) estimator, which ensembles gpt-5-2025-08-07, gemini-2.5-pro, and granite-3.3-8b-instruct (run locally), along with the output from the tool-calling LLM, and calculates a robust estimate of the predictive uncertainty against a database of training and calibration examples from the OpenVerification1 dataset. Unique to the Reexpress method, you can easily adapt the model to your tasks: Simply call the ReexpressAddTrue or ReexpressAddFalse tools after a verification has completed, and then future calls to the Reexpress tool will dynamically take your updates into consideration when calculating the verification probability. We also include the training scripts for the model, so that you can run a full retraining when more substantive changes are needed, or you want to use alternative underlying LLMs.

[!NOTE] In addition to providing you (the user) with a principled estimate of confidence in the output given your instructions, the tool-calling LLM itself can use the verification output to progressively refine its answer, determine if it needs additional outside resources or tools, or has reached an impasse and needs to ask you for further clarification or information. That's what we call reasoning with SDM verification --- an entirely new capability in the AI toolkit that we think will open up a much broader range of use-cases for LLMs and LLM agents, for both individuals and enterprises.

Data is only sent via standard LLM API calls to Azure/OpenAI and Google; all of the processing for the SDM estimator is done locally on your computer. (Optionally, we recommend providing access to web search via your MCP client, such as via Claude Desktop or a web-search MCP server, or for closed-domain settings, access to domain-specific retrieval.) Reexpress MCP has a simple and conservative, but effective, file access system: You control which additional files (if any) get sent to the LLM APIs by explicitly specifying files via the file-access tools ReexpressDirectorySet() and ReexpressFileSet().

What's new in version 2.0.0

Version 2.0.0 introduces our updated formulation of the SDM estimator. The SDM activation function remains the same, but the calibration method for the SDM estimator is simplified while retaining the desirable properties of the earlier version that had an additional rescaling transform. You can read about this version in our publications below. Moving forward, the convention is to refer to this version as the canonical "SDM estimator".

Separately, we have also refactored and rewritten the code to dramatically improve efficiency, enabling scaling to much larger datasets and training SDM language models, the code for which is also included here (see our paper below and the separate research repo for details).

System Requirements

The MCP server runs on Linux and macOS. The primary requirement is that the machine running the MCP server needs to be able to locally run ibm-granite/granite-3.3-8b-instruct (via the HuggingFace transformers library). This takes as input two short model explanations and one short summary and only needs to generate 1 token, so the compute requirements are relatively modest in practice.

Installation

See INSTALL.md.

[!TIP] The Reexpress MCP server is straightforward to setup relative to other MCP servers, but we assume some familiarity with LLMs, MCP, and command-line tools. Our target audience is developers and data scientists. Only add other MCP servers from sources that you trust, and keep in mind that other MCP tools could alter the behavior of our MCP server in unexpected ways.

Citation

If you find this software useful, consider citing the following papers:

@misc{Schmaltz-2025-SimilarityDistanceMagnitudeLanguageModels,
      title={Similarity-Distance-Magnitude Language Models}, 
      author={Allen Schmaltz},
      year={2025},
      eprint={2510.26183},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2510.26183}, 
}

@misc{Schmaltz-2025-SimilarityDistanceMagnitudeActivations,
      title={Similarity-Distance-Magnitude Activations}, 
      author={Allen Schmaltz},
      year={2025},
      eprint={2509.12760},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2509.12760}, 
}

The output format has changed since v1.0.0 used in the video. See What's new in version 2.0.0 ↩

Reexpress Model Context Protocol Server