by intsig-textin
Extracts text and performs OCR on images, PDFs, and Microsoft Office documents, converts documents to Markdown, and retrieves structured key information in JSON format.
Provides three core tools for document handling: recognition_text for raw text extraction, doc_to_markdown for converting documents to Markdown, and general_information_extration for extracting specified key‑value pairs or table data from PDFs, Word/Excel files, and common image formats.
path (local file path or public URL) and optional parameters (key, table_header).MCP_SERVER_REQUEST_TIMEOUT.Q: What file formats are supported? A: JPEG, JPG, PNG, BMP, PDF, Word (DOCX), and Excel (XLSX).
Q: Can I process files behind authentication? A: No. The server only accepts publicly accessible URLs or local file paths.
Q: How do I set the request timeout?
A: Define MCP_SERVER_REQUEST_TIMEOUT (in milliseconds) in the environment variables; the default in the example is 600000 (10 minutes).
Q: Do I need to install any additional dependencies?
A: The server runs via npx and pulls the required package @intsig/server-textin automatically.
Q: Where can I find the license? A: The project is released under the MIT License; see the LICENSE file in the repository.
English | 中文
TextIn MCP Server is a tool for extracting text and performing OCR on documents, including document text recognition, ID recognition, and invoice recognition. It also supports converting documents into Markdown format.
recognition_text
path (string, required): file path or a URL (HTTP/HTTPS) pointing to a documentdoc_to_markdown
path (string, required): file path or a URL (HTTP/HTTPS) pointing to a documentgeneral_information_extration
path (string, required): file path or a URL (HTTP/HTTPS) pointing to a documentkey (string[], optional): The non-tabular text information that the user wants to identify, input format is an array of strings.table_header (string[], optional): The table information that the user wants to identify, input format is an array of strings.When the input is a URL, it does not support handling access to protected resources.
Click here to register for a TextIn account.
Get Textin APP_ID and APP_SECRET by following the instructions here.
{
"mcpServers": {
"textin-ocr": {
"command": "npx",
"args": [
"-y",
"@intsig/server-textin"
],
"env": {
"APP_ID": "<YOUR_APP_ID>",
"APP_SECRET": "<YOUR_APP_SECRET>",
"MCP_SERVER_REQUEST_TIMEOUT": "600000"
},
"timeout": 600
}
}
}
This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
Please log in to share your review and rating for this MCP.
{
"mcpServers": {
"textin-ocr": {
"command": "npx",
"args": [
"-y",
"@intsig/server-textin"
],
"env": {
"APP_ID": "<YOUR_APP_ID>",
"APP_SECRET": "<YOUR_APP_SECRET>",
"MCP_SERVER_REQUEST_TIMEOUT": "600000"
}
}
}
}claude mcp add textin-ocr npx -y @intsig/server-textinExplore related MCPs that share similar capabilities and solve comparable challenges
by zed-industries
A high‑performance, multiplayer code editor designed for speed and collaboration.
by modelcontextprotocol
Model Context Protocol Servers
by modelcontextprotocol
A Model Context Protocol server for Git repository interaction and automation.
by modelcontextprotocol
A Model Context Protocol server that provides time and timezone conversion capabilities.
by cline
An autonomous coding assistant that can create and edit files, execute terminal commands, and interact with a browser directly from your IDE, operating step‑by‑step with explicit user permission.
by continuedev
Enables faster shipping of code by integrating continuous AI agents across IDEs, terminals, and CI pipelines, offering chat, edit, autocomplete, and customizable agent workflows.
by upstash
Provides up-to-date, version‑specific library documentation and code examples directly inside LLM prompts, eliminating outdated information and hallucinated APIs.
by github
Connects AI tools directly to GitHub, enabling natural‑language interactions for repository browsing, issue and pull‑request management, CI/CD monitoring, code‑security analysis, and team collaboration.
by daytonaio
Provides a secure, elastic infrastructure that creates isolated sandboxes for running AI‑generated code with sub‑90 ms startup, unlimited persistence, and OCI/Docker compatibility.