

PDF Vector is a document processing platform that provides an API for parsing and analyzing various file formats. It is designed for developers and companies that need to turn unstructured documents—such as PDFs, Word files, and images—into structured markdown or JSON for use in other applications.
The tool supports specialized tasks including invoice extraction, ID parsing, and an academic search feature that pulls from databases like PubMed and Google Scholar.
Beyond parsing, the platform includes a Q&A capability that allows users to ask questions about the content of their documents. It also supports the Model Context Protocol (MCP) for integration with AI assistants such as Claude and ChatGPT.
Buyers should verify if the credit-based pricing tiers align with their expected document volume and whether the supported file formats cover their specific business needs.
Converts PDF, Word, Excel, and image files into structured markdown.
Turns documents into structured data using user-defined custom fields.
Supports asking questions about uploaded documents and receiving answers in markdown format.
Searches academic papers across databases including PubMed, ArXiv, and Google Scholar, and fetches papers via DOI.
Includes dedicated parsing for invoices, bank statements, and identity documents.
Connects document processing and academic search to AI assistants via Model Context Protocol.
Supports the creation of Retrieval-Augmented Generation pipelines by converting documents into clean text.
Parsing invoices, expense reports, and bank statements into structured formats.
Extracting key terms and identifying risks within legal contracts.
Searching and analyzing research papers across multiple scientific databases.
Extracting candidate information and skills from resumes.
Pricing is credit-based with a free tier (100 credits). Paid plans include Basic at $23/month, Pro at $89/month, and Enterprise at $457/month.
PDF Vector supports PDFs, Word documents, Excel spreadsheets, and image files.
Yes, there is a free plan that provides 100 credits for testing and small projects.
Yes, it provides integrations for no-code automation platforms like Zapier, Make, and n8n.
It supports searching for papers across databases such as PubMed, Semantic Scholar, ArXiv, and Google Scholar, and fetching specific papers using DOI.
Source category: Software Development
Source subcategory: Document Automation
PDF Vector is a document parsing API for software and enterprise teams that converts PDFs, images, and office documents into structured markdown. It supports workflows such as RAG pipelines, invoice extraction, and academic research. Buyers should consider the credit limits of each subscription tier based on their processing volume.