Skip to main content
POST
/
v1
/
documents
curl https://gateway.bud.studio/v1/documents \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "document-processor",
    "document": "base64_encoded_document_content",
    "task": "extract"
  }'
{
  "id": "doc-abc123",
  "object": "document.processing",
  "created": 1699000000,
  "model": "document-processor",
  "result": {
    "extracted_text": "Document content here...",
    "metadata": {
      "pages": 5,
      "format": "pdf"
    }
  }
}
curl https://gateway.bud.studio/v1/documents \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "document-processor",
    "document": "base64_encoded_document_content",
    "task": "extract"
  }'
{
  "id": "doc-abc123",
  "object": "document.processing",
  "created": 1699000000,
  "model": "document-processor",
  "result": {
    "extracted_text": "Document content here...",
    "metadata": {
      "pages": 5,
      "format": "pdf"
    }
  }
}

Headers

ParameterTypeRequiredDescription
AuthorizationstringYesBearer authentication header

Body

ParameterTypeRequiredDescription
modelstringYesDocument processing model identifier
documentstringYesDocument content (base64 encoded or URL)
taskstringNoProcessing task: extract, analyze, summarize. Default: extract
optionsobjectNoTask-specific processing options

Supported Providers

OpenAI

GPT-4, GPT-3.5 for document understanding

Anthropic

Claude 3.5 Sonnet, Opus, Haiku for document analysis

Azure OpenAI

Enterprise GPT-4 and GPT-3.5 deployments

Google

Gemini Pro and Ultra for document processing

AWS Bedrock

Claude, Llama, and Mistral on AWS

Together AI

Llama 3, Mixtral, and open-source models

Fireworks

Fast inference for document understanding

xAI

Grok models with extended context