What file types does the Claude Files API support?

The Files API supports PDFs, plain text files, and images (JPEG, PNG, GIF, WebP). The file is stored server-side and referenced by ID - Claude receives it as if it were included inline in the request. Maximum file size limits apply per file type. Check the Anthropic documentation for current limits as they may change as the API evolves.

How long does the Files API store uploaded files?

Files are stored for a limited period - currently files expire after a set number of days. You can delete files explicitly via the API once you no longer need them. For production applications, store file IDs in your database alongside their upload timestamps so you can detect expiry and re-upload when needed.

When should I use the Files API vs inline base64 encoding?

Use the Files API when you analyse the same file across multiple API calls in a session or across sessions - the upload cost is paid once. Use inline base64 encoding for one-off analysis of unique documents where you will not reference the file again. For documents sent with every request in a long workflow, the Files API combined with prompt caching is the most cost-efficient approach.

Claude Files API Tutorial: Upload Once, Use Many Times

← Back to Claude API Hub

Every time you send a document to Claude via the standard messages API, you are sending the full file content in the request body - the entire base64-encoded image or the complete text of a PDF. If ten different users ask questions about the same 50-page policy document, that document is uploaded and transmitted ten separate times. At scale, this is wasteful in bandwidth, latency, and cost.

The Files API solves this. You upload a document once and receive a file ID. That ID can be referenced in any number of subsequent API requests without re-uploading the content. This is the right architecture for any application that processes the same documents repeatedly - knowledge bases, document Q&A systems, training material assistants, and compliance review tools.

What is the Claude Files API?

The Claude Files API lets you upload PDFs, images, and text files to Anthropic's servers once and reference them by a persistent file ID in any number of subsequent API requests. Instead of base64-encoding the same document into every request, you store it once and pass the ID - reducing bandwidth, lowering latency for large files, and simplifying your request payload for document-heavy workflows.

What the Files API Is For

The Files API lets you:

Upload files once and reference them by ID in future requests
Store PDFs, images, plain text, CSV files, and other document types
List and manage your uploaded files via API
Delete files when they are no longer needed

Files uploaded via the Files API are stored on Anthropic's systems and associated with your API key. They persist until you delete them.

Files API vs Prompt Caching

The Files API and Prompt Caching are different optimisations for different scenarios. The Files API is for reusing the same uploaded file across many separate API calls - ideal when the document itself is the constant and the questions vary. Prompt Caching is for caching a long system prompt or context prefix that repeats across many requests in a session. Both can be used together in advanced architectures.

Uploading a File

python

import anthropic

client = anthropic.Anthropic()

# Upload a PDF document
with open("company-handbook.pdf", "rb") as f:
    response = client.beta.files.upload(
        file=("company-handbook.pdf", f, "application/pdf")
    )

file_id = response.id
print(f"File uploaded. ID: {file_id}")
# File uploaded. ID: file_011CNkBGpN7KDGmhXAMxdMpV

The upload request accepts:

PDF files: application/pdf
Plain text files: text/plain
Images: image/jpeg, image/png, image/gif, image/webp

Store the file_id - this is your permanent reference for the uploaded content across any number of future requests.

Using a Stored File in a Request

Once a file is uploaded, reference it by ID in the messages content block using the document content type:

python

import anthropic

client = anthropic.Anthropic()

# file_id obtained from previous upload
file_id = "file_011CNkBGpN7KDGmhXAMxdMpV"

response = client.beta.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=2048,
    messages=[
        {
            "role": "user",
            "content": [
                {
                    "type": "document",
                    "source": {
                        "type": "file",
                        "file_id": file_id
                    },
                    "title": "Company Handbook",
                    "context": "This is the company employee handbook."
                },
                {
                    "type": "text",
                    "text": "What is the company's policy on remote work and flexible hours?"
                }
            ]
        }
    ],
    betas=["files-api-2025-04-14"]
)

print(response.content[0].text)

The beta header enables the Files API. Note the betas=["files-api-2025-04-14"] parameter - this is required for the Files API and uses the beta client (client.beta.messages.create).

Listing and Managing Files

List Uploaded Files

python

# List all uploaded files for your API key
files = client.beta.files.list()

for file in files.data:
    print(f"ID: {file.id}")
    print(f"Name: {file.filename}")
    print(f"Size: {file.size} bytes")
    print(f"Created: {file.created_at}")
    print("---")

Get File Metadata

python

# Retrieve metadata for a specific file
file_info = client.beta.files.retrieve_metadata(file_id)
print(f"File: {file_info.filename}, Size: {file_info.size}")

Delete a File

python

# Delete a file when it is no longer needed
client.beta.files.delete(file_id)
print(f"File {file_id} deleted.")

Building a Document Q&A System

Here is a practical pattern for a document question-answering system that uses the Files API to serve multiple user questions from a single uploaded document:

python

import anthropic

client = anthropic.Anthropic()

class DocumentQA:
    def __init__(self, pdf_path: str, document_title: str):
        # Upload the document once during initialisation
        with open(pdf_path, "rb") as f:
            upload_response = client.beta.files.upload(
                file=(pdf_path, f, "application/pdf")
            )
        
        self.file_id = upload_response.id
        self.document_title = document_title
        print(f"Document uploaded: {self.file_id}")
    
    def ask(self, question: str) -> str:
        """Answer a question about the uploaded document."""
        response = client.beta.messages.create(
            model="claude-sonnet-4-6",
            max_tokens=2048,
            messages=[
                {
                    "role": "user",
                    "content": [
                        {
                            "type": "document",
                            "source": {
                                "type": "file",
                                "file_id": self.file_id
                            },
                            "title": self.document_title
                        },
                        {
                            "type": "text",
                            "text": question
                        }
                    ]
                }
            ],
            betas=["files-api-2025-04-14"]
        )
        
        return response.content[0].text
    
    def cleanup(self):
        """Delete the file when finished."""
        client.beta.files.delete(self.file_id)


# Usage
qa = DocumentQA("annual-report-2025.pdf", "Annual Report 2025")

# Ask multiple questions - document is uploaded only once
print(qa.ask("What was the total revenue for Q3 2025?"))
print(qa.ask("Which regions showed the highest growth?"))
print(qa.ask("What are the key risks mentioned in the report?"))

# Clean up
qa.cleanup()

Persist file_ids in Your Database

For production applications, store file_ids in your application database alongside metadata about what document they represent, when they were uploaded, and which users have access to them. This lets you avoid redundant uploads when users access the same document, maintain an inventory of your stored files, and implement cleanup routines to delete files that have not been accessed in a set period.

Multi-Document Workflows

The Files API supports including multiple document files in a single request - useful for comparative analysis or synthesis across multiple sources:

python

response = client.beta.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=4096,
    messages=[
        {
            "role": "user",
            "content": [
                {
                    "type": "document",
                    "source": {"type": "file", "file_id": contract_v1_id},
                    "title": "Contract Version 1 (Current)"
                },
                {
                    "type": "document",
                    "source": {"type": "file", "file_id": contract_v2_id},
                    "title": "Contract Version 2 (Proposed)"
                },
                {
                    "type": "text",
                    "text": "Compare these two versions of the contract. What are the key differences in the liability clauses, payment terms, and termination conditions?"
                }
            ]
        }
    ],
    betas=["files-api-2025-04-14"]
)

Pricing and Storage Limits

File storage is associated with your API key and counts against your usage
Files are charged at a per-byte storage rate - check the Anthropic pricing page for current rates
When a file is used in a request, the content contributes to your input token count for that request
The key cost benefit: even if you reference a file ID many times, you avoid the data transfer overhead of re-uploading base64 content on every request

Delete Unused Files Promptly

Files persist until you explicitly delete them. Implement a cleanup routine in your application that deletes files that are no longer needed - for example, files created for a single-session user interaction should be deleted when that session ends. Accumulating thousands of unused files will add unnecessary storage costs and make file management harder.

Summary

The Files API is the right architecture for any Claude application that works with documents repeatedly. Upload once, reference by ID, and reuse across as many requests as needed.

Key practices:

Use client.beta.files.upload with the correct MIME type for your file
Include betas=["files-api-2025-04-14"] in all requests that reference uploaded files
Store file_ids in your database to avoid redundant uploads
Implement cleanup routines to delete files that are no longer needed
Use the document content type with a file source in your messages content blocks

With the Files API completed, we have now covered all the major tools in Claude's capability set. Time for a knowledge check: Knowledge Check: Claude Tools & Capabilities Quiz.

Files API in Production: Best Practices

Moving beyond the basics, here are practices that matter in production document workflows:

Version control your file IDs. When a document is updated (new policy version, updated contract), upload the new version and store both the old and new file IDs with version metadata. This lets you audit which version of a document was queried at any point in time - important for regulated industries.

Combine with the vision API for image-heavy documents. PDFs containing charts, diagrams, or scanned images are processed using Claude's vision capability when referenced via the Files API. For best results with image-heavy PDFs, pair with Claude Vision analysis patterns to extract and describe visual content explicitly.

Use the Files API alongside RAG for large corpora. For very large document collections, the Files API alone is not a substitute for a retrieval system - you cannot send hundreds of files in one request. Use the Files API for the retrieved chunks that your RAG pipeline surfaces, uploading source documents once and referencing them when relevant chunks are retrieved.

The Anthropic Files API documentation covers current size limits, supported MIME types, and beta header requirements. Storage rates and per-file limits are detailed on the Anthropic pricing page.

This post is part of the Anthropic AI Tutorial Series. Previous post: Claude Computer Use: Let Claude Control a Desktop.

External references:

Frequently Asked Questions

Q: What is the Claude Files API and what problem does it solve? The Files API lets you upload files (PDFs, images, text documents, code) to Anthropic's servers once and reference them by file ID in multiple API calls, rather than re-encoding and re-sending the full file content with every request. This reduces request payload size, lowers latency for repeated use of the same file, and simplifies multi-turn conversations that reference a large document throughout.

Q: How do you upload a file and use it in a Claude message? Upload with file = client.beta.files.upload(file=("doc.pdf", open("doc.pdf", "rb"), "application/pdf")). Then reference it in a message: {"type": "document", "source": {"type": "file", "file_id": file.id}}. The file ID is stable until you delete it, so you can store IDs in your database and reuse them across many conversations without re-uploading.

Q: What file types does the Files API support and are there size limits? The Files API supports PDFs, plain text files, and images (PNG, JPEG, GIF, WebP). File size limits and supported types may evolve - check the current Anthropic documentation for exact limits. Large PDFs are processed page by page internally. For very large documents, consider splitting them into logical sections and uploading each as a separate file to enable targeted retrieval rather than forcing Claude to process the entire document every time.

Part of the Claude AI Masterclass.