Unified PDF processing with automatic fallback:
Multimodal (Python + Vision) 2. Python pdfplumber 3. R pdftools
Usage
process_pdf_unified(
file_path,
use_multimodal = FALSE,
vision_provider = "ollama",
vision_model = NULL,
api_key = NULL,
describe_images = TRUE
)
Arguments
- file_path
Character string path to PDF file
- use_multimodal
Logical, enable multimodal extraction
- vision_provider
Character, "ollama" or "openai"
- vision_model
Character, model name
- api_key
Character, OpenAI API key (if using OpenAI)
- describe_images
Logical, generate image descriptions
Value
List: success, data, type, method, message