Process PDF File (Unified Entry Point) — process_pdf_unified • TextAnalysisR

Unified PDF processing:

Multimodal (R-native pdftools + Vision LLM) if enabled
R pdftools text extraction as fallback

Usage

process_pdf_unified(
  file_path,
  use_multimodal = FALSE,
  vision_provider = "ollama",
  vision_model = NULL,
  api_key = NULL,
  describe_images = TRUE
)

Arguments

file_path: Character string path to PDF file
use_multimodal: Logical, enable multimodal extraction
vision_provider: Character, "ollama", "openai", or "gemini"
vision_model: Character, model name
api_key: Character, API key (if using openai/gemini)
describe_images: Logical, generate image descriptions

Value

List: success, data, type, method, message

See also

Other preprocessing: get_available_dfm(), get_available_tokens(), import_files(), lemmatize_tokens(), prep_texts(), unite_cols()