Unified PDF processing with automatic fallback:
Multimodal (Python + Vision) 2. Python pdfplumber 3. R pdftools
Usage
process_pdf_unified(
file_path,
use_multimodal = FALSE,
vision_provider = "ollama",
vision_model = NULL,
api_key = NULL,
describe_images = TRUE
)See also
Other preprocessing:
get_available_dfm(),
get_available_tokens(),
import_files(),
prep_texts(),
unite_cols()
