Extracts tabular data from PDF using pdfplumber (Python). No Java required - pure Python solution.
Details
Uses pdfplumber Python library through reticulate. Works with complex table layouts without Java dependency.
Examples
if (interactive()) {
setup_python_env()
pdf_path <- "path/to/table_document.pdf"
table_data <- extract_tables_from_pdf_py(pdf_path)
head(table_data)
}
