Using Camelot and Excalibur to automate PDF table extraction and export
Dimiter Naydenov
almost 30 years ago…
This document describes the base technology and ideas behind the project named "Camelot".
[…] a universal way to communicate documents across a wide variety of machine configurations, operating systems and communication networks. […] viewable on any display […] printable on any modern printers.
—The Camelot Project, John Warnock
source:
Evolution of the Digital Document:
Celebrating Adobe Acrobat’s 25th Anniversary
Looks familiar?
Often you need to: select one cell at a time, copy & paste, repeat.
Camelot
https://github.com/camelot-dev/camelot
Excalibur
https://github.com/camelot-dev/excalibur
https://tryexcalibur.com
Started in 2016 by Vinayak Mehta
@vortex_ape
at SocialCops
in Bangalore, India.
matplotlib
Camelot
Using Conda (easiest way)
conda install -c conda-forge camelot-py
Using pip
, after installing prerequisites: tk
and ghostscript
pip install --upgrade pip camelot-py[cv]
Excalibur
Using pip
, after installing prerequisites tk
and ghostscript
pip install --upgrade pip excalibur-py
Ghostscript
with alternatives<your-favourite-feature?>
Questions ?
@dimitern