Converting PDF content into Data for import - XML/PHP/MYSQL?
I wondered if anyone knew of a method to import text and image content into a MySQL database? Or to convert a set of PDFs into word files?
I did search for some tools to do the conversion but results are not brilliant.
As you can imagine, some PDFs are layed out in different columns, causing the conversion to show text in the wrong place!
I had heard though that it may be possible to add XML tags into the PDF in the appropriate places and running a process will import the correct fields?
<title>[I]Article Title Appears Here[I]</title>
<intro>[I]Article Intro Here[I]</intro>
<author> and so on....
Any help will be appreciated