I am looking to grab text from a word document file on the fly. I have seen that this is possible using the DOM package, but unfortunately the software will be on a linux server and this package will not be available.
Is there any other way to read Word (.doc) files into a string?
I have seen most of the solutions talked about on there…
Antiword - problem is this has to be installed server side and could prove to be a problem if we decide to distribute this software further down the line.
phpLiveDocx - problem with this solution is that Zend has to be installed server side and it also seems a very big download (around 140 MB).
It seems there is no easy solution at this time - very strange as I thought it would of been quite a common task to read from Word documents.