i have this php code which read ms word file, the problem is that the .doc file is not plain text, the script read the .doc file but is also get many special char which is not needed i need the word content only i also will run this script on linux server so i must not use the COM object
Greetings!
MS Word files not only contain texts but also objects, images, equations etc… so you can not read it by scanning line by line this way. Because you want your script to run on a Linux server, COM is not a solution. You may search the Internet for functions/class that suit your need.
I found this class, but it is used to create .doc files, not read from them. Anyway, it may be helpful to you:
phpclasses dot o r g slash browse slash package slash 2631 dot html
PS: The forum doesn’t allow members with less than 10 posts (like me) posting links, so I use that format instead.