please, tell me how can I read such format under unix. I just wanna to read only the text but no pictures if any (or may be other windows controls).
Namely, how can I read only the text from doc file. As a result I want to get only the list of words from this text (search engine require such a thing). So it doesnt matter the regularity of text.examples'r prefferred.
thank u.
>please, tell me how can I
>read such format under unix.
>I just wanna to read
>only the text but no
>pictures if any (or may
>be other windows controls).
>Namely, how can I read only
>the text from doc file.
>As a result I want
>to get only the list
>of words from this text
>(search engine require such a
>thing). So it doesnt matter
>the regularity of text.
>
>examples'r prefferred.
>
>thank u.I hope this will help:
http://sourceforge.net/project/showfiles.php?group_id=10501&...
It looks very strange. Do u understand how does It works?thank u
>It looks very strange. Do u
>understand how does It works?
>
>
>thank u
For example, if you want to extract a text only from the file file.doc:wvWare -x /usr/local/share/wv/wvText.xml file.doc > somefile.txt
By default it produces output in html format.
P.S. To my opinion it looks a bit ugly, but I do not know about another software for proccesing and converting MS Word documents. May be somebody else knows...
I do not wanna to translate it in suuch way. I wanna to use it's API if any. So could u show me such an example?tnk u
>I do not wanna to translate
>it in suuch way. I
>wanna to use it's API
>if any. So could u
>show me such an example?
>
>
>tnk uSorry, but I used it only once to extract some info from a numerous of word documents - perl, awk, and C function 'popen' were enough for those purposes. So use 'popen' for now and try to find another software or examine this source code for the future.
Best.
man -s1 stringsThis "feature" out txt strings from any file (and binary too). Test it. Source codes for Linux or BSD are no problem.
>man -s1 strings
>
>This "feature" out txt strings from
>any file (and binary too).
>Test it. Source codes for
>Linux or BSD are no
>problem.Depends on task. In my case it was unacceptable.:-(