How to read MS-word document in java without displaying junk
(unnecessary) data?
Answers were Sorted based on User's Feedback
Answer / nivas
There are lot of third party API's availbale. Please try
below one available in Jakarta site
Apache POI - HWPF - Java API to Handle Microsoft Word Files
| Is This Answer Correct ? | 23 Yes | 9 No |
Answer / rajdeo
hi this is rajdeo
i m trying to read a word file using poi interface but i m
getting the error on import org.apache.poi.hwpf.extractor.*;
interface i have put the poi JAR FILE on WEB-INF BUT IN DONT
KNOW WHICH jar file ((means )) name of the jar file. i put
the jar file with the poi-bin-3.6-20091214.tar zip file plz
help me any body
i m getting the error to the page
WordExtractor we = new WordExtractor(doc);
i used this code plz watch this and reply me please
thanks
import org.apache.poi.poifs.filesystem.*;
import org.apache.poi.hwpf.*;
import org.apache.poi.hwpf.extractor.*;
import java.io.*;
public class readDoc
{
public static void main( String[] args )
{
String filesname = "Hello.doc";
POIFSFileSystem fs = null;
try
{
fs = new POIFSFileSystem(new
FileInputStream(filesname;
//Couldn't close the braces at the end as
my site did not allow it to close
HWPFDocument doc = new HWPFDocument(fs);
WordExtractor we = new WordExtractor(doc);
String[] paragraphs = we.getParagraphText();
System.out.println( "Word Document has " +
paragraphs.length + " paragraphs" );
for( int i=0; i<paragraphs .length; i++ ) {
System.out.println( "Length:"+paragraphs[ i
].length());
}
}
catch(Exception e) {
e.printStackTrace();
}
}
}
| Is This Answer Correct ? | 8 Yes | 1 No |
Answer / arumugam
import org.apache.poi.poifs.filesystem.*;
import org.apache.poi.hwpf.*;
import org.apache.poi.hwpf.extractor.*;
import java.io.*;
public class readDoc
{
public static void main( String[] args )
{
String filesname = "Hello.doc";
POIFSFileSystem fs = null;
try
{
fs = new POIFSFileSystem(new
FileInputStream(filesname;
//Couldn't close the braces at the end as
my site did not allow it to close
HWPFDocument doc = new HWPFDocument(fs);
WordExtractor we = new WordExtractor(doc);
String[] paragraphs = we.getParagraphText();
System.out.println( "Word Document has " +
paragraphs.length + " paragraphs" );
for( int i=0; i<paragraphs .length; i++ ) {
System.out.println( "Length:"+paragraphs[ i
].length());
}
}
catch(Exception e) {
e.printStackTrace();
}
}
}
Note : Make sure before run this program , you should added
supporting jars are presence in this link :
http://poi.apache.org/download.html#POI-3.6
| Is This Answer Correct ? | 9 Yes | 6 No |
Answer / prabhu
download bin zip file from apache any version.
poi.jar will have org.apache.poi.poifs.*
poi-scratchpad.jar will have org.apache.poi.hwpf.*
else if names of the jars which you have downloaded is
different then do not worry open all the jars with winzip
and check for the classes that you have used in the program.
Then select those jars and putit in your project in web-
inf.lib directory
| Is This Answer Correct ? | 4 Yes | 1 No |
Answer / tamina
Use Java Libraries which are available online, one such
library is Aspose.Words for Java which allows you to perform
many operations on word docs including reading MS Word
document without showing junk data.
http://www.aspose.com/java/word-component.aspx
| Is This Answer Correct ? | 1 Yes | 0 No |
hai, i want to know how the connectionpool manager work in the java or netbeans.Anybody having the exact code plz give to me i have no idea about that so help me plz
Write Down Steps Using SAX Parser
Where can i find some sample applications for different J2EE frameworks like Spring,Struts ,JSF, Hibernate etc ??
How to read MS-word document in java without displaying junk (unnecessary) data?
Can we change the validator-rules.xml for our own validations in struts??
Can you forward data from a servlet to a regular .java class that is not a servlet?
how to display after one month adding today's sysdatedate in jsp?
an on-line examination application using html jsp servlet and jdbc. including session management and cookies
Write Down Steps Using SAX Parser
i am trying to intigrate ejb and hibernate ,from session facade i am callind dao implemented through hibernate,i am getting a ClassDefNotFoundException for this org/hibernate/Session i ve set the class path at build path and in setEnv in weblogic still .........
How to run the Result Intemation System project in java for collage student in which result of internal exam marks send on parents mobile using SMS? what software required to run this project? please reply immediately...
How to get one hasmap value in another hashmap ,only value not key