How to read MS-word document in java without displaying junk
(unnecessary) data?

Answer Posted / arumugam

import org.apache.poi.poifs.filesystem.*;
import org.apache.poi.hwpf.*;
import org.apache.poi.hwpf.extractor.*;
import java.io.*;

public class readDoc
{
public static void main( String[] args )
{
String filesname = "Hello.doc";
POIFSFileSystem fs = null;
try
{
fs = new POIFSFileSystem(new
FileInputStream(filesname;
//Couldn't close the braces at the end as
my site did not allow it to close

HWPFDocument doc = new HWPFDocument(fs);

WordExtractor we = new WordExtractor(doc);

String[] paragraphs = we.getParagraphText();

System.out.println( "Word Document has " +
paragraphs.length + " paragraphs" );
for( int i=0; i<paragraphs .length; i++ ) {
System.out.println( "Length:"+paragraphs[ i
].length());
}
}
catch(Exception e) {
e.printStackTrace();
}
}
}


Note : Make sure before run this program , you should added
supporting jars are presence in this link :
http://poi.apache.org/download.html#POI-3.6

Is This Answer Correct ?    9 Yes 6 No



Post New Answer       View All Answers


Please Help Members By Posting Answers For Below Questions

Write Down Steps Using SAX Parser

2179


an on-line examination application using html jsp servlet and jdbc. including session management and cookies

4458


Do you think about CMM(Capability Maturity Model) process?

580


plz send code for manage group of hotels in j2ee frontend:J2EE Backend: DB2 Express

2402


exception org.apache.jasper.JasperException: java.lang.NullPointerException org.apache.jasper.servlet.JspServletWrapper.handleJs pException(JspServletWrapper.java:491) org.apache.jasper.servlet.JspServletWrapper.service( JspServletWrapper.java:419) org.apache.jasper.servlet.JspServlet.serviceJspFile( JspServlet.java:313) org.apache.jasper.servlet.JspServlet.service(JspServ let.java:260) javax.servlet.http.HttpServlet.service(HttpServlet.j ava:717) root cause java.lang.NullPointerException org.apache.struts.taglib.TagUtils.retrieveMessageRes ources(TagUtils.java:1175) org.apache.struts.taglib.TagUtils.message(TagUtils.j ava:1038) org.apache.struts.taglib.bean.MessageTag.doStartTag( MessageTag.java:224) org.apache.jsp.register_jsp._jspx_meth_bean_005fmess age_005f0(register_jsp.java:138) org.apache.jsp.register_jsp._jspService(register_jsp .java:94) org.apache.jasper.runtime.HttpJspBase.service(HttpJs pBase.java:70) javax.servlet.http.HttpServlet.service(HttpServlet.j ava:717) org.apache.jasper.servlet.JspServletWrapper.service( JspServletWrapper.java:377) org.apache.jasper.servlet.JspServlet.serviceJspFile( JspServlet.java:313) org.apache.jasper.servlet.JspServlet.service(JspServ let.java:260) javax.servlet.http.HttpServlet.service(HttpServlet.j ava:717)

3370






How to run the Result Intemation System project in java for collage student in which result of internal exam marks send on parents mobile using SMS? what software required to run this project? please reply immediately...

2586


i am trying to intigrate ejb and hibernate ,from session facade i am callind dao implemented through hibernate,i am getting a ClassDefNotFoundException for this org/hibernate/Session i ve set the class path at build path and in setEnv in weblogic still .........

1964


How to get one hasmap value in another hashmap ,only value not key

921


Can we change the validator-rules.xml for our own validations in struts??

2330


hai, i want to know how the connectionpool manager work in the java or netbeans.Anybody having the exact code plz give to me i have no idea about that so help me plz

2595


plz send code for feature rich resume builder in j2ee frontend:J2EE Backend: DB2 Express

3174


plz send code for Ecorps in j2ee frontend:J2EE Backend: DB2 Express

2391