Replaing strings in html pages
Basically I'm looking to write something that searches for a specific string in an html document and then replaces it by something i defined in the code already. so no need to take user inputs, the code will decide already what its going to do. The rundown :
-need the code to search html docs, aka i have a var named stringx and i want to see where in the document it is found
-once stringx is found, i need to go back a couple of spaces as in search backwards for another string (to check a number that is the name) once that number is found, i need to add a link to a filename that is constant + the number found.
is this clear enough?
thanks in advance if anyone can help
Basically I'm looking to write something that searches
for a specific string in an html document and then
replaces it by something i defined in the code
already. so no need to take user inputs, the code will
decide already what its going to do. The rundown :
-need the code to search html docs, aka i have a var
named stringx and i want to see where in the document
it is found
-once stringx is found, i need to go back a couple of
spaces as in search backwards for another string (to
check a number that is the name) once that number is
found, i need to add a link to a filename that is
constant + the number found.
is this clear enough?
thanks in advance if anyone can helpIf it is a well formed HTML doc you could basically treat it as an XML doc and use SAX n DOM for the purpose. XPath could be an option along with DOM to use.
Similar Messages
-
Converting a String to HTML page
Hi,
I need to know how can i print a jsp page, throw java application.
1. I have a jsp(html) page whice contains some tables
and data. (this page should be printed)
2. I know how to send a String (which represents the jsp
source code) to a java application(server side)
3. I know how to Print a String.
But it's not enough, because i want to print the page itself
and not it's source code.
Please help.
thanks in advance.Hmmm ... I tried the same long time ago ... and failed. ;-)
But as I see now, they added some new methods to the engine.
Now I would start here:
http://jakarta.apache.org/tomcat/tomcat-4.1-doc/jasper/docs/api/org/apache/jasper/servlet/JasperLoader.html
Maybe it just works when you pass your JSP file to this method:
http://jakarta.apache.org/tomcat/tomcat-4.1-doc/jasper/docs/api/org/apache/jasper/servlet/JasperLoader.html#loadClassDataFromFile(java.lang.String)
and write the byte[] in a file.
The problem is, that you just get an class file. (imho, no html file)
But if you execute this class, you should get the proper HTML code...
Hmm... not easy as I said bevore. :-)
Hope I could help you! Let me know if I did.
Happy Coding! :-) © -
Get HTML page content as string in BPEL
Hi!
I would like to get HTML page content as string in BPEL via partnerLink.
So, I define WSDL file for this partnerLink:
<definitions targetNamespace="urn:GetSummaryContent"
xmlns="http://schemas.xmlsoap.org/wsdl/"
xmlns:tns="urn:GetSummaryContent"
xmlns:mime="http://schemas.xmlsoap.org/wsdl/mime/"
xmlns:soap12="http://schemas.xmlsoap.org/wsdl/soap12/"
xmlns:xsd="http://www.w3.org/2001/XMLSchema"
xmlns:soap="http://schemas.xmlsoap.org/wsdl/soap/"
xmlns:http="http://schemas.xmlsoap.org/wsdl/http/">
<message name="MarkMessageAsReadHttpGetIn">
<part name="webid" type="xsd:string"/>
</message>
<message name="MarkMessageAsReadHttpGetOut">
<part name="Body" type="xsd:anyType"/>
</message>
<portType name="getHtmlPageGet">
<operation name="getHtmlPage">
<input message="tns:MarkMessageAsReadHttpGetIn"/>
<output message="tns:MarkMessageAsReadHttpGetOut"/>
</operation>
</portType>
<binding name="MessagingHttpGet" type="tns:getHtmlPageGet">
<http:binding verb="GET"/>
<operation name="getHtmlPage">
<http:operation location=""/>
<input>
<http:urlEncoded/>
</input>
<output>
<mime:content type="text/html" part="Body"/>
</output>
</operation>
</binding>
<service name="Messaging">
<port name="MessagingHttpGet" binding="tns:MessagingHttpGet">
<http:address location="http://server:port/app-context-root/sss.xsql"/>
</port>
</service>
</definitions>
As a result I got bindingFault: [email protected]9b : Could not find binding output for operation getHtmlPage
Could You help me to solve this trouble?
Have You any solution?
Thank You.What are you trying to accomplish?
-
Cannot get the parameter from a HTML page at the first time
Hi everyone,
I have a html page with <input name="username" ...>, and the submit action is a url to my JSF web application. In the start page of the JSF webapp, I use following code to get value of username.
FacesContext facesContext = javax.faces.context.FacesContext.getCurrentInstance();
String userName = (String)facesContext.getExternalContext().getRequestParameterMap().get("username");
However, I cannot get it at the first time. The strange thing is if i click "refresh" button, the value of username is shown up on my page.
Why? and How to solve it?
Thanks in advance,Hi,
Are you putting this code in the public void preprocess() method?
Check out
http://developers.sun.com/prodtech/javatools/jscreator/reference/techart/2/app_model.html
The assumption is that Creator is being used. -
Problem in Opening HTML Page in Internet Explorer from my Swing Application
Hi,
I am opening a HTML file in Internet Explorer from my swing application.I am using the code given below
private final static String WIN_FLAG = "url.dll,FileProtocolHandler";
private final static String WIN_PATH = "rundll32";
String cmd = WIN_PATH + " " + WIN_FLAG + " " + url;
// url is HTML file Path
Process p = Runtime.getRuntime().exec( cmd );
Here there are two things i need to address ...
1)The HTML file is opening up,but it always opens behind the swing application,that makes me every time to maximize the HTML file after it being opened up.I want to open it in front of the Swing Application.I need to implement "Always On Top" functionality for the html page.
2)Whenever i trigger action to open different HTML page,it opens in new Internet Explorer window.
I need to open it in same IE window.
how to solve out these problems ??? any help would be greatly appreciated .Thanks in advance.
- Manikandanany idea about this ????
-
Open a html page from an applet
Hi,
I would like to open, from an applet, an HTML page in a new window with a specific width an eight.
Who can help me ?Here is an example of applet to javascript with liveconnect:
http://www.narhari.com/java/applet2javascript/
You can use the send(String) method from the sample applet to send something like:
window.open('xyz.html','myPopup','width=200,height=200');in order to open a window with specific dimensions
of course you can also build HTML in your applet and use send() + some javascript to render it. -
How to display a text file in a html page?
I have written a servlet that executes a batch file whose output is saved to a txt file. I am attempting to display content of the text file as a html page from the servlet but am struggling I have attempted to read the file and output it as shown below but it is not displaying the correct data
Appreciate some help
Thanks in advance
Amjad
Runtime r = Runtime.getRuntime();
Process p = r.exec("C:\\Condor\\test\\QueStatus.bat");
PrintWriter out;
String title = "Que Status";
response.setContentType("text/html");
out = response.getWriter();
FileReader fr = new FileReader ("C:\\Condor\\test\\output.txt");
for (int chr = fr.read(); chr != -1; chr = fr.read()){
out.print(chr);
}Couple of things. First you should search the forums for how to properly exec a dos batch file. There are some problems you may run into especially with io. I suspect you may not actually be running that batch file at all.
I would test outputing a text file separately from execing the batch file.
Here is some code so you don't have to output 1 char at a time. Setting the content length and closing the output stream might help.
File downloadFile = new File("c:\\blah\\blah\\blah");
response.setContentLength((int)downloadFile.length());
response.setContentType("text/html");
ServletOutputStream out = response.getOutputStream();
FileInputStream fis = new FileInputStream(downloadFile);
byte[] buffer = new byte[1024];
int bytesRead;
while ((bytesRead = fis.read(buffer)) != -1) {
out.write(buffer, 0, bytesRead);
fis.close();
out.close(); -
How ias integrate with Snacktory for getting main text from an html page
Hi All,
i am new to endeca and ias, i have an requirement, need to get main text from whole html page before ias save text to Endeca_Document_Text property,
as ias save all text in page to endeca_document_text property, it is not ok for reading when show in web page, i use an third party API to filter out the main text from original page,
now i want to save these text to endeca_document_text property,
an another question,
i get zero page when doing the logic of filtering main text from original html text in ParseFilter( HTMLMetatagFilter implements ParseFilter) using Snacktory.
if only do little things, it will work fine, if do more thing, clawer fail to crawl page. any one know how to fix it.
log for clawler.
Successfully set recordstore configuration.
INFO 2013-09-03 00:56:42,743 0 com.endeca.eidi.web.Main [main] Reading seed URLs from: /home/oracle/oracle/endeca/IAS/3.0.0/sample/myfirstcrawl/conf/endeca.lst
INFO 2013-09-03 00:56:42,744 1 com.endeca.eidi.web.Main [main] Seed URLs: [http://www.liferay.com/community/forums/-/message_boards/category/]
INFO 2013-09-03 00:56:43,497 754 com.endeca.eidi.web.db.CrawlDbFactory [main] Initialized crawldb: com.endeca.eidi.web.db.BufferedDerbyCrawlDb
INFO 2013-09-03 00:56:43,498 755 com.endeca.eidi.web.Crawler [main] Using executor settings: numThreads = 100, maxThreadsPerHost=1
INFO 2013-09-03 00:56:44,163 1420 com.endeca.eidi.web.Crawler [main] Fetching seed URLs.
INFO 2013-09-03 00:56:46,519 3776 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] come into EndecaHtmlParser getParse
INFO 2013-09-03 00:56:46,519 3776 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] come into HTMLMetatagFilter
INFO 2013-09-03 00:56:46,519 3776 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] meta tag viewport ==minimum-scale=1.0, width=device-width
INFO 2013-09-03 00:56:52,889 10146 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] come into EndecaHtmlParser getParse
INFO 2013-09-03 00:56:52,889 10146 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] come into HTMLMetatagFilter
INFO 2013-09-03 00:56:52,890 10147 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] meta tag viewport ==minimum-scale=1.0, width=device-width
INFO 2013-09-03 00:56:59,184 16441 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] come into EndecaHtmlParser getParse
INFO 2013-09-03 00:56:59,185 16442 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] come into HTMLMetatagFilter
INFO 2013-09-03 00:56:59,185 16442 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] meta tag viewport ==minimum-scale=1.0, width=device-width
INFO 2013-09-03 00:57:07,057 24314 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] come into EndecaHtmlParser getParse
INFO 2013-09-03 00:57:07,057 24314 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] come into HTMLMetatagFilter
INFO 2013-09-03 00:57:07,057 24314 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] meta tag viewport ==minimum-scale=1.0, width=device-width
INFO 2013-09-03 00:57:07,058 24315 com.endeca.eidi.web.Crawler [main] Seeds complete.
INFO 2013-09-03 00:57:07,090 24347 com.endeca.eidi.web.Crawler [main] Starting crawler shut down
INFO 2013-09-03 00:57:07,095 24352 com.endeca.eidi.web.Crawler [main] Waiting for running threads to complete
INFO 2013-09-03 00:57:07,095 24352 com.endeca.eidi.web.Crawler [main] Progress: Level: Cumulative crawl summary (level)
INFO 2013-09-03 00:57:07,095 24352 com.endeca.eidi.web.Crawler [main] host-summary: www.liferay.com to depth 1
host depth completed total blocks
www.liferay.com 0 0 1 1
www.liferay.com 1 0 0 0
www.liferay.com all 0 1 1
INFO 2013-09-03 00:57:07,096 24353 com.endeca.eidi.web.Crawler [main] host-summary: total crawled: 0 completed. 1 total.
INFO 2013-09-03 00:57:07,096 24353 com.endeca.eidi.web.Crawler [main] Shutting down CrawlDb
INFO 2013-09-03 00:57:07,160 24417 com.endeca.eidi.web.Crawler [main] Progress: Host: Cumulative crawl summary (host)
INFO 2013-09-03 00:57:07,162 24419 com.endeca.eidi.web.Crawler [main] Host: www.liferay.com: 0 fetched. 0.0 mB. 0 records. 0 redirected. 4 retried. 0 gone. 0 filtered.
INFO 2013-09-03 00:57:07,162 24419 com.endeca.eidi.web.Crawler [main] Progress: Perf: All (cumulative) 23.6s. 0.0 Pages/s. 0.0 kB/s. 0 fetched. 0.0 mB. 0 records. 0 redirected. 4 retried. 0 gone. 0 filtered.
INFO 2013-09-03 00:57:07,162 24419 com.endeca.eidi.web.Crawler [main] Crawl complete.
~/oracle/endeca
-======================================
source code for parsefilter
package com.endeca.eidi.web.parse;
import java.util.Map;
import java.util.Properties;
import org.apache.hadoop.conf.Configuration;
import org.apache.log4j.Logger;
import org.apache.nutch.metadata.Metadata;
import org.apache.nutch.parse.HTMLMetaTags;
import org.apache.nutch.parse.Parse;
import org.apache.nutch.parse.ParseData;
import org.apache.nutch.parse.ParseFilter;
import org.apache.nutch.protocol.Content;
import de.jetwick.snacktory.ArticleTextExtractor;
import de.jetwick.snacktory.JResult;
public class HTMLMetatagFilter implements ParseFilter {
public static String METATAG_PROPERTY_NAME_PREFIX = "Endeca.Document.HTML.MetaTag.";
public static String CONTENT_TYPE = "text/html";
private static final Logger logger = Logger.getLogger(HTMLMetatagFilter.class);
public Parse filter(Content content, Parse parse) throws Exception {
logger.info("come into EndecaHtmlParser getParse");
logger.info("come into HTMLMetatagFilter");
//update the content with the main text in html page
//content.setContent(HtmlExtractor.extractMainContent(content));
parse.getData().getParseMeta().add("FILTER-HTMLMETATAG", "ACTIVE");
ParseData parseData = parse.getData();
if (parseData == null) return parse;
extractText(content, parse);
logger.info("update the content with the main text content");
return parse;
private void extractText(Content content, Parse parse){
try {
ParseData parseData = parse.getData();
if (parseData == null) return;
Metadata md = parseData.getParseMeta();
ArticleTextExtractor extractor = new ArticleTextExtractor();
String sourceHtml = new String(content.getContent());
JResult res = extractor.extractContent(sourceHtml);
String text = res.getText();
md.set("Endeca_Document_Text", text);
} catch (Exception e) {
// TODO: handle exception
public static void log(String msg){
System.out.println(msg);
public Configuration getConf() {
return null;
public void setConf(Configuration conf) {but it only extracts URLs from <A> (anchor) tags. I want to be able to extract URLs from <MAP> tags as wellGee, do you think you could modify the code to check for "Map" attributes as well.
Can someone maybe point a page containing info on the HTML toolkit for me?It's called the API. Since you are using the HTMLEditorKit and an ElementIterator and an AttributeSet, I would start there.
There is no such API that says "get me all the links", so you have to do a little work on your own.
Maybe you could use a ParserCallback and every time you get a new tag you check for the "href" attribute. -
After a good bit of reading, I found that printing HTML directly to the printer doesn't work for many printers. I don't want to print the code. I want the actual HTML page. I have some code here that reads the HTML file into a JEditorPane and then is rendered into a graphic for printing.
My code compiles fine, I'm not getting any errors, but what I'm also not getting is any pages printing.
Here is the source of my class:
* PrintReport.java
* @author tristan
* Created on September 27, 2007, 4:06 PM
package fedex;
import java.awt.Color;
import java.awt.Dimension;
import java.awt.Graphics;
import java.awt.Graphics2D;
import java.awt.print.PageFormat;
import java.awt.print.Printable;
import java.awt.print.PrinterException;
import java.awt.print.PrinterJob;
import java.io.*;
import javax.print.Doc;
import javax.print.DocFlavor;
import javax.print.DocPrintJob;
import javax.print.PrintException;
import javax.print.PrintService;
import javax.print.PrintServiceLookup;
import javax.print.ServiceUI;
import javax.print.SimpleDoc;
import javax.print.attribute.*;
import javax.print.attribute.standard.*;
import javax.swing.JEditorPane;
import javax.swing.RepaintManager;
public class PrintReport
private FileInputStream fileIS;
private Doc doc;
private DocAttributeSet das = new HashDocAttributeSet ();
PrintService defaultService;
/** Creates a new instance of PrintReport */
public PrintReport ()
public void printHTMLFile (String file) throws FileNotFoundException, IOException
PrintableEditorPane jEditorPane = new PrintableEditorPane ();
jEditorPane.setContentType ("text/html");
jEditorPane.read (new BufferedInputStream (new FileInputStream (file)), "");
System.out.println (jEditorPane.getText ());
PrinterJob job = PrinterJob.getPrinterJob ();
job.setPrintable (jEditorPane);
if (job.printDialog ())
try
job.print ();
catch (Exception ex)
System.out.println (ex);
public class PrintableEditorPane extends JEditorPane implements Printable, Serializable
public int print (Graphics g, PageFormat pf, int pageIndex) throws PrinterException
Graphics2D g2 = (Graphics2D)g;
g2.setColor (Color.black);
RepaintManager.currentManager (this).setDoubleBufferingEnabled (false);
Dimension d = this.getSize ();
double panelWidth = d.width;
double panelHeight = d.height;
double pageWidth = pf.getImageableWidth ();
double pageHeight = pf.getImageableHeight ();
double scale = pageWidth / panelWidth;
int totalNumPages = (int)Math.ceil (scale * panelHeight / pageHeight);
System.out.println ("Total pages to print are " + totalNumPages);
if (pageIndex >= totalNumPages) return Printable.NO_SUCH_PAGE;
g2.translate (pf.getImageableX (), pf.getImageableY ());
g2.translate (0f, -pageIndex * pageHeight);
g2.scale (scale, scale);
this.paint (g2);
return Printable.PAGE_EXISTS;
}I tried using a complex HTML file as well as a basic one, but everytime Total pages to print are 0 is printed back to the console. I can't figure out why it's unable to create any pages to print.DrClap wrote:
I think you will find thatDimension d = this.getSize ();produces (0, 0) until you actually display your component somewhere. I believe it's possible to "display" it in such a way that it doesn't appear on the user's screen, but I don't know how.
There's a Swing forum here where you get answers from people who are good at Swing. Posting here attracts answers from people like me who are mediocre at it.I think that calling validate() on a Container will lay it out and size it's components, even if it hasn't been made visible, but don't take my word for it. -
How do I change filename for single-image HTML pages in web gallery for LR 3
Hi,
I would like to change the default filename for the HTML pages produced by Lightroom 3 in the web gallery. Specifically, I would like to remove the string "_large" from the HTML filename, so it corresponds exactly to the image filename.
Specifically, the URL by default is like this:
http://www.mysite.com/content/_c_EMT2146_large.html
and I would like it to be like this
http://www.mysite.com/content/_c_EMT2146.html
I assume there is some way to make this change using the template editors, but I can't figure it out.
thanks,
James RiemermannTurns out there's a fairly easy way to change that on Windows, but I can't figure out an equivalent on a Mac, which is where the need is. On windows, I can edit (after backing up) the following two config files:
C:\Program Files\Adobe\Adobe Photoshop Lightroom 3.6\Shared\webengines\default_html.lrwebengine\grid.html
C:\Program Files\Adobe\Adobe Photoshop Lightroom 3.6\Shared\webengines\default_html.lrwebengine\manifest.lrweb
In which the "_large" string is appended and can be removed, following which the gallery works exactly as I would hope.
But I can't find the equivalent files on the Mac, though I would assume they exist. -
How to display an image on the HTML page by using applet
Dear friends,
I am now writing an java applet, I want to display an image on the HTML page, and tried the following commands:
{color:#ff00ff}{color:#000000}Image map; //put in the class definition
map=getImage(getCodeBase(),"hhh.gif"); //put in the function of init()
g.drawImage(map,0,300,this); //put in the function of paint(){color}
{color}
However, when I run it, the image wasn't displayed at all.
I hope who guys ever come across this problem could help me to solve it. Thank you in advance!
HawaiiHi,
I am no expert on Images
but
from personal exp.
are you sure that map actually contains the image?
try using
ImageIcon ii = ImageIcon(String name);
map = ii.getImage();I saw a tutorial on images i think on sun
where they use ImageIcon to load the image -
Replace HTML page with an applet
Hello to everybody!
I created a servlet who can manage file upload from an HTML page.
Now I want to replace that HTML page with an applet.
I know that I need to use multipart/form-data , but I have no experience with applet.
HTML page code is:
<html>
<head><title>Upload</title></head>
<body>
<FORM ACTION="http://localhost:8080/examples/servlets/servlet/UploadTest" ENCTYPE="multipart/form-data" METHOD=POST>
What is your name? <INPUT TYPE=TEXT NAME=submitter> <BR>
Which file do you want to upload? <INPUT TYPE=FILE NAME=file> <BR>
<INPUT TYPE=SUBMIT>
</FORM>
</body>
</html>
Any help for "translating" this html page into an applet?
Applet will send, like html page, a string and a file.
Thanks in advance, LiukIf you use something like Apache Http Client you dont have to do any low level implementations.
Also if you are the one who is writing both servlet and applet you dot have to stick to standerd formats. you can develop your own format and write both servlet and applet to handle it. that kind of thing can be done easily even with URLConnection. But if you plan to send large files then URLConnection may not be the right option becouse I have seen many complaining that URLConnection gives OutOfMemoryErrors when uploading large amounts of data. -
Can I use multiple p class="logos" tag with the same name within the same html page?
I was told not to use <div class> tags too many times. I was using them for text, images, to clear floats, I basically built my website using multiple <div class> tags. So if I can't use multiple <div class> tags could I use <p class> tags multiple times in the same html page?
I have a string of logos at the bottom of my webpage which will all be using the same css characteristics for all logos. Would this be the proper way to write the code:
HTML
<p class="logos">Logo1<a href="...></a></p>
<p class="logos">Logo2<a href="...></a></p>
<p class="logos">Logo3<a href="...></a></p>
<p class="logos">Logo4<a href="...></a></p>
<p class="logos">Logo5<a href="...></a></p>
<p class="logos">Logo6<a href="...></a></p>
CSS
.logos {
margin-left:10px;
Here's my website: http://www.darbymanufacturing.com/test_website/index.html - this is the website built with all div class tags
I restarted the website in order to write the code properly so that I don't come to errors when uploading on the server like I am having with the website link above.Instead of writing something like this -
<p class="logos">Logo1<a href="...></a></p>
<p class="logos">Logo2<a href="...></a></p>
<p class="logos">Logo3<a href="...></a></p>
<p class="logos">Logo4<a href="...></a></p>
<p class="logos">Logo5<a href="...></a></p>
<p class="logos">Logo6<a href="...></a></p>
Why not have something like this -
<div id="logodiv">
<p>Logo1<a href="...></a></p>
<p>Logo2<a href="...></a></p>
<p>Logo3<a href="...></a></p>
<p>Logo4<a href="...></a></p>
<p>Logo5<a href="...></a></p>
<p>Logo6<a href="...></a></p>
</div>
with CSS like this -
#logodiv p { ... } -
Opening a HTML page on clicking of button .
i created a button under menuItem help , i want to open a HTML page in other window (that is save inside my project folder ) when i click that button.
for this i created jeditorpane and passing the URL in to its constructor but i m getting exception when i m using URL like "D:/x/y.html" exception is protocol not find and in the case of "http://localhost/d:/x/y.html" exception is java.net.ConnectException: Connection refused: connect
at java.net.PlainSocketImpl.socketConnect(Native Method
my code is
import javax.swing.JEditorPane;
import javax.swing.JFrame;
import javax.swing.JLabel;
import javax.swing.JScrollPane;
import javax.swing.SwingUtilities;
import java.awt.Color;
import java.awt.Container;
import java.io.IOException;
public class HelpDialog {
* height of frame in pixels
private static final int height = 1000;
* width of frame in pixels
private static final int width = 1000;
private static final String TITLE_STRING = "Help";
* URL of page we want to display
private static final String URL = "http://localhost/D:/SVGcode/javaApplet/src/com/Documentation.html";
// --------------------------- main() method ---------
public HelpDialog()
SwingUtilities.invokeLater( new Runnable()
public void run()
System.out.println( "Starting" );
final JFrame jframe =
new JFrame( TITLE_STRING );
Container contentPane = jframe.getContentPane();
jframe.setSize( width, height );
contentPane.setBackground( Color.WHITE );
contentPane.setForeground( Color.BLUE );
jframe.setDefaultCloseOperation( JFrame.DISPOSE_ON_CLOSE);
try
System.out.println( "acquiring URL" );
JEditorPane jep = new JEditorPane(URL);
System.out.println( "URL acquired" );
JScrollPane jsp =
new JScrollPane( jep,
JScrollPane.VERTICAL_SCROLLBAR_AS_NEEDED,
JScrollPane.HORIZONTAL_SCROLLBAR_AS_NEEDED );
contentPane.add( jsp );
catch ( IOException e )
System.err.println( "can't find URL" );
e.printStackTrace();
contentPane.add( new JLabel( "can't find URL" ) );
jframe.validate();
jframe.setVisible( true );
plz help meHi Sandy,
Instead of giving url in the constructor. Paste the code below the JEditorPane. Remove the url from the JEditorPane constructor.
try{
/** Files from system */
java.net.URL helpURL = new URL ("file:///" + "C:\\thread.jspa.htm");
/** Files from inter net */
helpURL = new URL ("http://forums.sun.com/thread.jspa?threadID=5421146&tstart=0");
if(helpURL!=null)
jHelp.setPage(helpURL);
else
System.out.println("Couldnt find file");
}catch(MalformedURLException exp){
System.out.println("Exception in page loading---"+exp);
}catch(IOException ioexp){
System.out.println("Exception in page loading"+ioexp);
} -
Read Text from HTML-Pages and want to solve "ChangedCharSetException"
Hello,
I have an app that connect via threads with pages and parse them an gives me only the Text-version of a HTML-page. Works fine, but if it found a page, where the text is within images, than the whole app stopps and gave me the message:
javax.swing.text.ChangedCharSetException
at javax.swing.text.html.parser.DocumentParser.handleEmptyTag(DocumentParser.java:169)
at javax.swing.text.html.parser.Parser.startTag(Parser.java:372)
at javax.swing.text.html.parser.Parser.parseTag(Parser.java:1846)
at javax.swing.text.html.parser.Parser.parseContent(Parser.java:1881)
at javax.swing.text.html.parser.Parser.parse(Parser.java:2047)
at javax.swing.text.html.parser.DocumentParser.parse(DocumentParser.java:106)
at javax.swing.text.html.parser.ParserDelegator.parse(ParserDelegator.java:78)
at aufruf.main(aufruf.java:33)So I tried to catch them with "getCharSetSpec()" and "keyEqualsCharSet( )" from the class "javax.swing.text.ChangedCharSetException" and hoped that this solved the problem. But still doesen't work...
Then I looked at the web and found, that I have to add the line:
doc.putProperty("IgnoreCharsetDirective", new Boolean(true));"doc." is a new HTML Dokument, created with the HTMLEditorKit. I do not have much knowledge about that and so I hope, that someone can explain me, how I can solve that problem, within my code.
Here we go:
import javax.swing.text.*;
import java.lang.*;
import java.util.*;
import java.net.*;
import java.io.*;
import javax.swing.text.html.*;
import javax.swing.text.html.parser.*;
public class myParser extends Thread
private String name;
public void run()
try
URL viele = new URL(name); // "name" ia a variable with a lot of links
URLConnection hs = viele.openConnection();
hs.connect();
if (hs.getContentType().startsWith("text/html"))
InputStream is = hs.getInputStream();
InputStreamReader isr = new InputStreamReader(is);
BufferedReader br = new BufferedReader(isr);
Lesen los = new Lesen();
ParserDelegator parser = new ParserDelegator();
parser.parse(br,los, false);
catch (MalformedURLException e)
System.err.print("Doesn't work");
catch (ChangedCharSetException e)
e.getCharSetSpec();
e.keyEqualsCharSet();
e.printStackTrace();
catch (Exception o)
public void vowi(String n)
name = n;
}and for the case that it is important here is the class "Lesen"
import java.net.*;
import java.io.*;
import javax.swing.text.*;
import javax.swing.text.html.*;
import javax.swing.text.html.parser.*;
class Lesen extends HTMLEditorKit.ParserCallback
public void handleStartTag(HTML.Tag t, MutableAttributeSet a, int pos)
try
if ((t==HTML.Tag.P) || (t==HTML.Tag.H1) || (t==HTML.Tag.H2) || (t==HTML.Tag.H3) || (t==HTML.Tag.H4) || (t==HTML.Tag.H5) || (t==HTML.Tag.H6))
System.out.println();
catch (Exception q)
System.out.println(q.getMessage());
public void handleSimpleTag(HTML.Tag t,MutableAttributeSet a, int pos)
try
if (t==HTML.Tag.BR)
System.out.println(); // Neue Zeile
System.out.println();
catch (Exception qw)
System.out.println(qw.getMessage());
public void handleText(char[] data, int pos)
try
System.out.print(data); // prints the text from HTML-pages
catch (Exception ab)
System.out.println(ab.getMessage());
}Thanks a lot for helping...
Stephanparser.parse(br,los, false);
parser.parse(br,los, true);
Maybe you are looking for
-
How do i connect my ipad to WLAN internet? it says its loading but never comes up with a tick
I have recently moved to Madrid in Spain from England where i bought my iPad. neither my iPad nor iPhone will connect to the WLAN internet in the house that I am living in. When i type in the password it saying it is loading however the tick never co
-
I've been having this problem for a long time so I thought I'd post again and maybe get a hit this time. We run off windows cfmx 7.x now fully patched, prior to that we were on cfmx 6.x. We use session management only. Many times on a new page hit th
-
Font sizes between Photoshop and Edge - Any Accurate conversion table?
I have been trying to make sense of the font sizes between Photoshop and Edge. It seems that the conversion tables I have found do not actually give the right numbers and I am wondering if there is a table from Adobe that would make it easy to have
-
how can i find out which version is on my machine? I need help recovering over 600 photos that were in my library and now I cannot see them. I rec'd a message the file is unreadable. I believe the version i have is the one that came with the emac in
-
Hello, I'm developing a WDP application which runs outside the EP. I have also an EP installation with a customized theme running. I can use the delivered SAP themes for my application running outside the EP. Can somebody tell me how I can reference