Trim filter for iPlanet - whitespace removing from html
Hi,
I've just published my Trim Filter NSAPI plugin for Sun One Web Server/iPlanet
It is a plugin/filter which removes whitespaces from HTML code. Its LGPL and it can be found here: http://www.thrull.com/iplanet/
Feel free to test it :-)
BR,
Igor
Cool, thanks for sharing it!
I took a quick look, and I think I spotted a bug in the write filter method: /* workaround for a bug? in iPlanet, when returning empty content */
if (!out_size && amount) {
rv = net_write(layer->lower, NON_EMPTY_STRING, 1);
out_size = 1;
} else {
rv = net_write(layer->lower, (const char *)buffer, out_size);
return rv;According to the NSAPI Programmer's Guide (http://docs.sun.com/source/817-6252/npgnsapi.html#wp1004627), the write filter method should return the number of bytes consumed on success. It looks like your write filter method is returning the number of bytes written instead. Perhaps that's the reason you needed that work around?
Similar Messages
-
ASDocs - Preventing style and tags removing from HTML
Hi,
I am using Flex 4 ASDoc
When I write in my comments:
<span class="myClass">whatever</span>
entire tag including text is removed (myClass is present in style.css)
When I write:
<span style="color: #FF0000;">whatever</span>
style attribute is removed.
I have a feeling this has something to do with xslts and other setting. Can someone point me to the files where changes can be made so that custom styles and classes are not processed/removed?
Thank you.bump it up.
-
I have searched Finder, Trash and the applications folder but i can't find where it is kept. I want to put the icon for Dashboard back in the dock - how can i do this?
The combo update may replace anything you have inadvertently deleted.
Mac OS X 10.6.8 Update Combo v1.1
You can reapply it over the top of any previously downloaded version. I see that your profile says 10.6.8.
Then software update to pick up any new security, iTunes updates. -
Sub removed from my country - refund question
Has anyone ever come across a problem with subscriptions and refunds where the magazine was removed from the store before your sub was up?
I’m a yearly subscriber to the Newsstand app “Popular Science+”. Recently I noticed that the monthly subscription was late downloading. While trying to fix it I removed the app and all my downloaded issues, and tried to reinstall.
It’s only at this point that I noticed that the app was no longer available in the Australian iTunes store.
I contacted the publisher who confirmed that the magazine had been removed from the Australian store and wouldn’t be returning.
As I still have about 6 months of my subscription left they said I would need to contact Apple to get a refund.
My question is, what kind of refund should I be seeking. I’ve still got 6 months left, so should it be 50% of my yearly sub? But I’ve now lost access to the back issues that I had already paid for, so should I be seeking a 100% refund? I was under the impression that even if I ended my sub those back issues would always be available for me to download?Yes it is annoying. I didn't mention in my original post, but it took some time to get into contact with US PopSci. The US website redirects to the Australian site, which is a different entitiy all together. I ended up having to use an overseas proxy to get to their support pages. Then there was the usual confusion while it was passed around, forgotten, repassed etc b4 they worked out what was wrong.
There no attempt from anyone, (US PopSci, Aus PopSci or Apple), to let me know the newsstand magazine I had paid for had been removed from my country. It was up to me to notice the lack of updates and chase it up.
US PopSci did try to make things right. They offered to transfer my sub to zinio. But it's not the same so I elected to get a refund from Apple. They gave me a percentage of the sub back based on how many months I had left, but I still lost access to the back issues I had already paid for. Zinio may be an option for you though.
It's a shame because I enjoyed the mag. Aus PopSci isn't as good and is really expensive. -
Deploying Oracle Password filter for Microsoft Active Directory.
We created the Oracle Wallet w/ Self-Signed Certificate. When we setup the Oracle Password filter for Microsoft Active Directory from LDAP server. We are getting the error. Certificate is not valid.
Is this OK or Can we get the success message.You can't ignore it. A valid certificate is important to make it work.
--olaf -
How ias integrate with Snacktory for getting main text from an html page
Hi All,
i am new to endeca and ias, i have an requirement, need to get main text from whole html page before ias save text to Endeca_Document_Text property,
as ias save all text in page to endeca_document_text property, it is not ok for reading when show in web page, i use an third party API to filter out the main text from original page,
now i want to save these text to endeca_document_text property,
an another question,
i get zero page when doing the logic of filtering main text from original html text in ParseFilter( HTMLMetatagFilter implements ParseFilter) using Snacktory.
if only do little things, it will work fine, if do more thing, clawer fail to crawl page. any one know how to fix it.
log for clawler.
Successfully set recordstore configuration.
INFO 2013-09-03 00:56:42,743 0 com.endeca.eidi.web.Main [main] Reading seed URLs from: /home/oracle/oracle/endeca/IAS/3.0.0/sample/myfirstcrawl/conf/endeca.lst
INFO 2013-09-03 00:56:42,744 1 com.endeca.eidi.web.Main [main] Seed URLs: [http://www.liferay.com/community/forums/-/message_boards/category/]
INFO 2013-09-03 00:56:43,497 754 com.endeca.eidi.web.db.CrawlDbFactory [main] Initialized crawldb: com.endeca.eidi.web.db.BufferedDerbyCrawlDb
INFO 2013-09-03 00:56:43,498 755 com.endeca.eidi.web.Crawler [main] Using executor settings: numThreads = 100, maxThreadsPerHost=1
INFO 2013-09-03 00:56:44,163 1420 com.endeca.eidi.web.Crawler [main] Fetching seed URLs.
INFO 2013-09-03 00:56:46,519 3776 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] come into EndecaHtmlParser getParse
INFO 2013-09-03 00:56:46,519 3776 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] come into HTMLMetatagFilter
INFO 2013-09-03 00:56:46,519 3776 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] meta tag viewport ==minimum-scale=1.0, width=device-width
INFO 2013-09-03 00:56:52,889 10146 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] come into EndecaHtmlParser getParse
INFO 2013-09-03 00:56:52,889 10146 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] come into HTMLMetatagFilter
INFO 2013-09-03 00:56:52,890 10147 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-1] meta tag viewport ==minimum-scale=1.0, width=device-width
INFO 2013-09-03 00:56:59,184 16441 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] come into EndecaHtmlParser getParse
INFO 2013-09-03 00:56:59,185 16442 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] come into HTMLMetatagFilter
INFO 2013-09-03 00:56:59,185 16442 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] meta tag viewport ==minimum-scale=1.0, width=device-width
INFO 2013-09-03 00:57:07,057 24314 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] come into EndecaHtmlParser getParse
INFO 2013-09-03 00:57:07,057 24314 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] come into HTMLMetatagFilter
INFO 2013-09-03 00:57:07,057 24314 com.endeca.eidi.web.parse.HTMLMetatagFilter [pool-1-thread-2] meta tag viewport ==minimum-scale=1.0, width=device-width
INFO 2013-09-03 00:57:07,058 24315 com.endeca.eidi.web.Crawler [main] Seeds complete.
INFO 2013-09-03 00:57:07,090 24347 com.endeca.eidi.web.Crawler [main] Starting crawler shut down
INFO 2013-09-03 00:57:07,095 24352 com.endeca.eidi.web.Crawler [main] Waiting for running threads to complete
INFO 2013-09-03 00:57:07,095 24352 com.endeca.eidi.web.Crawler [main] Progress: Level: Cumulative crawl summary (level)
INFO 2013-09-03 00:57:07,095 24352 com.endeca.eidi.web.Crawler [main] host-summary: www.liferay.com to depth 1
host depth completed total blocks
www.liferay.com 0 0 1 1
www.liferay.com 1 0 0 0
www.liferay.com all 0 1 1
INFO 2013-09-03 00:57:07,096 24353 com.endeca.eidi.web.Crawler [main] host-summary: total crawled: 0 completed. 1 total.
INFO 2013-09-03 00:57:07,096 24353 com.endeca.eidi.web.Crawler [main] Shutting down CrawlDb
INFO 2013-09-03 00:57:07,160 24417 com.endeca.eidi.web.Crawler [main] Progress: Host: Cumulative crawl summary (host)
INFO 2013-09-03 00:57:07,162 24419 com.endeca.eidi.web.Crawler [main] Host: www.liferay.com: 0 fetched. 0.0 mB. 0 records. 0 redirected. 4 retried. 0 gone. 0 filtered.
INFO 2013-09-03 00:57:07,162 24419 com.endeca.eidi.web.Crawler [main] Progress: Perf: All (cumulative) 23.6s. 0.0 Pages/s. 0.0 kB/s. 0 fetched. 0.0 mB. 0 records. 0 redirected. 4 retried. 0 gone. 0 filtered.
INFO 2013-09-03 00:57:07,162 24419 com.endeca.eidi.web.Crawler [main] Crawl complete.
~/oracle/endeca
-======================================
source code for parsefilter
package com.endeca.eidi.web.parse;
import java.util.Map;
import java.util.Properties;
import org.apache.hadoop.conf.Configuration;
import org.apache.log4j.Logger;
import org.apache.nutch.metadata.Metadata;
import org.apache.nutch.parse.HTMLMetaTags;
import org.apache.nutch.parse.Parse;
import org.apache.nutch.parse.ParseData;
import org.apache.nutch.parse.ParseFilter;
import org.apache.nutch.protocol.Content;
import de.jetwick.snacktory.ArticleTextExtractor;
import de.jetwick.snacktory.JResult;
public class HTMLMetatagFilter implements ParseFilter {
public static String METATAG_PROPERTY_NAME_PREFIX = "Endeca.Document.HTML.MetaTag.";
public static String CONTENT_TYPE = "text/html";
private static final Logger logger = Logger.getLogger(HTMLMetatagFilter.class);
public Parse filter(Content content, Parse parse) throws Exception {
logger.info("come into EndecaHtmlParser getParse");
logger.info("come into HTMLMetatagFilter");
//update the content with the main text in html page
//content.setContent(HtmlExtractor.extractMainContent(content));
parse.getData().getParseMeta().add("FILTER-HTMLMETATAG", "ACTIVE");
ParseData parseData = parse.getData();
if (parseData == null) return parse;
extractText(content, parse);
logger.info("update the content with the main text content");
return parse;
private void extractText(Content content, Parse parse){
try {
ParseData parseData = parse.getData();
if (parseData == null) return;
Metadata md = parseData.getParseMeta();
ArticleTextExtractor extractor = new ArticleTextExtractor();
String sourceHtml = new String(content.getContent());
JResult res = extractor.extractContent(sourceHtml);
String text = res.getText();
md.set("Endeca_Document_Text", text);
} catch (Exception e) {
// TODO: handle exception
public static void log(String msg){
System.out.println(msg);
public Configuration getConf() {
return null;
public void setConf(Configuration conf) {but it only extracts URLs from <A> (anchor) tags. I want to be able to extract URLs from <MAP> tags as wellGee, do you think you could modify the code to check for "Map" attributes as well.
Can someone maybe point a page containing info on the HTML toolkit for me?It's called the API. Since you are using the HTMLEditorKit and an ElementIterator and an AttributeSet, I would start there.
There is no such API that says "get me all the links", so you have to do a little work on your own.
Maybe you could use a ParserCallback and every time you get a new tag you check for the "href" attribute. -
Hi there!
I'm searching for a method to remove Tags from HTML (using HTMLEditorKit, HTMLDocument ...).
My current code is as follows:
// first get the whole paragraph
int iCaretPos = tpMyTextPane.getCaretPosition();
Object oAttrib;
HTMLDocument.BlockElement oElem = (HTMLDocument.BlockElement)oMyDocument.getParagraphElement(iCaretPos);
AttributeSet oAttribs;
SimpleAttributeSet oNewAttribs;
int iParaStart = oElem.getStartOffset();
int iParaEnd = oElem.getEndOffset();
tpMyTextPane.select(iParaStart, iParaEnd);
// the following only fetches the Tags that are valid for the whole paragraph!!!!!
oAttribs = tpMyTextPane.getCharacterAttributes();
oNewAttribs = new SimpleAttributeSet(oAttribs);
if(iParaEnd - iParaStart > 0)
// now analyse the attributes (remove all paragraph-tags)
for(int iIndex = 0; iIndex < oaOurFormatTags.length; iIndex++)
oNewAttribs.removeAttribute(oaOurFormatTags[iIndex]);
if(iParaEnd - iParaStart > 0)
oMyDocument.setCharacterAttributes(iParaStart, iParaEnd - iParaStart, oNewAttribs, true);
tpMyTextPane.setCaretPosition(iCaretPos);
tpMyTextPane.requestFocus();
tpMyTextPane.repaint();
}This code works for me, but all Tags of the selected paragraph are removed. That means:
<P><H1>This is a <B>test</B> text<H2></P>
will be converted to:
<P>This is a test text</P>
but I want it to be converted to:
<P>This is a <B>test</B> text</P>
Is there any other method to remove specific Tags (<H1>, ... <H6>) without touching other tags????In February I wrote a feature request about this. Today it has been accepted to the bug database. Please make your vote:
http://developer.java.sun.com/developer/bugParade/bugs/4760082.html -
Help removing encoded html from string
Hello,
I am trying to remove all html from a user input string.
something like this:
<cfset string ="sample text <a>">
I have tried:
Replace(string , "<[^>]*>", "", "ALL")
Replace(string , "<br &##x2f;>>", "", "ALL")
Neither remove the html, can anyone help?The related functions for replacing strings using regular expressions are REReplace and REReplaceNoCase. For example,
<cfset string ="sample text <a>">
<cfoutput>#REReplace(string , "<[^>]*>", "", "ALL")#</cfoutput> -
I have a tax exempt form and I need to get taxes removed from my creative cloud bill for two differe
I have a tax exempt form and I need to get taxes removed from my creative cloud bill for two different accounts. How do I do this?
Here you go: http://helpx.adobe.com/x-productkb/policy-pricing/place-tax-exempt-order.html#main_Tax_exe mpt_orders_in_the_U_S__and_Canada
Contact support. -
MAC address gets removed from the CAM 4.8 Filter List??
Hi,
we got WLC and NAC integration going. I have a smartphone wi-fi client I would like to exclude from logon and posture assessment, so I add the MAC into the CAM's Filter List. It works fine for a short while and then the MAC gets deleted from the list for some reason. Here’s what CAM log show (roughly an hour between the MAC is manually added and then gets deleted):
any ideas why it's happening and how to fix this?Faisal,
I looked through the NAC log again, and I did miss another piece of info, so here's how the log looks after adding the MAC in the filter list:
Administration 2010-11-01 17:53:31 38:E7:D8:0B:42:A3 added to MAC list
Administration 2010-11-01 18:51:10 38:E7:D8:0B:42:A3 removed from the MAC list
Administration 2010-11-01 18:51:10 (API removemac): Succeeded!
So, it looks like there is some process running that removes the MAC from the list every time it's added in raughly an hour?
Dmitri. -
I have an iPad 2 and receive e-mail through regular internet connection. Lately, when I delete e-mails, and clear my Trash folder, they re-appear, downloaded anew from the server and marked as unread. My e-mail Advanced Settings are selected for "Remove from server when moved from Inbox." Even when I access the e-mail accountfrom my desktop and delete the e-mails, the re-appear on my iPad. Does anyone have any idea what is causing this and what I can do to actually delete them?
Sounds like you are looking in the wrong Administrative Group container which is why you are seeing your Exchange 2010 servers in there.
When you install Exchange 2003 only you will see a container named by default as "CN=First Administrative Group" container. But this could be named anything if you changed the Organization Name on the installation when you installed the first
Exchange 2003 server into the domain/forest.
You will notice that when you install Exchange 2010 part of the AD setup is to create a new configuration container and is named by default "CN=First Administrative Group (FYDIBOHF23SPDLT)".
So it sounds like you are not looking in the right location within ADSIEdit.
You may find the following article also helpful for this issue which is the same resolution:
http://blogs.technet.com/b/sbs/archive/2012/05/17/empty-cn-servers-container-causing-issues-with-public-folders-on-small-business-server-2011.aspx
I recommend though that you ensure your Exchange 2003 servers are fully uninstalled or no longer present in your environment before you go deleting the Servers container though.. The following Microsoft article will help with this:
http://technet.microsoft.com/en-gb/library/gg576862(v=exchg.141).aspx -
Revision: 3120
Author: [email protected]
Date: 2008-09-05 10:44:10 -0700 (Fri, 05 Sep 2008)
Log Message:
When you point Flex Builder at a local sandbox trunk build, it couldn't generate the html-templates folder correctly for new projects so we moved all the html templates up one level and removed the html-templates directory and adjusted build.xml's to accommodate the directory change
Modified Paths:
flex/sdk/trunk/build.xml
flex/sdk/trunk/webapps/webtier/build.xml
Added Paths:
flex/sdk/trunk/templates/client-side-detection/
flex/sdk/trunk/templates/client-side-detection/AC_OETags.js
flex/sdk/trunk/templates/client-side-detection/index.template.html
flex/sdk/trunk/templates/client-side-detection-with-history/
flex/sdk/trunk/templates/client-side-detection-with-history/AC_OETags.js
flex/sdk/trunk/templates/client-side-detection-with-history/history/
flex/sdk/trunk/templates/client-side-detection-with-history/history/history.css
flex/sdk/trunk/templates/client-side-detection-with-history/history/history.js
flex/sdk/trunk/templates/client-side-detection-with-history/history/historyFrame.html
flex/sdk/trunk/templates/client-side-detection-with-history/index.template.html
flex/sdk/trunk/templates/express-installation/
flex/sdk/trunk/templates/express-installation/AC_OETags.js
flex/sdk/trunk/templates/express-installation/index.template.html
flex/sdk/trunk/templates/express-installation/playerProductInstall.swf
flex/sdk/trunk/templates/express-installation-with-history/
flex/sdk/trunk/templates/express-installation-with-history/AC_OETags.js
flex/sdk/trunk/templates/express-installation-with-history/history/
flex/sdk/trunk/templates/express-installation-with-history/history/history.css
flex/sdk/trunk/templates/express-installation-with-history/history/history.js
flex/sdk/trunk/templates/express-installation-with-history/history/historyFrame.html
flex/sdk/trunk/templates/express-installation-with-history/index.template.html
flex/sdk/trunk/templates/express-installation-with-history/playerProductInstall.swf
flex/sdk/trunk/templates/metadata/
flex/sdk/trunk/templates/metadata/AC_OETags.js
flex/sdk/trunk/templates/metadata/readme.txt
flex/sdk/trunk/templates/no-player-detection/
flex/sdk/trunk/templates/no-player-detection/AC_OETags.js
flex/sdk/trunk/templates/no-player-detection/index.template.html
flex/sdk/trunk/templates/no-player-detection-with-history/
flex/sdk/trunk/templates/no-player-detection-with-history/AC_OETags.js
flex/sdk/trunk/templates/no-player-detection-with-history/history/
flex/sdk/trunk/templates/no-player-detection-with-history/history/history.css
flex/sdk/trunk/templates/no-player-detection-with-history/history/history.js
flex/sdk/trunk/templates/no-player-detection-with-history/history/historyFrame.html
flex/sdk/trunk/templates/no-player-detection-with-history/index.template.html
Removed Paths:
flex/sdk/trunk/templates/html-templates/Remember that Arch Arm is a different distribution, but we try to bend the rules and provide limited support for them. This may or may not be unique to Arch Arm, so you might try asking on their forums as well.
-
Would like to know why I have twice removed from the account for $ 1
Would like to know why I have twice removed from the account for $ 1
Your post makes no sense in English.
-
How the heck do I get my music back onto my windows 8 computer from my Ipad or Iphone. All of a sudden all my music (except for purchased songs from Apple) disappeared. When I try to sync I get a message that if I continue all the other songs (~1250) will be removed from my devices and I will be left with the ~50 purchased songs. There is no information on the help list on how to re-create the library using Windows 8.
See Empty/corrupt library after upgrade/crash.
tt2 -
The Elements 12 Mac disk works fine - I also have a Dell, and the Windows disk will not run at all, sounds like it's grinding, and is almost impossible to remove from the hard drive. I cannot find any place to tell me who to CALL to ask for a replacement!
Hi Samcat,
You can use a download instead of the disk as an alternative: Adobe - Download free trial version Adobe Photoshop Elements 12 | Adobe you can license with your serial
Please test the disk in another windows machine to check if it's caused by the drive or the disk. If you tested with 2 drives please start a support case, i assume Adobe will only provide downloads at this point. They are the same as the trials. If you still need a disk you can burn the trial to a disk as a backup disk. Good luck!
Maybe you are looking for
-
Used to be able get iMessages with iPhone and iPad. Cannot register # or email.
I used to be able to get iMessages using either my phone number or email address besides the me.com one. Now I can't and it won't verify them. I am not even given the choice to add my phone number as on my wife's phone. It keeps telling me I have to
-
How do I prevent my iPhone from moving my music to the cloud?
This is so frustrating! Every time I try to listen to my music on my iPhone it's all greed out with the cloud icon next to it. This is absolutely the worst music software I've ever used. The interface immediately vanishes replaced by a screen full of
-
Error for Cenvat Credit thourgh J1IEX
Dear Experts, At the time taken Capital Credit through J1IEX, system given error "FI/CO interface: Inconsistent FI/CO line item data for updating" message no RW016 Please help BK GAIKWAD
-
Date BETWEEN query with a difference?
Hi, I have a BETWEEN query (at least I think that's what it will need), but with a difference. Normally you would specific a field which was BETWEEN two set variables ie. {fieldname} BETWEEN 1 AND 3 However I need mine the other way round. I have a s
-
Hello All, I am new to this forum and BPEL. I am a peoplesoft Technical Devloper. I like to know what is BPEL and how to use BPEL from peoplesoft perspecitve. Deos BPEL needs seperate license inorder to use in PeopleSoft. Please let me know. Thanks