You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Some of our sites are accessible via both http and https. In order to avoid browsers complaining about mixed content, references within pages miss off the http/https bit in order to let the browser figure that out by itself.
However, fess then doesn't cope:
2017-05-23 00:20:44,314 [Crawler-20170523000000-8-3] INFO Crawling URL: https://www.linaro.org/downloads/
2017-05-23 00:20:44,949 [Crawler-20170523000000-8-3] WARN Could not parse anchor tags.
java.net.MalformedURLException: no protocol: //www.linaro.org/downloads/
at java.net.URL.<init>(URL.java:593)
at java.net.URL.<init>(URL.java:490)
at java.net.URL.<init>(URL.java:439)
at org.codelibs.fess.crawler.transformer.FessXpathTransformer.getAnchorList(FessXpathTransformer.java:584)
at org.codelibs.fess.crawler.transformer.FessXpathTransformer.putAdditionalData(FessXpathTransformer.java:320)
at org.codelibs.fess.crawler.transformer.FessXpathTransformer.storeData(FessXpathTransformer.java:171)
at org.codelibs.fess.crawler.transformer.impl.HtmlTransformer.transform(HtmlTransformer.java:120)
at org.codelibs.fess.crawler.processor.impl.DefaultResponseProcessor.process(DefaultResponseProcessor.java:77)
at org.codelibs.fess.crawler.CrawlerThread.processResponse(CrawlerThread.java:330)
at org.codelibs.fess.crawler.CrawlerThread.run(CrawlerThread.java:176)
at java.lang.Thread.run(Thread.java:745)
The text was updated successfully, but these errors were encountered:
Some of our sites are accessible via both http and https. In order to avoid browsers complaining about mixed content, references within pages miss off the http/https bit in order to let the browser figure that out by itself.
However, fess then doesn't cope:
The text was updated successfully, but these errors were encountered: