css4j 1.3.1 API
This project is an implementation of W3C's CSS Object Model API in the Java™ language, and also adds CSS support to the DOM4J package. It targets several different use cases, with functionalities from style sheet error detection to style computation.
Overview
This implementation can be used in several ways: with stand-alone style sheets, with its own DOM implementation, combined with DOM4J, or by wrapping a pre-existing DOM tree.
You can play with independent style sheets created with the createStyleSheet("title", "media")
method of the
CSSStyleSheetFactory
interface.
There are three implementations of that interface:
CSSDOMImplementation
, the native DOM implementation.DOMCSSStyleSheetFactory
for the DOM Wrapper.- If you use DOM4J, get it through
XHTMLDocumentFactory.getStyleSheetFactory()
.
The document back-end is only important if you plan to use the sheets inside a document.
The resulting style sheets are empty, but you can load a style sheet with the AbstractCSSStyleSheet.parseStyleSheet(source)
method (see Parsing a Style Sheet).
One of the most important functionalities in the library is the ability to compute styles for a given element. In practice, to obtain the 'computed' or 'used' values required for actual rendering a box model implementation is needed, and also device information. The library provides a simple box model that could be used, but the details of the rendering device can be more difficult.
Depending on the use case, the target device may not be the same one where the library is running (and some exact details hence not available).
To help in the computation, the library defines the DeviceFactory
interface to supply device and media-specific data (also provides
the objects required by media queries to work).
Prerequisite: Choose a SAC Parser
This library depends on a SAC parser to work. There are three active projects offering a SAC parser:
- Batik's SAC parser. Class name:
org.apache.batik.css.parser.Parser
. - Steadystate's cssparser (
com.steadystate.css.parser.SACParserCSS3
). - This library's own parser (
io.sf.carte.doc.style.css.parser.CSSParser
).
Batik's parser used to be this library's default parser, but its functionality is now lagging behind and cannot be recommended to handle level 3 or 4 CSS.
The Steadystate parser does better and mostly implements CSS3, but does not process calc()
values nor supports SAC comments. Both handle some
selectors in their own, undocumented way (example: pseudo-element support) that is not always easy to support.
Finally, this project's own parser supports level 4 values and selectors; an extension to SAC called NSAC was developed for that. This is the recommended parser.
Once you have chosen your parser, set the org.w3c.css.sac.parser
system property to its qualified class name, e.g.
com.steadystate.css.parser.SACParserCSS3
. If you decide to use this project's internal parser you do not need to set that property, as it is
the default parser.
Once you have the SAC parser and the rest of the dependencies (see the Release Notes) available in the classpath, and set the appropriate system property (if not using this library's own parser), you can start using the library. If you build it with Maven, beware that one of the dependencies (JCLF) is not available at any central repository and has to be manually downloaded and installed to your local repository.
Using css4j's native DOM implementation
You can create a DOM document from scratch and use the related DOM methods to programmatically build a document: just use the provided DOM implementation
(io.sf.carte.doc.dom.CSSDOMImplementation
). If you want to parse an existing document, the procedure depends on the type of document: to parse
an XML document (including XHTML documents), you can use this library's XMLDocumentBuilder
, while to parse an HTML one (or an
XHTML document that does not use namespace prefixes) you can use the validator.nu
HTML5 parser.
An example with that parser follows:
import java.io.Reader; import org.xml.sax.InputSource; import org.xml.sax.SAXException; import org.w3c.dom.css.CSSPrimitiveValue; import io.sf.carte.doc.dom.CSSDOMImplementation; import io.sf.carte.doc.dom.DOMElement; import io.sf.carte.doc.dom.HTMLDocument; import io.sf.carte.doc.style.css.CSSComputedProperties; import io.sf.carte.doc.style.css.CSSPrimitiveValue2; import io.sf.carte.doc.style.css.RGBAColor; import io.sf.carte.doc.xml.dtd.DefaultEntityResolver; import nu.validator.htmlparser.dom.HtmlDocumentBuilder; [...] // Instantiate DOM implementation (with default settings: no IE hacks accepted) // and configure it CSSDOMImplementation impl = new CSSDOMImplementation(); // Alternatively, impl = new CSSDOMImplementation(flags); // Now load default HTML user agent sheets impl.setDefaultHTMLUserAgentSheet(); // Prepare builder HtmlDocumentBuilder builder = new HtmlDocumentBuilder(impl); // Read the document to parse, and prepare source object Reader re = ... [reader for HTML document] InputSource source = new InputSource(re); // Parse. If the document is not HTML, you want to use DOMDocument instead HTMLDocument document = (HTMLDocument) builder.parse(source); re.close(); // Set document URI document.setDocumentURI("http://www.example.com/mydocument.html");
Then you have a CSS-enabled document. To compute styles, use getComputedStyle
:
DOMElement element = document.getElementById("someId"); CSSComputedProperties style = element.getComputedStyle(null); // Next line could be 'String display = style.getDisplay();' String display = style.getPropertyValue("display"); // If you use a factory that has been set to setLenientSystemValues(false), next // line may throw an exception if the 'color' property was not specified. // The default value for lenientSystemValues is TRUE. RGBAColor color = ((CSSPrimitiveValue2) style.getPropertyCSSValue("color")).getRGBColorValue(); // CSSPrimitiveValue2 is an extension to W3C's CSSPrimitiveValue // Suppose that the linked style sheet located at 'css/sheet.css' declares: // background-image: url('foo.png'); String image_css = style.getPropertyValue("background-image"); String image_uri = ((CSSPrimitiveValue) style.getPropertyCSSValue("background-image")).getStringValue(); // Then, because we already set the document URI to "http://www.example.com/mydocument.html", // image_css will be set to "url('http://www.example.com/css/foo.png')", // and image_uri to "http://www.example.com/css/foo.png"
The CSSComputedProperties
interface extends from W3C's CSSStyleDeclaration
,
adding methods like getComputedFontSize()
or getComputedLineHeight()
.
If you are computing styles for a specific medium, tell the document about it (see "Media Handling"):
document.setTargetMedium("print");
Conformance with the DOM specification
This library's native DOM implementation has some minor behavior differences with what is written in the DOM
Level 3 Core Specification. For example, on elements and attributes the Node.getLocalName()
method returns the tag name instead of
null
when the node was created with a DOM Level 1 method such as Document.createElement(). Read the io.sf.carte.doc.dom
package
description for additional information.
Usage with the DOM Wrapper
If you choose to build your document with your favorite DOM implementation instead of the CSS4J one or the DOM4J back-end, you can use the DOMCSSStyleSheetFactory.createCSSDocument(document)
method to wrap a pre-existing DOM Document. Example:
DOMCSSStyleSheetFactory factory = new DOMCSSStyleSheetFactory(); CSSDocument document = factory.createCSSDocument(otherDOMdocument);
Unlike the native DOM or the DOM4J back-end, the DOM resulting from the DOM wrapper is read-only, although you can change the values of some nodes.
Consistency of different DOM implementations
Beware that the computed styles found by each method (native DOM, DOM4J back-end or DOM wrapper) may not be completely identical, due to differences in the underlying document DOM implementations. For example, XML-oriented DOM implementations do not implement the case sensitivity subtleties of HTML. If you find a difference in styles computed from different back-ends that you believe to be a bug, please report.
Media Handling
By default, computed styles only take into account generic styles that are common to all media. If you want to target a more specific medium,
you have to use the CSSDocument.setTargetMedium("medium")
method. For example, if your document has the following style sheets linked:
<link href="http://www.example.com/css/sheet.css" rel="stylesheet" type="text/css" /> <link href="http://www.example.com/css/sheet_for_print.css" rel="stylesheet" media="print" type="text/css" />
Computed styles will initially take into account only the "sheet.css" style sheet. However, if you execute the following method:
document.setTargetMedium("print");
Then all subsequently computed styles will account for the merged style sheet from "sheet.css" and "sheet_for_print.css".
This way to tie a document with a medium is not totally standard, as the W3C APIs would probably expect a DeviceFactory
-related object implementing the
ViewCSS
interface and referencing the document,
but this approach allows to isolate DOM logic inside DOM objects and keep the DeviceFactory
for media-specific information only.
Style Sheet Sets
The library supports alternative style sheets. Use CSSDocument's methods enableStyleSheetsForSet, getStyleSheetSets, getSelectedStyleSheetSet and setSelectedStyleSheetSet. For example, if you have a document with these linked sheets:
<link href="http://www.example.com/commonsheet.css" rel="stylesheet" type="text/css" /> <link href="http://www.example.com/alter1.css" rel="alternate stylesheet" type="text/css" title="Alter 1" /> <link href="http://www.example.com/alter2.css" rel="alternate stylesheet" type="text/css" title="Alter 2" /> <link href="http://www.example.com/default.css" rel="stylesheet" type="text/css" title="Default" />
Initially, sheets 'alter1.css' and 'alter2.css' will not be used to compute styles. But then you can write code like the following:
String defset = document.getSelectedStyleSheetSet(); // Sets 'defset' to "Default" document.setSelectedStyleSheetSet("Alter 1"); // Selects the set with title "Alter 1" document.setSelectedStyleSheetSet("Alter 2"); // Selects the set with title "Alter 2"
These methods have been removed from the DOM standard unfortunately, but you can still read the specification at the CSSOM 5 December 2013 Working Draft.
Style Sheet Error Checking
You can check for errors and warnings in the document's sheets using the non-standard getErrorHandler
method. Example:
if (document.getStyleSheets().item(0).getErrorHandler().hasSacErrors()) ... error processing / reporting
The merged style sheet obtained from the getStyleSheet
method has the merged error/warning state from the document's active sheets.
A typical source of errors are the non-compliant IE hacks, like prefixing property names with an asterisk (you may want to use the proper
NSAC flags when creating the factory, see Compatibility with legacy browsers),
or charset rules found in the wrong place. On the other hand, if you aren't using this library's own SAC parser some errors may mean that the parser
did not recognize a valid syntax (e.g. calc()
values), rather than an error in the style sheet.
Override Styles
Override styles, that come after the author style sheet in the cascade algorithm, are supported. For example:
document.getOverrideStyle(element, null).setCssText("padding: 6pt;");
The getOverrideStyle
method is defined at the DocumentCSS interface.
Parsing a Style Sheet
Although document's style sheet fetching and parsing is automatic with this library, it is possible to manually parse rules from a source stream with the ExtendedCSSStyleSheet.parseStyleSheet(InputSource, boolean)
method:
Reader re = ... org.w3c.css.sac.InputSource source = new org.w3c.css.sac.InputSource(re); sheet.parseStyleSheet(source, true);
The new css rules found in the source stream are added to the already present ones, i.e. the sheet is not reset by this method (although the error handler is).
When the second argument is true
, the comments in the source stream are ignored when parsing.
Compatibility with legacy Browsers (NSAC-only)
This section only applies if you are using a NSAC parser, like this library's default one.
Today's style sheets often contain non-conformant styles that target specific versions of old web browsers, like Internet Explorer. Several web sites contain information about that, including:
Although it could be argued whether those hacks should be used or not, the point is that actual style sheets do contain them, so this library supports them.
By default, a factory is configured to use a flagless NSAC parser which would produce an error on any of those non-standard constructs, but a set of compatibility flags can be specified in the constructors for the factory implementations. The different flags are documented in the NSAC javadocs.
The flags available at the time of this guide's update are the following:
STARHACK
. When set, the parser will handle asterisk-prefixed property names as accepted, normal names.IEVALUES
supports values ending with\9
or\0
, as well as progid filters and IE expressions.IEPRIO
allows values ending with the'!ie'
priority hack.IEPRIOCHAR
accepts values with an'!important!'
priority hack (note the'!'
at the end).
The object model manages these compatibility values in parallel to standard ones. For example, after parsing this declaration
with IEVALUES
set:
width: 900px; width: 890px\9;
its serialization would be identical (if the flag was set correctly):
width: 900px; width: 890px\9;
but the declaration's length shall be only 1
. And computed styles only use the standard values unless there are
no alternatives (no standard value was set). The workings are similar for IEPRIO
and IEPRIOCHAR
:
width: 890px !ie; width: 890px !important!;
with the last one being handled as of important
priority. Values created by IEPRIOCHAR
are never used
in computed styles.
Instead, declarations including asterisk-prefixed property names (created by STARHACK
) always increase the declaration's
length. For example, the length of the following declaration would be 2
:
width: 890px; *width: 890px;
If you want to use these flags at the NSAC level (instead of the Object Model), you may want to read the 'Parser Flags' section in the
NSAC package description, as well as the documentation for the individual flags in
Parser2.Flag
.
CSS Style Formatting
The serialization of the cssText
attribute in rules and
style declarations can be customized with an implementation
of the StyleFormattingContext
interface.
You can set your StyleFormattingFactory
(which produces your customized formatting context) to the sheet factory with the CSSStyleSheetFactory.setStyleFormattingFactory
method, or subclass your base factory and override the createDefaultStyleFormattingFactory
method.
Look at the DefaultStyleFormattingContext
class for an example of a formatting context implementation.
There is also the possibility to customize the default serialization of string values, with the CSSStyleSheetFactory.setFactoryFlag(byte)
method. You can set two flags that govern which quotation you prefer, or keep the default behaviour:
- Default: Try to keep the original quotation (single or double quotes), unless the alternative is more efficient.
STRING_DOUBLE_QUOTE
: Use double quotes unless single quotes are more efficient (when the string contains more double quotes than single).STRING_SINGLE_QUOTE
: Use single quotes unless double quotes are more efficient (when the string contains more single quotes than double).
Accessing Style Sheet Comments
There is no standard CSSOM API for accessing comments in style sheets, but the AbstractCSSRule.getPrecedingComments()
method is provided for that:
List<String> comments = document.getStyleSheets().item(0).getCssRules().item(3).getPrecedingComments();
Other API models were considered for object-model comments, like having a special type of comment rule, but stylesheet-level comments are often related to a rule and if that rule is moved or deleted, those stand-alone comments would have the potential to create a lot of confusion.
The comments preceding any rule will be included in the text returned by the sheet's AbstractCSSStyleSheet.toString()
and toStyleString()
methods,
while other comments (located at places that cannot be easily related to a rule) are lost.
BUG: comments that are intended to apply to the previous rule may be assigned to the next one. This is intended to be addressed in version 2.0, when SAC-based parsing is going to be partly replaced.
Comments in the default HTML style sheet are not available, as the parser is instructed to ignore them when parsing.
CSS level 3 and 4 Support
CSS level 3/4 is only partially supported by the library; the following table summarizes the basic support for setting/retrieving the main CSS specifications (as of 2019):
CSS Spec Name | Support | Observations |
---|---|---|
Background / Border | Yes | |
Color | Partial | Some level 4 functions are not implemented. |
Media Queries | Partial | Event handling with addListener/removeListener is not supported. Relies on CSSCanvas implementations to match/unmatch media features. |
Selectors | Yes | Only this library's own SAC/NSAC parser supports level 4 selectors. |
Transitions | Yes | |
Values | Yes | Only this library's own SAC/NSAC parser supports calc() values. |
Grid / Template / Alignment | Partial | Legacy gap properties (grid-row-gap ,
grid-column-gap , and grid-gap ) are not supported, although the longhands can be used if declared explicitly). |
Pseudo-classes
The following pseudo-classes are supported by the library (although they may not be supported by the SAC parser that you are using):
first-child
, last-child
, only-child
, nth-child()
, nth-last-child()
, first-of-type
,
last-of-type
, only-of-type
, nth-of-type()
, nth-last-of-type()
, link
, visited
,
target
, root
, empty
, blank
, disabled
, enabled
, read-write
,
read-only
, is
, has
, not
, placeholder-shown
, default
, checked
and indeterminate
.
State pseudo-classes like hover
are supported through the CSSCanvas.isActivePseudoClass(CSSElement, String)
method.
Rendering-oriented interfaces
To help with the determination of 'used' values and the actual rendering, this library provides a few helper interfaces. The most important are:
DeviceFactory
. It is not a "factory of devices", but instead the object that delivers the relevant abstractions for a requested medium:StyleDatabase
andCSSCanvas
.
StyleDatabase
. Provides medium-specific information like available fonts and colors.
CSSCanvas
. Has knowledge of medium-specific information that depends (or may depend) on a specific viewport, like supported media features. It is linked to a viewport, if there is any. This interface is used to determine the state of active pseudo-classes.
Viewport
. It represents a viewport defined as per the CSS specifications.
The differences between style databases and canvases can be subtle, and for some media features it could be argued that they belong to one or the other. The basic idea is that style databases should be relatively easy to implement for a given medium, while canvases are probably only going to exist if there is an actual rendering engine implemented.
Java™ Runtime Environment Requirements
The classes in the binary packages have been compiled with a Java SE compiler with 1.7 compiler compliance level,
except for the module-info
file which targets Java 11.