/*
 * Licensed to the Apache Software Foundation (ASF) under one or more
 * contributor license agreements.  See the NOTICE file distributed with
 * this work for additional information regarding copyright ownership.
 * The ASF licenses this file to You under the Apache License, Version 2.0
 * (the "License"); you may not use this file except in compliance with
 * the License.  You may obtain a copy of the License at
 *
 *     http://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */

// $Id: DocumentBuilderFactory.java 884950 2009-11-27 18:46:18Z mrglavas $

package javax.xml.parsers;

import javax.xml.validation.Schema;

Defines a factory API that enables applications to obtain a parser that produces DOM object trees from XML documents.
Author:Jeff Suttor
Version:$Revision: 884950 $, $Date: 2009-11-27 13:46:18 -0500 (Fri, 27 Nov 2009) $
/** * Defines a factory API that enables applications to obtain a * parser that produces DOM object trees from XML documents. * * @author <a href="Jeff.Suttor@Sun.com">Jeff Suttor</a> * @version $Revision: 884950 $, $Date: 2009-11-27 13:46:18 -0500 (Fri, 27 Nov 2009) $ */
public abstract class DocumentBuilderFactory { private boolean validating = false; private boolean namespaceAware = false; private boolean whitespace = false; private boolean expandEntityRef = true; private boolean ignoreComments = false; private boolean coalescing = false; protected DocumentBuilderFactory () { }
Obtain a new instance of a DocumentBuilderFactory. This static method creates a new factory instance. This method uses the following ordered lookup procedure to determine the DocumentBuilderFactory implementation class to load:
  • Use the javax.xml.parsers.DocumentBuilderFactory system property.
  • Use the properties file "lib/jaxp.properties" in the JRE directory. This configuration file is in standard java.util.Properties format and contains the fully qualified name of the implementation class with the key being the system property defined above. The jaxp.properties file is read only once by the JAXP implementation and it's values are then cached for future use. If the file does not exist when the first attempt is made to read from it, no further attempts are made to check for its existence. It is not possible to change the value of any property in jaxp.properties after it has been read for the first time.
  • Use the Services API (as detailed in the JAR specification), if available, to determine the classname. The Services API will look for a classname in the file META-INF/services/javax.xml.parsers.DocumentBuilderFactory in jars available to the runtime.
  • Platform default DocumentBuilderFactory instance.
Once an application has obtained a reference to a DocumentBuilderFactory it can use the factory to configure and obtain parser instances.

Tip for Trouble-shooting

Setting the jaxp.debug system property will cause this method to print a lot of debug messages to System.err about what it is doing and where it is looking at.

If you have problems loading DocumentBuilders, try:

java -Djaxp.debug=1 YourProgram ....
Throws:
Returns:New instance of a DocumentBuilderFactory
/** * Obtain a new instance of a * <code>DocumentBuilderFactory</code>. This static method creates * a new factory instance. * This method uses the following ordered lookup procedure to determine * the <code>DocumentBuilderFactory</code> implementation class to * load: * <ul> * <li> * Use the <code>javax.xml.parsers.DocumentBuilderFactory</code> system * property. * </li> * <li> * Use the properties file "lib/jaxp.properties" in the JRE directory. * This configuration file is in standard <code>java.util.Properties * </code> format and contains the fully qualified name of the * implementation class with the key being the system property defined * above. * * The jaxp.properties file is read only once by the JAXP implementation * and it's values are then cached for future use. If the file does not exist * when the first attempt is made to read from it, no further attempts are * made to check for its existence. It is not possible to change the value * of any property in jaxp.properties after it has been read for the first time. * </li> * <li> * Use the Services API (as detailed in the JAR specification), if * available, to determine the classname. The Services API will look * for a classname in the file * <code>META-INF/services/javax.xml.parsers.DocumentBuilderFactory</code> * in jars available to the runtime. * </li> * <li> * Platform default <code>DocumentBuilderFactory</code> instance. * </li> * </ul> * * Once an application has obtained a reference to a * <code>DocumentBuilderFactory</code> it can use the factory to * configure and obtain parser instances. * * * <h2>Tip for Trouble-shooting</h2> * <p>Setting the <code>jaxp.debug</code> system property will cause * this method to print a lot of debug messages * to <tt>System.err</tt> about what it is doing and where it is looking at.</p> * * <p> If you have problems loading {@link DocumentBuilder}s, try:</p> * <pre> * java -Djaxp.debug=1 YourProgram .... * </pre> * * @return New instance of a <code>DocumentBuilderFactory</code> * * @exception FactoryConfigurationError if the implementation is not * available or cannot be instantiated. */
public static DocumentBuilderFactory newInstance() { try { return (DocumentBuilderFactory) FactoryFinder.find( /* The default property name according to the JAXP spec */ "javax.xml.parsers.DocumentBuilderFactory", /* The fallback implementation class name */ "org.apache.xerces.jaxp.DocumentBuilderFactoryImpl"); } catch (FactoryFinder.ConfigurationError e) { throw new FactoryConfigurationError(e.getException(), e.getMessage()); } }
Throws:
Returns:New instance of a DocumentBuilderFactory
/** * @return New instance of a <code>DocumentBuilderFactory</code> * * @exception FactoryConfigurationError if the implementation is not * available or cannot be instantiated. */
public static DocumentBuilderFactory newInstance(String factoryClassName, ClassLoader classLoader) { if (factoryClassName == null) { throw new FactoryConfigurationError("factoryClassName cannot be null."); } if (classLoader == null) { classLoader = SecuritySupport.getContextClassLoader(); } try { return (DocumentBuilderFactory) FactoryFinder.newInstance(factoryClassName, classLoader, false); } catch (FactoryFinder.ConfigurationError e) { throw new FactoryConfigurationError(e.getException(), e.getMessage()); } }
Creates a new instance of a DocumentBuilder using the currently configured parameters.
Throws:
Returns:A new instance of a DocumentBuilder.
/** * Creates a new instance of a {@link javax.xml.parsers.DocumentBuilder} * using the currently configured parameters. * * @exception ParserConfigurationException if a DocumentBuilder * cannot be created which satisfies the configuration requested. * @return A new instance of a DocumentBuilder. */
public abstract DocumentBuilder newDocumentBuilder() throws ParserConfigurationException;
Specifies that the parser produced by this code will provide support for XML namespaces. By default the value of this is set to false
Params:
  • awareness – true if the parser produced will provide support for XML namespaces; false otherwise.
/** * Specifies that the parser produced by this code will * provide support for XML namespaces. By default the value of this is set * to <code>false</code> * * @param awareness true if the parser produced will provide support * for XML namespaces; false otherwise. */
public void setNamespaceAware(boolean awareness) { this.namespaceAware = awareness; }
Specifies that the parser produced by this code will validate documents as they are parsed. By default the value of this is set to false.

Note that "the validation" here means a validating parser as defined in the XML recommendation. In other words, it essentially just controls the DTD validation. (except the legacy two properties defined in JAXP 1.2. See here for more details.)

To use modern schema languages such as W3C XML Schema or RELAX NG instead of DTD, you can configure your parser to be a non-validating parser by leaving the setValidating(boolean) method false, then use the setSchema(Schema) method to associate a schema to a parser.

Params:
  • validating – true if the parser produced will validate documents as they are parsed; false otherwise.
/** * Specifies that the parser produced by this code will * validate documents as they are parsed. By default the value of this * is set to <code>false</code>. * * <p> * Note that "the validation" here means * <a href="http://www.w3.org/TR/REC-xml#proc-types">a validating * parser</a> as defined in the XML recommendation. * In other words, it essentially just controls the DTD validation. * (except the legacy two properties defined in JAXP 1.2. * See <a href="#validationCompatibility">here</a> for more details.) * </p> * * <p> * To use modern schema languages such as W3C XML Schema or * RELAX NG instead of DTD, you can configure your parser to be * a non-validating parser by leaving the {@link #setValidating(boolean)} * method <tt>false</tt>, then use the {@link #setSchema(Schema)} * method to associate a schema to a parser. * </p> * * @param validating true if the parser produced will validate documents * as they are parsed; false otherwise. */
public void setValidating(boolean validating) { this.validating = validating; }
Specifies that the parsers created by this factory must eliminate whitespace in element content (sometimes known loosely as 'ignorable whitespace') when parsing XML documents (see XML Rec 2.10). Note that only whitespace which is directly contained within element content that has an element only content model (see XML Rec 3.2.1) will be eliminated. Due to reliance on the content model this setting requires the parser to be in validating mode. By default the value of this is set to false.
Params:
  • whitespace – true if the parser created must eliminate whitespace in the element content when parsing XML documents; false otherwise.
/** * Specifies that the parsers created by this factory must eliminate * whitespace in element content (sometimes known loosely as * 'ignorable whitespace') when parsing XML documents (see XML Rec * 2.10). Note that only whitespace which is directly contained within * element content that has an element only content model (see XML * Rec 3.2.1) will be eliminated. Due to reliance on the content model * this setting requires the parser to be in validating mode. By default * the value of this is set to <code>false</code>. * * @param whitespace true if the parser created must eliminate whitespace * in the element content when parsing XML documents; * false otherwise. */
public void setIgnoringElementContentWhitespace(boolean whitespace) { this.whitespace = whitespace; }
Specifies that the parser produced by this code will expand entity reference nodes. By default the value of this is set to true
Params:
  • expandEntityRef – true if the parser produced will expand entity reference nodes; false otherwise.
/** * Specifies that the parser produced by this code will * expand entity reference nodes. By default the value of this is set to * <code>true</code> * * @param expandEntityRef true if the parser produced will expand entity * reference nodes; false otherwise. */
public void setExpandEntityReferences(boolean expandEntityRef) { this.expandEntityRef = expandEntityRef; }

Specifies that the parser produced by this code will ignore comments. By default the value of this is set to false .

Params:
  • ignoreComments – boolean value to ignore comments during processing
/** * <p>Specifies that the parser produced by this code will * ignore comments. By default the value of this is set to <code>false * </code>.</p> * * @param ignoreComments <code>boolean</code> value to ignore comments during processing */
public void setIgnoringComments(boolean ignoreComments) { this.ignoreComments = ignoreComments; }
Specifies that the parser produced by this code will convert CDATA nodes to Text nodes and append it to the adjacent (if any) text node. By default the value of this is set to false
Params:
  • coalescing – true if the parser produced will convert CDATA nodes to Text nodes and append it to the adjacent (if any) text node; false otherwise.
/** * Specifies that the parser produced by this code will * convert CDATA nodes to Text nodes and append it to the * adjacent (if any) text node. By default the value of this is set to * <code>false</code> * * @param coalescing true if the parser produced will convert CDATA nodes * to Text nodes and append it to the adjacent (if any) * text node; false otherwise. */
public void setCoalescing(boolean coalescing) { this.coalescing = coalescing; }
Indicates whether or not the factory is configured to produce parsers which are namespace aware.
Returns: true if the factory is configured to produce parsers which are namespace aware; false otherwise.
/** * Indicates whether or not the factory is configured to produce * parsers which are namespace aware. * * @return true if the factory is configured to produce parsers which * are namespace aware; false otherwise. */
public boolean isNamespaceAware() { return namespaceAware; }
Indicates whether or not the factory is configured to produce parsers which validate the XML content during parse.
Returns: true if the factory is configured to produce parsers which validate the XML content during parse; false otherwise.
/** * Indicates whether or not the factory is configured to produce * parsers which validate the XML content during parse. * * @return true if the factory is configured to produce parsers * which validate the XML content during parse; false otherwise. */
public boolean isValidating() { return validating; }
Indicates whether or not the factory is configured to produce parsers which ignore ignorable whitespace in element content.
Returns: true if the factory is configured to produce parsers which ignore ignorable whitespace in element content; false otherwise.
/** * Indicates whether or not the factory is configured to produce * parsers which ignore ignorable whitespace in element content. * * @return true if the factory is configured to produce parsers * which ignore ignorable whitespace in element content; * false otherwise. */
public boolean isIgnoringElementContentWhitespace() { return whitespace; }
Indicates whether or not the factory is configured to produce parsers which expand entity reference nodes.
Returns: true if the factory is configured to produce parsers which expand entity reference nodes; false otherwise.
/** * Indicates whether or not the factory is configured to produce * parsers which expand entity reference nodes. * * @return true if the factory is configured to produce parsers * which expand entity reference nodes; false otherwise. */
public boolean isExpandEntityReferences() { return expandEntityRef; }
Indicates whether or not the factory is configured to produce parsers which ignores comments.
Returns: true if the factory is configured to produce parsers which ignores comments; false otherwise.
/** * Indicates whether or not the factory is configured to produce * parsers which ignores comments. * * @return true if the factory is configured to produce parsers * which ignores comments; false otherwise. */
public boolean isIgnoringComments() { return ignoreComments; }
Indicates whether or not the factory is configured to produce parsers which converts CDATA nodes to Text nodes and appends it to the adjacent (if any) Text node.
Returns: true if the factory is configured to produce parsers which converts CDATA nodes to Text nodes and appends it to the adjacent (if any) Text node; false otherwise.
/** * Indicates whether or not the factory is configured to produce * parsers which converts CDATA nodes to Text nodes and appends it to * the adjacent (if any) Text node. * * @return true if the factory is configured to produce parsers * which converts CDATA nodes to Text nodes and appends it to * the adjacent (if any) Text node; false otherwise. */
public boolean isCoalescing() { return coalescing; }
Allows the user to set specific attributes on the underlying implementation.
Params:
  • name – The name of the attribute.
  • value – The value of the attribute.
Throws:
/** * Allows the user to set specific attributes on the underlying * implementation. * @param name The name of the attribute. * @param value The value of the attribute. * @exception IllegalArgumentException thrown if the underlying * implementation doesn't recognize the attribute. */
public abstract void setAttribute(String name, Object value) throws IllegalArgumentException;
Allows the user to retrieve specific attributes on the underlying implementation.
Params:
  • name – The name of the attribute.
Throws:
Returns:value The value of the attribute.
/** * Allows the user to retrieve specific attributes on the underlying * implementation. * @param name The name of the attribute. * @return value The value of the attribute. * @exception IllegalArgumentException thrown if the underlying * implementation doesn't recognize the attribute. */
public abstract Object getAttribute(String name) throws IllegalArgumentException;

Set a feature for this DocumentBuilderFactory and DocumentBuilders created by this factory.

Feature names are fully qualified URIs. Implementations may define their own features. An ParserConfigurationException is thrown if this DocumentBuilderFactory or the DocumentBuilders it creates cannot support the feature. It is possible for an DocumentBuilderFactory to expose a feature value but be unable to change its state.

All implementations are required to support the XMLConstants.FEATURE_SECURE_PROCESSING feature. When the feature is:

  • true: the implementation will limit XML processing to conform to implementation limits. Examples include entity expansion limits and XML Schema constructs that would consume large amounts of resources. If XML processing is limited for security reasons, it will be reported via a call to the registered ErrorHandler.fatalError(SAXParseException exception). See DocumentBuilder.setErrorHandler(ErrorHandler errorHandler).
  • false: the implementation will processing XML according to the XML specifications without regard to possible implementation limits.
Params:
  • name – Feature name.
  • value – Is feature state true or false.
Throws:
/** * <p>Set a feature for this <code>DocumentBuilderFactory</code> and <code>DocumentBuilder</code>s created by this factory.</p> * * <p> * Feature names are fully qualified {@link java.net.URI}s. * Implementations may define their own features. * An {@link ParserConfigurationException} is thrown if this <code>DocumentBuilderFactory</code> or the * <code>DocumentBuilder</code>s it creates cannot support the feature. * It is possible for an <code>DocumentBuilderFactory</code> to expose a feature value but be unable to change its state. * </p> * * <p> * All implementations are required to support the {@link javax.xml.XMLConstants#FEATURE_SECURE_PROCESSING} feature. * When the feature is:</p> * <ul> * <li> * <code>true</code>: the implementation will limit XML processing to conform to implementation limits. * Examples include entity expansion limits and XML Schema constructs that would consume large amounts of resources. * If XML processing is limited for security reasons, it will be reported via a call to the registered * {@link org.xml.sax.ErrorHandler#fatalError(SAXParseException exception)}. * See {@link DocumentBuilder#setErrorHandler(org.xml.sax.ErrorHandler errorHandler)}. * </li> * <li> * <code>false</code>: the implementation will processing XML according to the XML specifications without * regard to possible implementation limits. * </li> * </ul> * * @param name Feature name. * @param value Is feature state <code>true</code> or <code>false</code>. * * @throws ParserConfigurationException if this <code>DocumentBuilderFactory</code> or the <code>DocumentBuilder</code>s * it creates cannot support this feature. * @throws NullPointerException If the <code>name</code> parameter is null. */
public abstract void setFeature(String name, boolean value) throws ParserConfigurationException;

Get the state of the named feature.

Feature names are fully qualified URIs. Implementations may define their own features. An ParserConfigurationException is thrown if this DocumentBuilderFactory or the DocumentBuilders it creates cannot support the feature. It is possible for an DocumentBuilderFactory to expose a feature value but be unable to change its state.

Params:
  • name – Feature name.
Throws:
Returns:State of the named feature.
/** * <p>Get the state of the named feature.</p> * * <p> * Feature names are fully qualified {@link java.net.URI}s. * Implementations may define their own features. * An {@link ParserConfigurationException} is thrown if this <code>DocumentBuilderFactory</code> or the * <code>DocumentBuilder</code>s it creates cannot support the feature. * It is possible for an <code>DocumentBuilderFactory</code> to expose a feature value but be unable to change its state. * </p> * * @param name Feature name. * * @return State of the named feature. * * @throws ParserConfigurationException if this <code>DocumentBuilderFactory</code> * or the <code>DocumentBuilder</code>s it creates cannot support this feature. */
public abstract boolean getFeature(String name) throws ParserConfigurationException;
Gets the Schema object specified through the setSchema(Schema schema) method.
Throws:
  • UnsupportedOperationException – For backward compatibility, when implementations for earlier versions of JAXP is used, this exception will be thrown.
Returns: the Schema object that was last set through the setSchema(Schema) method, or null if the method was not invoked since a DocumentBuilderFactory is created.
Since:1.5
/** * Gets the {@link Schema} object specified through * the {@link #setSchema(Schema schema)} method. * * * @throws UnsupportedOperationException * For backward compatibility, when implementations for * earlier versions of JAXP is used, this exception will be * thrown. * * @return * the {@link Schema} object that was last set through * the {@link #setSchema(Schema)} method, or null * if the method was not invoked since a {@link DocumentBuilderFactory} * is created. * * @since 1.5 */
public Schema getSchema() { throw new UnsupportedOperationException( "This parser does not support specification \"" + this.getClass().getPackage().getSpecificationTitle() + "\" version \"" + this.getClass().getPackage().getSpecificationVersion() + "\"" ); }

Set the Schema to be used by parsers created from this factory.

When a Schema is non-null, a parser will use a validator created from it to validate documents before it passes information down to the application.

When errors are found by the validator, the parser is responsible to report them to the user-specified ErrorHandler (or if the error handler is not set, ignore them or throw them), just like any other errors found by the parser itself. In other words, if the user-specified ErrorHandler is set, it must receive those errors, and if not, they must be treated according to the implementation specific default error handling rules.

A validator may modify the outcome of a parse (for example by adding default values that were missing in documents), and a parser is responsible to make sure that the application will receive modified DOM trees.

Initially, null is set as the Schema.

This processing will take effect even if the isValidating() method returns false.

It is an error to use the http://java.sun.com/xml/jaxp/properties/schemaSource property and/or the http://java.sun.com/xml/jaxp/properties/schemaLanguage property in conjunction with a Schema object. Such configuration will cause a ParserConfigurationException exception when the newDocumentBuilder() is invoked.

Note for implementors

A parser must be able to work with any Schema implementation. However, parsers and schemas are allowed to use implementation-specific custom mechanisms as long as they yield the result described in the specification.

Params:
  • schema – Schema to use or null to remove a schema.
Throws:
  • UnsupportedOperationException – For backward compatibility, when implementations for earlier versions of JAXP is used, this exception will be thrown.
Since:1.5
/** * <p>Set the {@link Schema} to be used by parsers created * from this factory. * * <p> * When a {@link Schema} is non-null, a parser will use a validator * created from it to validate documents before it passes information * down to the application. * * <p>When errors are found by the validator, the parser is responsible * to report them to the user-specified {@link org.xml.sax.ErrorHandler} * (or if the error handler is not set, ignore them or throw them), just * like any other errors found by the parser itself. * In other words, if the user-specified {@link org.xml.sax.ErrorHandler} * is set, it must receive those errors, and if not, they must be * treated according to the implementation specific * default error handling rules. * * <p> * A validator may modify the outcome of a parse (for example by * adding default values that were missing in documents), and a parser * is responsible to make sure that the application will receive * modified DOM trees. * * <p> * Initially, null is set as the {@link Schema}. * * <p> * This processing will take effect even if * the {@link #isValidating()} method returns <tt>false</tt>. * * <p>It is an error to use * the <code>http://java.sun.com/xml/jaxp/properties/schemaSource</code> * property and/or the <code>http://java.sun.com/xml/jaxp/properties/schemaLanguage</code> * property in conjunction with a {@link Schema} object. * Such configuration will cause a {@link ParserConfigurationException} * exception when the {@link #newDocumentBuilder()} is invoked.</p> * * * <h4>Note for implementors</h4> * <p> * A parser must be able to work with any {@link Schema} * implementation. However, parsers and schemas are allowed * to use implementation-specific custom mechanisms * as long as they yield the result described in the specification. * * @param schema <code>Schema</code> to use or <code>null</code> to remove a schema. * * @throws UnsupportedOperationException * For backward compatibility, when implementations for * earlier versions of JAXP is used, this exception will be * thrown. * * @since 1.5 */
public void setSchema(Schema schema) { throw new UnsupportedOperationException( "This parser does not support specification \"" + this.getClass().getPackage().getSpecificationTitle() + "\" version \"" + this.getClass().getPackage().getSpecificationVersion() + "\"" ); }

Set state of XInclude processing.

If XInclude markup is found in the document instance, should it be processed as specified in XML Inclusions (XInclude) Version 1.0.

XInclude processing defaults to false.

Params:
  • state – Set XInclude processing to true or false
Throws:
  • UnsupportedOperationException – For backward compatibility, when implementations for earlier versions of JAXP is used, this exception will be thrown.
Since:1.5
/** * <p>Set state of XInclude processing.</p> * * <p>If XInclude markup is found in the document instance, should it be * processed as specified in <a href="http://www.w3.org/TR/xinclude/"> * XML Inclusions (XInclude) Version 1.0</a>.</p> * * <p>XInclude processing defaults to <code>false</code>.</p> * * @param state Set XInclude processing to <code>true</code> or * <code>false</code> * * @throws UnsupportedOperationException * For backward compatibility, when implementations for * earlier versions of JAXP is used, this exception will be * thrown. * * @since 1.5 */
public void setXIncludeAware(final boolean state) { throw new UnsupportedOperationException( "This parser does not support specification \"" + this.getClass().getPackage().getSpecificationTitle() + "\" version \"" + this.getClass().getPackage().getSpecificationVersion() + "\"" ); }

Get state of XInclude processing.

Throws:
  • UnsupportedOperationException – For backward compatibility, when implementations for earlier versions of JAXP is used, this exception will be thrown.
Returns:current state of XInclude processing
Since:1.5
/** * <p>Get state of XInclude processing.</p> * * @return current state of XInclude processing * * @throws UnsupportedOperationException * For backward compatibility, when implementations for * earlier versions of JAXP is used, this exception will be * thrown. * * @since 1.5 */
public boolean isXIncludeAware() { throw new UnsupportedOperationException( "This parser does not support specification \"" + this.getClass().getPackage().getSpecificationTitle() + "\" version \"" + this.getClass().getPackage().getSpecificationVersion() + "\"" ); } }