Ajax and Web Service Data Formats Part 1: XML, SOAP, and HTML

Key Takeaways

AJAX, originally standing for Asynchronous JavaScript and XML, has evolved to represent any technique where data is sent between the browser and the server without requiring a full page reload. It’s not essential to use asynchronous methods, JavaScript, or XML for this process.
XML, SOAP, and HTML are all viable data formats for AJAX. XML, while verbose and sometimes ambiguous, offers good security and is well-supported by most languages. SOAP, a standardized format for web service data exchange, is often used for server-to-server communications, while HTML is easy to use if the AJAX response is to be inserted directly onto the page.
Despite their benefits, each format has its drawbacks. XML can be verbose with a low data-to-structure ratio, SOAP is even more verbose than XML and difficult to parse in JavaScript, and HTML can risk breaking the current layout and pose security issues.

When the AJAX acronym was devised by Jesse James Garrett, its original meaning was “Asynchronous JavaScript and XML.” In essence, you follow this process:

Create a web service, e.g. a PHP ‘page’ which is passed HTTP GET/POST arguments and returns a response in XML.
Write client-side JavaScript code to consume the web service, i.e. pass arguments and retrieve the XML response. The call is handled asynchronously so the browser isn’t locked while it waits for data to arrive.
Parse the XML and update the HTML document accordingly.

The AJAX name stuck and the term was used and abused by developers and marketing-types alike. Today, the uppercase acronym has evolved into the term “Ajax” — a name for any technique where data is sent between the browser and the server without requiring a full page reload. The reason:

It’s not essential to use asynchronous methods (although it’s usually desirable).
You don’t necessarily require JavaScript.
You certainly don’t need XML.

Ultimately, whatever technology or technique you’re using, you must still pass data between two devices. This is the first part in a series of three articles which discusses various formatting options and their pros and cons.

XML

In the beginning, XML was the logical choice. Few other data exchange formats had been formalized and most languages provided libraries for creating, validating and parsing XML data. Even if your language didn’t directly support XML, it’s essentially plain text:


<?xml version="1.0"?>
<products>
	<book>
		<title>The Principles of Beautiful Web Design, 2nd Edition</title>
		<url>https://www.sitepoint.com/books/design2/</url>
		<author>Jason Beaird</author>
		<publisher>SitePoint</publisher>
		<price currency="USD">39.95</price>
	</book>
	<book>
		<title>jQuery: Novice to Ninja</title>
		<url>https://www.sitepoint.com/books/jquery1/</url>
		<author>Earle Castledine & Craig Sharkie</author>
		<publisher>SitePoint</publisher>
		<price currency="USD">29.95</price>
	</book>
	<book>
		<title>Build Your Own Database Driven Website</title>
		<url>https://www.sitepoint.com/books/phpmysql4/</url>
		<author>Kevin Yank</author>
		<publisher>SitePoint</publisher>
		<price currency="USD">39.95</price>
	</book>
</products>

The benefits of XML include:

XML can be read by humans and is easier to understand than some other formats (assuming you use understandable tags). In my previous series of articles, How to Create Your Own Twitter Widget in PHP, I used the XML feed for reference even though the application didn’t use it.
Most languages provide excellent support for XML including, crucially, JavaScript.
XML offers reasonable security. Data must be extracted and parsed so it’s not easy to send a malicious payload.

Unfortunately, there are several disadvantages when using XML:

There won’t always be an industry-approved XML format (schema) for the data you’re publishing. You may be able to adapt a format such as RSS but, even then, the JavaScript client must be programmed to understand it.
XML can be verbose with a low data to structure ratio. Ideally, an Ajax response should be small in order to minimize bandwidth and lessen the burden on the browser.
XML can be a little ambiguous. Should an item of data be a new element or an attribute for an existing one? You can reduce an XML document size by choosing attributes, but that’s not necessarily a good reason to adopt them.
XML parsing in JavaScript is tedious. XPath support is patchy at best, so it’s necessary to extract data and convert the string to a real value before it can be used, e.g.


// grab value in first <data> element
var xml = xhr.responseXML;
var nodes = xml.getElementsByTagName("data");
var data = (nodes.length > 0 ? nodes[0].firstChild.nodeValue : null);

Many developers consider XML to be all but dead. I disagree. It may not be the best choice for Ajax clients, but you won’t always know how a web service will be consumed. XML’s ubiquity makes it a great choice — don’t ignore it.

SOAP

SOAP is a standardized format for web service data exchange. The full technical details run to hundreds of pages but, ultimately, SOAP relies on well-defined XML schemas.

Few developers use SOAP directly (the smell gives them away!) The beauty of SOAP is that client libraries automatically parse the XML response into native objects. For example, .NET developers can create SOAP-based web services and clients with very little effort. As far as the developer is concerned, they’re simply instantiating a C# object — it doesn’t matter that it was created on a remote machine.

Unfortunately, SOAP’s XML roots show:

SOAP is even more verbose than a typical XML response.
Parsing SOAP messages in JavaScript remains difficult. It is possible and SOAP libraries can help, but it’s a lot of effort for the coder and the browser.

SOAP remains a viable choice for server-to-server communications — especially if they’re within the same network. However, it’s too unwieldy if the majority of calls are from Ajax requests.

HTML

HTML is an easy format to use if you want to insert the Ajax response directly on to the page without further analysis. For example, assume you have a small shopping cart widget which appears on every page. You already have server-side code which creates that HTML — it could be adapted to return the same HTML as an Ajax response when an item is purchased.

The benefits include:

It’s easy to reuse existing code and create a web service.
There’s no need for complex data parsing on the client.
The HTML can be quickly added to the page using innerHTML.

But there are disadvantages:

It may be difficult to extract useful data. For example, if you wanted to show the cart total elsewhere, it may not be easy to identify that value within the HTML.
The message is more verbose than necessary and will probably be larger than an equivalent XML message.
Injecting HTML into a page risks breaking the current layout.
Security could be an issue — the response could contain a malicious script.

In my next post, we’ll discuss a couple of data formats which are far more efficient for Ajax use: JSON and JSONP.

Frequently Asked Questions (FAQs) about AJAX Data Formats: XML, SOAP, HTML

What is the difference between XML, SOAP, and HTML in AJAX data formats?

XML, SOAP, and HTML are all data formats used in AJAX. XML (eXtensible Markup Language) is a markup language that defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. SOAP (Simple Object Access Protocol) is a messaging protocol that allows programs running on disparate operating systems to communicate with each other. HTML (HyperText Markup Language) is the standard markup language for documents designed to be displayed in a web browser. While XML and SOAP are used for data exchange, HTML is used for data presentation.

How can I get a response as XML from a web service response?

To get a response as XML from a web service response, you need to set the ‘Accept’ header of your HTTP request to ‘application/xml’. This tells the server that you want the response to be in XML format. Once the server sends the response, you can parse it using an XML parser.

How can I use AJAX to load an XML file?

To load an XML file using AJAX, you can use the ‘XMLHttpRequest’ object. This object allows you to send HTTP requests and receive HTTP responses. You can use its ‘open’ method to specify the request method and URL, and its ‘send’ method to send the request. Once the response is received, you can use the ‘responseXML’ property to get the response as an XML document.

How can I use Spring Boot to create a RESTful web service that returns XML?

To create a RESTful web service that returns XML using Spring Boot, you need to use the ‘@RestController’ annotation to define a controller and the ‘@RequestMapping’ annotation to map HTTP requests to handler methods. You also need to set the ‘produces’ attribute of the ‘@RequestMapping’ annotation to ‘application/xml’ to specify that the response should be in XML format.

What is AJAX and how does it work?

AJAX (Asynchronous JavaScript and XML) is a set of web development techniques that allows a web page to update parts of its content without reloading the whole page. It works by sending HTTP requests to the server and receiving HTTP responses, all in the background. This allows the web page to remain interactive while the data is being loaded.

How can I use Spring to create a RESTful web service that returns XML?

To create a RESTful web service that returns XML using Spring, you need to use the ‘@RestController’ annotation to define a controller and the ‘@RequestMapping’ annotation to map HTTP requests to handler methods. You also need to set the ‘produces’ attribute of the ‘@RequestMapping’ annotation to ‘application/xml’ to specify that the response should be in XML format.

How can I parse an XML response in AJAX?

To parse an XML response in AJAX, you can use the ‘DOMParser’ object. This object allows you to parse XML strings into XML documents. You can use its ‘parseFromString’ method to parse the XML response.

How can I handle errors in AJAX?

To handle errors in AJAX, you can use the ‘onerror’ event handler of the ‘XMLHttpRequest’ object. This event handler is called when an error occurs during the request.

How can I send a SOAP request using AJAX?

To send a SOAP request using AJAX, you can use the ‘XMLHttpRequest’ object. You need to set the ‘Content-Type’ header of your HTTP request to ‘text/xml’ and the ‘SOAPAction’ header to the URI of the SOAP action. Then, you can send the SOAP envelope as the body of the request.

How can I use AJAX to load an HTML file?

To load an HTML file using AJAX, you can use the ‘XMLHttpRequest’ object. You can use its ‘open’ method to specify the request method and URL, and its ‘send’ method to send the request. Once the response is received, you can use the ‘responseText’ property to get the response as a string of HTML.