Title: Using XML as a data storage format
Question: Does it make sense to use XML in local applications?
Answer:
Introduction
Evolution of the web industry leads developers through sophisticated technologies waving and balancing between client-oriented and server-oriented approaches. New standards for browsers, protocols, and scripting languages replace each other. The next step is XML - extended markup language, which suppose to replace HTML and dominate in web applications as main communication format. XML has been designed to transfer data to clients separately from data's format. Web server or Internet client should combine XML data with a style sheet and generate well-known HTML code to display the data.
On the other hand you may use XML language to extend functionality of your programs. Consider XML as a universal data storage format, which as a matter of fact much more flexible than DBF or any other relational database format.
Why should you use XML as the only advanced way to create web pages? XML is more universal and more flexible than just only markup language. You may even create your own data format or network protocol by inventing your own tags and interface.
Below you will find one example of such non-standard XML usage, designed for use in Borland Delphi applications.
Analysis
Your program needs to store some data in a file or transfer the data to another computer. You can save your data in text or any proprietary format. You will need to write code to support such format. Should you change data structure, you will face necessity to modify the code. In any case you should not consider this format as a standard one, and other developers probably will not support the format.
Use XML as base for your data storage and transfer and you will gain the following advantages:
* Established and supported syntax standards;
* Common program interface for reading and writing XML text;
* Flexible data structure;
* Web enabled technology;
Using XML as a data format has some disadvantages too:
* Unlike usual database, XML text does not have fixed structure.
In most cases whole XML text stored in memory, so it should
be relatively small files;
* There are no built-in security in XML, so you have to encrypt
and decrypt XML files;
* XML is text format so you have to convert all fields to text
type before storing in text;
The most optimal and convenient way to implement this protocol in Delphi is to design an interface between Pascal objects and XML code. The goal is to store instances of Pascal classes in XML format with ability to restore the instances from the XML text.
It is a good idea to split "Pascal Object - XML" interface into two interfaces: "Pascal Object - Data object" and "Data object - XML" (see figure 1).
I do not see a dainty solution to extend any Pascal class with XML interface. Instead, let's create a new class to store and manage properties and data fields of an object. A developer will need to write additional code to move data between Pascal object and data object.
Figure 1. XML data interfaces
Design
Our task is to implement two independent interfaces: "Pascal Object - Data Object" and "Data Object - XML".
First interface will enable a developer to replicate data structure of an instance of any class and instance of the SPO class. "SPO" stands for Standard Pascal Object. SPO class will provide data container and set of methods to store and retrieve data.
Second interface will be implemented as a set of routines to convert SPO object to XML text and visa versa.
Design of SPO interface
The main purpose of the SPO class is to store and provide access to a data fields and properties. Because of that, it is much more important to create a convenient data access interface, than to optimize data storage method. After all, we will be able to change data storage method later without affecting the interface.
First of all let's define the data types, supported by our interface. We will consider only five data types in our model:
* String
* Numeric
* Date
* Boolean
* Object
The data type "object" allows us to build complex data structure by nesting one object to another. On figure 2 you may see an example of data object structure:
Figure 2 Data object model
To access data in our object we can use traditional method - to get property of main project, retrieve object from the property, then get the property of the retrieved object, etc:
var
O: TspoObject;
P: TspoProperty;
S: string;
begin
P := AnObject.PropByName['Career position'];
O := P.TheObject;
S := P.PropByName['Title'].Value;
end;
The same long sequence will be necessary to add new property to an object. To simplify access to nested objects and properties we will use path-style resource locator string. For example to get value of the "Start date" field you will use the following syntax:
AnObject['Career position/Start date'] := Now;
To create the object on figure 2 we can use only six lines of code:
with AnObject do begin
AddItem('First name','John');
AddItem('Last name','Smith');
AddItem('Date of birth',EncodeDate(1964,7,23));
AddItem('Career position',NULL,xtObject);
AddItem('Career position/Start date',EncodeDate(1998,9,1));
AddItem('Career position/Title','Director');
end;
Using collections
Sometimes you need to store several objects inside one property. You may index this collection of objects by a number or by keyword. To create a collection in object property you have to create an object and then add as many properties as you wish:
with AnObject do begin
AddItem('List',NULL,xtObject);
for i:=1 to 10 do begin
AddItem('List/'+intToStr(i),'Item'+IntToStr(i));
end;
end;
Design of XML interface
Now we are ready to design an interface to store our SPO in XML text (string). Because XML code is just a sequence of tags, we will design separate routine to format XML string as a XML text with indents. The XML dictionary will be quite simple. To store an object in XML text let's use keyword "object":
Object will contain only properties and methods:
Each property must contain a name, type and value. It can also include default value, and scope:
string
First name
Smith
The full list of used tags will looks like:
* Object
* Property
* Scope
* Type
* Name
* DefaultValue
* Value
* Param
* Method
Construction
There are two units. First unit - CPOM.PAS contains classes to create in-memory data objects: TspoObject, TspoProperty, and TspoMethod. Second unit - CxmlInterface.pas contains routines to convert TspoObject to XML text and vice versa.
You may download these units here.
The development is still in process and I will be happy to hear any comments.