USQ Australia
Home | Dept. M & C | USQ  
USQ Home Page Banner






Concurrency Control for Semi-structured Data and XML

Introduction

With the advent of native and XML-enabled databases, a new look at traditional tasks provided by DBMSs has become necessary. In particular, several papers discussed on this page have shown that classic concurrency control mechanisms developed for relational, hierarchical, and object oriented databases are no longer adequate for semi-structured databases.

The aim of this page is to present our own research in this emerging field, and to provide links to other papers on the subject.

Instance-dependent CC Methods

This section is based mostly on research done by Jan Hidders (UA) and Stijn Dekeyser (USQ). Click here for a list of publications.

In our Kluwer WWW Journal article A Transaction Model for XML Databases, we present the following concepts to solve the concurrency control problem for XML data:

  • A data model of documents and query and update languages as seen and used by the user transactions and by the scheduler subsystem of the document server. So far, the model does not take order among sibling elements into account, and both the query (based on XPath) and update language are rather limited in expressive power.
  • Two alternative and equivalent Path Locking schemes: PL-Prop (for propagation) and PL-Sat (for satisfiability). There is a trade-off between space and time complexity between both schemes, in regards to checking for conflicts between read and write locks. The locks are associated to nodes in the document, hence the instance-dependent nature of this work.
  • Two alternative Schedulers that can use both PL-Prop and PL-Sat locking schemes. The Commit scheduler does not allow conflicts between read and write locks to arise, whereas the Conflict scheduler does, but keeps a dependency graph. Importantly, we show that both schedulers guarantee serializability of the schedules they allow.
Other results include a local necessity proof which shows that the conflict rules presented for path locks are indeed necessary.

Many issues remain, so our research in this field is ongoing. We also hope to work on an implementation soon.

Instance-Independent CC Methods

This section is based mostly on research done by Jan Hidders, Jan Paredaens, Roel Vercammen (all at UA), and Stijn Dekeyser (USQ). Click here for a list of publications.

In contrast to the research mentioned above, in this section we assume that we have no knowledge of the document instance, or (for reasons of efficiency) do not want to use that knowledge. As a result, our approach here is more fundamental and general in nature compared to the more applied nature of the research above.
We present a framework that is document-independent in the sense that two schedules of semistructured transactions are equivalent if they are equivalent on all possible documents. We prove that it is decidable in polynomial time and space whether two given schedules in this framework are equivalent. This also solves the serializability issue for semistructured schedules, polynomially in the size of the schedule and exponentially in the number of transactions.

Other Research Papers

To date (August '04), a few other papers have appeared which try to solve the issues described above.

 
Suggestions?   Do you know of other papers related to concurrency control for XML? Please let us know. Thank you.

 


  CRICOS: QLD 00244B | NSW 02225M | VIC 02387D Updated 2 August 2004 | Stijn Dekeyser