Marking scheme ( TBD—see moodle )

 

 

 

 

 

 

 

 

 

References

       Principles of distributed Database Systems, 3rd edition

            by T. Ozsu and P. Valduriez
            Springer

          

       Selected Papers (TBD)

 

Plagiarism Policy

Note that although cooperation and peer-to-peer learning are encouraged, plagiarism is not permitted.

 

For further information please consult:

   

    Guidelines about how to avoid plagiarism in Computer Science

                   CSCI 6407 Management of Data in Distributed Systems

CSCI 6407  Management of Data in Distributed Systems

Management of data in a distributed environment is not only one of the fundamental problems that most organizations have, are, and will be facing, but it also has significant implications on various research areas that deal with distributed data.  Thus, it is important for graduate students to understand the fundamental issues, theory, and approaches to solutions in managing data in distributed environment.  Furthermore, different context, stemming from different architecture of the various distributed computing approaches, such as Grid or Cloud computing, affect algorithms and mechanisms that use distributed data.  This course will provide the student with background, on management of data in distributed environment, that will be useful in their future professional career and development and also in their further research.  Other goals of this course is to hone the students’ critical thinking through analysis of the application of the theory to the various distributed environments and also to practice and improve their communication skills. 

 

 

 

 

 

 

 

 

Topics

Data Management in Distributed Databases –  Underpinning theory and methods developed for management of data in Databases and Distributed Databases will be overviewed.  This is required as the management of data in the cloud and in distributed, peer-to-peer, and Web systems includes providing solutions to the same, or similar, problems as occur in Distributed Databases, but in an environment consisting of a collection of heterogeneous systems with differing architectures that are used to manage the data, which makes the problems more complex. 

Service Oriented Architecture (SOA) and Web services – SOA will be introduced and selected research topics in web services will be overviewed.  Industry has moved towards SOA in general, and web services in particular, as middleware of choice for integration of software and for services that access data – which has implications on data management. 

Peer-to-Peer Data Management – Research topics for peer-to-peer (P2P) data management systems will be overviewed.  P2P systems contain heterogeneous and autonomous systems without clients and servers.  Infrastructure architecture may also vary and the number of systems participating is massive.

Web Data Management – Research topics discussed will include modeling of the web as a graph, web querying, and distributed XML management. 

Cloud Computing – The current approaches and solutions to data management in the cloud will be overviewed. 

Data Privacy – Privacy regulations and selected privacy frameworks and mechanisms for managing data privacy will be overviewed.

 

Tests

 

Assignments

 

Project

 

Literature Review (Group)

 

Literature Review (Individual)

 

Class Participation