Research
This category contains articles about my main research topic.

Database in the Cloud PDF Print E-mail
Written by Carlo A. Curino   
Tuesday, 17 November 2009 14:25
The advent of cloud computing and hosted software as a service is creating a novel market for data management. Cloud-based DB services are starting to appear, and have the potential to attract customers from very diverse sectors of the market, from small businesses aiming at reducing the total cost of ownership (perfectly suited for multi-tenancy solutions), to very large enterprises (seeking high-profile solutions spanning on potentially thousands of machines and capable of absorbing unexpected burst of traffic). At the same time, the database community is producing new architectures and systems tackling in novel ways various classes of typical DB workloads. These novel, dedicated approaches often outshine the traditional DBMS trying to provide an unreachable "one-size-fit-all" dream.  Such increase in the number of available data management products is exacerbating the problem of selecting, deploying and tuning in-house solutions for data management.  

In this context we are building a cloud-based database architecture, enabling the implementation of DB-as-a-service.  We envision a database provider that (i) provides the illusion of infinite resources, by continuously meeting user expectations under evolving workloads, and (ii) minimizes the operational costs associated to each user by amortizing administrative costs across users and developing software techniques to automate the management of many databases running inside of a single data center.  Thus, the traditional provisioning problem (what resources to allocate for a single database) becomes an optimization issue, where a large user base, multiple DBMS engines, and a very large data center provide an unprecedented opportunity to exploit economy of scale, smart load balancing, higher power efficiency, and principled overselling.

The "relational cloud" infrastructure has several strong points for database users:  (i) predictable and lower costs, proportional to the quality of service and actual workloads, (ii) reduced technical complexity, thanks to a unified access interface and the delegation of DB tuning and administration, and (iii) elasticity and scalability, providing the perception of virtually infinite resources ready at hand. In order to achieve this, we are working on harnessing many recent technological advances in data management by efficiently exploiting multiple DBMS engines (targeting different types of workloads) in a self-balancing solution that optimizes the assignment of resources of very large data centers to serve potentially thousands of users with very diverse needs.  Among the critical research challenges to be faced we have identified: (i) automatic database partitioning, (ii) multi-db, multi-engine workload balancing, (iii) scalable multi-tenancy strategies, (iv) high profile distributed transaction support, (v) automated replication and backup.   These and many others are the key ingredients of the RelationalCloud (http://relationalcloud.com) prototype currently under development.
 
Data Integration PDF Print E-mail
Written by Carlo A. Curino   
Thursday, 11 September 2008 09:39

The current business reality induces frequent acquisitions and merges of companies and organizations, and more and more tight interaction between information systems of cooperating companies. We present our contributions to Context-ADDICT, a framework supporting on-the-fly (semi-) automatic data integration, context modeling and context-aware data filtering. Data Integration is achieved in Context-ADDICT by means of automatic ontology-based data source wrapping and integration. While this approach lays its foundations in a solid theoretical background, it also provides heuristics to solve the practical aspects of data integration in a dynamic context. 

More details on this research effort can be found at: http://kid.dei.polimi.it/

 

 
Java for HW reconfigurable architectures PDF Print E-mail
Written by Carlo A. Curino   
Thursday, 11 September 2008 09:37

Another research topic I have worked on is the exploitation of the Java language as an Hardware Description Language for IP-Core definition in the context of the DRESD project. The main advantage will be the leveraging of the familiarity of the developer with an existing language, and the exploitation of existing IDE for the Java language, while maintaining the performance typical of the Caronte architecture, developed in the DRESD project.

More details can be found at: http://www.dresd.org/

Last Updated on Saturday, 03 January 2009 11:00
 
Wireless Sensor Networks: the TinyLime experience! PDF Print E-mail
Written by Carlo A. Curino   
Friday, 08 February 2008 22:20
In the the past i spent some time working on Wireless Sensor Networks. The most complete result of this research has been the design and development of TinyLIME, in collaboration with GianPietro Picco and Amy Murphy, the original designer and developer of LIME.

TinyLIME is a middleware for wireless sensor networks (WSN) that departs from the traditional WSN setting where sensor data is collected by a central monitoring station, and enables instead multiple mobile monitoring stations to access the sensors in their proximity and share the collected data through wireless links. This intrinsically context-aware setting is demanded by applications where the sensors are sparse and possibly isolated, and where on-site, location-dependent data collection is required. An extension of LIME, TinyLIME makes sensor data available through a tuple space interface, providing the illusion of shared memory between applications and sensors. Data aggregation capabilities and a power-savvy architecture complete the middleware features.

 Please refer to the official webpage for more details and to download the system. 

Last Updated on Friday, 08 February 2008 22:20
 
Schema Evolution PDF Print E-mail
Written by Carlo A. Curino   
Sunday, 13 January 2008 20:28

Under the guidance of Professor Carlo Zaniolo I'm currently work on schema  evolution and Temporal Databases. Every Information System (IS) is subject to an  uninterrupted evolution process aimed at adapting the system to changing requirements. One of the most critical portions (of the IS) to evolve is the data management core.

Often based on relational database technologies, the data  management core of a system needs to evolve whenever the revision process requires modifications in the set of data or in the way they are stored or maintained, e.g., to increase performance. Given its fundamental role, the evolution of the database underlying an IS has a very strong impact on (all) the application(s) accessing the data, and support for a graceful evolution is of paramount importance nowadays. The complexity of database and software maintenance, clearly, grows with the size and complexity of the system.

Furthermore,when moving from intra-company systems, typically managed by rather small and stable teams of developers/administrators, to collaboratively-developed and -maintained public systems, the need for a well-behaved evolution becomes indispensable. In this web-scale scenario, due to its collaborative nature and fast rate of growth, the forces driving a system to change become “wilder”, while stability, shared agreement, and evolution documentation become crucial. This part of the research is thus devoted to the effort of developing methodologies and tools to support seamless database evolution by means of query rewriting and migration support.

Within this context we developed two systems: PRISM, a system to support schema evolution in relational snapshot databases, and PRIMA:a system to support data archival and querying under schema evolution.

 

Last Updated on Friday, 08 August 2008 14:14
 
«StartPrev12NextEnd»

Page 1 of 2
Copyright © 2010 . All Rights Reserved.
Joomla! is Free Software released under the GNU/GPL License.