Home | Search | Help  
Home Page Università di Genova

Seminar Details


Date 5-12-2011
Time 15:15
Room/Location DISI-Sala Conferenze III piano
Title Effective Data Density Estimation in Ring-based P2P Networks
Speaker Minqi Zhou
Affiliation Institute of Massive Computing, School of Software Engineering, East China Normal Unive
Link https://www.disi.unige.it/index.php?eventsandseminars/seminars
Abstract Effective Data Density Estimation in Ring-based P2P Networks Derived from file sharing applications, Peer-to-Peer (P2P) systems have been matured and deployed in Cloud Computing systems for scalable, fault-tolerant data management recently, such as Cassandra and, Dynamo. Estimating the global data distribution in such systems becomes even more important. It can benefit many applications, such as load balancing analysis, query processing, and data mining. Inspired by the inversion method for random variate generation, we propose a novel model named distribution-free data density estimation for dynamic ring-based P2P networks to achieve high estimation accuracy with low estimation cost regardless of distribution models of the underlying data. It generates random samples for any arbitrary distribution by sampling the global cumulative distribution function and is free from sampling bias. In P2P networks, the key idea for distribution-free estimation is to sample a small subset of peers for estimating the global data distribution over the data domain, and consequently resulting a accurate estimation with low cost. SPEAKER: Minqi Zhou is currently Associate Professor of Institute of Massive Computing, School of Software Engineering, East China Normal University, China. His is interested in researching and developing effective and efficient distributed data storage, placement, indexing, processing, analysis techniques for data intensive applications. Particularly, he is currently building two systems, one for real-time processing based on Cassandra, and the other for off-line analysis based on Hadoop. He finds new applications for his research in computational advertising recently, and starts a few research work on it, such as advertising on Sina micro blogs.
Back to Seminars