ACM SIGMOD Athens, Greece, 2011
sigmod pods logo

SIGMOD - Accepted Research Papers

No Free Lunch in Data Privacy
Daniel Kifer*, Penn State; Ashwin Machanavajjhala, Yahoo

A New Approach for Processing Ranked Subsequence Matching Based on Ranked Union
Wook-Shin Han*, Kyungpook National University; Jinsoo Lee, Kyungpook National University; Yang-Sae Moon, ; Seung-won Hwang, ; Hwanjo Yu, POSTECH

Automatic Discovery of Attributes in Relational Databases
Meihui Zhang, National University of Singapore; Marios Hadjieleftheriou*, AT&T Labs - Research; Beng Chin Ooi, National University of Singapore; Cecilia Procopiuc, AT&T Labs - Research; Divesh Srivastava, AT&T Labs - Research

Exact Indexing for Support Vector Machines
Hwanjo Yu*, POSTECH; Ilhwan Ko, POSTECH; Youngdae Kim, POSTECH; Seung-won Hwang, ; Wook-Shin Han, Kyungpook National University

SkimpyStash: RAM Space Skimpy Key-Value Store on Flash-based Storage
Biplob Debnath*, EMC Corporation; Sudipta Sengupta, Microsoft Research, Redmond, USA; Jin Li, Microsoft Research, Redmond, USA

Local Graph Sparsification for Scalable Clustering
Venu Satuluri*, The Ohio State Univeristy; Srinivasan Parthasarathy, The Ohio State University; Yiye Ruan, The Ohio State University

BE-Tree: An Index Structure to Efficiently Match Boolean Expressions over High-dimensional Discrete Space
Mohammad Sadoghi*, University of Toronto; Hans-Arno Jacobsen, University of Toronto

Efficient Parallel Skyline Processing using Hyperplane Projections
Henning Koehler*, University of Queensland; Jing Yang, ; Xiaofang Zhou, University of Queensland

Latent OLAP: Data Cubes over Latent Variables
Bee-Chung Chen*, Yahoo! Research; Deepak Agarwal, Yahoo! Research

How Soccer Players Would Do Stream Joins
Jens Teubner*, ETH Zurich; Rene Mueller, IBM Almaden

Ranking with Uncertain Scoring Functions: Semantics and Sensitivity Measures
Mohamed Soliman*, University of Waterloo; Ihab Ilyas, University of Waterloo ; Davide Martinenghi, Politecnico di Milano; Marco Tagliasacchi, Politecnico di Milano

Querying Uncertain Data with Aggregate Constraints
Mohan Yang*,; Haixun Wang, ; Haiquan Chen, Auburn University; Wei-Shinn Ku,

Collaborative Tagging for Effective Storage and Retrieval of Web 2.0 Data
Meiyu Lu, NUS; Bing Tian Dai*, National Univ of Singapore; Anthny Tung, ; Divyakant Agrawal, UC Santa Barbara

TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets
Chun Chen, College of Computer Science, Zhejiang University; Feng Li, National University of Singapo; Beng Chin Ooi, National University of Singapore; Sai Wu*, National Univ. of Singapore

E-Cube: Multi-Dimensional Event Sequence Analysis Using Hierarchical Pattern Query Sharing
Mo Liu*, Worcester Polytechnic Institut; Elke Rundensteiner, Worcester Polytechnic Institute; Kara Greenfield, Worcester Polytechnic Institute; Chetan Gupta, HP; Song Wang, HP; Abhay Mehta, HP; Ismail Ari, Ozyegin University

Fast Personalized PageRank on MapReduce
Bahman Bahmani, Stanford University; Kaushik Chakrabarti, Microsoft Research; Dong Xin*, Google

Efficient Diversity-Aware Search
Albert Angel*, University of Toronto; Nick Koudas, University of Toronto

Keyword Search over Relational Databases: a Metadata Approach
sonia Bergamaschi, ; Elton Domnori, Universita di Modena e RE; Francesco Guerra, Universita di Modena e RE; Raquel Trillo Lado, University of Zaragoza; Yannis Velegrakis*, University of Trento

MaSM: Efficient Online Updates in Data Warehouses
Manos Athanassoulis*, EPFL; Shimin Chen, Intel Labs Pittsburgh; Phillip Gibbons, Intel; Anastasia Ailamaki, EPFL; Radu Stoica, EPFL

Changing Flights in Mid-air: A Model for Safely Modifying Continuous Queries
Kyumars Sheykh Esmaili, ETH Zurich; Tahmineh Sanamrad, ETH Zurich; Peter Fischer*, ETH Zurich; Nesime Tatbul, ETH Zurich

Entangled queries: enabling declarative data-driven coordination
Nitin Gupta, ; Lucja Kot*, Cornell University; Sudip Roy, ; Gabriel Bender, Cornell University; Johannes Gehrke, ; Christoph Koch,

Labeling Recursive Workflow Executions On-the-Fly
Zhuowei Bao*, University of Pennsylvania; Susan Davidson, University of Pennsylvania; Tova Milo, Tel Aviv University

Processing Theta-Joins using MapReduce
Alper Okcan, Northeastern University; Mirek Riedewald*, Northeastern University

Efficient Query Answering in Probabilistic RDF Graphs
Xiang Lian*, HKUST; Lei Chen, Hong Kong University of Science and Technology

Querying contract databases based on temporal behavior
Elio Damaggio*, UCSD; Alin Deutsch, UCSD; Dayou Zhou, UCSD

Score-Consistent Algebraic Optimization of Full-Text Search Queries with GRAFT
Nathan Bales*, UC San Diego; Alin Deutsch, UCSD; Vasilis Vassalos, AUEB

Mining a Search Engine’s Corpus: Efficient Yet Unbiased Sampling and Aggregate Estimation
Mingyang Zhang*, The George Washington University; Nan Zhang, The George Washington University; Gautam Das, University of Texas at Arlington

Zephyr: Live Migration in Shared Nothing Databases for Elastic Cloud Platforms
Aaron Elmore, UC Santa Barbara; Sudipto Das*, UC Santa Barbara; Divyakant Agrawal, UC Santa Barbara; Amr El Abbadi, UC Santa Barbara

Schedule Optimization for Data Processing Flows on the Cloud
Herald Kllapi*, University Of Athens; Eva Sitaridi, University of Athens; Manolis Tsangaris, University of Athens; Yannis Ioannidis, University of Athens

Operation-Aware Buffer Management in Flash-based Systems
Yanfei Lv*, Peking University; Bin Cui, Pku; Bingsheng He, Nanyang Technological Universi; Xuexuan Chen, Peking University

LazyFTL: A Page-level Flash Translation Layer Optimized for NAND Flash Memory
Dongzhe Ma*, Tsinghua University; Jianhua Feng, Tsinghua University; Guoliang Li, Tsinghua University

Reverse Spatial and Textual k Nearest Neighbors Search
Jiaheng Lu, DEKE, Renmin University of China; Ying Lu*, Renmin University of China; Gao Cong, Nanyang Technological University

We Challenge You to Certify Your Updates
Su Chen, NUS; Xin Luna Dong, AT&T Labs - Research; Laks VS Lakshmanan, ; Divesh Srivastava*, AT&T Labs - Research

Neighborhood Based Fast Graph Search in Large Networks
Arijit Khan*, UCSB; Xifeng Yan, UCSB; Ziyu Guan, UCSB; Nan Li, UCSB; Supriyo Chakraborty, UCLA; Shu Tao, IBM

CrowdDB: Answering Queries with Crowdsourcing
Michael Franklin, UC Berkeley; Donald Kossmann, ETH Zürich; Tim Kraska*, UC Berkeley; Sukriti Ramesh, ETH Zurich; Reynold Xin, UC Berkeley

Assessing and Ranking Structural Correlations in Graphs
Ziyu Guan*, UCSB; Jian Wu, Zhejiang University; Zheng Yun,; Ambuj Singh, UC - Santa Barbara; Xifeng Yan, UCSB

Llama: Leveraging Columnar Storage for Scalable Join Processing in the MapReduce Framework
Yuting Lin*, NUS; Divyakant Agrawal, UC Santa Barbara; Chun Chen, College of Computer Science, Zhejiang University; Beng Chin Ooi, National University of Singapore; Sai Wu, National Univ. of Singapore

Efficient Auditing For Complex SQL Queries
Raghav Kaushik, Microsoft Research; Ravishankar Ramamurthy*, Microsoft Research

Faerie: Efficient Filtering Algorithms for Approximate Dictionary-based Entity Extraction
Guoliang Li*, Tsinghua University; Dong Deng, Tsinghua University; Jianhua Feng, Tsinghua University

Interaction between Record Matching and Data Repairing
Wenfei Fan, University of Edinburgh; Jianzhong Li, Harbin Institute of Technology; Shuai Ma, Beihang University, China; Nan Tang, University of Edinburgh; Wenyuan Yu*, University of Edinburgh

A memory efficient reachability data structure through bit vector compression
Sebastiaan van Schaik*, University of Oxford; Oege de Moor, University of Oxford Computing Laboratory

Incremental Graph Pattern Matching
Wenfei Fan, University of Edinburgh; Jianzhong Li, Harbin Institute of Technology; Jizhou Luo, Harbin Institute of Technology; Zijing Tan, ; Xin Wang, University of Edinburgh; Yinghui Wu*, University of Edinburgh

WHAM: A High-throughput Sequence Alignment Method
Yinan Li*, University of Wisconsin; Allison Terrell, University of Wisconsin; Jignesh Patel, University of Wisconsin

Design and Evaluation of Main Memory Hash Join Algorithms for Multi-core CPUs
Spyros Blanas, University of Wisconsin; Yinan Li*, University of Wisconsin; Jignesh Patel, University of Wisconsin

Performance prediction for concurrent database workloads
Jennie Duggan*, Brown University; Ugur Cetintemel, Brown University; Olga Papaemmanouil, Brandeis University; Eli Upfal, Brown University

Query Optimization Techniques for Partitioned Tables
Herodotos Herodotou*, Duke University; Nedyalko Borisov, Duke University; Shivnath Babu, Duke University

TrustedDB: A Trusted Hardware Based Database with Privacy and Data Confidentiality
Sumeet Bajaj*, Stony Brook University; Radu Sion, Stony Brook University

On k-skip shortest paths
Yufei Tao*, Chinese Univ. of Hong Kong; Cheng Sheng, The Chinese University of Hong Kong; JIan Pei, Simon Fraser Univ.

Apples and Oranges: A Comparison of RDF Benchmarks and Real RDF Datasets
Songyun Duan, IBM T. J. Watson; Anastasios Kementsietsidis*, IBM Research - Thomas J. Watson Research Ctr.; Kavitha Srinivas, IBM T. J. Watson; Octavian Udrea, IBM T.J. Watson

Location-Aware Type Ahead Search on Spatial Databases: Semantics and Efficiency
Senjuti Basu Roy*, UT Arlington; Kaushik Chakrabarti, Microsoft Research

Data Generation using Declarative Constraints
Arvind Arasu*, Microsoft Research; Raghav Kaushik, Microsoft Research; Jian Li, University of Maryland, College Park

Jigsaw: Efficient optimization over uncertain enterprise data
Oliver Kennedy*, Cornell University; Suman Nath, Microsoft

Fast Checkpoint Recovery Algorithms for Frequently Consistent Applications
Tuan Cao*, Cornell University; Marcos Vaz Salles, Cornell University; Benjamin Sowell, Cornell University; Yao Yue, Cornell University; Johannes Gehrke, ; Alan Demers , Cornell University; Walker White, Cornell University

Tracing Data Errors with View-Conditioned Causality
Alexandra Meliou*, University of Washington; Wolfgang Gatterbauer, University of Washington; Suman Nath, Microsoft; Dan Suciu, University of Washington

Sharing work in Keyword Search over Databases
Marie Jacob*, University Of Pennsylvania; Zack Ives, University of Pennsylvania

Efficient Exact Edit Similarity Query Processing with Asymmetric Signature Schemes
Jianbin Qin*, University of New South Wales; Wei Wang, UNSW; Yifei Lu, UNSW; Chuan Xiao, UNSW; Xuemin Lin, UNSW

Effective Data Co-Reduction for Multimedia Similarity Search
Zi Huang, University of Queensland; Heng Tao Shen*, University of Queensland; Jiajun Liu, University of Queensland; Xiaofang Zhou, University of Queensland

Designing and Refining Schema Mappings via Data Examples
Bogdan Alexe, UC Santa Cruz; Balder ten Cate, UC Santa Cruz; Phokion Kolaitis, UC Santa Cruz; Wang-Chiew Tan*, IBM Research - Almaden and UC Santa Cruz

Warding off the Dangers of Data Corruption with Amulet
Nedyalko Borisov*, Duke University; Shivnath Babu, Duke University; Nagapramod Mandagere, IBM Almaden Research Center; Sandeep Uttamchandani,

Facet Discovery for Structured Web Search: A Query-log Mining Approach
Jeffrey Pound*, University of Waterloo; Stelios Paparizos, Microsoft Research; Panayiotis Tsaparas,

Hybrid In-Database Inference for Declarative Information Extraction
Daisy Zhe Wang*, UC Berkeley; Michael Franklin, UC Berkeley; Joseph Hellerstein, ; Minos Garofalakis, ; Michael Wick,

ATLAS: A Probabilistic Algorithm for High Dimensional Similarity Search
Jiaqi Zhai*, Cornell University; Yin Lou, Cornell University; Johannes Gehrke,

ArrayStore: A Storage Manager for Complex Parallel Array Processing
Emad Soroush*, University of Washington; Magdalena Balazinska, University of Washington; Daniel Wang, SLAC

Flexible Aggregate Similarity Search
Yang Li, Shanghai Jiao Tong University; Feifei Li*, FSU; Ke Yi, HKUST; Bin Yao, Florida State University; Min Wang, HP Labs China

A Latency and Fault-Tolerance Optimizer for Online Parallel Query Processing
Prasang Upadhyaya*, University of Washington; YongChul Kwon, University of Washington; Magdalena Balazinska, University of Washington

Neighborhood-Privacy Protected Shortest Distance Computing in Cloud
Jun Gao*, Peking University; Jeffrey Xu Yu, The Chinese University of Hong Kong; Ruoming Jin, Kent State University; Jiashuai Zhou, ; Tengjiao Wang, ; dongqing Yang,

Context-sensitive Ranking for Document Retrieval
Liang Chen*, UCSD; Yannis Papakonstantinou,

Efficient and Generic Evaluation of Ranked Queries
Wen Jin*, ; Jignesh Patel, University of Wisconsin

Graph Cube: On Warehousing and OLAP Multidimensional Networks
Peixiang Zhao*, UIUC; Xiaolei Li, Microsoft Cooperation; Dong Xin, Google; Jiawei Han, UIUC

Sampling Based Algorithms for Quantile Computation in Sensor Networks
Zengfeng Huang, ; Lu Wang, ; Ke Yi*, HKUST; Yunhao Liu,

Leveraging Query Logs for Schema Mapping Generation in U-MAP
Hazem Elmeleegy*, Purdue University; Ahmed Elmagarmid, Qatar Computing Research Institute; Jaewoo Lee, Purdue University

Differentially Private Data Cubes: Optimizing Noise Sources and Consistency
Bolin Ding*, UIUC; Marianne Winslett, University of Illinois; Jiawei Han, UIUC; Zhenhui Li, UIUC

Finding Shortest Path on Land Surface
Lian Liu, HKUST; Raymond Chi-Wing Wong*, The Hong Kong University of Sc

Skyline Query Processing over Joins
Akrivi Vlachou*, NTNU; Christos Doulkeridis, NTNU; Neoklis Polyzotis, University of California, Santa Cruz

iReduct: Differential Privacy with Reduced Relative Errors
Xiaokui Xiao*, Nanyang Technological Univ; Gabriel Bender, Cornell University; Michael Hay, Cornell University; Johannes Gehrke,

Workload-Aware Database Monitoring and Consolidation
Carlo Curino*, Mit; Evan Jones, MIT; Sam Madden, MIT; Hari Balakrishnan, MIT

Finding Semantics in Time Series
Peng Wang*, Fudan University; Haixun Wang, ; Wei Wang, Fudan University

Attribute Domain Discovery for Hidden Web Databases
Xin Jin, The George Washington University; Nan Zhang*, The George Washington University; Gautam Das, University of Texas at Arlington

Collective Spatial Keyword Querying
Xin Cao*, Singapore NTU; Gao Cong, Nanyang Technological University; Christian Jensen, Aarhus University; Beng Chin Ooi, National University of Singapore

Nearest Keyword Search in XML Documents
Yufei Tao*, Chinese Univ. of Hong Kong; Stavros Papadopoulos, Chinese Univ. of Hong Kong; Cheng Sheng, The Chinese University of Hong Kong; Kostas Stefanidis, Chinese Univ. of Hong Kong

Sensitivity Analysis and Explanations for Robust Query Evaluation in Probabilistic Databases
Bhargav Kanagal*, University of Maryland; Jian Li, University of Maryland, College Park; Amol Deshpande, "University of Maryland, College Park"

Joint Unsupervised Structure Discovery and Information Extraction
Eli Custodio Vilarinho*, Federal University of Amazonas; Altigran da Silva, Federal University of Amazonas; Daniel Oliveira, Federal University of Amazonas; Edleno de Moura, Federal University of Amazonas; Alberto Laender, Federal University of Minas Gerais

Advancing Data Clustering via Projective Clustering Ensembles
Carlotta Domeniconi, George Mason University; Francesco Gullo*, DEIS, University of Calabria; Andrea Tagarelli, DEIS, University of Calabria

A Platform for Scalable One-pass Analytics using MapReduce
Boduo Li*, UMass Amherst; Edward Mazur, UMass Amherst; Yanlei Diao, "University of Massachusetts, Amherst"; Andrew McGregor, UMass Amherst; Prashant Shenoy, UMass Amherst

Scalable Query Rewriting: A Graph-Based Approach
George Konstantinidis*, ISI/USC; José Luis Ambite, ISI/USC

Predicting cost amortization for query services
Verena Kantere*, Cyprus University of Technology; Debabrata Dash, ArcSight, an HP Company; Georgios Gratsias, ELCA Informatique SA; Anastasia Ailamaki, EPFL

More efficient Datalog queries: Subsumptive tabling beats magic sets
K. Tuncay Tekle*, SUNY Stony Brook; Yanhong A. Liu, SUNY Stony Brook