ACM International Conference on Management of Data, SIGMOD 2014


Title/Authors Title Research Artifacts
[?] A research artifact is any by-product of a research project that is not directly included in the published research paper. In Computer Science research this is often source code and data sets, but it could also be media, documentation, inputs to proof assistants, shell-scripts to run experiments, etc.
Details

A formal approach to finding explanations for database queries

Sudeepa Roy, Dan Suciu

A formal approach to finding explanations for database queries

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel data analysis directly on scientific file formats

Spyros Blanas, Kesheng Wu, Surendra Byna, Bin Dong, Arie Shoshani

Parallel data analysis directly on scientific file formats

Details
Discussion Comments: 0
Verification: Authors have not verified information

Multi-dimensional data statistics for columnar in-memory databases

Curtis Kroetsch

Multi-dimensional data statistics for columnar in-memory databases

Details
Discussion Comments: 0
Verification: Author has not verified information

iCheck: computationally combating "lies, d-ned lies, and statistics"

You Wu, Brett Walenz, Peggy Li, Andrew Shim, Emre Sonmez, Pankaj K. Agarwal, Chengkai Li, Jun Yang, Cong Yu

iCheck: computationally combating "lies, d-ned lies, and statistics"

Details
Discussion Comments: 0
Verification: Authors have not verified information

Exploiting ordered dictionaries to efficiently construct histograms with q-error guarantees in SAP HANA

Guido Moerkotte, David DeHaan, Norman May, Anisoara Nica, Alexander Böhm

Exploiting ordered dictionaries to efficiently construct histograms with q-error guarantees in SAP HANA

Details
Discussion Comments: 0
Verification: Authors have not verified information

Incremental elasticity for array databases

Jennie Duggan, Michael Stonebraker

Incremental elasticity for array databases

Details
Discussion Comments: 0
Verification: Authors have not verified information

Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation

Qi Li, Yaliang Li, Jing Gao, Bo Zhao, Wei Fan, Jiawei Han

Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation

Details
Discussion Comments: 0
Verification: Authors have not verified information

TriAD: a distributed shared-nothing RDF engine based on asynchronous message passing

Sairam Gurajada, Stephan Seufert, Iris Miliaraki, Martin Theobald

TriAD: a distributed shared-nothing RDF engine based on asynchronous message passing

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient cohesive subgraphs detection in parallel

Yingxia Shao, Lei Chen, Bin Cui

Efficient cohesive subgraphs detection in parallel

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel subgraph listing in a large-scale graph

Yingxia Shao, Bin Cui, Lei Chen, Lin Ma, Junjie Yao, Ning Xu

Parallel subgraph listing in a large-scale graph

Details
Discussion Comments: 0
Verification: Authors have not verified information

Mining statistically significant connected subgraphs in vertex labeled graphs

Akhil Arora, Mayank Sachan, Arnab Bhattacharya

Mining statistically significant connected subgraphs in vertex labeled graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

PrivBayes: private data release via bayesian networks

Jun Zhang, Graham Cormode, Cecilia M. Procopiuc, Divesh Srivastava, Xiaokui Xiao

PrivBayes: private data release via bayesian networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

Orca: a modular query optimizer architecture for big data

Mohamed A. Soliman, Lyublena Antova, Venkatesh Raghavan, Amr El-Helw, Zhongxian Gu, Entong Shen, George C. Caragea, Carlos Garcia-Alvarado, Foyzur Rahman, Michalis Petropoulos, Florian Waas, Sivaramakrishnan Narayanan, Konstantinos Krikellas, Rhonda Baldwin

Orca: a modular query optimizer architecture for big data

Details
Discussion Comments: 0
Verification: Authors have not verified information

PLANET: making progress with commit processing in unpredictable environments

Gene Pang, Tim Kraska, Michael J. Franklin, Alan Fekete

PLANET: making progress with commit processing in unpredictable environments

Details
Discussion Comments: 0
Verification: Authors have not verified information

DataSift: a crowd-powered search toolkit

Aditya G. Parameswaran, Ming Han Teh, Hector Garcia-Molina, Jennifer Widom

DataSift: a crowd-powered search toolkit

Details
Discussion Comments: 0
Verification: Authors have not verified information

Tripartite graph clustering for dynamic sentiment analysis on social media

Linhong Zhu, Aram Galstyan, James Cheng, Kristina Lerman

Tripartite graph clustering for dynamic sentiment analysis on social media

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel in-situ data processing with speculative loading

Yu Cheng, Florin Rusu

Parallel in-situ data processing with speculative loading

Details
Discussion Comments: 0
Verification: Authors have not verified information

A pivotal prefix based filtering algorithm for string similarity search

Dong Deng, Guoliang Li, Jianhua Feng

A pivotal prefix based filtering algorithm for string similarity search

Details
Discussion Comments: 0
Verification: Authors have not verified information

Navigating the maze of graph analytics frameworks using massive graph datasets

Nadathur Satish, Narayanan Sundaram, Md. Mostofa Ali Patwary, Jiwon Seo, Jongsoo Park, M. Amber Hassaan, Shubho Sengupta, Zhaoming Yin, Pradeep Dubey

Navigating the maze of graph analytics frameworks using massive graph datasets

Details
Discussion Comments: 0
Verification: Authors have not verified information

Query shredding: efficient relational evaluation of queries over nested multisets

James Cheney, Sam Lindley, Philip Wadler

Query shredding: efficient relational evaluation of queries over nested multisets

Details
Author Comments: The "shredding" branch was merged into the main implementation of Links and the functionality is now available (optionally) in the main Links distribution, which can also be installed using the OPAM package manager.
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

EDS: a segment-based distance measure for sub-trajectory similarity search

Min Xie

EDS: a segment-based distance measure for sub-trajectory similarity search

Details
Discussion Comments: 0
Verification: Author has not verified information

Answering top-k representative queries on graph databases

Sayan Ranu, Minh X. Hoang, Ambuj K. Singh

Answering top-k representative queries on graph databases

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

OASSIS: query driven crowd mining

Yael Amsterdamer, Susan B. Davidson, Tova Milo, Slava Novgorodov, Amit Somech

OASSIS: query driven crowd mining

Details
Discussion Comments: 0
Verification: Authors have not verified information

PriView: practical differentially private release of marginal contingency tables

Wahbeh H. Qardaji, Weining Yang, Ninghui Li

PriView: practical differentially private release of marginal contingency tables

Details
Discussion Comments: 0
Verification: Authors have not verified information

InsightNotes: summary-based annotation management in relational databases

Dongqing Xiao, Mohamed Y. Eltabakh

InsightNotes: summary-based annotation management in relational databases

Details
Discussion Comments: 0
Verification: Authors have not verified information

Querying big graphs within bounded resources

Wenfei Fan, Xin Wang, Yinghui Wu

Querying big graphs within bounded resources

Details
Discussion Comments: 0
Verification: Authors have not verified information

A software-defined networking based approach for performance management of analytical queries on distributed data stores

PengCheng Xiong, Hakan Hacigümüs, Jeffrey F. Naughton

A software-defined networking based approach for performance management of analytical queries on distributed data stores

Details
Discussion Comments: 0
Verification: Authors have not verified information

HAWQ: a massively parallel processing SQL engine in hadoop

Lei Chang, Zhanwei Wang, Tao Ma, Lirong Jian, Lili Ma, Alon Goldshuv, Luke Lonergan, Jeffrey Cohen, Caleb Welton, Gavin Sherry, Milind Bhandarkar

HAWQ: a massively parallel processing SQL engine in hadoop

Details
Discussion Comments: 0
Verification: Authors have not verified information

Towards unified ad-hoc data processing

Xiaogang Shi, Bin Cui, Gillian Dobbie, Beng Chin Ooi

Towards unified ad-hoc data processing

Details
Discussion Comments: 0
Verification: Authors have not verified information

Schema-free SQL

Fei Li, Tianyin Pan, Hosagrahar Visvesvaraya Jagadish

Schema-free SQL

Details
Discussion Comments: 0
Verification: Authors have not verified information

Durable write cache in flash memory SSD for relational and NoSQL databases

Woon-Hak Kang, Sang-Won Lee, Bongki Moon, Yang-Suk Kee, Moonwook Oh

Durable write cache in flash memory SSD for relational and NoSQL databases

Details
Discussion Comments: 0
Verification: Authors have not verified information

H2RDF+: an efficient data management system for big RDF graphs

Nikolaos Papailiou, Dimitrios Tsoumakos, Ioannis Konstantinou, Panagiotis Karras, Nectarios Koziris

H2RDF+: an efficient data management system for big RDF graphs

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Overlap interval partition join

Anton Dignös, Michael H. Böhlen, Johann Gamper

Overlap interval partition join

Details
Discussion Comments: 0
Verification: Authors have not verified information

Which concepts are worth extracting?

Arash Termehchy, Ali Vakilian, Yodsawalai Chodpathumwan, Marianne Winslett

Which concepts are worth extracting?

Details
Discussion Comments: 0
Verification: Authors have not verified information

Interactive redescription mining

Esther Galbrun, Pauli Miettinen

Interactive redescription mining

Details
Discussion Comments: 0
Verification: Authors have not verified information

NaLIR: an interactive natural language interface for querying relational databases

Fei Li, Hosagrahar Visvesvaraya Jagadish

NaLIR: an interactive natural language interface for querying relational databases

Details
Discussion Comments: 0
Verification: Authors have not verified information

Sloth: being lazy is a virtue (when issuing database queries)

Alvin Cheung, Samuel Madden, Armando Solar-Lezama

Sloth: being lazy is a virtue (when issuing database queries)

Details
Discussion Comments: 0
Verification: Authors have not verified information

Storm@twitter

Ankit Toshniwal, Siddarth Taneja, Amit Shukla, Karthikeyan Ramasamy, Jignesh M. Patel, Sanjeev Kulkarni, Jason Jackson, Krishna Gade, Maosong Fu, Jake Donham, Nikunj Bhagat, Sailesh Mittal, Dmitriy V. Ryaboy

Storm@twitter

Details
Discussion Comments: 0
Verification: Authors have not verified information

OPT: a new framework for overlapped and parallel triangulation in large-scale graphs

Jinha Kim, Wook-Shin Han, Sangyeon Lee, Kyungyeol Park, Hwanjo Yu

OPT: a new framework for overlapped and parallel triangulation in large-scale graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Discovering queries based on example tuples

Yanyan Shen, Kaushik Chakrabarti, Surajit Chaudhuri, Bolin Ding, Lev Novik

Discovering queries based on example tuples

Details
Discussion Comments: 0
Verification: Authors have not verified information

Complex event analytics: online aggregation of stream sequence patterns

Yingmei Qi, Lei Cao, Medhabi Ray, Elke A. Rundensteiner

Complex event analytics: online aggregation of stream sequence patterns

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced no artifacts
Verification: Authors have verified information

Major technical advancements in apache hive

Yin Huai, Ashutosh Chauhan, Alan Gates, Günther Hagleitner, Eric N. Hanson, Owen O'Malley, Jitendra Pandey, Yuan Yuan, Rubao Lee, Xiaodong Zhang

Major technical advancements in apache hive

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scalable atomic visibility with RAMP transactions

Peter Bailis, Alan Fekete, Joseph M. Hellerstein, Ali Ghodsi, Ion Stoica

Scalable atomic visibility with RAMP transactions

Details
Discussion Comments: 0
Verification: Authors have not verified information

Versatile optimization of UDF-heavy data flows with sofa

Astrid Rheinländer, Martin Beckmann, Anja Kunkel, Arvid Heise, Thomas Stoltmann, Ulf Leser

Versatile optimization of UDF-heavy data flows with sofa

Details
Discussion Comments: 0
Verification: Authors have not verified information

Matching heterogeneous event data

Xiaochen Zhu, Shaoxu Song, Xiang Lian, Jianmin Wang, Lei Zou

Matching heterogeneous event data

Details
Discussion Comments: 0
Verification: Authors have not verified information

Demonstrating efficient query processing in heterogeneous environments

Tomas Karnagel, Matthias Hille, Mario Ludwig, Dirk Habich, Wolfgang Lehner, Max Heimel, Volker Markl

Demonstrating efficient query processing in heterogeneous environments

Details
Discussion Comments: 0
Verification: Authors have not verified information

CrowdMatcher: crowd-assisted schema matching

Chen Jason Zhang, Ziyuan Zhao, Lei Chen, H. V. Jagadish, Caleb Chen Cao

CrowdMatcher: crowd-assisted schema matching

Details
Discussion Comments: 0
Verification: Authors have not verified information

Optimizing queries over partitioned tables in MPP systems

Lyublena Antova, Amr El-Helw, Mohamed A. Soliman, Zhongxian Gu, Michalis Petropoulos, Florian Waas

Optimizing queries over partitioned tables in MPP systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

One DBMS for all: the brawny few and the wimpy crowd

Tobias Mühlbauer, Wolf Rödiger, Robert Seilbeck, Angelika Reiser, Alfons Kemper, Thomas Neumann

One DBMS for all: the brawny few and the wimpy crowd

Details
Discussion Comments: 0
Verification: Authors have not verified information

Spatio-temporal visual analysis for event-specific tweets

Mashaal Musleh

Spatio-temporal visual analysis for event-specific tweets

Details
Discussion Comments: 0
Verification: Author has not verified information

On complexity and optimization of expensive queries in complex event processing

Haopeng Zhang, Yanlei Diao, Neil Immerman

On complexity and optimization of expensive queries in complex event processing

Details
Discussion Comments: 0
Verification: Authors have not verified information

A user interaction based community detection algorithm for online social networks

Himel Dev

A user interaction based community detection algorithm for online social networks

Details
Discussion Comments: 0
Verification: Author has not verified information

Local search of communities in large graphs

Wanyun Cui, Yanghua Xiao, Haixun Wang, Wei Wang

Local search of communities in large graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Palette: enabling scalable analytics for big-memory, multicore machines

Fei Chen, Tere Gonzalez, Jun Li, Manish Marwah, Jim Pruyne, Krishnamurthy Viswanathan, Mijung Kim

Palette: enabling scalable analytics for big-memory, multicore machines

Details
Discussion Comments: 0
Verification: Authors have not verified information

NewsNetExplorer: automatic construction and exploration of news information networks

Fangbo Tao, George Brova, Jiawei Han, Heng Ji, Chi Wang, Brandon Norick, Ahmed El-Kishky, Jialu Liu, Xiang Ren, Yizhou Sun

NewsNetExplorer: automatic construction and exploration of news information networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

Druid: a real-time analytical data store

Fangjin Yang, Eric Tschetter, Xavier Léauté, Nelson Ray, Gian Merlino, Deep Ganguli

Druid: a real-time analytical data store

Details
Discussion Comments: 0
Verification: Authors have not verified information

OceanRT: real-time analytics over large temporal data

Shiming Zhang, Yin Yang, Wei Fan, Liang Lan, Mingxuan Yuan

OceanRT: real-time analytics over large temporal data

Details
Discussion Comments: 0
Verification: Authors have not verified information

Approximation schemes for many-objective query optimization

Immanuel Trummer, Christoph Koch

Approximation schemes for many-objective query optimization

Details
Discussion Comments: 0
Verification: Authors have not verified information

Resource-oriented approximation for frequent itemset mining from bursty data streams

Yoshitaka Yamamoto, Koji Iwanuma, Shoshi Fukuda

Resource-oriented approximation for frequent itemset mining from bursty data streams

Details
Discussion Comments: 0
Verification: Authors have not verified information

On-the-fly token similarity joins in relational databases

Nikolaus Augsten, Armando Miraglia, Thomas Neumann, Alfons Kemper

On-the-fly token similarity joins in relational databases

Details
Discussion Comments: 0
Verification: Authors have not verified information

An application-specific instruction set for accelerating set-oriented database primitives

Oliver Arnold, Sebastian Haas, Gerhard P. Fettweis, Benjamin Schlegel, Thomas Kissinger, Wolfgang Lehner

An application-specific instruction set for accelerating set-oriented database primitives

Details
Author Comments:
Discussion Comments: 0
Sharing: Not able to share produced artifacts
Verification: Authors have verified information

DSH: data sensitive hashing for high-dimensional k-nnsearch

Jinyang Gao, Hosagrahar Visvesvaraya Jagadish, Wei Lu, Beng Chin Ooi

DSH: data sensitive hashing for high-dimensional k-nnsearch

Details
Discussion Comments: 0
Verification: Authors have not verified information

ERIS live: a NUMA-aware in-memory storage engine for tera-scale multiprocessor systems

Tim Kiefer, Thomas Kissinger, Benjamin Schlegel, Dirk Habich, Daniel Molka, Wolfgang Lehner

ERIS live: a NUMA-aware in-memory storage engine for tera-scale multiprocessor systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Demonstration of the Myria big data management service

Daniel Halperin, Victor Teixeira de Almeida, Lee Lee Choo, Shumo Chu, Paraschos Koutris, Dominik Moritz, Jennifer Ortiz, Vaspol Ruamviboonsuk, Jingjing Wang, Andrew Whitaker, Shengliang Xu, Magdalena Balazinska, Bill Howe, Dan Suciu

Demonstration of the Myria big data management service

Details
Discussion Comments: 0
Verification: Authors have not verified information

VQA: vertica query analyzer

Alkis Simitsis, Kevin Wilkinson, Jason Blais, Joe Walsh

VQA: vertica query analyzer

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient algorithms for optimal location queries in road networks

Zitong Chen, Yubao Liu, Raymond Chi-Wing Wong, Jiamin Xiong, Ganglin Mai, Cheng Long

Efficient algorithms for optimal location queries in road networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

SerpentTI: flexible analytics of users, boards and domains for pinterest

Alex Cheng, Mary Malit, Chuanxi Zhang, Nick Koudas

SerpentTI: flexible analytics of users, boards and domains for pinterest

Details
Discussion Comments: 0
Verification: Authors have not verified information

LINVIEW: incremental view maintenance for complex analytical queries

Milos Nikolic, Mohammed Elseidy, Christoph Koch

LINVIEW: incremental view maintenance for complex analytical queries

Details
Discussion Comments: 0
Verification: Authors have not verified information

Patience is a virtue: revisiting merge and sort on modern processors

Badrish Chandramouli, Jonathan Goldstein

Patience is a virtue: revisiting merge and sort on modern processors

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fine-grained partitioning for aggressive data skipping

Liwen Sun, Michael J. Franklin, Sanjay Krishnan, Reynold S. Xin

Fine-grained partitioning for aggressive data skipping

Details
Discussion Comments: 0
Verification: Authors have not verified information

ONTOCUBO: cube-based ontology construction and exploration

Carlos Garcia-Alvarado, Carlos Ordonez

ONTOCUBO: cube-based ontology construction and exploration

Details
Discussion Comments: 0
Verification: Authors have not verified information

SLQ: a user-friendly graph querying system

Shengqi Yang, Yanan Xie, Yinghui Wu, Tianyi Wu, Huan Sun, Jian Wu, Xifeng Yan

SLQ: a user-friendly graph querying system

Details
Discussion Comments: 0
Verification: Authors have not verified information

The pursuit of a good possible world: extracting representative instances of uncertain graphs

Panos Parchas, Francesco Gullo, Dimitris Papadias, Francesco Bonchi

The pursuit of a good possible world: extracting representative instances of uncertain graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

In search of influential event organizers in online social networks

Kaiyu Feng, Gao Cong, Sourav S. Bhowmick, Shuai Ma

In search of influential event organizers in online social networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

An extendable framework for managing uncertain spatio-temporal data

Tobias Emrich, Maximilian Franzke, Hans-Peter Kriegel, Johannes Niedermayer, Matthias Renz, Andreas Züfle

An extendable framework for managing uncertain spatio-temporal data

Details
Discussion Comments: 0
Verification: Authors have not verified information

AutoPlait: automatic mining of co-evolving time sequences

Yasuko Matsubara, Yasushi Sakurai, Christos Faloutsos

AutoPlait: automatic mining of co-evolving time sequences

Details
Discussion Comments: 0
Verification: Authors have not verified information

Reactive and proactive sharing across concurrent analytical queries

Iraklis Psaroudakis, Manos Athanassoulis, Matthaios Olma, Anastasia Ailamaki

Reactive and proactive sharing across concurrent analytical queries

Details
Discussion Comments: 0
Verification: Authors have not verified information

Modeling entity evolution for temporal record matching

Yueh-Hsuan Chiang, AnHai Doan, Jeffrey F. Naughton

Modeling entity evolution for temporal record matching

Details
Discussion Comments: 0
Verification: Authors have not verified information

Blowfish privacy: tuning privacy-utility trade-offs using policies

Xi He, Ashwin Machanavajjhala, Bolin Ding

Blowfish privacy: tuning privacy-utility trade-offs using policies

Details
Discussion Comments: 0
Verification: Authors have not verified information

MISO: souping up big data query processing with a multistore system

Jeff LeFevre, Jagan Sankaranarayanan, Hakan Hacigümüs, Jun'ichi Tatemura, Neoklis Polyzotis, Michael J. Carey

MISO: souping up big data query processing with a multistore system

Details
Discussion Comments: 0
Verification: Authors have not verified information

ABS: a system for scalable approximate queries with accuracy guarantees

Kai Zeng, Shi Gao, Jiaqi Gu, Barzan Mozafari, Carlo Zaniolo

ABS: a system for scalable approximate queries with accuracy guarantees

Details
Discussion Comments: 0
Verification: Authors have not verified information

A sample-and-clean framework for fast and accurate query processing on dirty data

Jiannan Wang, Sanjay Krishnan, Michael J. Franklin, Ken Goldberg, Tim Kraska, Tova Milo

A sample-and-clean framework for fast and accurate query processing on dirty data

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scalable big graph processing in MapReduce

Lu Qin, Jeffrey Xu Yu, Lijun Chang, Hong Cheng, Chengqi Zhang, Xuemin Lin

Scalable big graph processing in MapReduce

Details
Discussion Comments: 0
Verification: Authors have not verified information

Cloud-based RDF data management

Zoi Kaoudi, Ioana Manolescu

Cloud-based RDF data management

Details
Discussion Comments: 0
Verification: Authors have not verified information

The PH-tree: a space-efficient storage structure and multi-dimensional index

Tilmann Zäschke, Christoph Zimmerli, Moira C. Norrie

The PH-tree: a space-efficient storage structure and multi-dimensional index

Details
Discussion Comments: 0
Verification: Authors have not verified information

Density-based place clustering in geo-social networks

Jieming Shi, Nikos Mamoulis, Dingming Wu, David W. Cheung

Density-based place clustering in geo-social networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

EAGr: supporting continuous ego-centric aggregate queries over large dynamic graphs

Jayanta Mondal, Amol Deshpande

EAGr: supporting continuous ego-centric aggregate queries over large dynamic graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age

Viktor Leis, Peter A. Boncz, Alfons Kemper, Thomas Neumann

Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scalable similarity search for SimRank

Mitsuru Kusumoto, Takanori Maehara, Ken-ichi Kawarabayashi

Scalable similarity search for SimRank

Details
Discussion Comments: 0
Verification: Authors have not verified information

Privacy preserving social graphs for high precision community detection

Himel Dev

Privacy preserving social graphs for high precision community detection

Details
Discussion Comments: 0
Verification: Author has not verified information

Re-evaluating designs for multi-tenant OLTP workloads on SSD-basedI/O subsystems

Ning Zhang, Jun'ichi Tatemura, Jignesh M. Patel, Hakan Hacigümüs

Re-evaluating designs for multi-tenant OLTP workloads on SSD-basedI/O subsystems

Details
Discussion Comments: 0
Verification: Authors have not verified information

A comprehensive study of main-memory partitioning and its application to large-scale comparison- and radix-sort

Orestis Polychroniou, Kenneth A. Ross

A comprehensive study of main-memory partitioning and its application to large-scale comparison- and radix-sort

Details
Discussion Comments: 0
Verification: Authors have not verified information

Mining latent entity structures from massive unstructured and interconnected data

Jiawei Han, Chi Wang

Mining latent entity structures from massive unstructured and interconnected data

Details
Discussion Comments: 0
Verification: Authors have not verified information

A comparison of platforms for implementing and running very large scale machine learning algorithms

Zhuhua Cai, Zekai J. Gao, Shangyu Luo, Luis Leopoldo Perez, Zografoula Vagena, Christopher M. Jermaine

A comparison of platforms for implementing and running very large scale machine learning algorithms

Details
Discussion Comments: 0
Verification: Authors have not verified information

Similarity joins for uncertain strings

Manish Patil, Rahul Shah

Similarity joins for uncertain strings

Details
Discussion Comments: 0
Verification: Authors have not verified information

GenBase: a complex analytics genomics benchmark

Rebecca Taft, Manasi Vartak, Nadathur Rajagopalan Satish, Narayanan Sundaram, Samuel Madden, Michael Stonebraker

GenBase: a complex analytics genomics benchmark

Details
Discussion Comments: 0
Verification: Authors have not verified information

Knowing when you're wrong: building fast and reliable approximate query processing systems

Sameer Agarwal, Henry Milner, Ariel Kleiner, Ameet Talwalkar, Michael I. Jordan, Samuel Madden, Barzan Mozafari, Ion Stoica

Knowing when you're wrong: building fast and reliable approximate query processing systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

NADEEF/ER: generic and interactive entity resolution

Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Si Yin

NADEEF/ER: generic and interactive entity resolution

Details
Discussion Comments: 0
Verification: Authors have not verified information

Opportunistic physical design for big data analytics

Jeff LeFevre, Jagan Sankaranarayanan, Hakan Hacigümüs, Jun'ichi Tatemura, Neoklis Polyzotis, Michael J. Carey

Opportunistic physical design for big data analytics

Details
Discussion Comments: 0
Verification: Authors have not verified information

SpongeFiles: mitigating data skew in mapreduce using distributed memory

Khaled Elmeleegy, Christopher Olston, Benjamin Reed

SpongeFiles: mitigating data skew in mapreduce using distributed memory

Details
Discussion Comments: 0
Verification: Authors have not verified information

Towards indexing functions: answering scalar product queries

Arijit Khan, Pouya Yanki, Bojana Dimcheva, Donald Kossmann

Towards indexing functions: answering scalar product queries

Details
Discussion Comments: 0
Verification: Authors have not verified information

Complete yet practical search for minimal query reformulations under constraints

Ioana Ileana, Bogdan Cautis, Alin Deutsch, Yannis Katsis

Complete yet practical search for minimal query reformulations under constraints

Details
Discussion Comments: 0
Verification: Authors have not verified information

CrowdFill: collecting structured data from the crowd

Hyunjung Park, Jennifer Widom

CrowdFill: collecting structured data from the crowd

Details
Discussion Comments: 0
Verification: Authors have not verified information

Interactive data exploration using semantic windows

Alexander Kalinin, Ugur Çetintemel, Stanley B. Zdonik

Interactive data exploration using semantic windows

Details
Discussion Comments: 0
Verification: Authors have not verified information

Histograms as a side effect of data movement for big data

Zsolt István, Louis Woods, Gustavo Alonso

Histograms as a side effect of data movement for big data

Details
Discussion Comments: 0
Verification: Authors have not verified information

IQR: an interactive query relaxation system for the empty-answer problem

Davide Mottin, Alice Marascu, Senjuti Basu Roy, Gautam Das, Themis Palpanas, Yannis Velegrakis

IQR: an interactive query relaxation system for the empty-answer problem

Details
Discussion Comments: 0
Verification: Authors have not verified information

Influence maximization: near-optimal time complexity meets practical efficiency

Youze Tang, Xiaokui Xiao, Yanchen Shi

Influence maximization: near-optimal time complexity meets practical efficiency

Details
Discussion Comments: 0
Verification: Authors have not verified information

PackageBuilder: querying for packages of tuples

Kevin Fernandes, Matteo Brucato, Rahul Ramakrishna, Azza Abouzied, Alexandra Meliou

PackageBuilder: querying for packages of tuples

Details
Discussion Comments: 0
Verification: Authors have not verified information

Materialization optimizations for feature selection workloads

Ce Zhang, Arun Kumar, Christopher Ré

Materialization optimizations for feature selection workloads

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient location-aware influence maximization

Guoliang Li, Shuo Chen, Jianhua Feng, Kian-Lee Tan, Wen-Syan Li

Efficient location-aware influence maximization

Details
Discussion Comments: 0
Verification: Authors have not verified information

MeanKS: meaningful keyword search in relational databases with complex schema

Mehdi Kargar, Aijun An, Nick Cercone, Parke Godfrey, Jaroslaw Szlichta, Xiaohui Yu

MeanKS: meaningful keyword search in relational databases with complex schema

Details
Discussion Comments: 0
Verification: Authors have not verified information

How to stop under-utilization and love multicores

Anastasia Ailamaki, Erietta Liarou, Pinar Tözün, Danica Porobic, Iraklis Psaroudakis

How to stop under-utilization and love multicores

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fun with hardware transactional memory

Maurice Herlihy

Fun with hardware transactional memory

Details
Discussion Comments: 0
Verification: Author has not verified information

H2O: a hands-free adaptive store

Ioannis Alagiannis, Stratos Idreos, Anastasia Ailamaki

H2O: a hands-free adaptive store

Details
Discussion Comments: 0
Verification: Authors have not verified information

Track join: distributed joins with minimal network traffic

Orestis Polychroniou, Rajkumar Sen, Kenneth A. Ross

Track join: distributed joins with minimal network traffic

Details
Discussion Comments: 0
Verification: Authors have not verified information

Stratified-sampling over social networks using mapreduce

Roy Levin, Yaron Kanza

Stratified-sampling over social networks using mapreduce

Details
Discussion Comments: 0
Verification: Authors have not verified information

Plan bouquets: query processing without selectivity estimation

Anshuman Dutt, Jayant R. Haritsa

Plan bouquets: query processing without selectivity estimation

Details
Discussion Comments: 0
Verification: Authors have not verified information

The next generation operational data historian for IoT based on informix

Sheng Huang, Yaoliang Chen, Xiaoyan Chen, Kai Liu, Xiaomin Xu, Chen Wang, Kevin Brown, Inge Halilovic

The next generation operational data historian for IoT based on informix

Details
Discussion Comments: 0
Verification: Authors have not verified information

A temporal context-aware model for user behavior modeling in social media systems

Hongzhi Yin, Bin Cui, Ling Chen, Zhiting Hu, Zi Huang

A temporal context-aware model for user behavior modeling in social media systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Leveraging compression in the tableau data engine

Richard Michael Grantham Wesley, Pawel Terlecki

Leveraging compression in the tableau data engine

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient summarization framework for multi-attribute uncertain data

Jie Xu, Dmitri V. Kalashnikov, Sharad Mehrotra

Efficient summarization framework for multi-attribute uncertain data

Details
Discussion Comments: 0
Verification: Authors have not verified information

Reachability queries on large dynamic graphs: a total order approach

Andy Diwen Zhu, Wenqing Lin, Sibo Wang, Xiaokui Xiao

Reachability queries on large dynamic graphs: a total order approach

Details
Discussion Comments: 0
Verification: Authors have not verified information

Indexing on modern hardware: hekaton and beyond

Justin J. Levandoski, David B. Lomet, Sudipta Sengupta, Adrian Birka, Cristian Diaconu

Indexing on modern hardware: hekaton and beyond

Details
Discussion Comments: 0
Verification: Authors have not verified information

HYDRA: large-scale social identity linkage via heterogeneous behavior modeling

Siyuan Liu, Shuhui Wang, Feida Zhu, Jinbo Zhang, Ramayya Krishnan

HYDRA: large-scale social identity linkage via heterogeneous behavior modeling

Details
Discussion Comments: 0
Verification: Authors have not verified information

Dynamically optimizing queries over large scale data platforms

Konstantinos Karanasos, Andrey Balmin, Marcel Kutsch, Fatma Ozcan, Vuk Ercegovac, Chunyang Xia, Jesse Jackson

Dynamically optimizing queries over large scale data platforms

Details
Discussion Comments: 0
Verification: Authors have not verified information

Explore-by-example: an automatic query steering framework for interactive data exploration

Kyriaki Dimitriadou, Olga Papaemmanouil, Yanlei Diao

Explore-by-example: an automatic query steering framework for interactive data exploration

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fast database restarts at facebook

Aakash Goel, Bhuwan Chopra, Ciprian Gerea, Dhruv Mátáni, Josh Metzler, Fahim Ul Haq, Janet L. Wiener

Fast database restarts at facebook

Details
Discussion Comments: 0
Verification: Authors have not verified information

Corleone: hands-off crowdsourcing for entity matching

Chaitanya Gokhale, Sanjib Das, AnHai Doan, Jeffrey F. Naughton, Narasimhan Rampalli, Jude W. Shavlik, Xiaojin Zhu

Corleone: hands-off crowdsourcing for entity matching

Details
Discussion Comments: 0
Verification: Authors have not verified information

Secure query processing with data interoperability in a cloud database environment

Wai Kit Wong, Ben Kao, David Wai-Lok Cheung, Rongbin Li, Siu-Ming Yiu

Secure query processing with data interoperability in a cloud database environment

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fast and unified local search for random walk based k-nearest-neighbor query in large graphs

Yubao Wu, Ruoming Jin, Xiang Zhang

Fast and unified local search for random walk based k-nearest-neighbor query in large graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel I/O aware query optimization

Pedram Ghodsnia, Ivan T. Bowman, Anisoara Nica

Parallel I/O aware query optimization

Details
Discussion Comments: 0
Verification: Authors have not verified information

Characterizing and selecting fresh data sources

Theodoros Rekatsinas, Xin Luna Dong, Divesh Srivastava

Characterizing and selecting fresh data sources

Details
Discussion Comments: 0
Verification: Authors have not verified information

Lazy evaluation of transactions in database systems

Jose M. Faleiro, Alexander Thomson, Daniel J. Abadi

Lazy evaluation of transactions in database systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

BabbleFlow: a translator for analytic data flow programs

Petar Jovanovic, Alkis Simitsis, Kevin Wilkinson

BabbleFlow: a translator for analytic data flow programs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Aggregate estimation over a microblog platform

Saravanan Thirumuruganathan, Nan Zhang, Vagelis Hristidis, Gautam Das

Aggregate estimation over a microblog platform

Details
Discussion Comments: 0
Verification: Authors have not verified information

Global immutable region computation

Jilian Zhang, Kyriakos Mouratidis, HweeHwa Pang

Global immutable region computation

Details
Discussion Comments: 0
Verification: Authors have not verified information

DoomDB: kill the query

Carsten Binnig, Abdallah Salama, Erfan Zamanian

DoomDB: kill the query

Details
Discussion Comments: 0
Verification: Authors have not verified information

Natural language question answering over RDF: a graph data driven approach

Lei Zou, Ruizhe Huang, Haixun Wang, Jeffrey Xu Yu, Wenqiang He, Dongyan Zhao

Natural language question answering over RDF: a graph data driven approach

Details
Discussion Comments: 0
Verification: Authors have not verified information

A probabilistic model for linking named entities in web text with heterogeneous information networks

Wei Shen, Jiawei Han, Jianyong Wang

A probabilistic model for linking named entities in web text with heterogeneous information networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

Are we experiencing a big data bubble?

Fatma Özcan, Nesime Tatbul, Daniel J. Abadi, Marcel Kornacker, C. Mohan, Karthik Ramasamy, Janet L. Wiener

Are we experiencing a big data bubble?

Details
Discussion Comments: 0
Verification: Authors have not verified information

Querying k-truss community in large and dynamic graphs

Xin Huang, Hong Cheng, Lu Qin, Wentao Tian, Jeffrey Xu Yu

Querying k-truss community in large and dynamic graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

How i learned to stop worrying and love compilers

Eric Sedlar

How i learned to stop worrying and love compilers

Details
Discussion Comments: 0
Verification: Author has not verified information

Querying virtual hierarchies using virtual prefix-based numbers

Curtis E. Dyreson, Sourav S. Bhowmick, Ryan Grapp

Querying virtual hierarchies using virtual prefix-based numbers

Details
Discussion Comments: 0
Verification: Authors have not verified information

JSON data management: supporting schema-less development in RDBMS

Zhen Hua Liu, Beda Christoph Hammerschmidt, Doug McMahon

JSON data management: supporting schema-less development in RDBMS

Details
Author Comments:
Discussion Comments: 0
Sharing: Other
Verification: Authors have verified information

Fusing data with correlations

Ravali Pochampally, Anish Das Sarma, Xin Luna Dong, Alexandra Meliou, Divesh Srivastava

Fusing data with correlations

Details
Discussion Comments: 0
Verification: Authors have not verified information

Descriptive and prescriptive data cleaning

Anup Chalamalla, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti

Descriptive and prescriptive data cleaning

Details
Discussion Comments: 0
Verification: Authors have not verified information

TAREEG: a MapReduce-based web service for extracting spatial data from OpenStreetMap

Louai Alarabi, Ahmed Eldawy, Rami Alghamdi, Mohamed F. Mokbel

TAREEG: a MapReduce-based web service for extracting spatial data from OpenStreetMap

Details
Discussion Comments: 0
Verification: Authors have not verified information

Should we all be teaching "intro to data science" instead of "intro to databases"?

Bill Howe, Michael J. Franklin, Juliana Freire, James Frew, Tim Kraska, Raghu Ramakrishnan

Should we all be teaching "intro to data science" instead of "intro to databases"?

Details
Discussion Comments: 0
Verification: Authors have not verified information

Robust set reconciliation

Di Chen, Christian Konrad, Ke Yi, Wei Yu, Qin Zhang

Robust set reconciliation

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient top-K SimRank-based similarity join

Wenbo Tao, Guoliang Li

Efficient top-K SimRank-based similarity join

Details
Discussion Comments: 0
Verification: Authors have not verified information

Partial results in database systems

Willis Lang, Rimma V. Nehme, Eric Robinson, Jeffrey F. Naughton

Partial results in database systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Towards dependable data repairing with fixing rules

Jiannan Wang, Nan Tang

Towards dependable data repairing with fixing rules

Details
Discussion Comments: 0
Verification: Authors have not verified information

Anti-combining for MapReduce

Alper Okcan, Mirek Riedewald

Anti-combining for MapReduce

Details
Discussion Comments: 0
Verification: Authors have not verified information

JECB: a join-extension, code-based approach to OLTP data partitioning

Khai Q. Tran, Jeffrey F. Naughton, Bruhathi Sundarmurthy, Dimitris Tsirogiannis

JECB: a join-extension, code-based approach to OLTP data partitioning

Details
Discussion Comments: 0
Verification: Authors have not verified information

Sinew: a SQL system for multi-structured data

Daniel Tahara, Thaddeus Diamond, Daniel J. Abadi

Sinew: a SQL system for multi-structured data

Details
Discussion Comments: 0
Verification: Authors have not verified information

Querying encrypted data

Arvind Arasu, Ken Eguro, Raghav Kaushik, Ravishankar Ramamurthy

Querying encrypted data

Details
Discussion Comments: 0
Verification: Authors have not verified information

Indexing for interactive exploration of big data series

Kostas Zoumpatianos, Stratos Idreos, Themis Palpanas

Indexing for interactive exploration of big data series

Details
Discussion Comments: 0
Verification: Authors have not verified information

Knowledge expansion over probabilistic knowledge bases

Yang Chen, Daisy Zhe Wang

Knowledge expansion over probabilistic knowledge bases

Details
Discussion Comments: 0
Verification: Authors have not verified information

Explainable security for relational databases

Gabriel Bender, Lucja Kot, Johannes Gehrke

Explainable security for relational databases

Details
Discussion Comments: 0
Verification: Authors have not verified information

Tracking set correlations at large scale

Foteini Alvanaki, Sebastian Michel

Tracking set correlations at large scale

Details
Discussion Comments: 0
Verification: Authors have not verified information

NLyze: interactive programming by natural language for spreadsheet data analysis and manipulation

Sumit Gulwani, Mark Marron

NLyze: interactive programming by natural language for spreadsheet data analysis and manipulation

Details
Discussion Comments: 0
Verification: Authors have not verified information

Searching with XQ: the exemplar query search engine

Davide Mottin, Matteo Lissandrini, Yannis Velegrakis, Themis Palpanas

Searching with XQ: the exemplar query search engine

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Localizing anomalous changes in time-evolving graphs

Kumar Sricharan, Kamalika Das

Localizing anomalous changes in time-evolving graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Hypersphere dominance: an optimal approach

Cheng Long, Raymond Chi-Wing Wong, Bin Zhang, Min Xie

Hypersphere dominance: an optimal approach

Details
Discussion Comments: 0
Verification: Authors have not verified information

Online optimization and fair costing for dynamic data sharing in a cloud data market

Ziyang Liu, Hakan Hacigümüs

Online optimization and fair costing for dynamic data sharing in a cloud data market

Details
Discussion Comments: 0
Verification: Authors have not verified information

The analytical bootstrap: a new method for fast error estimation in approximate query processing

Kai Zeng, Shi Gao, Barzan Mozafari, Carlo Zaniolo

The analytical bootstrap: a new method for fast error estimation in approximate query processing

Details
Discussion Comments: 0
Verification: Authors have not verified information