What's New

      Call for Paper  


      Online Submission


      Important Dates



      Program Committee

      Program Schedule


      Keynote Speeches

      Accepted Papers



      Student Travel Award

      Visa to USA


      About San Francisco 



IEEE BigData 2013 Main Conference

1. Big Data Foundations

Regular Papers:

BigD244 "On-Line Learning Gossip Algorithm in Multi-Agent Systems with Local Decision Rules"
Stephan Clemencon, Pascal Bianchi, Gemma Morral, and Jeremie Jakubowicz

BigD342 "Labeled $N$-gram Topic Model"
Noriaki Kawamae

BigD358 "Communication Efficient Algorithms for Fundamental Big Data Problems"
Peter Sanders, Ingo Müller, and Sebastian Schlag

BigD399 "Map-Based Graph Analysis on MapReduce"
Upa Gupta and Leonidas Fegaras

Short Papers:

BigD216 "P-DOT: A Model of Computation for Big Data"
Tao Luo, Yin Liao, Yunquan Zhang, and Guoliang Chen

BigD229 "Transparent Composite Model For Large Scale Image/Video Processing"
Enhui Yang and Xiang Yu

BigD319 "Elastic Algorithms for Guaranteeing Quality Monotonicity in Big Data Mining"
Rui Han, Lei Nie, Moustafa M. Ghanem, and Yike Guo

2. Big Data Infrastructure

Regular Papers:

BigD279 "HFSP: Size-based Scheduling for Hadoop"
Mario Pastorelli, Antonio Barbuzzi, Damiano Carra, Matteo Dell'Amico, and Pietro Michiardi

BigD314 "An Evaluation Study of BigData Frameworks for Graph Processing"
Benedikt Elser and Alberto Montresor

BigD331 "Storing and manipulating environmental big data with JASMIN"
Bryan Lawrence, Victoria Bennett, Jonathan Churchill, Martin Juckes, Philip Kershaw, Stephen Pascoe, Sam Pepler, Matt Pritchard, and Ag Stephens

BigD345 "Efficient Gear-shifting for a Power-proportional Distributed Data-placement Method"
Hieu Hanh Le, Satoshi Hikida, and Haruo Yokota

BigD354 "Agrios: A Hybrid Approach to Big Array Analytics"
Patrick Leyshock, David Maier, and Kristin Tufte

BigD413 "Building a Generic Platform for Big Sensor Data Application"
Chun-Hsiang Lee, David Birch, Chao Wu, Dilshan Silva, Orestis Tsinalis, Yang Li, Shulin Yan, Moustafa Ghanem, and Yike Guo
BigD455 "Locality-driven High-level I/O Aggregation for Processing Scientific Datasets"
Jialin Liu, Bradly Crysler, and Yong Chen

Short Papers:

BigD215 "clusiVAT: A Mixed Visual/Numerical Clustering Algorithm for Big Data"
Dheeraj Kumar, James Bezdek, Sutharshan Rajasegarar, Marimuthu Palaniswami, Christopher Leckie, and Timothy Havens

BigD225 "Hardware acceleration of Hadoop MapReduce"
Toshimori Honjo and Kazuki Oikawa

BigD285 "Optimizing the MapReduce Framework on Intel Xeon Phi Coprocessor"
Mian Lu, Lei Zhang, Huynh Phung Huynh, Zhongliang Ong, Yun Liang, Bingsheng He, Rick Siow Mong Goh, and Richard Huynh

BigD287 "On the Performance and Energy Efficiency of Hadoop Deployment Models"
Eugen Feller, Lavanya Ramakrishnan, and Christine Morin

BigD289 "Optimizing Throughput on Guaranteed-Bandwidth WAN Networks for the Large Synoptic Survey Telescope (LSST)"
Mike Freemon

BigD298 "Feliss: Flexible distributed computing framework with light-weight checkpointing"
Takuya Araki, Kazuyo Narita, and Hiroshi Tamano

BigD339 "Algebraic Dataflows for Big Data Analysis"
Jonas Dias, Eduardo Ogasawara, Daniel de Oliveira, Fabio Porto, Patrick Valduriez, and Marta Mattoso

BigD355 "Scalable and Robust Key Group Size Estimation For Reducer Load Balancing in MapReduce"
Wei Yan, Yuan Xue, and Bradley Malin

BigD359 "Robot: An Efficient Model For Big Data Storage Systems Based On Erasure Coding"
Chao Yin, Jianzong Wang, Changsheng Xie, Jiguang Wan, and Changlin Long

BigD360 "Multilevel Active Storage for Big Data Applications in High Performance Computing"
Chao Chen and Yong Chen

BigD363 "GPU Accelerated Item-Based Collaborative Filtering for Big-Data Applications"
Chandima HewaNadungodage, Yuni Xia, John Lee, Myungcheol Lee, and Choon Seo Park

BigD390 "GPU-Accelerated Adaptive Compression Framework for Genomics Data"

3. Big Data Management

Regular Papers:

BigD217 "Iteration Aware Prefetching For Unstructured Grids"
Oyindamola Akande and Philip Rhodes

BigD245 "Measuring Inter-Site Engagement"
Elad Yom-Tov, Mounia Lalmas, Ricardo Baeza-Yates, Georges Dupret, Janette Lehmann, and Pinar Donmez

BigD249 "A Selective Checkpointing Mechanism for Query Plans in a Parallel Database System"
Ting Chen and Kenjiro Taura

BigD253 "CORE: Cross-Object Redundancy for Efficient Data Repair in Storage Systems"
Kyumars Sheykh Esmaili, Lluis PamiesJuarez, and Anwitaman Datta

BigD270 "H2RDF+: High-performance Distributed Joins over Large-scale RDF Graphs"
Nikolaos Papailiou, Ioannis Konstantinou, Dimitrios Tsoumakos, Panagiotis Karras, and Nectarios Koziris

BigD312 "Direct QR factorizations for tall-and-skinny matrices in MapReduce architectures"
Austin Benson, David Gleich, and James Demmel

BigD338 "Adaptive File Management for Scientific Workflows on the Azure Cloud"
Radu Tudoran, Alexandru Costan, Ramin Rad Rezai, Goetz Brasche, and Gabriel Antoniu

BigD407 "Model-View Sensor Data Management in the Cloud"
Tian Guo, Thanasis G. Papaioannou, and Karl Aberer

BigD423 "Spatio-temporal Indexing in Non-relational Distributed Databases"
Anthony Fox, Chris Eichelberger, James Hughes, and Skylar Lyon

Short Papers:

BigD243 "Scientific Discovery through Weighted Sampling"
Lefteris Sidirourgos, Martin Kersten, and Peter Boncz

BigD278 "Scalable Data Citation in Dynamic, Large Databases: Model and Reference Implementation"
Stefan Pröll and Andreas Rauber

BigD294 "On the Use of Shared Storage in Shared-Nothing Environments"
Krishnaraj Ravindranathan, Aleksander Khasymski, Guanying Wang, Ali Butt, and Gaurav Makkar

BigD344 "Self-Adaptive Event Recognition for Intelligent Transport Management"
Alexander Artikis, Matthias Weidlich, Avigdor Gal, Vana Kalogeraki, and Dimitrios Gunopoulos

BigD365 "Improving Floating Point Compression through Binary Masks"
Leonardo Bautista Gomez and Franck Cappello

BigD373 "Using Pattern-Models to Guide SSD Deployment for Big Data in HPC systems"
Junjie Chen, Yong Chen, and Philip C. Roth

BigD375 "Robust Crowdsourced Learning"
Zhiquan Liu, Luo Luo, and Wu-Jun Li

BigD384 "imGraph: A distributed in-memory graph database"
Salim Jouili and Aldemar Reynaga

BigD445 "Segmented Analysis for Reducing Data Movement"
Jialin Liu, Surendra Byna, and Yong Chen

BigD447 "Knowledge Cubes - A Proposal for Scalable and Semantically-Guided Management of Big Data"
Amgad Madkour, Walid Aref, and Saleh Basalamah

4. Big Data Search and Mining

Regular Papers:

BigD211 "Continuous Hyperparameter Optimization for Large-scale Recommender Systems"
Simon Chan, Philip Treleaven, and Licia Capra

BigD220 "4S: Scalable Subspace Search Scheme"
Hoang Vu Nguyen, Emmanuel Müller, and Klemens Böhm

BigD254 "Computing Betweenness Centrality in External Memory"
Lars Arge, Michael Goodrich, and Freek van Walderveen

BigD260 "A Parallel Computing Platform for Training Large Scale Neural Networks"
Rong Gu, Furao Shen, and Yihua Huang

BigD267 "Self-Tuned Kernel Spectral Clustering for Large Scale Networks"
Raghvendra Mall, Rocco Langone, and Johan Suykens

BigD282 "NUMA-optimized Parallel Breadth-first Search on Multicore Single-node System"
Yuichiro Yasui, Katsuki Fujisawa, and Kazushige Goto

BigD315 "A Distributed Vertex-Centric Approach for Pattern Matching in Massive Graphs"
Arash Fard, M. Usman Nisar, Lakshmish Ramaswamy, John A. Miller, and Matthew Saltz

BigD318 "Fast Scalable Selection Algorithms for Large Scale Data"
Lee Thompson, Weijia Xu, and Daniel Miranker

BigD330 "An NML-based Model Selection Criterion for General Relational Data Modeling"
Yoshiki Sakai and Kenji Yamanishi

BigD334 "Parallel Matrix Factorization for Binary Response"
Rajiv Khanna, Liang Zhang, Deepak Agarwal, and Bee-Chung Chen

BigD400 "CallCab: A Unified Recommendation System for Carpooling and Regular Taxicab Services"
Desheng Zhang and Tian He

BigD403 "Top-K aggregation over a Large Graph Using Shared-Nothing Systems"
Abhirup Chakraborty

BigD410 "Distributed Confidence-Weighted Classification on MapReduce"
Nemanja Djuric, Mihajlo Grbovic, and Slobodan Vucetic

BigD411 "Scalable Context-Aware Role Mining with MapReduce"
Zhiwei Yu, Raymond Wong, and Chi-Hung Chi

Short Papers:

BigD212 "Elver: Recommending Facebook Pages in Cold Start Situation Without Content Features"
Yusheng Xie, Alok Choudhary, Zhengzhang Chen, and Ankit Agrawal

BigD233 "Massively Scalable Near Duplicate Detection in Streams of Documents using MDSH"
Paul Bogen, Christopher Symons, Amber McKenzie, Robert Patton, and Rob Gillen

BigD241 "Incremental Algorithms for Network Management and Analysis based on Closeness Centrality"
Ahmet Erdem Sariyuce, Kamer Kaya, Erik Saule, and Umit V. Catalyurek

BigD247 "Classification of Big Velocity Data via Cross-Domain Canonical Correlation Analysis"
Bo Zhang and Zhongzhi Shi

BigD248 "A Distributed Tree Data Structure For Real-Time OLAP On Cloud Architectures"
Frank Dehne, Quan Kong, Andrew Rau-Chaplin, Hamidreza Zaboli, and Rebecca Zhou

BigD284 "DL-MPI: Enabling Data Locality Computation for MPI-based Data-Intensive Applications"
Jiangling Yin, Andrew Foran, and Jun Wang

BigD297 "Sparse Poisson Coding for High Dimensional Document Clustering"
Chenxia Wu, Haiqin Yang, Jianke Zhu, Jiemi Zhang, Irwin King, and Michael R. Lyu

BigD308 "Fast OLAP Query Execution in Main Memory on Large Data in a Cluster"
Martin Weidner, Jonathan Dees, and Peter Sanders

BigD323 "Group-Scheme: A Universal SIMD-based Compression Scheme"
Xudong Zhang, Xin Zhao, Dongdong Shan, and Hongfei Yan

BigD335 "Efficient Large Graph Pattern Mining for Big Data in the Cloud"
Chun-Chieh Chen, Kuan-Wei Lee, Chih-Chieh Chang, De-Nian Yang, and Ming-Syan Chen

BigD350 "A Streaming Partitioning Approach to Processing Large Scale Distributed Graph Datasets"
Rui Wang and Kenneth Chiu

BigD361 "Scalable Distributed Event Detection for Twitter"
Richard McCreadie, Craig Macdonald, Iadh Ounis, Miles Osborne, and Sasa Petrovic

BigD366 "Analysis of GSM calls data for understanding user mobility behavior"
Chiara Renso, Barbara Furletti, Lorenzo Gabrielli, and Salvatore Rinzivillo

BigD402 "Personalizing Search: A Case for Scaling Concurrency in Multi-Tenant Semantic Web Search Systems over Large RDF Datasets"
HAIZHOU FU, Hyeongsik Kim, and Kemafor Anyanwu

BigD417 "A Hypergraph-Partitioned Vertex Programming Approach for Large-scale Consensus Optimization"
Hui Miao, Xiangyang Liu, Bert Huang, and Lise Getoor

BigD448 "A Higher-Order Data Flow Model for Heterogeneous Big Data"
Simon Price and Peter Flach

BigD465 "Parallel Subgroup Discovery on Computing Clusters -- First Results"
Daniel Trabold and Henrik Grosskreutz

5. Big Data Security & Privacy

Regular Papers:

BigD269 "DP-WHERE: Differentially Private Modeling of Human Mobility"
Darakhshan Mir, Sibren Isaacman, Ramón Cáceres, Margaret Martonosi, and Rebecca Wright

BigD305 "Malicious URLs Filtering - A Big Data Application"
Min-Sheng Lin, Chien-Yi Chiu, Yuh-Jye Lee, and Hsing-Kuo Pao

BigD328 "Zero-Knowledge Private Graph Summarization"
Maryam Shoaran, Alex Thomo, and Jens Weber

Short Papers:

BigD230 "Scalable Network Traffic Visualization Using Compressed Graphs"
Lei Shi, Qi Liao, and Xiaohua Sun

BigD391 "Breaking the Arc: RIsk Control for Big Data"
Duncan Hodges and Sadie Creese

6. Big Data Applications

Regular Papers:

BigD288 "The BTWorld Use Case for Big Data Analytics: Description, MapReduce Logical Workflow, and Empirical Evaluation"
Tim Hegeman, Bogdan Ghi?, Mihai Capota, Jan Hidders, Dick Epema, and Alexandru Iosup

BigD311 "Modeling Heterogeneous Time Series Dynamics to Profile Big Sensor Data in Complex Physical Systems"
Bin Liu, Haifeng Chen, Abhishek Sharma, Guofei Jiang, and Hui Xiong

BigD332 "Efficiently Extracting Frequent Subgraphs using MapReduce"
Wei Lu, Gang Chen, Anthony Tung, and Feng Zhao

BigD341 "Explaining the Product Range Effect in Purchase Data"
Diego Pennacchioli, Michele Coscia, Salvatore Rinzivillo, Dino Pedreschi, and Fosca Giannotti

BigD353 "Large-scale Predictive Analytics for Real Time Energy Management"
Natasha Balac, Tamara Sipes, Nicole Wolter, Kenneth Nunes, Robert Sinkovits, and Homa Karimabadi

BigD372 "Parallel Deterministic Annealing Clustering and its Application to LC-MS Data Analysis"
Geoffrey Fox, D. R. Mani, and Saumyadipta Pyne

BigD378 "Terabyte-scale image similarity search: experience and best practice"
Diana Moise, Denis Shestakov, Gylfi Gudmundsson, and Laurent Amsaleg

BigD437 "Demand Response Targeting Using Big Data Analytics"
Jungsuk Kwac and Ram Rajagopal

Short Papers:

BigD238 "Predicting flight arrival times with a multistage model"
Gabor Takacs

BigD252 "HIG – An In-memory Database Platform Enabling Real-time Analyses of Genome Data"
Matthieu-P. Schapranow and Hasso Plattner

BigD266 "Real-time streaming mobility analytics"
Andras Garzo, Csaba Sidlo, Daniel Tahara, Erik Wyatt, and Andras Bencur

BigD320 "QuPARA: Query-Driven Large-Scale Portfolio Aggregate Risk Analysis on MapReduce"
Andrew Rau-Chaplin, Blesson Varghese, Duane Wilson, Zhimin Yao, and Norbert Zeh

BigD405 "Constructing User Profiles from Social Media Data"
Mauricio Hernandez, Kirsten Hildrum, Prateek Jain, Chitra Venkatramani, Rohit Wagle, Bogdan Alexe, and Ioana Roxana Stanoi

BigD431 "CloudRS: An Error Correction Algorithm of High-Throughput Sequencing Data based on Scalable Framework"
Chien-Chih Chen, Yu-Jung Chang, Wei-Chun Chung, Der-Tsai Lee, and Jan-Ming Ho

BigD444 "Building dynamic thermal profiles of energy consumption for individuals and neighborhoods"
Adrian Albert and Ram Rajagopal

Industry and Government Program

N203  Peter Bajcsy, Antoine Vandecreme, Julien Amelot, Phuong Nguyen, Joe Chalfoun, and Mary Brady, Terabyte-sized Image Computations on Hadoop Cluster Platforms
N206 Ron Begleiter, Yuval Elovici, Yona Hollander, Ori Mendelson, Lior Rokach, and Roi Saltzman, A Fast and Scalable Method for Threat Detection in Large-scale DNS Logs
N207  Matthew Hayes and Sam Shah, Hourglass: a Library for Incremental Processing on Hadoop
N209 Qi Guo, Yan Li, Tao Liu, Kun Wang, Guancheng Chen, Xiaoming Bao, and Wentao Tang, Correlation-based Performance Analysis for Full-System MapReduce Optimization
N217 Mihajlo Grbovic, Jon Malkin, and Hirakendu Das, Large Scale Ad Latency Analysis
N218 Alessandro Morari, Vito Giovanni Castellana, Oreste Villa, David Haglin, John Feo, Jesse Weaver, and Antonino Tumeo, Accelerating semantic graph databases on commodity clusters
N219 Peter Lubell-Doughtie and Jon Sondag, Practical Distributed Classification using the Alternating Direction Method of Multipliers Algorithm
N225  Varun Sharma, Scaling Deep Social Feeds at Pinterest
N226  Thibaud Chardonnens, Philippe Cudre-Mauroux, Martin Grund, and Benoit Perroud, Big Data Analytics on High Velocity Streams: A Case Study


Last update: 12 Aug 2012