Description: Description: Description: D:\Academic\TsG\Conferences\conference pre works\bigdata2014\BigData 2014 map network drive\whitehouse.png


      What's New

      Important Dates

      Online Submission


      Special Session



      Program Committee

      Program Schedule

      Keynote Speeches

      Panel with Program Directors  


      Doctoral Symposium


      Accepted Papers


      Student Travel Award

      Visa to USA

      Travel Information

      About Washington DC


Description: Description: Description: Description: Description: Description: Description: Description: D:\Academic\TsG\Conferences\conference pre works\bigdata2014\BigData 2014 map network drive\ieee_mb_blue.jpg          

Description: Description: Description: Description: Description: Description: Description: Description: D:\Academic\TsG\Conferences\conference pre works\bigdata2014\BigData 2014 map network drive\image_gallery.gif

































Accepted Papers                                                                                    


Regular Papers:

BigD210 Maria Christoforaki and Torsten Suel, Learning to Estimate Pairwise Distances in Large Graphs

BigD215 Dongfang Zhao, Virtual Chunks: On Supporting Random Accesses to Scientific Data in Compressible Storage Systems

BigD216 Dongfang Zhao, Towards Supporting Data-Intensive Scientific Applications on Extreme-Scale High Performance Computing Systems

BigD234 Anh Thu Vu, Gianmarco De Francisci Morales, Joao Gama, and Albert Bifet, Distributed Adaptive Model Rules for Mining Big Data Streams

BigD244 Zhichuan Huang, Hongyao Luo, David Skoda, Ting Zhu, and Yu Gu, E-Sketch: Gathering Large-scale Energy Consumption Data Based on Consumption Patterns

BigD253 Dorit S. Hochbaum and Philipp Baumann, Sparse computation for large-scale data mining

BigD258 Arun Maiya, Topic Similarity Networks: Visual Analytics for Large Document Sets

BigD260 Lee Kellogg, Brian Ruttenberg, Alison O'Connor, Michael Howard, and Avi Pfeffer, Hierarchical Management of Large-Scale Malware Data

BigD271 Teng Wang, Sarp Oral, Yandong Wang, Brad Settlemyer, Scott Atchley, and Weikuan Yu, BurstMem: A High-Performance Burst Buffer System for Scientific Applications

BigD277 Sotiris Tasoulis, Lu Cheng, Niko Välimäki, Nicholas Croucher, Simon Harris, William Hanage, Teemu Roos, and Jukka Corander, Random Projection Based Clustering for Population Genomics

BigD283 Mohammed Nazim Feroz and Susan Mengel, Examination of Data, Rule Generation and Detection of Phishing URLs using Online Logistic Regression

BigD294 Yun Shen and Olivier Thonnard, MR-TRIAGE: Scalable Multi-Criteria Clustering for Big Data Security Intelligence Applications

BigD301 Mayank Daga, Mark Nutter, and Mitesh Meswani, Efficient Breadth-First Search on a Heterogeneous Processor

BigD303 Chad Steed, Katherine Evans, John Harney, Brian Jewell, Galen Shipman, Brian Smith, Peter Thornton, and Dean Williams, Web-based Visual Analytics for Extreme Scale Climate Science

BigD304 Ryan Compton, David Jurgens, and David Allen, Geotagging One Hundred Million Twitter Accounts with Total Variation Minimization

BigD306 Li-Yan Yuan, Lengdong Wu, and Jia-Huai You, BASIC: an Alternative to BASE for Large-Scale Data Management System

BigD313 Ruben Mayer, Boris Koldehofe, and Kurt Rothermel, Meeting Predictable Buffer Limits in the Parallel Execution of Event Processing Operators

BigD316 Xiaomeng Zhao, Huadong Ma, Haitao Zhang, Yi Tang, and Guangping Fu, Metadata Extraction and Correction for Large-Scale Traffic Surveillance Videos

BigD318 Junwhan Kim, Partial Rollback-based Scheduling on In-memory Transactional Data Grids

BigD336 Ke Tao, Claudia Hauff, Geert-Jan Houben, Fabian Abel, and Guido Wachsmuth, Facilitating Twitter Data Analytics: Platform, Language, and Functionality

BigD337 Mohan Yang and Carlo Zaniolo, Main Memory Evaluation of Recursive Queries on Multicore Machines

BigD338 Masahiko Itoh, Daisaku Yokoyama, Masashi Toyoda, Yoshimitsu Tomita, Satoshi Kawamura, and Masaru Kitsuregawa, Visual Fusion of Mega-City Big Data: An Application to Traffic and Tweets Data Analysis of Metro Passengers

BigD357 Alekh Jindal and Samuel Madden, GRAPHiQL: A Graph Intuitive Query Language for Relational Databases

BigD360 Daniela Ushizima, Talita Perciano, Harinarayan Krishnan, Burlen Loring, Hrishikesh Bale, Dilworth Parkinson, and James Sethian, Structure Recognition from High Resolution Images of Ceramic Composites

BigD379 Ronak Etemadpour, Paul Murray, and Angus Forbes, Evaluating Density-based Motion for Big Data Visual Analytics

BigD380 sufeng Niu, guangyu yang, nilim sarma, Melissa Smith, Pradip Srimani, and Feng Luo, Combining Hadoop and GPU to Preprocess Large Affymetrix Microarray Data

BigD382 Ulf Johansson, Cecilia Sönströd, and Henrik Linusson, Interpretable Streaming Regression Models with Local Performance Guarantees

BigD383 Wenrong Zeng, Yuhao Yang, and Bo Luo, Using Data Content to Assist Access Control for Large-Scale Content-Centric Databases

BigD391 Ming-Syan Chen, Pei-Ling Chen, and Chung-Kuang Chou, Distributed Algorithms for k-truss Decomposition

BigD395 George Slota, Siva Rajamanickam, and Kamesh Madduri, PULP: Scalable Multi-Objective Multi-Constraint Partitioning for Small-World Networks

BigD398 Arash Fard, Satya Manda, Lakshmish Ramaswamy, and John Miller, Effective Caching Techniques for Accelerating Pattern Matching Queries

BigD402 Shigeru Maya, Kai Morino, and Kenji Yamanishi, Predicting Glaucoma Progression using Multi-task Learning with Heterogeneous Features

BigD407 Dong Dai, Yong Chen, Dries Kimpe, and Rob Ross, Provenance-Based Object Storage Prediction Scheme for Scientific Big Data Applications

BigD419 Diana Palsetia, Mostofa Patwary, William Hendrix, Ankit Agrawal, and Alok Choudhary, Clique Guided Community Detection

BigD421 Yu Zhang, Stephen Wistar, Jose A. Piedra-Fernández, Jia Li, Michael Steinberg, and James Z. Wang, Locating Visual Storm Signatures from Satellite Images

BigD423 Hao Chen, Sastry Duri, Vasanth Bala, Nilton Bila, Canturk Isci, and Ayse Coskun, Detecting and Identifying System Changes in the Cloud via Discovery by Example

BigD426 Kyungho Jeon, Sharath Chandrashekhara, Feng Shen, Shikhar Mehra, Oliver Kennedy, and Steven Ko, PigOut: Making Multiple Hadoop Clusters to Work Together

BigD432 Marc Frincu, Charalampos Chelmis, Muhammad Noor, and Viktor Prasanna, Accurate and Efficient Selection of the Best Consumption Prediction Method in Smart Grids

BigD434 Zhisong Fu, Harish Dasari, Martin Berzins, and Bryan Thompson, Parallel Breadth First Search on GPU Clusters

BigD436 Songchang Jin, Jiawei Zhang, Philip S. Yu, Shuqiang Yang, and Aiping Li, Synergistic Partitioning in Multiple Large Scale Social Networks

BigD441 Todd Bodnar, Conrad Tucker, Kenneth Hopkinson, and Sven Bilén, Increasing the Veracity of Event Detection on Social Media Networks Through User Trust Modeling

BigD444 Hideyuki Shamoto, Koichi Shirahata, Aleksandr Drozd, Hitoshi Sato, and Satoshi Matsuoka, Large-scale Distributed Sorting for GPU-based Heterogeneous Supercomputers

BigD445 Alice Marascu, Pascal Pompey, Eric Bouillet, Michael Wurst, Olivier Verscheure, Martin Grund, and Philippe Cudre-Mauroux, TRISTAN: Real-Time Analytics on Massive Time Series Using Sparse Dictionary Compression

BigD451 Hao Li, Di Yu, Anand Kumar, and Yicheng Tu, Performance Modeling in CUDA Streams - A Means for High-Throughput Data Processing

BigD454 Chieh-Yen Lin, Cheng-Hao Tsai, Ching-Pei Lee, and Chih-Jen Lin, Large-scale Logistic Regression and Linear Support Vector Machines Using Spark

BigD455 Keita Iwabuchi, Hitoshi Sato, Yuichiro Yasui, Katsuki Fujisawa, and Satoshi Matsuoka, NVM-based Hybrid BFS with Memory Efficient Data Structure

BigD460 Can Altinigneli, Bettina Konte, Dan Rujescu, Christian Boehm, and Claudia Plant, Identification of SNP Interactions Using Data-Parallel Primitives on GPUs

BigD465 Sushovan De, Yuheng Hu, Yi Chen, and Subbarao Kambhampati, BayesWipe: A Multimodal System for Data Cleaning and Consistent Query Answering on Structured Data

BigD471 Ke Wang, Xiaobing Zhou, Tonglin Li, michael lang, and Ioan Raicu, Optimizing Load Balancing and Data-Locality with Data-aware Scheduling



Short Papers:

BigD204 Mark Paddrik, Richard Haynes, Andrew Todd, William Scherer, and Peter Beling, The Role of Visual Analysis in the Regulation of Electronic Order Book Markets

BigD217 noriaki kawamae, Preferences over Time

BigD225 Shan Jiang and Chengxiang Zhai, Random Walks on Adjacency Graphs for Mining Lexical Relations from Big Text Data

BigD227 Magnus Almgren, Olaf Landsiedel, Marina Papatriantafilou, and Zhang Fu, Online Temporal-Spatial Analysis for Detection of Critical Events in Cyber-Physical Systems

BigD230 Xuejie Xiao, Jian Tang, Zhenhua Chen, Jielong Xu, and Chonggang Wang, A Cross-job Framework for MapReduce Scheduling

BigD232 Mathias Johanson, Stanislav Belenki, Jonas Jalminger, Magnus Fant, and Mats Gjertz, Big Automotive Data - Leveraging large volumes of data for knowledge-driven product development

BigD233 Yufei Han, Xiaolan Sha, Etta Grover-Silva, and Pietro Michiardi, On the Impact of Socio-economic Factors on Power Load Forecasting

BigD238 Jonathan Mugan, Ranga Chari, Laura Hitt, Eric McDermid, Marsha Sowell, Yuan Qu, and Thayne Coffman, Entity Resolution Using Inferred Relationships and Behavior

BigD239 JONG HOON AHNN, Toward Personalized and Scalable Voice-Enabled Services Powered by Big Data

BigD242 Jia-Chun Lin, Ming-Chang Lee, and Ramin Yahyapour, Scheduling MapReduce Tasks on Virtual MapReduce Clusters from a Tenant’s Perspective

BigD247 Rong Gu, Yihua Huang, and Wei Hu, Rainbow: A Distributed and Hierarchical RDF Triple Store with Dynamic Scalability

BigD252 Hong Yi, Michel Rasquin, Jun Fang, and Igor Bolotnov, In-Situ Visualization and Computational Steering for Large-Scale Simulation of Turbulent Flows in Complex Geometries

BigD259 Guo-Qiang Zhang, Wei Zhu, Mengmeng Sun, Shiqiang Tao, Olivier Bodenreider, and Licong Cui, MaPLE: A MapReduce Pipeline for Lattice-based Evaluation and Its Application to SNOMED CT

BigD264 Takatsugu Ono, Yotaro Konishi, Teruo Tanimoto, Noboru Iwamatsu, Takashi Miyoshi, and Jun Tanaka, FlexDAS: A Flexible Direct Attached Storage for I/O Intensive Applications

BigD270 Lena Mashayekhy, Mahyar Movahed Nejad, and Daniel Grosu, A Two-Sided Market Mechanism for Trading Big Data Computing Commodities

BigD284 Zhiyuan Lin, Minsuk Kahng, Kaeser Md. Sabrin, Duen Horng Chau, Ho Lee, and U Kang, MMap: Fast Billion-Scale Graph Computation on a PC via Memory Mapping

BigD287 Thibault Debatty, Pietro Michiardi, Olivier Thonnard, and Wim Mees, Building k-nn graphs from large text data

BigD288 Arian Bär, Alessandro Finamore, Pedro Casas, Lukasz Golab, and Marco Mellia, Large-Scale Network Traffic Monitoring with DBStream, a System for Rolling Big Data Analysis

BigD291 Bun Theang Ong, Komei Sugiura, and Koji Zettsu, Dynamic Pre-training of Deep Recurrent Neural Networks for Predicting Environmental Monitoring Data

BigD293 Stéphan Clémençon, Bertail Patrice, and Emilie Chautru, Scaling up M-estimation via sampling designs: the Horvitz-Thompson stochastic gradient descent

BigD296 Jose M. Abuin, Juan C. Pichel, Tomas F. Pena, Pablo Gamallo, and Marcos Garcia, Perldoop: Efficient Execution of Perl Scripts on Hadoop Clusters

BigD310 Dean Williams, Giri Palanisamy, Galen Shipman, Thomas Boden, and Jimmy Voyles, Department of Energy Strategic Roadmap for Earth System Science Data Integration

BigD311 Patrick Leyshock, David Maier, and Kristin Tufte, Minimizing Data Movement through Query Transformation

BigD312 Jason Anderson, Ken Kennedy, Linh Ngo, Andre Luckow, and Amy Apon, Synthetic Data Generation for the Internet of Things

BigD315 Diana Gudu, Marcus Hardt, and Achim Streit, Evaluating the Performance and Scalability of the Ceph Distributed Storage System

BigD324 Raju Balakrishnan and Rajesh Parech, Learning to Predict Subject-Line Opens for Large-Scale Email Marketing

BigD327 Jane Greenberg, Adrian Ogletree, Angela Murillo, Thomas Caruso, and Herbie Huang, Metadata Capital: Simulating the Predictive Value of Self-Generated Heatlh Information (SGHI)

BigD331 Vladimir Estivill-Castro, Md Zahidul Islam, and Peter Hough, Empowering users of social networks to assess their privacy risks

BigD333 Robert Pienta, Acar Tamersoy, Hanghang Tong, and Duen Horng Chau, Matching Approximate Patterns in Richly-Attributed Graphs

BigD339 Jungkyu Han and Min Luo, Bootstrapping K-means for Big data analysis

BigD343 Raghvendra Mall, Vilen Jumutc, Rocco Langone, and Johan Suykens, Representative Subsets For Big Data Learning using k-NN graphs

BigD346 Tara Babaie, Sanjay Chawla, and Sebastien Ardon, A Unified Approach to Network Anomaly Detection

BigD347 Jiang Li, Hideyuki Kawashima, and Osamu Tatebe, Incremental Window Aggregates over Array Database

BigD350 Daniel Fried, Mihai Surdeanu, Stephen Kobourov, Melanie Hingle, and Dane Bell, Analyzing the Language of Food on Social Media

BigD356 Rubing Duan, Towards Building and Evaluating a Personalized Location-Based Recommender System

BigD361 Ahsanul Haque, Swarup Chandra, Latifur Khan, and Charu Aggarwal, Distributed Adaptive Importance Sampling on Graphical Models using MapReduce

BigD362 Michel Roger, Yiqi Xu, and Ming Zhao, BigCache for Big-data Systems

BigD364 Evie Kassela, Christina Boumpouka, Ioannis Konstantinou, and Nectarios Koziris, Automated Workload-aware Elasticity of NoSQL Clusters in the Cloud

BigD365 Mark Lycett and Asmat Monaghan, Big Data: Myths, Misconceptions and Opportunities

BigD366 Wei-Chun Chung, Yu-Jung Chang, D. T. Lee, and Jan-Ming Ho, Using Geometric Structures to Improve the Error Correction Algorithm of High-Throughput Sequencing Data on MapReduce Framework

BigD376 Bo Liu, Erico Souza, Stan Matwin, and Marcin Sydow, Knowledge-based Clustering of Ship Trajectories Using Density-based Approach

BigD384 Oyindamola Akande and Philip Rhodes, Multilevel Partitioning of Large Unstructured Grids

BigD387 Ciro Donalek, S.G. Djorgovski, Scott Davidoff, Alex Cioc, Anwell Wang, Giuseppe Longo, Jeffrey S. Norris, Jerry Zhang, Elizabeth Lawler, and Stacy Yeh, Immerive and collaborative data visualization using virtual reality platforms

BigD392 Sarker Ahmed and Dmitri Loguinov, On the Performance of MapReduce: A Stochastic Approach

BigD401 Khalifeh Aljadda, Mohammed Korayem, Camilo Ortiz, Trey Grainger, John Miller, and William York, PGMHD: A Scalable Probabilistic Graphical Model for Massive Hierarchical Data Problems

BigD409 Maryam Panahiazar, Vahid Taslimi, Ashutosh Jadhav, Amit Sheth, and Jyotishman Pathak, Empowering Personalized Medicine with Big Data and Semantic Web Technology: Promises, Challenges, Pitfalls, and Use Cases

BigD410 Khoa Luu, Chenchen Zhu, and Marios Savvides, Distributed Class Dependent Feature Analysis - A Big Data Approach

BigD411 Amit Gupta, Weijia Xu, Kenneth Perrine, Dennis Bell, and Natalia Ruiz-Juri, On Scaling Time Dependent Shortest Path Computations for Dynamic Traffic Assignment

BigD413 Tao Zhong, Kshitij Doshi, Gang Deng, Xiaoming Yang, and Hegao Zhang, High Volume Geospatial Mapping for Internet-of-Vehicle Solutions with In-Memory Map-Reduce Processing

BigD428 Krish K.R., M. Safdar Iqbal, and Ali Butt, VENU: Orchestrating SSDs in Hadoop Storage

BigD431 Lee Thompson, Weijia Xu, and Daniel Miranker, The Adaptive Projection Forest: Using Adjustable Exclusion and Parallelism in Metric Space Indexes

BigD438 Nusrat Islam, Xiaoyi Lu, Md. Rahman, Raghunath Rajachandrasekar, and Dhabaleswar Panda, In-Memory I/O and Replication for HDFS with Memcached: Early Experiences

BigD440 Dongeun Lee and Jaesik Choi, Low Complexity Sensing for Big Spatio-Temporal Data

BigD448 Tony Worm and Kenneth Chiu, Scaling Up Prioritized Grammar Enumeration for Scientific Discovery in the Cloud

BigD469 Jialin Liu, Yin Lu, and Yong Chen, In-advance Data Analytics for Reducing Time to Discovery

BigD475 Douglas Ottscott, Noah Evans, Latchesar Ionkov, ming zhou, and michael lang, Enabling Composite Applications through an Asynchronous Shared Memory Interface

BigD476 Silu Huang and Ada Fu, k-balanced sorting and skew join in MPI and MapReduce



Industry and Government Program:

N201 Praveen Bommannavar, Alek Kolcz, and Anand Rajaraman, Recall Estimation for Rare Topic Retrieval from Large Corpuses

N202 Celeste Lyn Paul, Chris Argent, William Elm, and Alex Endert, Future Directions of Humans in Big Data Research

N203 Jenny Weisenberg Williams, Kareem Aggour, John Interrante, Justin McHugh, and Eric Pool, Bridging High Velocity and High Volume Industrial Big Data Through Distributed In-Memory Storage & Analytics

N207 Renu Tewari, Dean Hildebrand, and Rui Zhang, In Unity There is Strength: Showcasing a Unified Big Data Platform with MapReduce Over both Object and File Storage

N209 Rohan Malcolm, Cherrelle Morrison, Tyrone Grandison, Sean Thorpe, Kimron Christie, Akim Wallace, Damian Green, Julian Jarrett, and Arnett Campbell, Increasing the Accessibility to Big Data Systems via a Common Services API

N211 Peter Bajcsy, Phuong Nguyen, Antoine Vandecreme, and Mary Brady, Spatial Computations over Terabyte-Sized Images on Hadoop Platforms

N213 Dhaval C. Lunagariya, Somayajulu D.V.L.N., and Radha Krishna P., SE-CDA: A Scalable and Efficient Community Detection Algorithm

N216 Khalifeh Aljadda, Mohammed Korayem, Trey Grainger, and Chris Russell, Crowdsourced Query Augmentation through Semantic Discovery of Domain-specific Jargon

N217 Vinay Deolalikar and Kave Eshghi, Lightweight Approximate Top-k for Distributed Settings

N218 Vinay Deolalikar, Query Revision During Cluster Based Search on Large Unstructured Corpora

N219 Eric Huang, Andres Quiroz, and Luca Ceriani, Automating Data Integration with HiperFuse

N222 Nicolas Poggi, David Carrera, Aaron Call, Rob Reinauer, Nikola Vujic, Daron Green, José Blakeley, Sergio Mendoza, Yolanda Becerra, Jordi Torres, Eduard Ayguadé, and Jesús Labarta, ALOJA: a Systematic Study of Hadoop Deployment Variables to Enable Automated Characterization of Cost-Effectiveness

N223 Chaitali Gupta, Mayank Bansal, Tzu-Cheng Chuang, Ranjan Sinha, and Sami Ben-romdhane, Astro: A Predictive Model for Anomaly Detection and Feedback-based Scheduling on Hadoop

N224 Francois Schnitzler, Thomas Liebig, Shie Mannor, Gustavo Souto, Sebastian Bothe, and Hendrik Stange, Heterogeneous Stream Processing for Disaster Detection and Alarming

N228 Jiang Zheng and Aldo Dagnino, An Initial Study of Predictive Machine Learning Analytics on Large Volumes of Historical Data for Power System Applications

N230 Jayasimha Reddy Katukuri, Tolga Konik, Rajyashree Mukherjee, and Santanu Kolay, Recommending Similar Items in Large-scale Online Marketplaces

N232 Sathyan Munirathinam, Big Data Predictive Analytics for Proactive Semiconductor Equipment Maintenance

N236 Yongli Tang, Tingting He, Bo Li, and Xiaohua Hu, Identifying top Chinese network buzzwords from social media big data set based on time-distribution features



Last update: 31 August 2014