The Community for Technology Leaders
2018 30th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) (2018)
Lyon, France
Sep 24, 2018 to Sep 27, 2018
ISBN: 978-1-5386-7769-8
TABLE OF CONTENTS

Copyright (PDF)

pp. 1

Program Committees (PDF)

pp. xvi-xvii

External Reviewers (PDF)

pp. xviii

MLNoC: A Machine Learning Based Approach to NoC Design (Abstract)

Nishant Rao , NetSpeed Systems
Akshay Ramachandran , NetSpeed Systems
Amish Shah , NetSpeed Systems
pp. 1-8

ADeLe: Rapid Architectural Simulation for Approximate Hardware (Abstract)

Isaias B. Felzmann , University of Campinas, Institute of Computing
Matheus M. Susin , University of Campinas, Institute of Computing
Liana Duenha , Federal University of Mato Grosso do Sul, Faculty of Computing - FACOM
Rodolfo Azevedo , University of Campinas, Institute of Computing
Lucas F. Wanner , University of Campinas, Institute of Computing
pp. 9-16

From Java to FPGA: An Experience with the Intel HARP System (Abstract)

Pedro Caldeira , UFMG, Campus Pampulham, Belo Horizonte, 31270-901, Brazil
Jeronimo C. Penha , Campus Universitário, UFV, Viçosa, 36570-900, Brazil
Lucas Braganca , Campus Universitário, UFV, Viçosa, 36570-900, Brazil
Ricardo Ferreira , Campus Universitário, UFV, Viçosa, 36570-900, Brazil
Jose Augusto M. Nacif , Campus Universitário, UFV, Viçosa, 36570-900, Brazil
Renato Ferreira , UFMG, Campus Pampulha, Belo Horizonte, 31270-901, Brazil
Fernando M. Q. Pereira , UFMG, Campus Pampulha, Belo Horizonte, 31270-901, Brazil
pp. 17-24

Online Detection of Spectre Attacks Using Microarchitectural Traces from Performance Counters (Abstract)

Congmiao Li , University of California Irvine, Electrical Engineering and Computer Science, Irvine, USA
Jean-Luc Gaudiot , University of California Irvine, Electrical Engineering and Computer Science, Irvine, USA
pp. 25-28

DOACROSS Parallelization Based on Component Annotation and Loop-Carried Probability (Abstract)

Luis Mattos , UNICAMP, Institute of Computing, Campinas, Brazil
Divino Cesar , UNICAMP, Institute of Computing, Campinas, Brazil
Juan Salamanca , UNICAMP, Institute of Computing, Campinas, Brazil
Joao P. L. de Carvalho , UNICAMP, Institute of Computing, Campinas, Brazil
Marcio Pereira , UNICAMP, Institute of Computing, Campinas, Brazil
Guido Araujo , UNICAMP, Institute of Computing, Campinas, Brazil
pp. 29-32

Scheduling Independent Stochastic Tasks Under Deadline and Budget Constraints (Abstract)

Louis-Claude Canon , CNRS ENS de Lyon Inria, Université Claude-Bernard Lyon 1, LIP UMR5668, Univ Lyon, LYON Cedex 07, F-69342, France
Aurelie Kong Win Chang , CNRS ENS de Lyon Inria, Université Claude-Bernard Lyon 1, LIP UMR5668, Univ Lyon, LYON Cedex 07, F-69342, France
Yves Robert , CNRS ENS de Lyon Inria, Université Claude-Bernard Lyon 1, LIP UMR5668, Univ Lyon, LYON Cedex 07, F-69342, France
Frederic Vivien , CNRS ENS de Lyon Inria, Université Claude-Bernard Lyon 1, LIP UMR5668, Univ Lyon, LYON Cedex 07, F-69342, France
pp. 33-40

Adaptive Scheduling of Collocated Applications Using a Task-Based Runtime System (Abstract)

Jiri Dokulil , University of Vienna, Faculty of Computer Science, Vienna, Austria
Siegfried Benkner , University of Vienna, Faculty of Computer Science, Vienna, Austria
pp. 41-48

A Batch Task Migration Approach for Decentralized Global Rescheduling (Abstract)

Vinicius Freitas , Federal University of Santa Catarina (UFSC), Florianópolis, Brazil
Alexandre de L. Santana , Federal University of Santa Catarina (UFSC), Florianópolis, Brazil
Marcio Castro , Federal University of Santa Catarina (UFSC), Florianópolis, Brazil
Laercio L. Pilla , Federal University of Santa Catarina (UFSC), Florianópolis, Brazil
pp. 49-56

Exploring Power Budget Scheduling Opportunities and Tradeoffs for AMR-Based Applications (Abstract)

Yubo Qin , Rutgers University, Rutgers Discovery Informatics Institute (RDI2), Piscataway, New Jersey, USA
Ivan Rodero , Rutgers University, Rutgers Discovery Informatics Institute (RDI2), Piscataway, New Jersey, USA
Pradeep Subedi , Rutgers University, Rutgers Discovery Informatics Institute (RDI2), Piscataway, New Jersey, USA
Manish Parashar , Rutgers University, Rutgers Discovery Informatics Institute (RDI2), Piscataway, New Jersey, USA
Sandro Rigo , Universidade Estadual de Campinas, Campinas, Sao Paulo, Brazil
pp. 57-64

EASE: Energy Efficiency and Proportionality Aware Virtual Machine Scheduling (Abstract)

Congfeng Jiang , Hangzhou Dianzi University, Hangzhou, Zhejiang, China
Yumei Wang , Hangzhou Dianzi University, Hangzhou, Zhejiang, China
Dongyang Ou , Hangzhou Dianzi University, Hangzhou, Zhejiang, China
Yeliang Qiu , Hangzhou Dianzi University, Hangzhou, Zhejiang, China
Youhuizi Li , Hangzhou Dianzi University, Hangzhou, Zhejiang, China
Jian Wan , Zhejiang University of Science and Technology Hangzhou, Zhejiang, China
Bing Luo , Systems Laboratory, Dept. of Computer Science, Wayne State University, Mobile and Internet, Detroit, MI, USA
Weisong Shi , Systems Laboratory, Dept. of Computer Science, Wayne State University, Mobile and Internet, Detroit, MI, USA
Christophe Cerin , Sorbonne Paris Cite’ LIPN/CNRS UMR, University of Paris 13, 7030
pp. 65-68

Energy - Efficient IaaS-PaaS Co-Design for Flexible Cloud Deployment of Scientific Applications (Abstract)

David Guyon , Inria, CNRS, IRISA, Univ Rennes, Rennes, France
Anne-Cecile Orgerie , Inria, CNRS, IRISA, Univ Rennes, Rennes, France
Christine Morin , Inria, CNRS, IRISA, Univ Rennes, Rennes, France
pp. 69-76

Frequency Selection Approach for Energy Aware Cloud Database (Abstract)

Chaopeng Guo , University of Toulouse, Institut de Recherche en Informatique de Toulousez, France
Jean-Marc Pierson , University of Toulouse, Institut de Recherche en Informatique de Toulousez, France
pp. 77-84

Network-Aware Energy-Efficient Virtual Machine Management in Distributed Cloud Infrastructures with On-Site Photovoltaic Production (Abstract)

Benjamin Camus , Inria, CNRS, IRISA, Univ. Rennes, Rennes, France
Fanny Dufosse , LIG, Inria, Grenoble, France
Anne Blavette , CNRS, SATIE, Univ. Rennes, Rennes, France
Martin Quinson , Inria, CNRS, IRISA, Univ. Rennes, Rennes, France
Anne-Cecile Orgerie , Inria, CNRS, IRISA, Univ. Rennes, Rennes, France
pp. 86-92

A Novel Broker-Based Hierarchical Authentication Scheme in Proxy Mobile IPv6 Networks (Abstract)

Su-Hwan Jang , Sungkyunkwan University, Department of Smart Factory Convergence, Gyeonggi-do, 16419, Republic of Korea
Jongpil Jeong , Sungkyunkwan University, Department of Smart Factory Convergence, Gyeonggi-do, 16419, Republic of Korea
Byungjun Park , Sungkyunkwan University, College of Sport Science, Gyeonggi-do, 16419, Republic of Korea
pp. 93-96

Designing a Parallel Memory-Aware Lattice Boltzmann Algorithm on Manycore Systems (Abstract)

Yuankun Fu , Purdue University, Indianapolis, Indiana
Feng Li , Purdue University, Indianapolis, Indiana
Fengguang Song , Indiana University-Purdue University, Indianapolis, Indiana
Luoding Zhu , Indiana University-Purdue University, Indianapolis, Indiana
pp. 97-106

A New Efficient Parallel Algorithm for Minimum Spanning Tree (Abstract)

Jucele Franca de Alencar Vasconcellos , Federal University of Mato Grosso do Sul, College of Computing, Campo Grande, MS, Brazil
Edson Norberto Caceres , Federal University of Mato Grosso do Sul, College of Computing, Campo Grande, MS, Brazil
Henrique Mongelli , Federal University of Mato Grosso do Sul, College of Computing, Campo Grande, MS, Brazil
Siang Wun Song , University of são Paulo, Institute of Mathematics and Statistics, são Paulo, Sp, Brazil
pp. 107-114

Exploring Self-Adaptivity Towards Performance and Energy for Time-Stepping Methods (Abstract)

Natalia Kalinnik , University Bayreuth
Robert Kiesel , Chemnitz University of Technology
Thomas Hauber , University Bayreuth
Marcel Richter , Chemnitz University of Technology
Gudula Runger , Chemnitz University of Technology
pp. 115-123

Predicting the Reliability Behavior of HPC Applications (Abstract)

Daniel Oliveira , UFRGS, Institute of Informatics, Porto Alegre, Brazil
Francis Birck Moreira , UFRGS, Institute of Informatics, Porto Alegre, Brazil
Paolo Rech , UFRGS, Institute of Informatics, Porto Alegre, Brazil
Philippe Navaux , UFRGS, Institute of Informatics, Porto Alegre, Brazil
pp. 124-131

Variable-Size Batched Condition Number Calculation on GPUs (Abstract)

Hartwig Anzt , University of Tennessee, Karlsruhe Institute of Technology Germany, USA
Jack Dongarra , Oak Ridge National Laboratory USA, University of Manchester, University of Tennessee USA, UK
Goran Flegar , Universidad Jaume I, Castellon, Spain
Thomas Grutzmacher , Karlsruhe Institute of Technology, Germany
pp. 132-139

Towards a Single-Host Many-GPU System (Abstract)

Ming-Hung Chen , IBM Research, Yorktown Heights, NY, USA
I.-Hsin Chung , IBM Research, Yorktown Heights, NY, USA
Bulent Abali , IBM Research, Yorktown Heights, NY, USA
Paul Crumley , IBM Research, Yorktown Heights, NY, USA
pp. 140-147

Exploiting Limited Access Distance for Kernel Fusion Across the Stages of Explicit One-Step Methods on GPUs (Abstract)

Matthias Korch , University of Bayreuth, Department of Computer Science, Bayreuth, Germany
Tim Werner , University of Bayreuth, Department of Computer Science, Bayreuth, Germany
pp. 148-157

Balancing Load of GPU Subsystems to Accelerate Image Reconstruction in Parallel Beam Tomography (Abstract)

Suren Chilingaryan , Karlsruhe Institute of Technology
Evelina Ametova , KU Leuven
Andreas Kopmann , Karlsruhe Institute of Technology
pp. 158-166

Performance Prediction of GPU-Based Deep Learning Applications (Abstract)

Eugenio Gianniti , Politecnico di Milano, Milan, Italy
Li Zhang , IBM T. J. Watson Research Center, Yorktown Heights, NY, Unites States
Danilo Ardagna , Politecnico di Milano, Milan, Italy
pp. 167-170

Polyhedral Dataflow Programming: A Case Study (Abstract)

Romain Fontaine , INSA Lyon, CITI, Univ Lyon, Villeurbanne, F-69621, Frances
Laure Gonnord , Université Claude Bernard Lyon 1, CNRS, ENS de Lyon Inria, LIP, Univ Lyon, LYON Cedex 07, F-69342, France
Lionel Morel , CEA List, Univ Grenoble Alpes, Grenoble, F-38000, France
pp. 171-179

Enabling Efficient Job Dispatching in Accelerator-Extended Heterogeneous Systems with Unified Address Space (Abstract)

Georgios Kornaros , Technological Educational Institute of Crete, Department of Informatics Engineering, Heraklion, Crete, 71500, Greece
Marcello Coppola , STMicroelectronics, Grenoble, France
pp. 180-188

Phase-Based Data Placement Scheme for Heterogeneous Memory Systems (Abstract)

Mohammad Laghari , Koç University, Computer Science and Engineering, Istanbul, Turkey
Najeeb Ahmad , Koç University, Computer Science and Engineering, Istanbul, Turkey
Didem Unat , Koç University, Computer Science and Engineering, Istanbul, Turkey
pp. 189-196

Exploiting Compute Caches for Memory Bound Vector Operations (Abstract)

Joao Vieira , Instituto Superior Tecnico, Universidade de Lisboa, INESC-ID, Portugal
Nuno Roma , Instituto Superior Tecnico, Universidade de Lisboa, INESC-ID, Portugal
Pedro Tomas , Instituto Superior Tecnico, Universidade de Lisboa, INESC-ID, Portugal
Paolo Ienne , Ecole Polytechnique - Fédérale de Lausanne, Lausanne, Switzerland
Gabriel Falcao , Universidade de Coimbra, Instituto de Telecomunicaçoos, Portugal
pp. 197-200

Exploring the Potential of Next Generation Software-Defined in Memory Frameworks (Abstract)

Shouwei Chen , Rutgers University, Rutgers Discovery Informatics Institute (RDI2)
Ivan Rodero , Rutgers University, Rutgers Discovery Informatics Institute (RDI2)
pp. 201-208

Towards Green Scientific Data Compression Through High-Level I/O Interfaces (Abstract)

Yevhen Alforov , Deutsches Klimarechenzentrum GmbH, Hamburg, Germany
Thomas Ludwig , Deutsches Klimarechenzentrum GmbH, Hamburg, Germany
Anastasiia Novikova , Universität Hamburg, Hamburg, Germany
Michael Kuhn , Universität Hamburg, Hamburg, Germany
Julian Kunkel , University of Reading, Reading, United Kingdom
pp. 209-216

Improving the Performance of Fog Computing Through the Use of Data Locality (Abstract)

Luiz Angelo Steffenel , CReSTIC Laboratory - CASH Team, Université de Reims Champagne-Ardenne, Reims, France
pp. 217-224

ECHOFS: A Scheduler-Guided Temporary Filesystem to Leverage Node-Local NVMS (Abstract)

Alberto Miranda , Barcelona Supercomputing Center (BSC)
Ramon Nou , Barcelona Supercomputing Center (BSC)
Toni Cortes , Universitat Politecnica de Catalunya, Barcelona Supercomputing Center (BSC)
pp. 225-228

A Jaccard Weights Kernel Leveraging Independent Thread Scheduling on GPUs (Abstract)

Hartwig Anzt , University of Tennessee, Karlsruhe Institute of Technology Germany, USA
Jack Dongarra , University of Manchester, University of Tennessee USA Oak Ridge National Laboratory USA, UK
pp. 229-232

Multicore Performance Engineering of Sparse Triangular Solves Using a Modified Roofline Model (Abstract)

Markus Wittmann , Friedrich-Alexander University, Erlangen Regional Computing Center, Erlangen-Nürnberg, Germany
Georg Hager , Friedrich-Alexander University, Erlangen Regional Computing Center, Erlangen-Nürnberg, Germany
Radim Janalik , Faculty of Informatics, Universitá della Svizzera italiana, Institute of Computational Science, Switzerland
Martin Lanser , University of Cologne, Faculty of Mathematics and Natural Sciences, Germany
Axel Klawonn , University of Cologne, Faculty of Mathematics and Natural Sciences, Germany
Oliver Rheinbach , TU Bergakademie Freiberg, Faculty of Mathematics and Computer Science, Germany
Olaf Schenk , Faculty of Informatics, Universitá della Svizzera italiana, Institute of Computational Science, Switzerland
Gerhard Wellein , Friedrich-Alexander University, Department of Computer Science, Erlangen-Nürnberg, Germany
pp. 233-241

Predicting the Performance Impact of Increasing Memory Bandwidth for Scientific Workflows (Abstract)

Nelson Mimura Gonzalez , IBM T. J. Watson Research Center, New York, USA
Jose Brunheroto , IBM T. J. Watson Research Center, New York, USA
Fausto Artico , IBM T. J. Watson Research Center, New York, USA
Yoonho Park , IBM T. J. Watson Research Center, New York, USA
Tereza Carvalho , University of São Paulo, Escola Poliéecnica, Brazil
Charles Christian Miers , Santa Catarina State University, Joinville, Brazil
Mauricio Aronne Pillon , Santa Catarina State University, Joinville, Brazil
Guilherme Piegas Koslovski , Santa Catarina State University, Joinville, Brazil
pp. 242-249

Mainstream vs. Emerging HPC: Metrics, Trade-Offs and Lessons Learned (Abstract)

Milan Radulovic , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Kazi Asifuzzaman , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Darko Zivanovic , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Nikola Rajovic , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Guillaume Colin de Verdiere , DAM, DIF, CEA, Arpajon, F-91297, France
Dirk Pleiter , Universität Regensburg, Institut für Theoretische Physik, Regensburg, 93040, Germany
Manolis Marazakisl , Foundation For Research & Technology - Hellas (FORTH), Heraklion, Greece
Nikolaos Kallimanis , Foundation For Research & Technology - Hellas (FORTH), Heraklion, Greece
Paul Carpenter , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Petar Radojkovic , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Eduard Ayguade , Barcelona Supercomputing Center (BSC), Barcelona, Spain
pp. 250-257

Assessing Time Predictability Features of ARM Big. LITTLE Multicores (Abstract)

Gabriel Fernandez , Barcelona Supercomputing Center (BSC), spain
Francisco J. Cazorla , Barcelona Supercomputing Center (BSC), spain
Jaume Abella , Barcelona Supercomputing Center (BSC), spain
Sylvain Girbal , THALES Research and Technology (TRT), France
pp. 258-261

Adaptive Partitioning for Iterated Sequences of Irregular OpenCL Kernels (Abstract)

Pierre Huchant , Bordeaux INP, U. of Bordeaux, Inria / LaBRI, Talence, France
Denis Barthou , Bordeaux INP, U. of Bordeaux, Inria / LaBRI, Talence, France
Marie-Christine Counilh , Bordeaux INP, U. of Bordeaux, Inria / LaBRI, Talence, France
pp. 262-265

Partitioning Convolutional Neural Networks for Inference on Constrained Internet-of-Things Devices (Abstract)

Fabiola Martins Campos de Oliveira , University of Campinas, Institute of Computing, Campinas, Brazil
Edson Borin , University of Campinas, Institute of Computing, Campinas, Brazil
pp. 266-273

Runtime Management of Data Quality for Scientific Observatories Using Edge and In-Transit Resources (Abstract)

Ali Reza Zamani , Rutgers University, Rutgers Discovery Informatics Institute (RDI2), Piscataway, NJ, 08854, USA
Daniel Balouek-Thomert , Rutgers University, Rutgers Discovery Informatics Institute (RDI2), Piscataway, NJ, 08854, USA
J. J. Villalobos , Rutgers University, Rutgers Discovery Informatics Institute (RDI2), Piscataway, NJ, 08854, USA
Ivan Rodero , Rutgers University, Rutgers Discovery Informatics Institute (RDI2), Piscataway, NJ, 08854, USA
Manish Parashar , Rutgers University, Rutgers Discovery Informatics Institute (RDI2), Piscataway, NJ, 08854, USA
pp. 274-281

A Fault-Tolerant Agent-Based Architecture for Transient Servers in Fog Computing (Abstract)

Jose Pergentino de Araujo Neto , Department of Computer Science (CIC), University of Brasilia (UnB), Brasilia, Brazil
Donald M. Pianto , Department of Statistics (EST), University of Brasilia (UnB), Brasilia, Brazil
Celia G. Ralha , Department of Computer Science (CIC), University of Brasilia (UnB), Brasilia, Brazil
pp. 282-289

Accelerating Deep Neural Network Training for Action Recognition on a Cluster of GPUs (Abstract)

Guojing Cong , IBM TJ Watson Research Center, 1101 Kitchawan Road, Yorktown Heights, NY, 10598
Giacomo Domeniconi , IBM TJ Watson Research Center, 1101 Kitchawan Road, Yorktown Heights, NY, 10598
Joshua Shapiro , IBM TJ Watson Research Center, 1101 Kitchawan Road, Yorktown Heights, NY, 10598
Fan Zhou , Georgia Tech, Atlanta, GA
Barry Chen , Lawrence Livermore National Laboratory, Livermore, CA
pp. 298-305

Deep Learning on Large-Scale Muticore Clusters (Abstract)

Kazumasa Sakivama , The University of Tokyo
Shinpei Kato , The University of Tokyo
Atsushi Hori , RIKEN
Abraham Monrroy , Nagoya University
pp. 314-321

On the Resilience of RTL NN Accelerators: Fault Characterization and Mitigation (Abstract)

Behzad Salami , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Osman S. Unsal , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Adrian Cristal Kestelman , Barcelona Supercomputing Center (BSC), Barcelona, Spain
pp. 322-329

T-SNE-CUDA: GPU-Accelerated T-SNE and its Applications to Modern Data (Abstract)

David M. Chan , University of California Berkeley, EECS Department, Berkeley, CA, USA
Roshan Rao , University of California Berkeley, EECS Department, Berkeley, CA, USA
Forrest Huang , University of California Berkeley, EECS Department, Berkeley, CA, USA
John F. Canny , University of California Berkeley, EECS Department, Berkeley, CA, USA
pp. 330-338

HyperSpace: Distributed Bayesian Hyperparameter Optimization (Abstract)

M. Todd Young , Oak Ridge National Laboratory, Computational Science & Engineering Division, Oak Ridge, USA
Jacob Hinkle , Oak Ridge National Laboratory, Computational Science & Engineering Division, Oak Ridge, USA
Arvind Ramanathan , Oak Ridge National Laboratory, Computational Science & Engineering Division, Oak Ridge, USA
Ramakrishnan Kannan , Oak Ridge National Laboratory, Computer Science & Mathematics Division, Oak Ridge, USA
pp. 339-347

A Machine Learning Approach for Parameter Screening in Earthquake Simulation (Abstract)

Marisol Monterrubio-Velasco , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Jose Carlos Carrasco-Jimenez , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Octavio Castillo-Reyes , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Fernando Cucchietti , Barcelona Supercomputing Center (BSC), Barcelona, Spain
Josep De la Puente , Barcelona Supercomputing Center (BSC), Barcelona, Spain
pp. 348-355

A Case Study on Optimizing Accurate Half Precision Average (Abstract)

Kenny Peou , Université Paris-Saclay, LRI UMR CNRS 8623
Alan Kelly , NUMSCALE
Joel Falcou , Université Paris-Saclay, LRI UMR CNRS 8623
Cecile Germain , Université Paris-Saclay, LRI UMR CNRS 8623
pp. 356-363

Optimization of a Sparse Grid-Based Data Mining Kernel for Architectures Using AVX-512 (Abstract)

Paul-Cristian Sarbu , Dept. of Informatics, Technical University of Munich, Chair of Scientific Computing, Garching, Germany
Hans-Joachim Bungartz , Dept. of Informatics, Technical University of Munich, Chair of Scientific Computing, Garching, Germany
pp. 364-371

Energy Efficient Parallel K-Means Clustering for an Intel® Hybrid Multi-Chip Package (Abstract)

Matheus A. Souza , Pontifical Catholic University of Minas Gerais (PUC Minas), Belo Horizonte, Brazil
Lucas A. Maciel , Pontifical Catholic University of Minas Gerais (PUC Minas), Belo Horizonte, Brazil
Pedro Henrique Penna , Pontifical Catholic University of Minas Gerais (PUC Minas), Belo Horizonte, Brazil
Henrique C. Freitas , Pontifical Catholic University of Minas Gerais (PUC Minas), Belo Horizonte, Brazil
pp. 372-379

Performance Comparison of a Parallel Recommender Algorithm Across Three Hadoop-Based Frameworks (Abstract)

Christina Diedhiou , University of Portsmouth, School of Computing, Portsmouth, UK
Bryan Carpenter , University of Portsmouth, School of Computing, Portsmouth, UK
Aamir Shafi , College of Computer Science and Information Technology, Department of Computer Science, Dammam, Saudi Arabia
Soumabha Sarkar , University of Portsmouth, School of Computing, Portsmouth, UK
Ramazan Esmeli , University of Portsmouth, School of Computing, Portsmouth, UK
Ryan Gadsdon , University of Portsmouth, School of Computing, Portsmouth, UK
pp. 380-387

Effect of Network Topology on the Performance of ADMM-Based SVMs (Abstract)

Shirin Tavara , University of Borås, Information Technology, Borås, Sweden
Alexander Schliep , University of Gothenburg, Computer Science and Engineering, Gothenburg, Sweden
pp. 388-393

High-Performance Ensembles of Online Sequential Extreme Learning Machine for Regression and Time Series Forecasting (Abstract)

Luis Fernando L. Grim , University of Campinas (FT/UNICAMP), School of Technology, Limeira-SP, Brazil
Andre Leon S. Gradvohl , University of Campinas (FT/UNICAMP), School of Technology, Limeira-SP, Brazil
pp. 394-401

Design Space Exploration of Energy Efficient NoC-and Cache-Based Many-Core Architecture (Abstract)

Matheus A. Souza , Pontifical Catholic University of Minas Gerais, Belo Horizonte, Brazil
Henrique C. Freitas , Pontifical Catholic University of Minas Gerais, Belo Horizonte, Brazil
Jean-Francois Mehaut , Université Grenoble Alpes, Grenoble, France
pp. 402-409

Highly Scalable Stencil-Based Matrix-Free Stochastic Estimator for the Diagonal of the Inverse (Abstract)

Fabio Verbosio , Universita della Svizzera italiana, Institute of Computational Science, Lugano, Switzerland
Jurai Kardos , Universita della Svizzera italiana, Institute of Computational Science, Lugano, Switzerland
Mauro Bianco , Swiss National Supercomputing Center, Lugano, Switzerland
Olaf Schenk , Universita della Svizzera italiana, Institute of Computational Science, Lugano, Switzerland
pp. 410-419

A Scalability and Sensitivity Study of Parallel Geometric Algorithms for Graph Partitioning (Abstract)

Shad Kirmani , eBay Inc., Brisbane, CA, USA
Hongyang Sun , Vanderbilt University, Nashville, TN, USA
Padma Raghavan , Vanderbilt University, Nashville, TN, USA
pp. 420-427

Automatic Ray-Tracer Cloud Offloading in OPENMP (Abstract)

Matheus Mortatti , University of Campinas (UNICAMP), Institute of Computing, Campinas, Brazil
Herve Yviquel , University of Campinas (UNICAMP), Institute of Computing, Campinas, Brazil
Guido Araujo , University of Campinas (UNICAMP), Institute of Computing, Campinas, Brazil
pp. 428-435

Evaluation of an OPENMP Parallelization of Lucas-Kanade on a NUMA-Manycore (Abstract)

Olfa Haggui , Mines ParisTech - PSL Research University, Centre de Recherche en Informatique (CRI), Paris, 60 boulevard Saint-Michel, 75006, France
Claude Tadonki , Mines ParisTech - PSL Research University, Centre de Recherche en Informatique (CRI), Paris, 60 boulevard Saint-Michel, 75006, France
Fatma Sayadi , Faculty of Sciences, University of Monastir, Electronics and Microelectronics Laboratory, Monastir, 5000, Tunisia
Bouraoui Ouni , Sousse National School of Engineering, Networked Objects Control and Communications Systems (NOCCS), BP 264 Sousse, Erriadh, 4023, Tunisia
pp. 436-441

Automated GPU Grid Geometry Selection for OPENMP Kernels (Abstract)

Taylor Lloyd , University of Alberta, Edmonton, Canada
Artem Chikin , University of Alberta, Edmonton, Canada
Sanket Kedia , IIT Kharagpur, Kharagpur, India
Dhruv Jain , IIT Kharagpur, Kharagpur, India
Jose Nelson Amaral , University of Alberta, Edmonton, Canada
pp. 442-449

Hybrid MPI+openMP Implementation of eXtended Discrete Element Method (Abstract)

Abdoul Wahid Mainassara Checkaraou , Computational Engineering Department, LuXDEM Research Centre
Alban Rousset , Computational Engineering Department, LuXDEM Research Centre
Xavier Besseron , Computational Engineering Department, LuXDEM Research Centre
Sebastien Varrette , Computer Science and Communications Department, University of Luxembourg, PCOG, Luxembourg
Bernhard Peters , Computational Engineering Department, LuXDEM Research Centre
pp. 450-457

Impacts of Three Soft-Fault Models on Hybrid Parallel Asynchronous Iterative Methods (Abstract)

Evan Coleman , Dahlgren Division, Naval Surface Warfare Center, Dahlgren, VA
Erik J. Jensen , Old Dominion University, Modeling Simulation and Visualization Engineering Department, Norfolk, VA
Masha Sosonkina , Old Dominion University, Modeling Simulation and Visualization Engineering Department, Norfolk, VA
pp. 458-465

Scaling and Optimizing the Gysela Code on a Cluster of Many-Core Processors (Abstract)

Guillaume Latu , IRFM, CEA, St-Paul-lez-Durance, FR-13108
Yuuichi Asahi , QST Rokkasho Fusion Institute, Aomori, Japan
Julien Bigot , Maison de la Simulation, CEA, Gif-sur-Yvette, FR-91191
Tamas Feher , Max Planck Institute for Plasma Physics, Garching, Germany
Virginie Grandgirard , IRFM, CEA, St-Paul-lez-Durance, FR-13108
pp. 466-473

Author Index (PDF)

pp. 475-478
93 ms
(Ver 3.3 (11022016))