HGPU group

HGPU group

HGPU group

HGPU group

OpenMP ARB

29 May 2025

📢 IWOMP 2025 – Deadline Extended to June 6th! 📢 Good news! The submission deadline for the International Workshop on OpenMP has been extended to 🗓️ June 6, 2025 (AoE)! 📍 Charlotte, NC | 🗓️ October 1–3, 2025 | 🏛️ Hosted by UNC Charlotte 📚 Accepted papers will be published in Springer’s LNCS series 🔍 2025 Theme: OpenMP – Balancing Productivity and Performance Portability Explore how OpenMP enables scalable, portable performance across diverse architectures. 🎯 Submit your latest research on: ✅ Offloading & Accelerated Computing ✅ HPC Applications & Scientific Computing ✅ Tasking, Runtime, and Tools ✅ OpenMP Extensions, ML & Data Analysis 📝 Up to 12 pages (excl. refs) for submissions Final version: up to 15 pages (incl. refs) 👉 Don’t miss this opportunity! iwomp.org/call-for-papers/ #IWOMP2025 #OpenMP #HPC #ParallelProgramming #PerformancePortability #CFP #DeadlineExtended

The 18th International Workshop on OpenMP

The premier forum to present and discuss issues, trends, recent research, and results related to parallel programming with OpenMP.

iwomp.org

176

Enhancing Transformer Performance and Portability through Auto-tuning Frameworks

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM...

Acceleration as a Service (XaaS) Source Containers

The 18th International Workshop on OpenMP

Exploring SYCL for batched kernels with memory allocations

Concurrent Scheduling of High-Level Parallel Programs on Multi-GPU Systems

Analyzing the Performance Portability of SYCL across CPUs, GPUs, and Hybrid Systems with Protein...

Performance portability via C PSTL, SYCL, OpenMP, and HIP: the Gaia AVU-GSR case study

On a Simplified Approach to Achieve Parallel Performance and Portability Across CPU and GPU...

Experiences with implementing Kokkos’ SYCL backend

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Retargeting and Respecializing GPU Workloads for Performance Portability

pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C Parallel STL Implementations

An Evaluative Comparison of Performance Portability across GPU Programming Models

AFOCL: Portable OpenCL Programming of FPGAs via Automated Built-in Kernel Management

Open SYCL on heterogeneous GPU systems: A case of study