Lmst

#performanceportability

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

#CUDA #LLM #Compilers #AI #PerformancePortability #Package

https://hgpu.org/?p=29940

🧪Curious about high performance across GPUs? Our new paper benchmarks a parallel FSI code on CUDA, SYCL & OpenMP across top systems. See Aristotle Martin present it at #ISC2025 on June 11, 10:45 in Hamburg! #HPC #GPUcomputing #PerformancePortability

Thesis: Acceleration as a Service (XaaS) Source Containers

#HPC #MPI #PerformancePortability #LLM #Package

https://hgpu.org/?p=29925

Exploring SYCL for batched kernels with memory allocations

#SYCL #CUDA #PerformancePortability #Package

https://hgpu.org/?p=29911

Concurrent Scheduling of High-Level Parallel Programs on Multi-GPU Systems

#SYCL #TaskScheduling #PerformancePortability #HPC #Package

https://hgpu.org/?p=29823

Leveraging LLVM OpenMP GPU Offload Optimizations for Kokkos Applications

#Kokkos #CUDA #HIP #OpenMP #PerformancePortability #Package

https://hgpu.org/?p=29747

CPU-GPU co-execution through the exploitation of hybrid technologies via SYCL

#SYCL #OpenCL #CUDA #LLVM #PerformancePortability #LoadBalancing #HybridComputing

https://hgpu.org/?p=29717

We're used to leaning on children's books in Computer Science - with Gulliver's big-endian vs little-endian. Back at Supercomputing hashtag#SC24, I spoke at the hashtag#Intel booth all about open standards, performance portability, and the journey up the Yellow Brick Road to see the Wizard of Oz. Check out the video of the talk on YouTube:
https://youtu.be/xO8FGAOScpo?si=_BnVilvTBa0Ns6dX
#performanceportability #OpenMP #SYCL

Analyzing the Performance Portability of SYCL across CPUs, GPUs, and Hybrid Systems with Protein Database Search

#SYCL #oneAPI #Bioinformatics #Databases #HPC #PerformancePortability #Package

https://hgpu.org/?p=29596

Performance portability via C++ PSTL, SYCL, OpenMP, and HIP: the Gaia AVU-GSR case study

#HIP #SYCL #OpenMP #CUDA #PerformancePortability #HPC #Astrophysics #Package

https://hgpu.org/?p=29555

Kokkidio: Fast, expressive, portable code, based on Kokkos and Eigen

#Kokkos #PerformancePortability #Package

https://hgpu.org/?p=29541

On a Simplified Approach to Achieve Parallel Performance and Portability Across CPU and GPU Architectures

#CUDA #HIP #Pthreads #Fortran #PerformancePortability #Package

https://hgpu.org/?p=29517

Thesis: Collection skeletons: declarative abstractions for data collections

#SYCL #OpenCL #PerformancePortability

https://hgpu.org/?p=29421

Challenging Portability Paradigms: FPGA Acceleration Using SYCL and OpenCL

#SYCL #OpenCL #FPGA #PerformancePortability #Package

https://hgpu.org/?p=29420

Thesis: Enhancing Code Portability, Problem Scale, and Storage Efficiency in Exascale Applicationsin Exascale Applications

#CUDA #OpenMP #MPI #HPC #PIC #ParticleInCell #PerformancePortability

https://hgpu.org/?p=29406

Evaluating Operators in Deep Neural Networks for Improving Performance Portability of SYCL

#SYCL #HIP #CUDA #oneAPI #PerformancePortability

https://hgpu.org/?p=29339

Thesis: Automatic Code Rewriting for Performance Portability

#OpenCL #SYCL #HIP #Compilers #PerformancePortability

https://hgpu.org/?p=29277

Experiences with implementing Kokkos’ SYCL backend

#SYCL #Kokkos #PerformancePortability #HPC #IWOCL #Package

https://hgpu.org/?p=29201

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

#CUDA #OpenMP #MonteCarlo #PerformancePortability #Intel #AMD #NVIDIA #Package

https://hgpu.org/?p=29161

Retargeting and Respecializing GPU Workloads for Performance Portability

#HIP #CUDA #PerformancePortability #Package

https://hgpu.org/?p=29160

Client Info

Server: https://mastodon.social

Version: 2025.04

Repository: https://github.com/cyevgeniy/lmst