Here are
28 public repositories
matching this topic...
Performance-portable, length-agnostic SIMD with runtime dispatch
Expressive Vector Engine - SIMD in C++ Goes Brrrr
Pelemay is a native compiler for Elixir, which generates SIMD instructions. It has a plan to generate for GPU code.
Updated
Feb 15, 2021
Elixir
PyTurboJPEG is a highly optimized Python wrapper of libjpeg-turbo (TurboJPEG API) which supports x86 and ARM architecture.
Updated
Mar 6, 2022
Python
SIMD-based linear algebra and statistics for data science with dart
Updated
Apr 14, 2022
Dart
DSL for SIMD Sorting on AVX2 & AVX512
"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Corium is a modern scripting language which combines simple, safe and efficient programming.
Two-dimensional flow solver with GUI using vortex particle and boundary element methods
n-body-simulation performance test suite
Updated
Feb 13, 2019
Python
GPU-accelerated 3D vortex methods solver with easy GUI
Vectroized String Helper Functions
A portable modern C++ primitive performance library for 3D Vision & Photo-Mechanics.
SIMD discrete Fourier transform tests and discussion
(experiments with) pragma-based SIMD C++ types
System benchmarks over JVM with JMH - SIMD (superscalar processing), Branch prediction, False sharing.
Updated
Sep 11, 2018
Java
This repository lists 4 problems solved using C. Each problem has its own serial and parallel implementations. For the latter, the OpenMP API was utilized.
EinsteinDB is a Hybrid memory system consisting of DRAM and Non-Volatile Memory configured to persist data fast.
SIMD-accelerated Vector math lib
Updated
Nov 14, 2021
Assembly
8x speedup of 1D Haar-Transform using intel SIMD intrinsics
Optimizing convolution function using ARM's NEON Intrinsics
AVX SIMD accelerated Julia fractal explorer, 7 beautiful sets
Updated
Apr 29, 2018
Assembly
A fast and simple c# hex-decode function using AVX2 and SSSE3 Intel intrinsics.
Image filters using SSE Instructions (Streaming SIMD Extensions) of Intel® x86-64 Architecture.
CMap2 Top Coder Data Science Marathon Match
In this project we change the code of the SmithWaterman algorithm to achive parallel computing with different ways. University project for the course "Parallel Processing". Course Code: CEID_NY408
deep learning convolutional neural network implemented with SIMD acceleration (auto-vectorization)
Examples of Distributed-Memory Programming with MPI
Improve this page
Add a description, image, and links to the
simd-parallelism
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
simd-parallelism
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.