Gpgpu-loadbalancerx Alternatives

Similar projects and alternatives to gpgpu-loadbalancerx

osmanip

62 212 7.4 C++ gpgpu-loadbalancerx VS osmanip

A cross-platform library for output stream manipulation using ANSI escape sequences.
SHA256-Implementation

3 5 0.0 C++ gpgpu-loadbalancerx VS SHA256-Implementation

A program that implements the SHA256 algorithm and generates the binary+hexdigest of a string input.
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
libletlib

1 11 0.0 C++ gpgpu-loadbalancerx VS libletlib

C++ framework for the impatient.
tser

4 131 0.0 C++ gpgpu-loadbalancerx VS tser

tser - tiny serialization for C++
emocrypt

4 37 0.0 C++ gpgpu-loadbalancerx VS emocrypt

Encrypts data into emojipastas
Blackjack_V1.02

2 1 0.0 C++ gpgpu-loadbalancerx VS Blackjack_V1.02

Extension of my old Blackjack game with Qt for C++
binary_io

3 26 6.6 C++ gpgpu-loadbalancerx VS binary_io

A binary i/o library for C++, without the agonizing pain
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
dmpower

1 12 0.6 C++ gpgpu-loadbalancerx VS dmpower

Discontinued Interactive terminal D&D helper toolbox program for Dungeon Masters, players, and worldbuilders.
stu

1 37 6.7 C++ gpgpu-loadbalancerx VS stu

Build automation
ftl

1 6 7.2 C++ gpgpu-loadbalancerx VS ftl

Discontinued Freestanding template library (by ronchaine)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better gpgpu-loadbalancerx alternative or higher similarity.

Suggest an alternative to gpgpu-loadbalancerx

gpgpu-loadbalancerx reviews and mentions

Posts with mentions or reviews of gpgpu-loadbalancerx. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-02-14.

vectorAdd.cu sample load-balanced on 3 GPUs
1 project | /r/CUDA | 25 Feb 2022

/** * Copyright 1993-2015 NVIDIA Corporation. All rights reserved. * * Please refer to the NVIDIA end user license agreement (EULA) associated * with this source code for terms and conditions that govern your use of * this software. Any use, reproduction, disclosure, or distribution of * this software and related documentation outside the terms of the EULA * is strictly prohibited. * */ /** * Vector addition: C = A + B. * * This sample is a very basic sample that implements element by element * vector addition. It is the same as the sample illustrating Chapter 2 * of the programming guide with some additions like error checking. */ #include // For the CUDA runtime routines (prefixed with "cuda_") #include #include // for load balancing between 3 different GPUs // https://github.com/tugrul512bit/gpgpu-loadbalancerx/blob/main/LoadBalancerX.h #include "LoadBalancerX.h" /** * CUDA Kernel Device code * * Computes the vector addition of A and B into C. The 3 vectors have the same * number of elements numElements. */ __global__ void vectorAdd(const float *A, const float *B, float *C, int numElements) { int i = blockDim.x * blockIdx.x + threadIdx.x; if (i < numElements) { C[i] = A[i] + B[i]; } } #include #include int main(void) { int numElements = 15000000; int numElementsPerGrain = 500000; size_t size = numElements * sizeof(float); float *h_A = (float *)malloc(size); float *h_B = (float *)malloc(size); float *h_C = (float *)malloc(size); for (int i = 0; i < numElements; ++i) { h_A[i] = rand()/(float)RAND_MAX; h_B[i] = rand()/(float)RAND_MAX; } /* * default tutorial vecAdd logic cudaMemcpy(d_A, h_A, size, cudaMemcpyHostToDevice); cudaMemcpy(d_B, h_B, size, cudaMemcpyHostToDevice); int threadsPerBlock = 256; int blocksPerGrid =(numElements + threadsPerBlock - 1) / threadsPerBlock; vectorAdd<<>>(d_A, d_B, d_C, numElements); cudaGetLastError(); cudaMemcpy(h_C, d_C, size, cudaMemcpyDeviceToHost); */ /* load-balanced 3-GPU version setup */ class GrainState { public: int offset; int range; std::map d_A; std::map d_B; std::map d_C; ~GrainState(){ for(auto a:d_A) cudaFree(a.second); for(auto b:d_B) cudaFree(b.second); for(auto c:d_C) cudaFree(c.second); } }; class DeviceState { public: int gpuId; int amIgpu; }; LoadBalanceLib::LoadBalancerX lb; lb.addDevice(LoadBalanceLib::ComputeDevice({0,1})); // 1st cuda gpu in computer lb.addDevice(LoadBalanceLib::ComputeDevice({1,1})); // 2nd cuda gpu in computer lb.addDevice(LoadBalanceLib::ComputeDevice({2,1})); // 3rd cuda gpu in computer // lb.addDevice(LoadBalanceLib::ComputeDevice({3,0})); // CPU single core for(int i=0;i( [&,i](DeviceState gpu, GrainState& grain){ if(gpu.amIgpu) { cudaSetDevice(gpu.gpuId); cudaMalloc((void **)&grain.d_A[gpu.gpuId], numElementsPerGrain*sizeof(float)); cudaMalloc((void **)&grain.d_B[gpu.gpuId], numElementsPerGrain*sizeof(float)); cudaMalloc((void **)&grain.d_C[gpu.gpuId], numElementsPerGrain*sizeof(float)); } }, [&,i](DeviceState gpu, GrainState& grain){ if(gpu.amIgpu) { cudaSetDevice(gpu.gpuId); cudaMemcpyAsync(grain.d_A[gpu.gpuId], h_A+i, numElementsPerGrain*sizeof(float), cudaMemcpyHostToDevice); cudaMemcpyAsync(grain.d_B[gpu.gpuId], h_B+i, numElementsPerGrain*sizeof(float), cudaMemcpyHostToDevice); } }, [&,i](DeviceState gpu, GrainState& grain){ if(gpu.amIgpu) { int threadsPerBlock = 1000; int blocksPerGrid =numElementsPerGrain/1000; vectorAdd<<>>(grain.d_A[gpu.gpuId], grain.d_B[gpu.gpuId], grain.d_C[gpu.gpuId], numElements-i); } else { for(int j=0;j de(3); for(int i=0;i<100;i++) { nanoseconds += lb.run(); } for(auto v:de) std::cout<
I created a load-balancer for multi-gpu projects.
1 project | /r/gpgpu | 23 Feb 2022
C++ Show and Tell - Experiment
12 projects | /r/cpp | 14 Feb 2022

Here is Nvidia's vectorAdd example modified for 3-GPU load balancing.
A note from our sponsor - InfluxDB
www.influxdata.com | 4 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Stats

Basic gpgpu-loadbalancerx repo stats

Mentions

Stars

Activity

2.6

Last Commit

about 2 years ago

tugrul512bit/gpgpu-loadbalancerx is an open source project licensed under GNU General Public License v3.0 only which is an OSI approved license.

The primary programming language of gpgpu-loadbalancerx is C++.

Popular Comparisons