RahulSChand

Follow

🤔

Rahul C RahulSChand

🤔

Follow

Working on the current thing

34 followers · 1 following

Achievements

Achievements

Pinned Loading

gpu_poor gpu_poor Public

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1.2k 60
llama2.c-for-dummies llama2.c-for-dummies Public

Step by step explanation/tutorial of llama2.c

C 210 19
Multi-Granularity-Hierarchical-Attention-Fusion-Networks-for-Question-Answering---TensorFlow Multi-Granularity-Hierarchical-Attention-Fusion-Networks-for-Question-Answering---TensorFlow Public

TensorFlow implementation of the "Multi-Granularity Hierarchical Attention Fusion Networks for Reading Comprehension and Question Answering" paper by Alibaba.

Python 8 3
monosemanticity-quantization monosemanticity-quantization Public

Affect of Quantization on Monosemanticity of 1 layered Networks

Python 1
batched-pytorch-ksvd batched-pytorch-ksvd Public

Batched version of KSVD written in PyTorch that runs on GPU

Python 2
Weighted-low-rank-factorization-Pytorch Weighted-low-rank-factorization-Pytorch Public

PyTorch implementation of Language model compression with weighted low-rank factorization

Python 8 2