Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
OpenTransformer
/
llama.cpp-prismml
like
0
arxiv:
2302.13971
arxiv:
2005.14165
arxiv:
2203.02155
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
llama.cpp-prismml
/
ggml
/
src
/
ggml-cpu
3.36 MB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
OpenTransformer
perf: maddubs kernel + nrc=4 multi-row for Q1_0_g128 (3.5-3.75 t/s)
570ff77
verified
2 months ago
amx
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
arch
perf: maddubs kernel + nrc=4 multi-row for Q1_0_g128 (3.5-3.75 t/s)
2 months ago
cmake
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
kleidiai
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
llamafile
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
spacemit
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
CMakeLists.txt
32.8 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
arch-fallback.h
21 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
binary-ops.cpp
6.71 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
binary-ops.h
518 Bytes
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
common.h
2.33 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
ggml-cpu-impl.h
13.2 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
ggml-cpu.c
129 kB
perf: maddubs kernel + nrc=4 multi-row for Q1_0_g128 (3.5-3.75 t/s)
2 months ago
ggml-cpu.cpp
24 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
hbm.cpp
2 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
hbm.h
155 Bytes
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
ops.cpp
372 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
ops.h
9.19 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
quants.c
43.2 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
quants.h
10.4 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
repack.cpp
151 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
repack.h
14.9 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
simd-gemm.h
3.77 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
simd-mappings.h
52.3 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
traits.cpp
1.23 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
traits.h
1.16 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
unary-ops.cpp
11.6 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
unary-ops.h
2.44 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
vec.cpp
25.3 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago
vec.h
66.6 kB
Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)
2 months ago