Skip to content

Issues: SJTU-IPADS/PowerInfer

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

about the use of OPT model question Further information is requested
#228 opened Oct 21, 2024 by bobzhang208
add new model in power-infer2 question Further information is requested
#227 opened Oct 21, 2024 by Francis235
Qualcomm chips support question Further information is requested
#226 opened Oct 21, 2024 by Francis235
Question about the perplexity question Further information is requested
#225 opened Oct 13, 2024 by eljrte
关于注意力块权重如何分配? question Further information is requested
#224 opened Oct 4, 2024 by Yues007
请问我该如何获得opt模型相关的weight文件? question Further information is requested
#223 opened Sep 23, 2024 by a1bc2def6g
统计predictor的overhead question Further information is requested
#220 opened Sep 16, 2024 by guanchenl
3 tasks done
Help! Want a toy example to run matmul with q40 weight by cuda kernel question Further information is requested
#219 opened Sep 11, 2024 by Eutenacity
CUDA toolkit version? question Further information is requested
#218 opened Sep 6, 2024 by shujiehan
Am i doing something wrong? question Further information is requested
#216 opened Aug 28, 2024 by RealMrCactus
3 tasks done
Some question about Fig4. question Further information is requested
#213 opened Jul 23, 2024 by rhmaaa
我要如何获得预测文件呢 question Further information is requested
#211 opened Jul 15, 2024 by LDLINGLINGLING
3 tasks
Feature request : Support for PHI3 mini enhancement New feature or request
#210 opened Jul 14, 2024 by raymond-infinitecode
3 tasks
请问powerinfer能否兼容llama.cpp的模型呢 question Further information is requested
#209 opened Jul 5, 2024 by mailonghua
About powerinfer-2 enhancement New feature or request
#207 opened Jul 2, 2024 by Ther-nullptr
3 tasks done
Where is the TurboSparse-Mixtral mlp_predictor? question Further information is requested
#203 opened Jun 27, 2024 by MatthewCroughan
请问能和vllm共同使用吗 question Further information is requested
#202 opened Jun 26, 2024 by yadandan
How to convert ProSparse-LLaMA-2-13B model to .gguf? question Further information is requested
#201 opened Jun 23, 2024 by Graysonicc
3 tasks done
windows下cmake编译失败
#199 opened Jun 19, 2024 by codetown
Source for v2 (mobile inference engine) question Further information is requested
#194 opened Jun 12, 2024 by peeteeman
Need quite a long time to load the model question Further information is requested
#188 opened May 21, 2024 by meicale
ProTip! Follow long discussions with comments:>50.