linzm14

2026年2月13日

摘要： Nano-vLLM-Ascend 项目链接：https://github.com/linzm1007/nano-vllm-ascend nano-vllm是github开源的一个gpu推理项目，基于开源版本弄的一个ascend npu版本推理小demo，旨在帮助初学者了解推理的整体流程，区别于vll 阅读全文

posted @ 2026-02-13 11:38 linzm14 阅读(118) 评论(0) 推荐(0)

2025年12月10日

sglang v0.5.5.post3 框架图

摘要：参考 https://github.com/zhaochenyang20/Awesome-ML-SYS-Tutorial/blob/main/sglang/code-walk-through/readme-CN.md https://github.com/sgl-project/sglang/tre 阅读全文

posted @ 2025-12-10 14:38 linzm14 阅读(78) 评论(0) 推荐(0)

2025年12月8日

omniinfer vllm v0.9.0整体框架图和pangu7b模型图

摘要：参考 https://shen-shanshan.github.io/articles/vllm-v1-整体流程从请求到算子执行/ https://gitee.com/omniai/omniinfer/tree/release_v0.6.0/ https://github.com/vllm-proj 阅读全文

posted @ 2025-12-08 22:16 linzm14 阅读(833) 评论(0) 推荐(0)

2025年12月7日

Nano-vLLM-Ascend

摘要：参考 https://github.com/linzm1007/nano-vllm-ascend Nano-vLLM-Ascend nano-vllm是开源的一个gpu推理项目，基于开源版本弄的一个ascend npu版本推理小demo，旨在帮助初学者了解推理的整体流程，区别于vllm，nano-v 阅读全文

posted @ 2025-12-07 21:09 linzm14 阅读(1058) 评论(0) 推荐(0)

2025年8月19日

failed to bind port 0.0.0.0:6667/tcp: fork/exec /usr/bin/docker-proxy: exec format error.

摘要： 1 报错内容 docker: Error response from daemon: driver failed programming external connectivity on endpoint xx_ssh (429495664dec9d44f6958a4380124df6381a789 阅读全文

posted @ 2025-08-19 09:55 linzm14 阅读(28) 评论(0) 推荐(0)

2025年4月15日

gcc : Depends: cpp (= 4:9.3.0-1ubuntu2) but it is not going to be installed g++ : Depends: cpp (= 4:9.3.0-1ubuntu2) but it is not going to be installed

摘要：问题 apt install build-essential Reading package lists... Done Building dependency tree... Done Reading state information... Done Some packages could no 阅读全文

posted @ 2025-04-15 21:36 linzm14 阅读(144) 评论(0) 推荐(0)

2024年8月6日

javax.validation包校验嵌套属性（List对象）的写法

摘要： 1 maven依赖  <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-validation</art 阅读全文

posted @ 2024-08-06 11:14 linzm14 阅读(484) 评论(0) 推荐(0)

2024年6月19日

java 大模型代理 chat completions

摘要： 1 controller @SneakyThrows @PostMapping(value = "/v1/chat/completions", produces = {TEXT_EVENT_STREAM_VALUE, APPLICATION_JSON_VALUE}) @Operation(summa 阅读全文

posted @ 2024-06-19 11:09 linzm14 阅读(176) 评论(0) 推荐(0)

2024年5月9日

text-generation-webui 推理模型Qwen1.5-7B-Chat相关报错问题解决

摘要：推理代码 text-generation-webui 推理模型 Qwen1.5-7B-Chat sys infogpu： Tesla V100-PCIE-32GBpython： 3.10model：Qwen1.5-7B-Chatdocker docker run -it --rm --gpus='" 阅读全文

posted @ 2024-05-09 11:23 linzm14 阅读(2338) 评论(0) 推荐(0)

LLaMA-Factory 训练 Llama3-Chinese-8B-Instruct 相关报错问题解决

摘要：模型路径 up主为 llama中文社区模型地址 https://www.modelscope.cn/models/FlagAlpha/Llama3-Chinese-8B-Instruct/summary sys info gpu： Tesla V100-PCIE-32GB python： 3.10 阅读全文

posted @ 2024-05-09 11:19 linzm14 阅读(2151) 评论(0) 推荐(0)

公告