当前位置: 首页 > news >正文

CANN/asc-devkit数据搬运API样例

Data Movement API Sample Introduction

【免费下载链接】asc-devkit本项目是CANN 推出的昇腾AI处理器专用的算子程序开发语言,原生支持C和C++标准规范,主要由类库和语言扩展层构成,提供多层级API,满足多维场景算子开发诉求。项目地址: https://gitcode.com/cann/asc-devkit

Overview

This directory contains samples for multiple APIs related to data movement. Each sample is based on Ascend C's <<<>>> direct call method, supporting implementation of main function and kernel function in the same cpp file.

Sample List

Directory NameFunction Description
broadcast_ub2l0cThis sample implements data broadcast movement based on BroadCastVecToMM, broadcasting data located on UB (Unified Buffer) and moving it to CO1 (L0C Buffer)
copy_ub2ubThis sample implements data movement based on Copy, applicable for data movement between VECIN and VECOUT, supporting mask continuous mode and counter mode
data_copy_gm2ub_sliceThis sample implements data slice movement based on DataCopy, extracting subsets of multi-dimensional Tensor data for movement between GM (Global Memory) and UB (Unified Buffer) pathways
data_copy_gm2ub_nddmaThis sample introduces how to use multi-dimensional data movement interface to implement data movement from GM (Global Memory) to UB (Unified Buffer) pathway. By freely configuring dimension information and corresponding Stride, it can be used for Padding, Transpose, BroadCast, Slice and other data transformation operations
data_copy_l0c2gmThis sample implements data inline quantization activation movement based on DataCopy in convolution scenarios
data_copy_pad_gm2ub_ub2gmThis sample implements non-32-byte aligned data movement based on DataCopyPad, with data padding
data_copy_ub2l1This sample implements data movement from UB (Unified Buffer) to L1 (L1 Buffer) based on DataCopy in Mmad matrix multiplication scenarios
ld_st_reg_maskThis sample implements UB (Unified Buffer) load/store operations to MaskReg (mask register) using Reg programming interface, and mask-based masked store operations
ld_st_reg_alignThis sample implements aligned data movement operations (continuous and non-continuous) from UB (Unified Buffer) to RegTensor (Reg vector computation basic unit) using Reg programming interface
ld_st_reg_unalignThis sample implements unaligned data movement operations from UB (Unified Buffer) to RegTensor (Reg vector computation basic unit) using Reg programming interface
gather_ld_regThis sample demonstrates using Gather interface to implement discrete data load, including high-dimensional Gather (source is LocalTensor) and Reg::GatherB (collect by DataBlock) scenarios
scatter_st_regThis sample demonstrates using Reg::Scatter interface to implement discrete data store (scatter elements to UB)
auxscalar_regThis sample demonstrates using AuxScalar method to read multiple scalar data from UB for computation
move_regThis sample implements data load/store operations from UB (Unified Buffer) to RegTensor using Reg programming interface

【免费下载链接】asc-devkit本项目是CANN 推出的昇腾AI处理器专用的算子程序开发语言,原生支持C和C++标准规范,主要由类库和语言扩展层构成,提供多层级API,满足多维场景算子开发诉求。项目地址: https://gitcode.com/cann/asc-devkit

创作声明:本文部分内容由AI辅助生成(AIGC),仅供参考

http://www.gsyq.cn/news/1342740.html

相关文章:

  • 2026最权威一键生成论文工具榜单:这些被高校和导师偷偷推荐的软件你用了吗
  • DeepEval终极指南:如何用开源框架轻松评估AI模型质量
  • 终极指南:如何快速上手MobileNetV3预训练模型实现高效图像分类
  • ubuntu24 主题经验
  • 2026年无添加微辣萝卜干深度厂家推荐 - 行业平台推荐
  • mlir 编译器学习笔记之六 -- 经典实现
  • 全面掌握GemPy三维地质建模:从核心概念到实战应用
  • RedisBloom Cuckoo过滤器终极指南:为什么它比布隆过滤器更强大
  • CANN/asc-devkit张量形状定义
  • 2026年评价高的榨菜芯/去皮榨菜优质厂家推荐榜 - 品牌宣传支持者
  • CANN Ascend C向量计算API
  • 《Sysinternals实战指南》进程和诊断工具学习笔记(8.24):Handle——谁占着不放?句柄泄漏排查、强制解锁与检索技巧
  • Go语言并发编程:sync包深度解析与实践
  • FICO创凭证标准错误:在折旧范围 01 中的业务与帐面净值规则冲突
  • 如何将普通桌面实时转换为3D立体视频?nunif iw3-desktop完全指南
  • ElevenLabs高棉文语音私有化部署终极方案(含Docker+Khmer IPA音素映射表),仅限前200位技术负责人获取
  • 3分钟掌握gmpublisher:Garry‘s Mod工坊发布的终极解决方案
  • 深入拆解 MySQL InnoDB 隔离级别:从 MVCC 到临键锁
  • rk3588/rk3576使用rkllm推理大模型,提供OpenAI服务
  • AI论文软件的实战手册:什么程度算学术不端?
  • C语言06(操作符)
  • 080.领域自适应:当你的YOLO在新车间“水土不服”时
  • VxWorks网络通信模块:网络协议栈解析(第五部分)
  • ComfyUI全面掌握-知识点详解——ComfyUI 开发与扩展基础(开发指南+环境搭建)
  • linuxcnc开发环境搭建
  • Win11 右键菜单缺少“新建文本文档“win11 某些软件中文乱码
  • 谷歌关键词优化seo需要怎么做?避开这4个最掏钱的布词误区
  • 2026年整箱榨菜厂家精选合集 - 行业平台推荐
  • Allen-Bradley 280D-F12Z-10B-CR启动控制模块
  • HTTP状态码与请求方式全解析【个人八股】