当前位置: 首页 > news >正文

cann/hccl集合通信AlltoAllVC示例

Collective Communication - AlltoAllVC

【免费下载链接】hccl集合通信库(Huawei Collective Communication Library,简称HCCL)是基于昇腾AI处理器的高性能集合通信库,为计算集群提供高性能、高可靠的通信方案项目地址: https://gitcode.com/cann/hccl

Sample Description

This sample demonstrates how to use theHcclAlltoAllVC()API to perform the AlltoAllVC operation. It covers the following functions:

  • CallaclrtGetDeviceCount()to detect devices and query the number of available devices.

  • CallHcclGetRootInfo()and userank 0as the root rank to generate the rootinfo identifier.

    The rootinfo identifier contains the device IP address and device ID. This information must be broadcast to all ranks in the cluster to initialize the communicator.

  • In each thread, callHcclCommInitRootInfo()to initialize the communicator based on the rootinfo identifier.

  • CallHcclAlltoAllVC()to split the input data into a specific number of blocks along a given dimension, send the blocks sequentially to other ranks, receive data from other ranks, concatenate the received data along the specific dimension in order, and display the result.

Directory Structure

├── main.cc # Sample source file ├── Makefile # Compilation and build configuration file └── alltoallvc # Compiled executable file

Environment Preparation

Environment Requirements

This sample supports the following products in a single-server N-card configuration (N >= 2):

  • Ascend 950PR / Ascend 950DT
  • Atlas A3 Training Series Products / Atlas A3 Inference Series Products
  • Atlas A2 Training Series Products

Setting Environment Variables

# Set CANN environment variables. The following uses the root user default installation path as an example. source /usr/local/Ascend/cann/set_env.sh

Compiling and Running the Sample

Run the following commands in the sample code directory:

make make test

Note: You can set theHCCL_OP_EXPANSION_MODEenvironment variable to configure the expansion mode of communication operators. For the supported ranges for different product models, see the usage instructions for this environment variable in the Environment Variable List.

# Set the communication operator expansion mode to the AI CPU communication engine export HCCL_OP_EXPANSION_MODE=AI_CPU

Sample Output

The data of each rank is initialized to the corresponding rank ID. After the AlltoAllVC operation, the content of each node is the concatenation of the input data of all nodes.

Found 8 NPU device(s) available rankId: 0, output: [ 0 1 2 3 4 5 6 7 ] rankId: 1, output: [ 0 1 2 3 4 5 6 7 ] rankId: 2, output: [ 0 1 2 3 4 5 6 7 ] rankId: 3, output: [ 0 1 2 3 4 5 6 7 ] rankId: 4, output: [ 0 1 2 3 4 5 6 7 ] rankId: 5, output: [ 0 1 2 3 4 5 6 7 ] rankId: 6, output: [ 0 1 2 3 4 5 6 7 ] rankId: 7, output: [ 0 1 2 3 4 5 6 7 ]

【免费下载链接】hccl集合通信库(Huawei Collective Communication Library,简称HCCL)是基于昇腾AI处理器的高性能集合通信库,为计算集群提供高性能、高可靠的通信方案项目地址: https://gitcode.com/cann/hccl

创作声明:本文部分内容由AI辅助生成(AIGC),仅供参考

http://www.gsyq.cn/news/1631707.html

相关文章:

  • CSS Subgrid 实践:对齐不是每个组件自己算一遍
  • Runbook最佳实践:10个高效自动化运维场景案例
  • BiliScope开发者指南:深入解析插件架构与API调用
  • E-Hentai Downloader终极使用指南:零基础快速上手漫画下载神器
  • 跨平台漫画神器:JHenTai的5大颠覆体验与专家级使用指南
  • E-Hentai Viewer:重新定义iOS漫画阅读体验的移动神器
  • gearmand完全指南:从安装到部署,打造高效分布式任务队列系统
  • SSH密钥生成与完整性保护:从Ed25519算法到Git签名实战
  • OSX-KVM音频延迟问题深度解析:三种高效解决方案对比
  • E-Hentai批量图片下载工具:2025年最全配置与使用手册
  • 含图解与实例)乐观锁、悲观锁和分布式锁,做项目时到底该怎么选?
  • AI生成代码真的可靠吗?3类致命缺陷+4步验证法,92%的团队还在忽略第3步
  • 3步完成跨平台文献管理:WPS-Zotero插件让你的科研写作效率倍增
  • E-Hentai批量下载工具终极指南:一键打包图库为ZIP文件
  • 【dnd-kit】react前端做一个可以垂直拖动的无序列表
  • 计算机毕业设计之基于jsp考研在线复习平台
  • Gemini Advanced订阅价值评估与合规使用指南
  • 从零到一:raylib游戏开发库终极入门指南
  • 终极指南:如何用yuzu模拟器在PC上流畅玩转任天堂Switch游戏
  • 5步打造专属漫画浏览体验:E-Viewer高效使用指南
  • 基建配套预制构件怎么选?2026年7月预制检查井厂家推荐参考
  • 百度文库文档净化脚本:让PDF保存变得简单纯粹
  • 解决方案:如何5分钟构建企业级国标视频监控平台
  • 如何免费提升BT下载速度300%:trackerslist终极指南
  • Qwen3实测全解析:4B到32B模型在多平台部署与中文任务表现
  • 特征融合技术提升小目标检测性能:原理、实现与工程实践
  • STM32F103C8T6的USB—CDC虚拟端口组件(HAL)
  • 《大模型实战指南》—— 面向软件开发者的系统性入门2
  • AI开发环境本地化:Codex与DeepSeek的协议转换与代理部署实战
  • 普通人也能入场的3个高薪AI岗位,平均月薪超3万!AI时代的机会在这里!