my_utils

面向 PyTorch 训练/推理工作流的实用工具集，核心覆盖：

性能采集与分析（NSYS / NCU / 统一 metrics）
运行时追踪与 Hook（NVTX / module hooks）
分布式辅助（时钟同步 / etcd barrier / sequence parallel padding）
内存诊断（snapshot / OOM flag / GPU tracker）
产物落盘与离线分析（dump / CSV）

30秒定位

flowchart TD
    A[开始: 我要做什么] --> B{目标}
    B -->|训练全局性能| C[profiling/templates + nsys]
    B -->|单kernel瓶颈| D[profiling/ncu]
    B -->|代码内埋点追踪| E[tracing + hooks]
    B -->|分布式辅助能力| F[distributed]
    B -->|内存诊断| G[memory]
    B -->|基础工具| H[core]

安装

cd my_utils
pip install -e .

可选依赖（按需）：

pip install -e .[profiling,tensordict,etcd,nvml,nvtx,system,megatron]

常用组合：

# 仅安装 my_utils，不动你现有 torch/cuDNN 环境
pip install -e .

# 安装所有可选依赖（不含 torch）
pip install -e .[all]

# 安装所有可选依赖（含 torch）
pip install -e .[all_with_torch]

一眼可用命令（最常用）

NSYS 快速采集：

bash my_utils/profiling/templates/run_nsys_quick.sh -- python train.py --config cfg.yaml

NSYS 离线分析：

myutils-profile nsys-analyze --sqlite ./train_rank0.sqlite --output ./nsys_analyze.json

NCU 完整采集：

python my_utils/profiling/ncu/run_ncu_quick_yaml.py \
  --config my_utils/profiling/ncu/ncu_full_collection.yaml

NCU 报告分析：

myutils-profile ncu-report-analyze --report ./run.ncu-rep --top-k 20 --pretty

Python 最小示例

1) core: 计时 + 日志

from my_utils.core import setup_logging_and_timer

logger, timer = setup_logging_and_timer(
    logger_name="train",
    log_file="train.log",
    use_cuda=True,
    rank=0,
)

timer.start("forward")
# ... your forward ...
timer.stop("forward")

2) tracing: NVTX 自动降级

from my_utils.tracing import create_labeler

labeler = create_labeler(preferred="auto")
with labeler.range("train_step"):
    # ... your step ...
    pass

包结构（按用途）

my_utils/profiling: 统一 profiling 入口（NSYS/NCU/metrics）
my_utils/core: logger/timer/通用工具
my_utils/tracing: NVTX labeler 与 trace 辅助
my_utils/hooks: forward hook / module trace / module profiler
my_utils/distributed: clock sync / etcd barrier / pad helpers
my_utils/memory: snapshot / OOM / GPU memory tracker
my_utils/artifacts: dump 与 CSV 离线分析
my_utils/legacy_profilers: 历史 profiler 兼容层

兼容性说明

旧导入路径（例如 from my_utils.utils import MyTimer）仍可用。
新代码建议使用分层路径（例如 from my_utils.core import MyTimer）。
my_utils/__init__.py 内置了 legacy module aliases，便于旧项目平滑迁移。

Name		Name	Last commit message	Last commit date
Latest commit History 151 Commits
.github/workflows		.github/workflows
.tmp_pytest		.tmp_pytest
.tmp_smoke		.tmp_smoke
CtrlRandom		CtrlRandom
my_utils.egg-info		my_utils.egg-info
my_utils		my_utils
test_metrics_output		test_metrics_output
tests/profiling		tests/profiling
tmp_cli_out		tmp_cli_out
tmp_demo_p013		tmp_demo_p013
tmp_demo_unified		tmp_demo_unified
tmp_trace_demo		tmp_trace_demo
tmp_trace_direct		tmp_trace_direct
.gitignore		.gitignore
LICENSE		LICENSE
NEW_FEATURES.md		NEW_FEATURES.md
PROFILING_ANALYSIS_REPORT.md		PROFILING_ANALYSIS_REPORT.md
PROFILING_GUIDE.md		PROFILING_GUIDE.md
PROFILING_VISUALIZATION_SUMMARY.md		PROFILING_VISUALIZATION_SUMMARY.md
QUICKSTART_PROFILING.md		QUICKSTART_PROFILING.md
README.md		README.md
README_IMPROVEMENTS.md		README_IMPROVEMENTS.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md
pyproject.toml		pyproject.toml
setup.py		setup.py
test_chart.html		test_chart.html
test_full_report.html		test_full_report.html
test_layout.html		test_layout.html
test_visualization.py		test_visualization.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

my_utils

30秒定位

安装

一眼可用命令（最常用）

Python 最小示例

1) core: 计时 + 日志

2) tracing: NVTX 自动降级

包结构（按用途）

兼容性说明

文档推荐阅读顺序

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

my_utils

30秒定位

安装

一眼可用命令（最常用）

Python 最小示例

1) core: 计时 + 日志

2) tracing: NVTX 自动降级

包结构（按用途）

兼容性说明

文档推荐阅读顺序

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages