Skill

這裡收錄 repo 裡找得到的完整 SKILL.md、awesome-agent-skills 上游索引，以及 skill repo 的技能商店與本地落地版。頁面上的標題、用途、說明與 Skill 內容都會整理成台灣慣用正體中文；來源連結會保留下來。

沒有符合條件的 Skill。

Multi NODE Slurm Skill

Convert single-node scripts to multi-node Slurm sbatch jobs 與 debug common multi-node failures。

awesome-agent-skills NVIDIA productivity

NEMO RL E2E Testing Skill

External NeMo-RL 端到端 validation 工作流用於 Megatron-Bridge model/provider changes, including downstream compatibility checks, external RL lifecycle behavior, Megatron poli..。

awesome-agent-skills NVIDIA testing

Parity Testing Skill

Structured framework 用於 verifying numerical parity of HF<->MCore weight conversions。

awesome-agent-skills NVIDIA testing

PERF Activation Recompute Skill

Validate 與 use selective 與 full activation recompute in Megatron Bridge to reduce GPU memory usage at the cost of extra compute。

awesome-agent-skills NVIDIA productivity

PERF CPU Offloading Skill

Validate 與 use CPU offloading in Megatron Bridge, including layer-level activation offloading 與 fractional optimizer state offloading 搭配 HybridDeviceOptimizer。

awesome-agent-skills NVIDIA productivity

PERF CUDA Graphs Skill

Validate 與 use CUDA graph capture in Megatron Bridge, including local full-iteration graphs 與 Transformer Engine scoped graphs 用於 attention, MLP, 與 MoE modules。

awesome-agent-skills NVIDIA productivity

PERF Expert Parallel Overlap Skill

Validate 與 use MoE expert-parallel communication overlap in Megatron-Bridge, including overlap_moe_expert_parallel_comm, delay_wgrad_compute, 與 flex dispatcher backends such as..。

awesome-agent-skills NVIDIA web

PERF Hierarchical Context Parallel 設計 Skill

Operational 指南用於 enabling hierarchical context parallelism in Megatron-Bridge, including config knobs, code anchors, pitfalls, 與 verification。

awesome-agent-skills NVIDIA design

PERF Megatron FSDP 設計 Skill

Operational 指南用於 enabling Megatron FSDP in Megatron-Bridge, including config knobs, code anchors, pitfalls, 與 verification。

awesome-agent-skills NVIDIA design

PERF Memory Tuning Skill

Techniques 用於 reducing peak GPU memory in Megatron Bridge — expandable segments, parallelism resizing, activation recompute, CPU offloading constraints, 與 common OOM fixes。

awesome-agent-skills NVIDIA productivity

PERF MOE COMM Overlap Skill

協助處理 PERF MOE COMM Overlap 相關工作，並依原始 Skill 說明完成設定與執行。

awesome-agent-skills NVIDIA productivity

PERF MOE Dispatcher Selection Skill

Choose the right MoE token dispatcher (`alltoall`, DeepEP, 或 HybridEP) 用於 the hardware, EP degree, 與 optimization stage。

awesome-agent-skills NVIDIA productivity

PERF MOE Hardware Configs Skill

Representative MoE training playbooks by hardware platform 與 model family。

awesome-agent-skills NVIDIA productivity

PERF MOE LONG Context 設計 Skill

Long-context MoE training guidance 用於 Megatron Bridge。

awesome-agent-skills NVIDIA design

PERF MOE Optimization Workflow Skill

Systematic 工作流用於 MoE training optimization in Megatron Bridge, based on the Megatron-Core MoE paper。

awesome-agent-skills NVIDIA agent

PERF MOE VLM Training 設計 Skill

Practical guidance 用於 training MoE VLMs in Megatron Bridge。

awesome-agent-skills NVIDIA design

PERF Parallelism Strategies 設計 Skill

Operational 指南用於 choosing 與 combining parallelism strategies in Megatron Bridge, including sizing rules, hardware topology mapping, 與 combined parallelism configuration。

awesome-agent-skills NVIDIA design

PERF Sequence Packing 設計 Skill

Validate 與 use packed sequences 與 long-context training in Megatron-Bridge, distinguishing offline packed SFT 用於 LLMs from in-batch packing 用於 VLMs, 與 applying the right CP..。

awesome-agent-skills NVIDIA design

PERF TP DP COMM Overlap 設計 Skill

Operational 指南用於 enabling TP, DP, 與 PP communication overlap in Megatron-Bridge, including config knobs, code anchors, pitfalls, 與 verification。

awesome-agent-skills NVIDIA design

Recipe Recommender Skill

Recommend 與 customize Megatron Bridge recipes 用於 a user's model, GPU count, 與 training goal。

awesome-agent-skills NVIDIA productivity

Resiliency Skill

Resiliency features in Megatron Bridge including fault tolerance, straggler detection, in-process restart, preemption, 與 re-run state machine。

awesome-agent-skills NVIDIA productivity

Testing Skill

Testing 參考資料用於 Megatron Bridge — unit 與 functional test layout, tier semantics (L0/L1/L2/flaky), script conventions, running 測試 locally, adding/moving/disabling 測試,..。

awesome-agent-skills NVIDIA testing

VERL E2E Testing Skill

External verl 端到端 validation 工作流用於 Megatron-Bridge model/provider changes。

awesome-agent-skills NVIDIA testing

Build AND Dependency Skill

Container-based dev environment setup 與 dependency management 用於 Megatron-LM。

awesome-agent-skills NVIDIA design

BUMP BASE Image 設計 Skill

協助處理 BUMP BASE Image 設計相關工作，並依原始 Skill 說明完成設定與執行。

awesome-agent-skills NVIDIA design

CI/CD 雲端部署 Skill

CI/CD 參考資料用於 Megatron-LM。

awesome-agent-skills NVIDIA cloud

Create Issue 雲端部署 Skill

Investigate a failing GitHub Actions run 或 job 與 create a GitHub issue 用於 the failure。

awesome-agent-skills NVIDIA cloud

Linting AND Formatting Skill

Linting 與 formatting 用於 Megatron-LM。

awesome-agent-skills NVIDIA productivity

Nightly SYNC Skill

Domain knowledge 用於 the nightly main-to-dev sync 工作流。

awesome-agent-skills NVIDIA agent

Onboard Gb200 1node Tests Skill

Onboard 1-node GitHub MR functional 測試用於 GB200 from existing mr-scoped 2-node 測試。

awesome-agent-skills NVIDIA testing

Respond TO Issue Skill

Research 與 draft a response to a GitHub issue 或 question from an external contributor。

awesome-agent-skills NVIDIA productivity

RUN ON Slurm Skill

協助處理 RUN ON Slurm 相關工作，並依原始 Skill 說明完成設定與執行。

awesome-agent-skills NVIDIA productivity

Split PR 設計 Skill

協助處理 Split PR 設計相關工作，並依原始 Skill 說明完成設定與執行。

awesome-agent-skills NVIDIA design

Testing Skill

測試 system 用於 Megatron-LM。

awesome-agent-skills NVIDIA testing

Update Golden Values 雲端部署 Skill

Refresh golden values from a GitHub Actions 工作流 run (failing-only 或 all jobs), score the change 搭配 average normalized relative differences, 與 produce a PR-ready summary。

awesome-agent-skills NVIDIA cloud

Accessing Mlflow 資料庫 Skill

Query 與 browse evaluation results stored in MLflow。

awesome-agent-skills NVIDIA database

Debug 雲端部署 Skill

協助處理 Debug 雲端部署相關工作，並依原始 Skill 說明完成設定與執行。

awesome-agent-skills NVIDIA cloud

Deployment 雲端部署 Skill

Serve a quantized 或 unquantized LLM checkpoint as an OpenAI-compatible API endpoint 使用 vLLM, SGLang, 或 TRT-LLM。

awesome-agent-skills NVIDIA cloud

Evaluation Skill

Evaluates accuracy of quantized 或 unquantized LLMs 使用 NeMo Evaluator Launcher (NEL)。

awesome-agent-skills NVIDIA productivity

Launching Evals Skill

Run, monitor, analyze, 與 debug LLM evaluations via nemo-evaluator-launcher。

awesome-agent-skills NVIDIA productivity

Monitor 雲端部署 Skill

Monitor submitted jobs (PTQ, evaluation, 部署) on SLURM clusters。

awesome-agent-skills NVIDIA cloud

PTQ Skill

協助處理 PTQ 相關工作，並依原始 Skill 說明完成設定與執行。

awesome-agent-skills NVIDIA productivity

Release Cherry PICK Skill

Cherry-pick merged PRs labeled 用於 a release branch into that branch, then open a PR 與 apply the cherry-pick-done label。

awesome-agent-skills NVIDIA productivity

BYOB Skill

建立 custom LLM evaluation benchmarks 使用 the BYOB decorator framework。

awesome-agent-skills NVIDIA productivity

Accessing Mlflow 資料庫 Skill

Query 與 browse evaluation results stored in MLflow。

awesome-agent-skills NVIDIA database

Launching Evals Skill

Run, monitor, analyze, 與 debug LLM evaluations via nemo-evaluator-launcher。

awesome-agent-skills NVIDIA productivity

NEL Assistant Skill

Interactive config wizard 用於 NeMo Evaluator Launcher (NEL)。

awesome-agent-skills NVIDIA productivity

ADD Benchmark 設計 Skill

> 指南用於 adding a new benchmark 或 training environment to NeMo-Gym。

awesome-agent-skills NVIDIA design

NEMO GYM Debugging Skill

>- Use when debugging a Nemo Gym run 或 reward profiling job。

awesome-agent-skills NVIDIA productivity

NEMO GYM DOCS Skill

> Maintain the NeMo Gym Fern docs site — add, update, move, 或 remove pages under fern/。

awesome-agent-skills NVIDIA docs

NEMO GYM Pivot Datasets Skill

>- Use when creating, validating, 或 documenting Nemo Gym pivot datasets from rollout, trajectory, chat-completion, Responses API, 或 tool-call artifacts。

awesome-agent-skills NVIDIA data

NEMO GYM Reward Profiling Skill

>- Use to help users get started 搭配 Nemo Gym reward profiling。

awesome-agent-skills NVIDIA productivity

AUTO Research 測試 Skill

Autonomous NeMo-RL research agent 工作流用於 directed hypothesis testing 與 open-ended discovery。

awesome-agent-skills NVIDIA testing

BREV Etiquette Skill

Brev instance operating guidance 用於 NeMo-RL agents working in /home/ubuntu/RL 搭配 limited workspace disk, a larger /ephemeral volume, 與 optional /home/ubuntu/RL/.env secrets。

awesome-agent-skills NVIDIA security

Build AND Dependency Skill

建置 and dependency management 用於 NeMo-RL。

awesome-agent-skills NVIDIA design

CI/CD 雲端部署 Skill

CI/CD 參考資料用於 NeMo-RL。

awesome-agent-skills NVIDIA cloud

Config Conventions Skill

Configuration conventions 用於 NeMo-RL。

awesome-agent-skills NVIDIA productivity

Contributing Skill

Contribution conventions 用於 NeMo-RL。

awesome-agent-skills NVIDIA productivity

Copyright 設計 Skill

NVIDIA copyright header requirements 用於 NeMo-RL。

awesome-agent-skills NVIDIA design

DOCS Skill

檔案 conventions 用於 NeMo-RL。

awesome-agent-skills NVIDIA docs