{"id":19891,"date":"2025-08-07T11:53:24","date_gmt":"2025-08-07T03:53:24","guid":{"rendered":"https:\/\/aicats.wiki\/sites\/19891.html"},"modified":"2025-08-07T11:53:24","modified_gmt":"2025-08-07T03:53:24","slug":"deepspeed","status":"publish","type":"sites","link":"https:\/\/aicats.wiki\/en\/sites\/19891.html","title":{"rendered":"DeepSpeed"},"content":{"rendered":"<p>2024\u5e746\u6708\uff0c\u4eba\u5de5\u667a\u80fd\u5927\u8bed\u8a00\u6a21\u578b\u6b65\u5165\u89c4\u6a21\u5316\u65f6\u4ee3\u3002DeepSpeed\uff0c\u7531\u5fae\u8f6f\u7814\u53d1\u5e76\u5f00\u6e90\uff0c\u4ee5\u5176\u5353\u8d8a\u7684\u4f18\u5316\u6280\u672f\u6210\u4e3a\u5168\u7403AI\u5f00\u53d1\u8005\u7684\u91cd\u8981\u5de5\u5177\u3002<strong>DeepSpeed<\/strong>\u5177\u5907\u9ad8\u6548\u7684\u591aGPU\u3001\u591a\u673a\u5206\u5e03\u5f0f\u8bad\u7ec3\u80fd\u529b\uff0c\u4f7f\u5f97\u5927\u89c4\u6a21<a href=\"https:\/\/aicats.wiki\/en\/sites\/20475-html\/\" title=\"LOBE\">AI model training<\/a>\u66f4\u4e3a\u9ad8\u6548\u3001\u6210\u672c\u66f4\u4f4e\u3002\u5176\u6838\u5fc3\u6280\u672f\u5305\u62ecDeepSpeed-Tr<a class=\"external\" href=\"https:\/\/aicats.wiki\/en\/sitetag\/ai\" title=\"View articles related to ai\" target=\"_blank\">ai<\/a>ning\u3001DeepSpeed-Inference\u3001DeepSpeed-Compression\u4ee5\u53caDeepSpeed4Science\u56db\u5927\u652f\u67f1\u3002\u7528\u6237\u53ef\u4ee5\u5728\u672c\u5730\u6216Azure\u4e91\u7aef\u90e8\u7f72\uff0c\u4f53\u9a8c\u514d\u8d39\u4e14\u9ad8\u6027\u80fd\u7684AI\u6a21\u578b\u8bad\u7ec3\u3002<\/p>\n\n\n\n<p>2024\u5e746\u6708\uff0c\u4eba\u5de5\u667a\u80fd\u5927\u8bed\u8a00\u6a21\u578b\u72c2\u98d9\u7a81\u8fdb\uff0c\u4eceGPT-4\u3001BLOOM\u5230\u6700\u65b0\u7684\u4f01\u4e1a\u7ea7\u5e94\u7528\uff0c<strong>\u5927\u89c4\u6a21AI\u8bad\u7ec3\u6a21\u578b<\/strong>\u5df2\u7ecf\u6210\u4e3a\u4e1a\u754c\u529b\u4e89\u9ad8\u5730\u3002\u968f\u7740\u6a21\u578b\u53c2\u6570\u91cf\u6b65\u5165\u6570\u5341\u4ebf\uff0c\u751a\u81f3\u4e0a\u5343\u4ebf\u7ea7\u522b\uff0c\u8bad\u7ec3\u548c\u63a8\u7406\u7684\u6210\u672c\u4e0e\u96be\u5ea6\u4e5f\u5448\u6307\u6570\u589e\u957f\u3002\u7531\u5fae\u8f6f\u7814\u53d1\u5e76\u5f00\u6e90\u7684<strong>DeepSpeed<\/strong>\uff0c\u6b63\u4ee5\u5176\u6781\u81f4\u7684\u4f18\u5316\u6280\u672f\uff0c\u5f7b\u5e95\u6539\u53d8\u8fd9\u4e00\u5c40\u9762\uff0c\u6210\u4e3a\u5168\u7403AI\u5f00\u53d1\u8005\u4e0d\u53ef\u6216\u7f3a\u7684\u201c\u57fa\u7840\u8bbe\u65bd\u201d\u3002<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>\u8bbf\u95eeDeepSpeed\u5b98\u65b9\u7ad9\u70b9\uff1a<a href=\"https:\/\/www.deepspeed.ai\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >https:\/\/www.deepspeed.ai\/<\/a><\/p>\n<\/blockquote>\n\n\n\n<figure class=\"wp-block-image size-full\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1595\" height=\"918\" src=\"https:\/\/aicats.wiki\/wp-content\/uploads\/2025\/08\/image-211.png\" alt=\"DeepSpeed\u5b98\u7f51\u622a\u56fe\" class=\"wp-image-23623\"\/><figcaption class=\"wp-element-caption\">Photo\/<a href=\"https:\/\/www.deepspeed.ai\/getting-started\/\" title=\"\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >DeepSpeed\u5b98\u7f51\u622a\u56fe<\/a><\/figcaption><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">\u4ec0\u4e48\u662fDeepSpeed\uff1f<\/h2>\n\n\n\n<p><strong>DeepSpeed<\/strong> \u662f\u4e00\u6b3e\u6df1\u5ea6\u5b66\u4e60\u4f18\u5316\u8f6f\u4ef6\u5957\u4ef6\uff08Deep Learning Optimization Software Suite\uff09\uff0c\u4e13\u4e3a\u8bad\u7ec3\u548c\u63a8\u7406\u8d85\u5927\u89c4\u6a21\u6a21\u578b\u6253\u9020\uff0c\u88ab\u666e\u904d\u5e94\u7528\u4e8e\u5982MT-NLG\u3001BLOOM\u3001Jurassic-1\u7b49\u9876\u7ea7\u5927\u6a21\u578b\u7684\u8bad\u7ec3\u573a\u666f\u3002DeepSpeed\u8ffd\u6c42\u6781\u81f4\u9ad8\u6548\uff0c\u4e3b\u6253\u5728\u591aGPU\u3001\u591a\u673a\u5206\u5e03\u5f0f\u73af\u5883\u4e2d\uff0c\u5c06\u8bad\u7ec3\u901f\u5ea6\u6700\u5927\u5316\u3001\u8d44\u6e90\u5229\u7528\u6700\u4f18\u3001\u6210\u672c\u5927\u5e45\u964d\u4f4e\u3002<br>\n\u5b83\u4e0d\u4ec5\u9002\u7528\u4e8e\u5927\u6a21\u578b\uff0c\u4e5f\u80fd\u8ba9\u4e2d\u5c0f\u578b\u56e2\u961f\u5728\u66f4\u5e73\u4ef7\u786c\u4ef6\u4e0a\u5b8c\u6210\u4ee5\u5f80\u9700\u201c\u5de8\u65e0\u9738\u201d\u670d\u52a1\u5668\u96c6\u7fa4\u7684\u4efb\u52a1\u3002<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">DeepSpeed\u7684\u4e3b\u8981\u529f\u80fd<\/h2>\n\n\n\n<h3 class=\"wp-block-heading\">\u6280\u672f\u521b\u65b0\u56db\u5927\u652f\u67f1<\/h3>\n\n\n\n<p>DeepSpeed\u7684\u6838\u5fc3\u521b\u65b0\u5206\u4e3a\u56db\u5927\u652f\u67f1\uff0c\u6bcf\u4e00\u9879\u90fd\u9762\u5411AI\u8bad\u7ec3\u548c\u63a8\u7406\u4e2d\u7684\u75db\u70b9\uff1a<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>\u652f\u67f1<\/th><th>Main functions<\/th><\/tr><\/thead><tbody><tr><td><strong>DeepSpeed-Training<\/strong><\/td><td>\u8bbe\u8ba1\u7528\u4e8e\u7a81\u7834\u6027\u7684\u5927\u89c4\u6a21\u5e76\u884c\u8bad\u7ec3\uff08\u5982ZeRO\u30013D-Parallelism\u3001Mixture-of-Experts\u3001ZeRO-Infinity\u7b49\uff09\uff0c\u663e\u8457\u63d0\u5347\u8bad\u7ec3\u6548\u7387\u4e0e\u89c4\u6a21\u3002<\/td><\/tr><tr><td><strong>DeepSpeed-Inference<\/strong><\/td><td>\u901a\u8fc7\u5f20\u91cf\u3001\u6d41\u6c34\u7ebf\u3001\u4e13\u5bb6\u6a21\u578b\u3001ZeRO\u7b49\u591a\u79cd\u5e76\u884c\u6280\u672f\u548c\u5185\u6838\u4f18\u5316\uff0c\u5b9e\u73b0\u8d85\u5927\u6a21\u578b\u7684\u9ad8\u6548\u4f4e\u5ef6\u8fdf\u63a8\u7406\u3002<\/td><\/tr><tr><td><strong>DeepSpeed-Compression<\/strong><\/td><td>\u63d0\u4f9b\u6613\u7528\u3001\u9ad8\u7075\u6d3b\u5ea6\u7684\u6a21\u578b\u538b\u7f29\u65b9\u6848\uff08\u5982ZeroQuant\u3001XTC\u7b49\uff09\uff0c\u5728\u4e0d\u5f71\u54cd\u8868\u73b0\u4e0b\u5927\u5e45\u51cf\u5c0f\u6a21\u578b\u4f53\u79ef\u3001\u52a0\u5feb\u63a8\u7406\u901f\u5ea6\u3001\u8282\u7701\u6210\u672c\u3002<\/td><\/tr><tr><td><strong>DeepSpeed4Science<\/strong><\/td><td>\u7ed3\u5408\u7cfb\u7edf\u4f18\u5316\u4e0e\u79d1\u5b66\u7b97\u529b\uff0c\u52a9\u529b\u751f\u547d\u79d1\u5b66\u3001\u7269\u7406\u7b49\u524d\u6cbf\u9886\u57df\uff0c\u5927\u5e45\u63d0\u5347\u79d1\u7814AI\u6a21\u578b\u7684\u8bad\u7ec3\u6548\u7387\u3002<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>\u8be6\u7ec6\u6280\u672f\u4ecb\u7ecd\u53ef\u67e5\u9605 <a href=\"https:\/\/www.deepspeed.ai\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >DeepSpeed\u6280\u672f\u652f\u67f1\u9875\u9762<\/a>\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"1595\" height=\"918\" src=\"https:\/\/aicats.wiki\/wp-content\/uploads\/2025\/08\/image-212.png\" alt=\" DeepSpeed\u6280\u672f\u652f\u67f1\u9875\u9762\" class=\"wp-image-23628\"\/><figcaption class=\"wp-element-caption\">Photo\/ <a href=\"https:\/\/www.deepspeed.ai\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >DeepSpeed\u6280\u672f\u652f\u67f1\u9875\u9762<\/a><\/figcaption><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\">DeepSpeed-Training<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>\u7a81\u7834\u6027\u5206\u5e03\u5f0f\u8bad\u7ec3\u4f18\u5316<\/strong>\uff0c\u5982ZeRO\u7cfb\u5217\u4f18\u5316\u5668\uff0c\u4f7f\u591a\u8282\u70b9\u6570\u5343GPU\u4e0a\u7ebf\u6027\u6269\u5c55\uff0c\u8f7b\u677e\u652f\u6491\u5343\u4ebf\u53c2\u6570\u6a21\u578b\u8bad\u7ec3\u3002<\/li>\n\n\n\n<li><strong>3D-Parallelism<\/strong>\uff0c\u5b9e\u73b0\u5f20\u91cf\u3001\u6d41\u6c34\u7ebf\u3001\u6570\u636e\u4e09\u7ef4\u5e76\u884c\uff0c\u6781\u81f4\u6316\u6398\u8ba1\u7b97\u4e0e\u5185\u5b58\u5e26\u5bbd\u3002<\/li>\n\n\n\n<li><strong>MoE\uff08\u4e13\u5bb6\u6a21\u578b\uff09\u8bad\u7ec3\u4f18\u5316<\/strong>\uff0c\u81ea\u52a8\u7ba1\u7406\u7a00\u758f\u6fc0\u6d3b\u5927\u6a21\u578b\uff0c\u8bad\u7ec3\u7a00\u758f\u53c2\u6570\u66f4\u9ad8\u6548\u3002<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">DeepSpeed-Inference<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>\u8d85\u5927\u6a21\u578b\u4f4e\u5ef6\u8fdf\u63a8\u7406<\/strong>\uff0c\u7ed3\u5408\u81ea\u7814\u9ad8\u6027\u80fd\u63a8\u7406\u5185\u6838\u548c\u901a\u4fe1\u4f18\u5316\uff0c\u5343\u4ebf\u7ea7\u6a21\u578b\u4e5f\u80fd\u5feb\u901f\u3001\u591a\u5e76\u53d1\u670d\u52a1\u4e0a\u7ebf\u3002<\/li>\n\n\n\n<li><strong>\u5f02\u6784\u5185\u5b58\u8c03\u5ea6<\/strong>\uff0c\u652f\u6301CPU+GPU\/NVMe\u6df7\u5408\u5b58\u50a8\uff0c\u6781\u5927\u964d\u4f4e\u663e\u5b58\u8981\u6c42\u3002<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">DeepSpeed-Compression<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>\u538b\u7f29\u4e0e\u91cf\u5316\u4e00\u4f53\u5316<\/strong>\uff0c\u5185\u7f6eZeroQuant\u3001XTC\u7b49\u524d\u6cbf\u6280\u672f\uff0c\u652f\u6301\u5168\u81ea\u52a8\u538b\u7f29\uff0c\u65b9\u4fbf\u9884\u6d4b\u90e8\u7f72\u3002<\/li>\n\n\n\n<li><strong>\u7075\u6d3b\u5e76\u53ef\u7ec4\u5408\u7684\u538b\u7f29API<\/strong>\uff0c\u517c\u5bb9\u79d1\u7814\u4e0e\u5de5\u4e1a\u9700\u6c42\u3002<\/li>\n<\/ul>\n\n\n\n<p>For more details, please see <a href=\"https:\/\/www.deepspeed.ai\/technology\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >DeepSpeed\u529f\u80fd\u5217\u8868<\/a>\u3002<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">DeepSpeed\u7684\u4ef7\u683c &amp; \u65b9\u6848<\/h2>\n\n\n\n<p><strong>DeepSpeed\u5b8c\u5168\u5f00\u6e90\u514d\u8d39<\/strong>\uff0c\u7528\u6237\u53ef\u4ee5\u5728<a href=\"https:\/\/github.com\/microsoft\/DeepSpeed\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >GitHub<\/a>\u5feb\u901f\u83b7\u53d6\u6e90\u4ee3\u7801\uff0c\u4f01\u4e1a\u5546\u7528\u3001\u79d1\u7814\u5747\u53ef\u65e0\u95e8\u69db\u4f7f\u7528\u3002\u5b98\u65b9\u8fd8\u63d0\u4f9b\uff1a<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"1595\" height=\"918\" src=\"https:\/\/aicats.wiki\/wp-content\/uploads\/2025\/08\/image-213.png\" alt=\"GitHub\u5f00\u6e90\" class=\"wp-image-23629\"\/><figcaption class=\"wp-element-caption\">Photo\/<a href=\"https:\/\/github.com\/microsoft\/DeepSpeed\" title=\"\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >GitHub\u5f00\u6e90<\/a><\/figcaption><\/figure>\n\n\n\n<ul class=\"wp-block-list\">\n<li>\u6df1\u5ea6Hugging Face Transformers\u3001PyTorch\u3001PyTorch Lightning\u96c6\u6210\uff0c\u964d\u4f4e\u6a21\u578b\u8fc1\u79fb\u4e0e\u4e8c\u6b21\u5f00\u53d1\u95e8\u69db\u3002<\/li>\n\n\n\n<li>\u5fae\u8f6f Azure \u4e0a\u5b8c\u6574\u652f\u6301\uff0c\u53ef\u76f4\u63a5\u901a\u8fc7AzureML\u4e00\u952e\u5f00\u542f\u5206\u5e03\u5f0f\u8bad\u7ec3\u3002\u8be6\u89c1 <a href=\"https:\/\/github.com\/Azure\/azureml-examples\/tree\/main\/python-sdk\/workflows\/train\/deepspeed\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >AzureML\u5b98\u65b9\u6307\u5f15<\/a>\u3002<\/li>\n<\/ul>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>plan<\/th><th>price<\/th><th>Applicable Scenarios<\/th><\/tr><\/thead><tbody><tr><td><strong>\u5f00\u6e90\u672c\u5730\u90e8\u7f72<\/strong><\/td><td>free<\/td><td>\u5185\u90e8\u6216\u79d1\u7814\u6570\u636e\u3001\u53ef\u81ea\u5b9a\u4e49\u73af\u5883<\/td><\/tr><tr><td><strong>Azure\u4e91\u7aefAI\u8bad\u7ec3<\/strong><\/td><td>\u6309\u4e91\u8d44\u6e90\u8ba1\u8d39<\/td><td>\u5f39\u6027\u6269\u5c55\u3001\u9ad8\u6027\u80fd\u96c6\u7fa4\u3001\u5927\u89c4\u6a21\u751f\u4ea7\u73af\u5883<\/td><\/tr><tr><td><strong>\u793e\u533a\u652f\u6301\u4e0e\u4f01\u4e1a\u5408\u4f5c<\/strong><\/td><td>\u90e8\u5206\u5546\u4e1a\u670d\u52a1\u8d39<\/td><td>\u5b9a\u5236\u5316\u6280\u672f\u652f\u6301\u3001\u884c\u4e1a\u7ea7\u6301\u7eed\u96c6\u6210<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>DeepSpeed\u4e0d\u6536\u53d6\u72ec\u7acb\u6388\u6743\u6216\u4f7f\u7528\u8d39\uff0c\u793e\u533a\u4e0e\u4f01\u4e1a\u670d\u52a1\u8bf7\u53c2\u89c1<a href=\"https:\/\/www.deepspeed.ai\/support\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >DeepSpeed\u793e\u533a<\/a>\u3002<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">\u5982\u4f55\u4f7f\u7528DeepSpeed<\/h2>\n\n\n\n<h3 class=\"wp-block-heading\">Install<\/h3>\n\n\n\n<p>DeepSpeed\u652f\u6301Linux\u3001Windows\u3001\u4e3b\u6d41GPU\u67b6\u6784\uff08\u5305\u62ecNVIDIA\u3001AMD\u3001Intel\u652f\u6301\uff09\uff0c\u7b80\u5355pip\u4e00\u884c\u5373\u53ef\uff1a<\/p>\n\n\n\n<!--wp-compress-html--><!--wp-compress-html no compression-->\n<pre class=\"wp-block-code\"><code>pip install deepspeed<\/code><\/pre>\n<!--wp-compress-html no compression--><!--wp-compress-html-->\n\n\n\n<p>\u8be6\u7ec6\u73af\u5883&amp;\u52a0\u901f\u5668\u652f\u6301\u89c1 <a href=\"https:\/\/www.deepspeed.ai\/tutorials\/accelerator-setup-guide\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >\u8bbe\u5907\u517c\u5bb9\u6027\u5217\u8868<\/a>\u3002<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">\u5feb\u901f\u96c6\u6210\u8bad\u7ec3\/\u63a8\u7406\u4ee3\u7801<\/h3>\n\n\n\n<p>\u4ee5PyTorch\u4e3a\u4f8b\uff0c\u53ea\u9700\u4e09\u6b65\uff1a<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">1. \u914d\u7f6eDeepSpeed\u53c2\u6570\uff08ds_config.json\uff09<\/h4>\n\n\n\n<!--wp-compress-html--><!--wp-compress-html no compression-->\n<pre class=\"wp-block-code\"><code>{\n  \"train_batch_size\": 8,\n  \"gradient_accumulation_steps\": 1,\n  \"optimizer\": {\"type\": \"Adam\", \"params\": {\"lr\": 0.00015}},\n  \"fp16\": {\"enabled\": true},\n  \"zero_optimization\": true\n}<\/code><\/pre>\n<!--wp-compress-html no compression--><!--wp-compress-html-->\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1595\" height=\"918\" src=\"https:\/\/aicats.wiki\/wp-content\/uploads\/2025\/08\/image-214.png\" alt=\"\u8bbe\u5907\u517c\u5bb9\u6027\u5217\u8868\" class=\"wp-image-23632\"\/><figcaption class=\"wp-element-caption\">Photo\/<a href=\"https:\/\/www.deepspeed.ai\/tutorials\/accelerator-setup-guide\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >\u8bbe\u5907\u517c\u5bb9\u6027\u5217\u8868<\/a><\/figcaption><\/figure>\n\n\n\n<h4 class=\"wp-block-heading\">2. \u521d\u59cb\u5316DeepSpeed\u5f15\u64ce\uff08\u8bad\u7ec3\u793a\u4f8b\uff09<\/h4>\n\n\n\n<!--wp-compress-html--><!--wp-compress-html no compression-->\n<pre class=\"wp-block-code\"><code>import deepspeed\nmodel, optimizer, _, _ = deepspeed.initialize(args=cmd_args, model=model, model_parameters=params)<\/code><\/pre>\n<!--wp-compress-html no compression--><!--wp-compress-html-->\n\n\n\n<h4 class=\"wp-block-heading\">3. \u542f\u52a8\u5206\u5e03\u5f0f\u8bad\u7ec3\u547d\u4ee4<\/h4>\n\n\n\n<!--wp-compress-html--><!--wp-compress-html no compression-->\n<pre class=\"wp-block-code\"><code>deepspeed --num_gpus=4 &lt;train_script.py&gt; --deepspeed --deepspeed_config ds_config.json<\/code><\/pre>\n<!--wp-compress-html no compression--><!--wp-compress-html-->\n\n\n\n<p>\u8be6\u7ec6\u7528\u6cd5\u53ef\u89c1<a href=\"https:\/\/www.deepspeed.ai\/getting-started\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >Official Getting Started Tutorial<\/a>\u3002<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">\u4e91\u7aef\u4e00\u952e\u8bd5\u7528<\/h3>\n\n\n\n<p>\u5fae\u8f6fAzure\u901a\u8fc7<a href=\"https:\/\/github.com\/Azure\/azureml-examples\/issues\" title=\"\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >AzureML<\/a>\u63d0\u4f9bDeepSpeed\u6a21\u677f\uff0c\u9002\u5408\u4e91\u8ba1\u7b97\u5f39\u6027\u8d44\u6e90\u7684\u5927\u89c4\u6a21AI\u8bad\u7ec3\u573a\u666f\u3002<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Hugging Face\u3001Lightning\u539f\u751f\u96c6\u6210<\/h3>\n\n\n\n<p>\u4e00\u53e5\u547d\u4ee4+config\u5373\u53ef\u8ba9Transformers\u3001PyTorch Lightning\u4efb\u52a1\u63a5\u5165DeepSpeed\u52a0\u901f\uff0c\u89c1 <a href=\"https:\/\/huggingface.co\/docs\/transformers\/deepspeed\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >HF\u96c6\u6210\u6559\u7a0b<\/a>\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1595\" height=\"918\" src=\"https:\/\/aicats.wiki\/wp-content\/uploads\/2025\/08\/image-215-1.jpg\" alt=\"HF\u96c6\u6210\u6559\u7a0b\" class=\"wp-image-23636\"\/><figcaption class=\"wp-element-caption\">Photo\/<a href=\"https:\/\/huggingface.co\/docs\/transformers\/deepspeed\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >HF\u96c6\u6210\u6559\u7a0b<\/a><\/figcaption><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">DeepSpeed\u7684\u9002\u7528\u4eba\u7fa4<\/h2>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>Role<\/th><th>\u4f7f\u7528\u4ef7\u503c<\/th><\/tr><\/thead><tbody><tr><td><strong>AI\u7814\u53d1\u56e2\u961f<\/strong><\/td><td>\u8bad\u7ec3\u5343\u4ebf\u53c2\u6570\u5927\u6a21\u578b\uff0c\u6781\u5927\u964d\u4f4e\u6210\u672c\u4e0e\u5f00\u53d1\u96be\u5ea6<\/td><\/tr><tr><td><strong>AI\u521b\u4e1a\u516c\u53f8<\/strong><\/td><td>\u5229\u7528\u5f00\u6e90\u96c6\u7fa4\uff0c\u4f4e\u95e8\u69db\u5b9e\u73b0\u884c\u4e1a\u9886\u5148\u7684\u6a21\u578b\u8bad\u7ec3<\/td><\/tr><tr><td><strong>\u5b66\u672f\u79d1\u7814\u56e2\u961f<\/strong><\/td><td>\u6269\u5c55\u8bba\u6587\u7ea7\u5b9e\u9a8c\u89c4\u6a21\uff0c\u63a8\u52a8\u5927\u6a21\u578b\u7406\u8bba\u4e0e\u65b0\u7b97\u6cd5\u7a81\u7834<\/td><\/tr><tr><td><strong>\u4e91\u670d\u52a1&amp;\u5927\u5382<\/strong><\/td><td>\u9762\u5411SaaS\/PaaS AI\u670d\u52a1\uff0c\u652f\u6491\u8d85\u9ad8\u5e76\u53d1\u4e0e\u5927\u6a21\u578b\u63a8\u7406\u4f4e\u5ef6\u8fdf\u90e8\u7f72<\/td><\/tr><tr><td><strong>DL\u6846\u67b6\u5f00\u53d1\u8005<\/strong><\/td><td>\u6df1\u5ea6\u96c6\u6210\u5206\u5e03\u5f0f\u4f18\u5316\u4e0e\u6a21\u578b\u538b\u7f29\u6280\u672f\uff0c\u5f00\u653e\u81ea\u5b9a\u4e49\u4f18\u5316\u4e0e\u63d2\u4ef6\u6846\u67b6<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>\u9002\u5408\u4e00\u5207\u5bf9<strong>AI training model<\/strong>\u6709\u9ad8\u6027\u80fd\u3001\u4f4e\u6210\u672c\u3001\u6613\u6269\u5c55\u9700\u6c42\u7684\u4f01\u4e1a\u4e0e\u4e2a\u4eba\u3002<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">DeepSpeed\u96c6\u6210\u4e0e\u751f\u6001<\/h2>\n\n\n\n<p><strong>DeepSpeed\u62e5\u6709\u9ad8\u5ea6\u5f00\u653e\u4e0e\u4e30\u5bcc\u7684\u751f\u6001\uff1a<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>\u4e0e\u4e3b\u6d41DL\u6846\u67b6\u65e0\u7f1d\u96c6\u6210<\/strong>\uff1a\u5982Hugging Face Transformers\u3001Accelerate\u3001PyTorch Lightning\u3001MosaicML\u7b49\u4e00\u952e\u5bf9\u63a5\uff0c\n<ul class=\"wp-block-list\">\n<li>\u6848\u4f8b\u89c1<a href=\"https:\/\/www.deepspeed.ai\/integrations\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >Integrated Documentation<\/a>\u3002<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>\u652f\u6301\u4e91\u539f\u751f\u4e0e\u672c\u5730\u90e8\u7f72<\/strong>\uff1aAzure\u3001Kubernetes\u7b49\u5e73\u53f0\u5747\u6709\u5b98\u65b9\u90e8\u7f72\u793a\u4f8b\u3002<\/li>\n\n\n\n<li><strong>\u4e3b\u6d41AI\u5927\u6a21\u578b\u6807\u914d<\/strong>\uff1aMT-530B\u3001BLOOM\u3001Jurassic-1\u3001GLM\u3001GPT-NeoX\u7b49\u5747\u4f7f\u7528DeepSpeed\u5b8c\u6210\u8bad\u7ec3\/\u63a8\u7406\u3002<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">Frequently Asked Questions<\/h2>\n\n\n\n<h3 class=\"wp-block-heading\">DeepSpeed\u652f\u6301\u7684\u5206\u5e03\u5f0fAI\u8bad\u7ec3\u6709\u54ea\u4e9b\u7c7b\u578b\uff1f<\/h3>\n\n\n\n<p>DeepSpeed\u652f\u6301\u6570\u636e\u5e76\u884c\uff08Data Parallel\uff09\u3001\u6a21\u578b\u5e76\u884c\uff08Model Parallel\uff09\u3001\u6d41\u6c34\u7ebf\u5e76\u884c\uff08Pipeline Parallel\uff09\u3001\u5f20\u91cf\u5e76\u884c\uff08Tensor Parallel\uff09\u3001\u4e13\u5bb6\u6a21\u578b\u5e76\u884c\uff08Expert Parallel\uff09\u7b49\u4e3b\u6d41\u8bad\u7ec3\u8303\u5f0f\uff0c\u534f\u540cZeRO\u7b49\u521b\u65b0\u7b97\u6cd5\uff0c\u9002\u7528\u4e8e\u4ece\u5355\u673a\u591a\u5361\u5230\u5343\u5361\u96c6\u7fa4\u7684\u5404\u79cd\u590d\u6742\u573a\u666f\uff0c\u6781\u5927\u51cf\u5c11\u663e\u5b58\u74f6\u9888\u3002<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>\u66f4\u591aZeRO\u5206\u5e03\u5f0f\u8bf4\u660e\u89c1 <a href=\"https:\/\/www.deepspeed.ai\/training\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >\u5b98\u65b9\u57f9\u8bad\u6280\u672f\u9875\u9762<\/a>\u3002<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">\u5982\u679c\u9884\u7b97\u6709\u9650\uff0cDeepSpeed\u662f\u5426\u4e5f\u9002\u5408\u4e2d\u5c0f\u56e2\u961f\uff1f<\/h3>\n\n\n\n<p>\u662f\u7684\u3002DeepSpeed\u521b\u65b0\u7684\u5185\u5b58\u7ba1\u7406\u548c\u5206\u5e03\u5f0f\u7b56\u7565\u80fd\u8ba9\u4e2d\u5c0f\u56e2\u961f\u5229\u7528\u5e02\u552e\u5355\u673a\u670d\u52a1\u5668\u3001\u4e91\u7aef\u4e2d\u4f4e\u914dGPU\u5b9e\u73b0\u4ee5\u5f80\u53ea\u6709\u201cAI\u5de8\u5934\u201d\u80fd\u5b8c\u6210\u7684\u8d85\u5927\u6a21\u578b\u8bad\u7ec3\u3002\u5e76\u4e14\u5176\u5bf9CPU\u3001NVMe\u7b49\u5b58\u50a8\u7684\u5f02\u6784\u4f18\u5316\uff0c\u964d\u4f4e\u9ad8\u914d\u786c\u4ef6\u6295\u5165\u95e8\u69db\u3002\u90e8\u7f72\u53c2\u8003 <a href=\"https:\/\/www.deepspeed.ai\/getting-started\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >\u8d44\u6e90\u914d\u7f6e\u6587\u6863<\/a>\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1595\" height=\"918\" src=\"https:\/\/aicats.wiki\/wp-content\/uploads\/2025\/08\/image-215.png\" alt=\"\u8d44\u6e90\u914d\u7f6e\u6587\u6863\u3002\" class=\"wp-image-23639\"\/><figcaption class=\"wp-element-caption\">Photo\/<a href=\"https:\/\/www.deepspeed.ai\/getting-started\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >\u8d44\u6e90\u914d\u7f6e\u6587\u6863<\/a>\u3002<\/figcaption><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\">DeepSpeed\u5bf9\u63a8\u7406\u90e8\u7f72\u548c\u6a21\u578b\u538b\u7f29\u6709\u4ec0\u4e48\u5e2e\u52a9\uff1f<\/h3>\n\n\n\n<p>DeepSpeed-Inference\u548cDeepSpeed-Compression\u53ef\u5c06\u767e\u4ebf\u53c2\u6570\u6a21\u578b\u8f7b\u677e\u201c\u88c5\u4e0b\u201d8G\u53ca\u4ee5\u4e0a\u663e\u5b58\u5361\uff0c\u5e76\u663e\u8457\u63d0\u5347\u63a8\u7406\u5e76\u53d1\u80fd\u529b\u548c\u901f\u5ea6\u3002ZeroQuant\u7b49\u6280\u672f\u8fd8\u80fd\u5b9e\u73b0\u6781\u4f4e\u6210\u672c\u7684W4A8\u7b49\u91cf\u5316\u6a21\u578b\uff0c\u65b9\u4fbf\u8fb9\u7f18\/\u4f4e\u5e26\u5bbd\u573a\u666f\u90e8\u7f72\u3002\u67e5\u770b <a href=\"https:\/\/www.deepspeed.ai\/inference\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >\u63a8\u7406\u6280\u672f\u9875\u9762<\/a> \u548c <a href=\"https:\/\/www.deepspeed.ai\/compression\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >\u538b\u7f29\u529f\u80fd<\/a> \u83b7\u53d6\u8be6\u7ec6\u8d44\u6599\u548c\u5f00\u6e90\u5de5\u5177\u3002<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p>\u968f\u7740\u5927\u6a21\u578b\u65f6\u4ee3\u5168\u9762\u5230\u6765\uff0c<strong>DeepSpeed\u5df2\u6210\u4e3aAI\u8bad\u7ec3\u6a21\u578b\u7684\u6838\u5fc3\u5f15\u64ce<\/strong>\u3002\u5b83\u7684\u5f00\u6e90\u5f00\u653e\u4e0e\u6781\u81f4\u6027\u80fd\uff0c\u8ba9\u4ece\u521d\u5b66\u8005\u5230AI\u5de8\u64d8\u90fd\u80fd\u6309\u9700\u6784\u5efa\u5c5e\u4e8e\u81ea\u5df1\u7684\u667a\u6167\u6a21\u578b\u3002\u65e0\u8bba\u4f60\u8eab\u5904AI\u79d1\u7814\u3001\u5de5\u4e1a\u843d\u5730\u8fd8\u662f\u4e91\u539f\u751f\u670d\u52a1\uff0cDeepSpeed\u90fd\u662f\u4f60\u63a2\u7d22AI\u8fb9\u754c\u3001\u63d0\u5347\u6548\u7387\u4e0e\u63a7\u5236\u6210\u672c\u7684\u5f3a\u529b\u81c2\u8180\u3002\u7acb\u5373\u524d\u5f80<a href=\"https:\/\/www.deepspeed.ai\/\" target=\"_blank\"  rel=\"nofollow noopener\"  class=\"external\" >DeepSpeed\u5b98\u7f51<\/a>\uff0c\u5f00\u542fAI\u8bad\u7ec3\u7684\u65e0\u9650\u53ef\u80fd\u3002<\/p>","protected":false},"author":3,"comment_status":"open","ping_status":"closed","template":"","meta":{"_crsspst_to_aicatswiki":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0},"content_visibility":[262],"sitetag":[17,812,811,830],"favorites":[577],"class_list":{"0":"post-19891","1":"sites","2":"type-sites","3":"status-publish","4":"hentry","5":"sitetag-ai","9":"favorites-ai-models"},"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/aicats.wiki\/en\/wp-json\/wp\/v2\/sites\/19891","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/aicats.wiki\/en\/wp-json\/wp\/v2\/sites"}],"about":[{"href":"https:\/\/aicats.wiki\/en\/wp-json\/wp\/v2\/types\/sites"}],"author":[{"embeddable":true,"href":"https:\/\/aicats.wiki\/en\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/aicats.wiki\/en\/wp-json\/wp\/v2\/comments?post=19891"}],"version-history":[{"count":2,"href":"https:\/\/aicats.wiki\/en\/wp-json\/wp\/v2\/sites\/19891\/revisions"}],"predecessor-version":[{"id":23642,"href":"https:\/\/aicats.wiki\/en\/wp-json\/wp\/v2\/sites\/19891\/revisions\/23642"}],"wp:attachment":[{"href":"https:\/\/aicats.wiki\/en\/wp-json\/wp\/v2\/media?parent=19891"}],"wp:term":[{"taxonomy":"content_visibility","embeddable":true,"href":"https:\/\/aicats.wiki\/en\/wp-json\/wp\/v2\/content_visibility?post=19891"},{"taxonomy":"sitetag","embeddable":true,"href":"https:\/\/aicats.wiki\/en\/wp-json\/wp\/v2\/sitetag?post=19891"},{"taxonomy":"favorites","embeddable":true,"href":"https:\/\/aicats.wiki\/en\/wp-json\/wp\/v2\/favorites?post=19891"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}