amazon-web-services - 客户端错误 : train channel is not specified with AWS object_detection_augmented_manifest

amazon-web-services - 客户端错误 : train channel is not specified with AWS object_detection_augmented_manifest_training using ground truth images

转载作者：行者123 更新时间：2023-12-04 08:10:57

我已经完成了 AWS ground truth 中的标记工作，并开始研究用于对象检测的笔记本模板。

我有 2 个 list ，其中包含 293 个带标签的火车中鸟类图像和验证集，如下所示:

{"source-ref":"s3://XXXXXXX/Train/Blackbird_1.JPG","Bird-Label-Train":{"workerId":XXXXXXXX,"imageSource":{"s3Uri":"s3://XXXXXXX/Train/Blackbird_1.JPG"},"boxesInfo":{"annotatedResult":{"boundingBoxes":[{"width":1612,"top":841,"label":"Blackbird","left":1276,"height":757}],"inputImageProperties":{"width":3872,"height":2592}}}},"Bird-Label-Train-metadata":{"type":"groundtruth/custom","job-name":"bird-label-train","human-annotated":"yes","creation-date":"2019-01-16T17:28:23+0000"}}

以下是我为笔记本实例使用的参数:

training_params = \
{
    "AlgorithmSpecification": {
        "TrainingImage": training_image, # NB. This is one of the named constants defined in the first cell.
        "TrainingInputMode": "Pipe"
    },
    "RoleArn": role,
    "OutputDataConfig": {
        "S3OutputPath": s3_output_path
    },
    "ResourceConfig": {
        "InstanceCount": 1,   
        "InstanceType": "ml.p3.2xlarge",
        "VolumeSizeInGB": 5
    },
    "TrainingJobName": job_name,
    "HyperParameters": { # NB. These hyperparameters are at the user's discretion and are beyond the scope of this demo.
         "base_network": "resnet-50",
         "use_pretrained_model": "1",
         "num_classes": "1",
         "mini_batch_size": "16",
         "epochs": "5",
         "learning_rate": "0.001",
         "lr_scheduler_step": "3,6",
         "lr_scheduler_factor": "0.1",
         "optimizer": "rmsprop",
         "momentum": "0.9",
         "weight_decay": "0.0005",
         "overlap_threshold": "0.5",
         "nms_threshold": "0.45",
         "image_shape": "300",
         "label_width": "350",
         "num_training_samples": str(num_training_samples)
    },
    "StoppingCondition": {
        "MaxRuntimeInSeconds": 86400
    },
 "InputDataConfig": [
    {
        "ChannelName": "train",
        "DataSource": {
            "S3DataSource": {
                "S3DataType": "AugmentedManifestFile", # NB. Augmented Manifest
                "S3Uri": s3_train_data_path,
                "S3DataDistributionType": "FullyReplicated",
                "AttributeNames": ["source-ref","Bird-Label-Train"] # NB. This must correspond to the JSON field names in your augmented manifest.
            }
        },
        "ContentType": "image/jpeg",
        "RecordWrapperType": "None",
        "CompressionType": "None"
    },
    {
        "ChannelName": "validation",
        "DataSource": {
            "S3DataSource": {
                "S3DataType": "AugmentedManifestFile", # NB. Augmented Manifest
                "S3Uri": s3_validation_data_path,
                "S3DataDistributionType": "FullyReplicated",
                "AttributeNames": ["source-ref","Bird-Label"] # NB. This must correspond to the JSON field names in your augmented manifest.
            }
        },
        "ContentType": "image/jpeg",
        "RecordWrapperType": "None",
        "CompressionType": "None"
    }
]

我最终会在运行我的 ml.p3.2xlarge 实例后打印出这个:

InProgress Starting
InProgress Starting
InProgress Starting
InProgress Training
Failed Failed

后跟此错误消息:“ClientError:未指定火车 channel 。”

有没有人想过如何让它无错误地运行？非常感谢任何帮助!

成功运行:下面是使用的参数，以及成功运行的增强 list JSON 对象。

training_params = \
{
    "AlgorithmSpecification": {
        "TrainingImage": training_image, # NB. This is one of the named constants defined in the first cell.
        "TrainingInputMode": "Pipe"
    },
    "RoleArn": role,
    "OutputDataConfig": {
        "S3OutputPath": s3_output_path
    },
    "ResourceConfig": {
        "InstanceCount": 1,   
        "InstanceType": "ml.p3.2xlarge",
        "VolumeSizeInGB": 50
    },
    "TrainingJobName": job_name,
    "HyperParameters": { # NB. These hyperparameters are at the user's discretion and are beyond the scope of this demo.
         "base_network": "resnet-50",
         "use_pretrained_model": "1",
         "num_classes": "3",
         "mini_batch_size": "1",
         "epochs": "5",
         "learning_rate": "0.001",
         "lr_scheduler_step": "3,6",
         "lr_scheduler_factor": "0.1",
         "optimizer": "rmsprop",
         "momentum": "0.9",
         "weight_decay": "0.0005",
         "overlap_threshold": "0.5",
         "nms_threshold": "0.45",
         "image_shape": "300",
         "label_width": "350",
         "num_training_samples": str(num_training_samples)
    },
    "StoppingCondition": {
        "MaxRuntimeInSeconds": 86400
    },
    "InputDataConfig": [
        {
            "ChannelName": "train",
            "DataSource": {
                "S3DataSource": {
                    "S3DataType": "AugmentedManifestFile", # NB. Augmented Manifest
                    "S3Uri": s3_train_data_path,
                    "S3DataDistributionType": "FullyReplicated",
                    "AttributeNames": attribute_names # NB. This must correspond to the JSON field names in your **TRAIN** augmented manifest.
                }
            },
            "ContentType": "application/x-recordio",
            "RecordWrapperType": "RecordIO",
            "CompressionType": "None"
        },
        {
            "ChannelName": "validation",
            "DataSource": {
                "S3DataSource": {
                    "S3DataType": "AugmentedManifestFile", # NB. Augmented Manifest
                    "S3Uri": s3_validation_data_path,
                    "S3DataDistributionType": "FullyReplicated",
                    "AttributeNames": ["source-ref","ValidateBird"] # NB. This must correspond to the JSON field names in your **VALIDATION** augmented manifest.
                }
            },
            "ContentType": "application/x-recordio",
            "RecordWrapperType": "RecordIO",
            "CompressionType": "None"
        }
    ]
}

Training Augmented Manifest File 在训练作业运行时生成

Line 1
{"source-ref":"s3://XXXXX/Train/Blackbird_1.JPG","TrainBird":{"annotations":[{"class_id":0,"width":1613,"top":840,"height":766,"left":1293}],"image_size":[{"width":3872,"depth":3,"height":2592}]},"TrainBird-metadata":{"job-name":"labeling-job/trainbird","class-map":{"0":"Blackbird"},"human-annotated":"yes","objects":[{"confidence":0.09}],"creation-date":"2019-02-09T14:21:29.829003","type":"groundtruth/object-detection"}}


Line 2
{"source-ref":"s3://xxxxx/Train/Blackbird_2.JPG","TrainBird":{"annotations":[{"class_id":0,"width":897,"top":665,"height":1601,"left":1598}],"image_size":[{"width":3872,"depth":3,"height":2592}]},"TrainBird-metadata":{"job-name":"labeling-job/trainbird","class-map":{"0":"Blackbird"},"human-annotated":"yes","objects":[{"confidence":0.09}],"creation-date":"2019-02-09T14:22:34.502274","type":"groundtruth/object-detection"}}


Line 3
{"source-ref":"s3://XXXXX/Train/Blackbird_3.JPG","TrainBird":{"annotations":[{"class_id":0,"width":1040,"top":509,"height":1695,"left":1548}],"image_size":[{"width":3872,"depth":3,"height":2592}]},"TrainBird-metadata":{"job-name":"labeling-job/trainbird","class-map":{"0":"Blackbird"},"human-annotated":"yes","objects":[{"confidence":0.09}],"creation-date":"2019-02-09T14:20:26.660164","type":"groundtruth/object-detection"}}

然后我解压缩 model.tar 文件以获得以下文件:hyperparams.JSON、model_algo_1-0000.params 和 model_algo_1-symbol

hyperparams.JSON 看起来像这样:

{"label_width": "350", "early_stopping_min_epochs": "10", "epochs": "5", "overlap_threshold": "0.5", "lr_scheduler_factor": "0.1", "_num_kv_servers": "auto", "weight_decay": "0.0005", "mini_batch_size": "1", "use_pretrained_model": "1", "freeze_layer_pattern": "", "lr_scheduler_step": "3,6", "early_stopping": "False", "early_stopping_patience": "5", "momentum": "0.9", "num_training_samples": "11", "optimizer": "rmsprop", "_tuning_objective_metric": "", "early_stopping_tolerance": "0.0", "learning_rate": "0.001", "kv_store": "device", "nms_threshold": "0.45", "num_classes": "1", "base_network": "resnet-50", "nms_topk": "400", "_kvstore": "device", "image_shape": "300"}

最佳答案

不幸的是，image/jpeg 内容类型不支持带有 AugmentedManifestFile 的管道模式。为了能够使用此功能，您需要将 RecordWrapperType 指定为 RecordIO 并将 ContentType 指定为 application/x-recordio.

关于amazon-web-services - 客户端错误 : train channel is not specified with AWS object_detection_augmented_manifest_training using ground truth images，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/54171261/

文章推荐： ruby-on-rails-3 - Ransack或查询

文章推荐： amazon-web-services - 如何从 Glue Dev Endpoint 运行胶水脚本

文章推荐： d3.js - 在 d3js 上停止强制布局并开始自由拖动节点

service - start 和 service start 有什么区别
我正在使用 choronos，它建议使用 start/stop 命令开始停止，如下所示开始计时停止计时但是，我正在编写 puppet manifest，它只适用于下面的服务命令。服务计时开始
Services.exe是什么进程？Services.exe病毒吗？Services.exe占CPU情况
来历及作用 services.exe进程程序文件是由微软公司为其发布的Windows操作系统定义的一个系统进程，常见于Windows 2000/XP/Vista/2007等系统中，被描述为服务和控
windows-services - Installutil不会卸载: “The specified service does not exist as an installed service”
我一直在尝试使用installutil:installutil /u GSIS.FileMoverService.exe安装Windows服务。我得到的输出是: Uninstalling assem
service-worker - 在一个域中推荐一个顶级 Service Worker 或多个 Service Worker？
如果一个域有多个团队和多个 Web 应用程序，那么注册 Service Worker 来管理整个站点的最佳建议是什么？具有范围的顶级服务 worker /或子域中的多个服务 worker ？由于一个域
java - org.jboss.msc.service.ServiceNotFoundException : Service service jboss. 找不到 ejb.default-resource-adapter-name-service
我已经在 eclipse 中创建了企业项目。动态web项目和ejb项目对企业项目有借鉴意义。当我运行管理员(企业项目)运行时选择 wildfly 服务器 18。我收到以下错误。谁能告诉我我错过了什么。
service - 类 javax.xml.ws.Service 中的构造函数 Service 无法应用于给定类型
我已经使用 apache-cxf-2.7.4 创建了一个 Web 服务。我进入了我的项目中制作的类(class)。我的项目中的库是: math3-commons-3.2.jar XStream-1.4
windows-services - AppFabric缓存错误:The AppFabric Caching Service service terminated unexpectedly
我在域中的 Virtual Box 中运行集群计算机，默认情况下服务在 Network 服务下运行，服务一直停止，事件日志中出现以下错误。请从下面的错误日志中查找错误详细信息。任何帮助都会很棒。 L
c# - 用于用户表示的 Service Fabric Service 与 Service Fabric Actors
在我的应用程序中，用户可以在 map 上发布事件。应用程序的入口点是一个无状态的 web api 服务。为了在内部代表用户，我想要一个用户服务。我应该何时使用 Reliable Stateful Ac
service - "Service failed to start - Verify that you have sufficient privileges to start system services"
当我尝试运行在WIX中创建的安装程序时，出现以下错误消息: “服务'Report Generator Service'(报告生成器服务)无法启动。请验证您是否具有启动系统服务的足够特权”。我已经在这
amazon-web-services - AWS ECS : Invalid service in ARN (Service: AmazonECS; . ..)
尝试使用 cloudformation 创建 ECS 服务(在 Fargate 上)但出现错误: Invalid service in ARN (Service: AmazonECS; Status
windows-services - 如何以编程方式停止Windows Service？
我正在编写一个简单的Windows服务，该服务每个月向所有员工发送一封电子邮件。我的问题是，完成后如何停止自我？我是该领域的新手，请帮帮我。非常感谢。它将部署在服务器上以每月运行。我没有开始做这件事
service-worker - 从 Service Worker 中获取 Service Worker id 或 date
有谁知道是否有办法在 service worker 中获取此号码或日期: 将我的服务 worker 缓存命名为 cache-1182 会很方便或 cache-20171127171448 我想在安装事
powershell - 启动服务: Failed to start service 'Microsoft Service Fabric Host Service (FabricHostSvc)'
我想开始使用 Azure Service Fabric 技术。我按照this document工作并安装最新的SDK。安装后，我打开 PowerShell(“以管理员身份运行”)命令行窗口并写入这些
ruby-on-rails - PG::UndefinedTable: 错误:关系 "services"不存在 LINE 1: SELECT "services".* FROM "services"
我在使用 whenever gem 时遇到了一些问题。我创建了一个 rake 任务，当我自己启动它时它工作得很好但是当我在日志中收到以下消息时尝试自动执行它: ActiveRecord::Statem
azure-service-fabric - "HTTP Error 503. The service is unavailable"与 Service Fabric 上的 WebListener 共享端口
我想在 service fabric 集群中为两个不同的 web 应用程序(webpi/website)共享 http/80 端口，应用程序必须有 2 个不同的主机名: mywebapi.com 和
java - org.hibernate.service.UnknownServiceException : Unknown service requested [org. hibernate.ogm.service.impl.OgmConfigurationService]
我创建了一个使用 MongoDB 实现 hibernate OGM 的应用程序。它在 Eclipse 中运行得很好，但是，当我构建一个 fat jar 并尝试运行它时，出现以下错误: Exceptio
Python Selenium 异常 AttributeError : "' Service' object has no attribute 'process' "in selenium. webdriver.ie.service.Service
我有一个 Selenium Python 测试套件。它开始运行，但几分钟后抛出以下错误: Exception AttributeError: "'Service' object has no attr
service - Centos 7 - 来自/etc/systemd/system/san.service 的服务未使用 systemctl start san.service 运行
我按照此链接的说明进行操作:https://www.thegeekdiary.com/centos-rhel-7-how-to-make-custom-script-to-run-automatica
web-services - JAVA JAX-WS NullPointerException 在 javax.xml.ws.Service.getPort(Service.java :188)
我在 ubuntu 下的 jboss 上部署了简单的“HelloWorld”Web 服务。我创建了简单的客户端，但我无法让它工作。每次运行客户端时，我都会收到 NullPointerExceptio
service-worker - Service Worker 中未触发定期同步
我正在尝试为我的网站使用后台定期同步。我正在使用 localhost 并在 1*1000 毫秒时注册 periodicsync 事件，但这根本不会触发。我看过这个demo ，但即使我将该网站安装为应

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

amazon-web-services - 客户端错误 : train channel is not specified with AWS object_detection_augmented_manifest_training using ground truth images