Troubleshoot Models in AI Catalog

Troubleshoot model deployment and inference errors in JFrog AI Catalog, including validation failures, 401/404 responses, and NVIDIA NGC authentication.

Use this topic to diagnose common JFrog AI Catalog model errors during deployment, inference, access control, and NVIDIA NGC remote repository setup. For setup steps, see AI Catalog Quick Start. To allow models before deployment, see Discover and Allow Models.

Deployment and Validation Issues

Expand the accordion that matches your deployment or validation symptom.

Deploy model validate fails?

When model deployment validation fails, use the table to match your symptom to a cause and resolution.

SymptomCauseResolution
Local system does not recognize the frogml commandThe frogml command is not on your system PATHAdd frogml to your system PATH environment variable
Deployed model not supportedThe model is not supported for deployment in AI CatalogConfirm the model is on the supported models list in Discover and Allow Models

Inference Errors

Expand the accordion that matches your inference HTTP status code or symptom.

Execute inference fails with 404?

When inference returns HTTP 404, review the causes in the following table.

SymptomCauseResolution
Inference request after blockingInference was attempted after the model was blockedConfirm the model is not blocked for the project
Execute inference fails with 401?

When inference returns HTTP 401, review the causes in the following table. For example, the response might indicate a bad token or an incorrect model name.

SymptomCauseResolution
Bad tokenThe token value is invalid or expiredVerify the token value
Incorrect model nameThe model name is incorrect or does not match a known modelVerify the model name matches a known model in the project

Model Access and Connection Issues

Expand the accordion that matches your allow, block, or delete symptom.

Block model fails?

When blocking a model fails, review the causes in the following table.

SymptomCauseResolution
Block action rejectedThe model is still deployedUndeploy the model for the project, then block it
Delete connection fails?

When deleting a model connection fails, review the causes in the following table.

SymptomCauseResolution
Delete action rejectedThe model is still allowed or connectedBlock the model first, then delete the connection
Error allowing model usage?

When you allow model usage for an open source model, you might see an error similar to the following screenshot.

Curation error when allowing model usage

Review the causes in the following table.

SymptomCauseResolution
Allow action fails for open source modelCuration is not enabled for the repository or package typeTurn on Curation and enable the package type and repository. For more information, see Set Up Curation Settings for Model Packages

NVIDIA NGC Remote Repository Authentication

Expand the accordion if your Artifactory remote repository cannot authenticate against the NVIDIA NGC registry.

NVIDIA NGC remote repository authentication errors?

The Artifactory remote repository might fail to authenticate against the NVIDIA NGC registry. You might see errors such as Error Response from Daemon: ... Invalid username/password configured for Remote Docker repository or Can't fetch token for repo... proxy_auth.

Review the causes in the following table.

SymptomCauseResolution
Remote repository authentication failureThe NVIDIA NGC API key in the remote repository settings is missing or incorrectAsk your administrator to verify the NVIDIA NGC API key in the remote repository settings
Remote repository authentication failureThe API key is valid, but Governing Terms (EULA) were not accepted for this gated model on the NVIDIA NGC websiteLog in to the NVIDIA NGC Catalog, navigate to the specific model page, and select Accept Terms

Related Topics