Storage-based Execution

Each file in the input path is translated to a single unit of processing (a JFrog ML task). Each input file is read and passed as a single chunk to the model predict function.

⚠️
Warning - Task Processing
As the file is transformed to a single prediction request, ensure that the model is deployed on an instance with sufficient resources to handle the batch request.
Where batch predictions are resource-heavy, consider splitting the input dataset into multiple smaller files.

Execution Configuration

Parameter	Description	Default Value
Model ID [Required]	The Model ID, as displayed on the model header.
Build ID	The JFrog ML-assigned build ID. You can optionally add this to use a different build in order to perform the execution.
Bucket	The source and destination bucket. If you read and write to the same bucket, you can specify this, but it is not mandatory. Note: This parameter is required if 'Source Bucket' and 'Destination Bucket' are not set.
Source bucket	The bucket from which the input files are read, to start an execution. Note: This parameter is required if 'Bucket' is not set.
Destination bucket	The bucket into which the execution output files are written. Note: This parameter is required if 'Bucket' is not set.
Source folder [Required]	The path to the source bucket where all the inference files are located.
Destination Folder [Required]	The path to the destination bucket where the result files are stored.
Input File Type	The file types supported by JFrog ML. The supported formats are: CSV, Parquet and Feather.	CSV
Output File Type	The types of the files stored by JFrog ML. The supported formats are: CSV, Parquet and Feather.	CSV
Access Token Name	The name of the secret (created using our Secret Service) that contains the Access Token with permission to the source and destination buckets.
Access Secret Name	The name of the secret (created using our Secret Service) that contains the Access Secret with permission to the source and destination buckets.
Job Timeout	The job timeout, in seconds. By setting the job timeout, you will limit the execution time, and it will fail if not completed in time.	0 - No Timeout
File Timeout	A single file timeout, in seconds. Setting this will limit the processing time for a single file, and fail the entire execution if one of the files does not finish in time.	0 - No Timeout
IAM role ARN	The user-provided AWS custom IAM role.	None
Pods	The number of k8s pods which will be used at batch inference time. Number of pods sets the maximum parallelism for an inference job. Each pod handles one or more files/tasks. This configuration takes precedence over the deployed model configuration.	The number of executors/pods defined in the deployment.
Instance	The relevant Instance to run the batch operation.	Small
Parameters	A list of parameters expressed as key-value pairs which will be passed to the execution request. The parameters are passed to the inference container as environment variables.
service account key secret name	Gcp service account key name to reach google cloud provider.	None

Running Batch Execution Using S3 Buckets

Currently, only S3 buckets situated in the region configured for your JFrog ML environment are compatible as sources for input / output paths.

To employ a different IAM Role ARN to grant permissions to an S3 location, include the provided trust policy. For customized parameters, please reach out to our support team.

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect": "Allow",
            "Principal": {
                "Federated": "arn:aws:iam::<AWS_ACCOUNT_ID>:oidc-provider/oidc.eks.us-east-1.amazonaws.com/id/<OIDC_EKS_CLUSTER_ID>"
            },
            "Action": "sts:AssumeRoleWithWebIdentity",
            "Condition": {
                "StringEquals": {
                    "oidc.eks.us-east-1.amazonaws.com/id/<OIDC_PROVIDER_EKS_CLUSTER_ID>:aud": "sts.amazonaws.com"
                },
                "ForAnyValue:StringEquals": {
                    "oidc.eks.us-east-1.amazonaws.com/id/<OIDC_PROVIDER_EKS_CLUSTER_ID>:sub": [
                        "system:serviceaccount:qwak:kube-deployment-captain-access"
                    ]
                }
            }
        }
    ]
}

Using Custom AWS IAM Role

Custom IAM Role allow access to both private buckets and source / destination folders. Provide the custom IAM role name when calling a batch execution.

The IAM role should be created with the following trust policy:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Principal": {
        "AWS": "arn:aws:iam::<account-id>:root"
      },
      "Action": "sts:AssumeRole",
      "Condition": {
        "ArnLike": {
          "aws:PrincipalArn": "arn:aws:iam::<account-id>:role/qwak-eks-base*"
        }
      }
    }
  ]
}

Batch Execution

▶ To start an execution from the SDK:

from frogml.core.clients.batch_job_management.client import BatchJobManagerClient
from frogml.core.clients.batch_job_management.results import StartExecutionResult
from frogml.core.clients.batch_job_management.executions_config import ExecutionConfig

// The execution configuration
execution_spec = ExecutionConfig.Execution(
    model_id=<model-id>,
    source_bucket=<source-bucket-name>,
    destination_bucket=<destination-bucket-name>,
    source_folder=<source-folder-path>,
    destination_folder=<destination-folder-path>,
    input_file_type=<input-file-type>,
    output_file_type=<output-file-type>,
    access_token_name=<access_token_name>,
    access_secret_name=<access-secret-name>,
    job_timeout=<job-timeout>,
    file_timeout=<file-timeout>,
    parameters=<dictionary of user provided parameters>
)

resources_config = ExecutionConfig.Resources(
    pods=<number-of-pods>,
    instance=<instance-type>,
)

execution_config = ExecutionConfig(execution=execution_spec, resources=resources_config)
batch_job_manager_client = BatchJobManagerClient()

execution_result: StartExecutionResult = batch_job_manager_client.start_execution(execution_config)
execution_id = execution_result.execution_id

frogml models execution start \                                                                       
    --model-id <model-id> \
    --source-bucket <source-bucket-name> \
    --source-folder <source-folder-path> \
    --destination-bucket <destination-bucket-name> \    
    --destination-folder <destination-folder-path> \
    --input-file-type <input-file-type> \
    --output-file-type <output-file-type> \
    --access-token-name <buckets-access-token-secret-name> \
    --access-secret-name <buckets-access-secret-secret-name> \
    --job-timeout <entire-job-timeout-in-seconds> \
    --file-timeout <single-file-timeout-in-seconds> \
    --pods <pods-count> \
    --cpus <cpus-fraction> \
    --memory <memory-size> \
    --build-id <alternate-build-id>

Here is a simplified version with all the default values:

from frogml.core.clients.batch_job_management.client import BatchJobManagerClient
from frogml.core.clients.batch_job_management.results import StartExecutionResult
from frogml.core.clients.batch_job_management.executions_config import ExecutionConfig

// The execution configuration
execution_spec = ExecutionConfig.Execution(
    model_id=<model-id>,
    bucket=<bucket-name>,
    destination_bucket=<destination-bucket-name>,
    source_folder=<source-folder-path>,
    destination_folder=<destination-folder-path>,
    access_token_name=<access_token_name>,
    access_secret_name=<access-secret-name>
)

execution_config = ExecutionConfig(execution=execution_spec)
batch_job_manager_client = BatchJobManagerClient()

execution_result: StartExecutionResult = batch_job_manager_client.start_execution(execution_config)
execution_id = execution_result.execution_id

frogml models execution start \                                                                       
    --model-id <model-id> \
    --bucket <bucket-name> \
    --source-folder <source-folder-path> \
    --destination-folder <destination-folder-path> \
    --access-token-name <buckets-access-token-secret-name> \
    --access-secret-name <buckets-access-secret-secret-name>

Batch Job Parallelism

Each file in the input path is treated as a single task, which is the main unit of parallelism.

For example, if you request five pods (whether specified by the deployment or the job itself) to process 10 files, five files (or tasks) will run in parallel. Consequently, requesting more pods than the total number of input files is unnecessary.

📘
Note - Concurrent Executions
You can run multiple executions concurrently, with one limitation: To prevent redundancy, you cannot run executions with identical values for the following parameters:

Model ID

Build ID

Source Bucket

Source Folder

Destination Bucket

Destination Folder

Switching Between On-demand and Spot Instances

To override the instance type for a specific batch execution, specify the purchase_option (either "on-demand" or "spot") within advanced_options.

If you do not specify this option, the execution defaults to the configuration defined in the currently deployed model.

from frogml.core.clients.batch_job_management.executions_config import ExecutionConfig

# Using an on-demand instance for this execution
execution_spec = ExecutionConfig.Execution(
    ...
    advanced_options=ExecutionConfig.AdvancedOptions(
      purchase_option="on-demand",
    )
)

# Using an spot instance for this execution
execution_spec = ExecutionConfig.Execution(
    ...
    advanced_options=ExecutionConfig.AdvancedOptions(
      purchase_option="spot",
    )
)

Local File Mode

It is also possible to run a batch execution using files stored locally.

SDK

The local file mode can be started by either using the local_file_run function from the BatchInferenceClient:

from frogml_inference.batch_client.batch_client import BatchInferenceClient

client = BatchInferenceClient()

client.local_file_run(
  model_id=...,
  source_folder=...,
  destination_folder=...,
  input_file_type=...,
  output_file_type=...,
  job_timeout=...,
  task_timeout=...,
  executors=...,
  instance=...,
  iam_role_arn=...,
  build_id=...,
  parameters=...,
  instance=...,
)

Parameters have the same meaning as in the execution configuration above. The source folder and the destination folder must start with file:// prefix.

You must also provide the input file type that will be used to select files from the source folder (by file extension).

❗️
Important - Local File Mode Parameters
Remember that the source and destination folder paths MUST start with the file:// prefix.
model_id, source_folder, destination_folder, and input_file_type are required parameters.

❗️
Important - Dependencies
The local file mode requires additional dependencies that can be installed using pip install "frogml-inference[batch,feedback]"
If the dependencies are missing, the SDK will display the following error: Notice that BatchInferenceClient and FeedbackClient are not available in the skinny package. In order to use them, please install them as extras: pip install "frogml-inference[batch,feedback]".

⚠️
Warning - Output Directory
If the destination directory doesn't exist, it will be created.
If the destination directory exists and contains files with the same names as the ones created by the batch job, those files WILL BE OVERWRITTEN!

CLI

Alternatively, the same local file mode can be run using the CLI.

In this case, the frogml models execution start command is run with the source folder and destination folder beginning with the file:// prefix.

The meaning of other parameters is the same as in the execution configuration above. The local file mode started with CLI requires the same parameters and dependencies as the local file mode started with an SDK. (See the warnings above.)

Example: frogml models execution start --model-id the_model_id --source-folder file://path_to_a_directory --destination-folder file://path_to_output_directory --input-file-type csv .

REST API

Use the following curl command template to send requests to the Batch Job Manager. Replace <your-environment> with the name of your account, which can be found in the bottom left corner of the JFrog ML Dashboard, and fill in the batch job details as per your needs.

curl --location --request POST '[https://grpc.](https://grpc.)<your-environment>.qwak.ai/api/v1/batch-job/start-job'  
--header 'Content-Type: application/json'  
--header "Authorization: Bearer $JFROG_TOKEN"  
--data '{
    "model_id": "<your_model_id>",
    "source_bucket": "...",
    "destination_bucket": "...",
    "source_folder": "batch_execution/my_model/input",
    "destination_folder": "batch_execution/my_model/output",
    "batch_job_deployment_size": {
        "number_of_pods": 4,
        "cpu": 1.0,
        "memory_amount": 512,
        "memory_units": "MIB"
    },
    "input_file_type": "PARQUET_INPUT_FILE_TYPE",
    "output_file_type": "PARQUET_OUTPUT_FILE_TYPE",
    "advanced_deployment_options": {
        "custom_iam_role_arn": "..."}
}'

Ensure the header is enclosed in double quotes (") when passing the JFROG_TOKEN as an environment variable.

Example Output:

{
  "batch_id": "<some_execution_id>",
  "success": true,
  "failure_message": ""
}

📘
Note
Before executing the curl command, ensure the JFROG_TOKEN environment variable is correctly set in your terminal with the value of your generated JFrog ML Token. This step is crucial for authenticating your requests to the JFrog ML Platform.