GenAIModel

Params

A human readable description of the object.

description to be done

description to be done

An instance of GPUServer.

generative ai model nb of bits per parameter from hypothesis in dimensionless.

generative ai model ratio between gpu memory footprint and model size from ecologits in dimensionless.

gpu latency per active parameter and output token from ecologits in second.

base gpu latency per output_token from ecologits in second.

number of bits per token from hypothesis in dimensionless.

ExplainableQuantity in dimensionless, representing the open-mistral-7b from mistralai nb of active parameters from ecologits.

Example value: 7300000000.0 dimensionless

Depends directly on:

through the following calculations:

ExplainableQuantity in dimensionless, representing the open-mistral-7b from mistralai total nb of parameters from ecologits.

Example value: 7300000000.0 dimensionless

Depends directly on:

through the following calculations:

ExplainableQuantity in gigabyte, representing the generative ai model base ram consumption.

Example value: 17.52 gigabyte

Depends directly on:

through the following calculations: