Positron | Cloud

Now Available

Positron Performance and Efficiency Advantages in Software V1.x

August 2024

September 2024

Software Release

Models Benchmarked

Relative
Performance

Performance
per Watt
Advantage

Performance
per $
Advantage

Confidence

V1.0

Mixtral 8x7B

0.65*

2.1

1.5

Measured

V1.1

Mixtral 8x7B
Llama 3.1 70B

1.1*

3.9

2.6

In-dev, measured.

* Nvidia performance is based on vLLM 0.5.4 for both Mixtral 8x7B, Llama 3.1 8B, and Llama 3.1 70B.

Switch

System SWServer

Atlas

Positron Atlas Hardware

Network

Scale-Up IOTransformer engine

Sys MemHost
CPU

AI Math
Accelerator

Mem

Positron maps any trained HuggingFace Transformers Library model directly onto hardware for maximum performance and ease of use

Develop or procure a model using the HuggingFace Transformers Library.
Upload or link trained model file (.pt or .safetensors) to Positron Model Manager.
Update client applications to use Positron’s OpenAI API-compliant endpoint.
Issue API requests and receive the best performance.