AI Inference

ื”ืจื™ืฆื• ืžื•ื“ืœื™ AI ื‘ืชืฉืชื™ืช ื™ืฉืจืืœื™ืช

ื”ืกืงื” (Inference) ืฉืœ ืžื•ื“ืœื™ AI ืžื”ื™ืจื” ื•ืืžื™ื ื” ืขืœ GPU ื™ื™ืขื•ื“ื™ื™ื ื‘ืชืœ-ืื‘ื™ื‘. ืชืžื™ื›ื” ื‘-LLMs, Diffusion Models, ื•-Custom Models.

Features

GPU ื™ื™ืขื•ื“ื™ื™ื ืœืื™ื ืคืจื ืก

NVIDIA A100 ื•-H100 GPUs ืขื ื–ื™ื›ืจื•ืŸ ื’ื‘ื•ื” ืœืื™ื ืคืจื ืก ืžื”ื™ืจ.

ืžื•ื“ืœื™ื ืคื•ืคื•ืœืจื™ื™ื ืžื•ื›ื ื™ื

Llama, Mistral, Stable Diffusion ื•ืขื•ื“ โ€” ืžื•ื›ื ื™ื ืœื”ืจืฆื” ื‘ืœื—ื™ืฆื” ืื—ืช.

API ืชื•ืื OpenAI

Endpoint ืชื•ืื OpenAI API โ€” ืขื‘ืจื• ืžืžื•ื“ืœื™ OpenAI ืœืœื ืฉื™ื ื•ื™ ืงื•ื“.

ื ืชื•ื ื™ื ื ืฉืืจื™ื ื‘ื™ืฉืจืืœ

ื›ืœ ื”ื‘ืงืฉื•ืช ืžืขื•ื‘ื“ื•ืช ื‘ื“ืื˜ื”-ืกื ื˜ืจ ืชืœ-ืื‘ื™ื‘ ืœืขืžื™ื“ื” ื‘ืจื’ื•ืœืฆื™ื” ื™ืฉืจืืœื™ืช.

Pricing

Simple, transparent ILS pricing

Pay-as-you-go

โ‚ช0/mo

  • ื—ื™ื•ื‘ ืœืคื™ Token
  • ื—ื™ื•ื‘ ืœืคื™ GPU Minute
  • ืื™ืŸ ืžื™ื ื™ืžื•ื

Ready to get started?

Deploy AI Inference in under a minute

AI Inference | CloudMarket | CloudMarket