The trusted market for AI compute

Buy the right AI hardware. Run the right models.

Inference Exchange helps founders, agencies, and small teams choose, buy, set up, and run private AI compute. Every listing shows what it runs, what it doesn’t, and what setup we include.

Browse hardware Book a consult

AI-ready SKUs

Edge → workstation

Models in fit matrix

Llama, Qwen, Phi, vision

Agent templates

Docs, coding, edge

Quote-first pricing

24h

Stock + build confirmed

Featured · LLM workstation

Demand signal · 7D

Premium demand

Flagship local LLM

RTX 5090 Local LLM Workstation

Premium local AI workstation for the highest-end consumer GPU class.

IX score

From

Request build quote

Run tier

Largest practical local workstation tier

Status

Supplier listed

Supplier listedSetup supportedEstimated

What it runs

Qwen Coder 32BGood
Llama 70B quantizedGood
Very large MoE modelsWorks with limits

Buy configured

Live signals

7D demand

Qwen Coder 32B

Coding

+18%

Llama 70B local

Open-weight

+11%

Edge vision models

Vision

+9%

Pi 5 16GB

Tiny local models, automation, edge services

Entry demand

Jetson Orin Nano

Edge vision, small generative AI, robotics

Edge rising

Mac mini M4

Small local models, dev workflows, hybrid agents

Starter demand

Private document agent

Business

+22%

Hermes agent runtime

Agent sandbox

+14%

Edge camera agent

Edge

+8%

Hot models

What people are actually trying to run

Demand for these models is what drives hardware recommendations on IX.

All models

Coding

Qwen Coder 32B

+18%

Hardware target

RTX 4090+

Best first SKU path is a 24GB+ workstation.

Find hardware

Open-weight

Llama 70B local

+11%

Hardware target

RTX 5090 / cloud fallback

Works locally with quantization; quote larger builds carefully.

Compare setups

Vision

Edge vision models

+9%

Hardware target

Jetson / Pi

Best fit is cameras, sensors, and robotics workflows.

Build edge kit

Trending hardware

AI-ready SKUs and configured builds

Every machine ships with a setup guide and a bounded 30-min consult.

Starter

Entry demand

Raspberry Pi 5 16GB

From

R2,999.90 inc VAT

IX score

Runs

Tiny local models, automation, edge services

Supplier listedSetup supported

View build Get quote

NVIDIA Jetson Orin Nano Super Developer Kit

Edge kit

Edge rising

NVIDIA Jetson Orin Nano Super Developer Kit

From

$249 reference

IX score

Runs

Edge vision, small generative AI, robotics

Supplier listedPublic benchmark

View build Get quote

Starter

Starter demand

Apple Mac mini M4 16GB / 512GB

From

Request quote

IX score

Runs

Small local models, dev workflows, hybrid agents

Supplier listedSetup supported

View build Get quote

LLM workstation

High intent

RTX 4090 Local LLM Workstation

From

Request build quote

IX score

Runs

7B-34B strong, larger models quantized

Supplier listedSetup supported

View build Get quote

LLM workstation

Premium demand

RTX 5090 Local LLM Workstation

From

Request build quote

IX score

Runs

Largest practical local workstation tier

Supplier listedSetup supported

View build Get quote

Trending agents

Setups people are paying us to build

Real agent patterns — not fake marketplace listings.

Business

Private document agent

+22%

Suggested hardware

RTX workstation

Setup supportedBest business wedge

Set up this agent

Agent sandbox

Hermes agent runtime

+14%

Suggested hardware

Cloud or local workstation

Setup pendingDeveloper interest

Plan compute

Edge

Edge camera agent

+8%

Suggested hardware

Jetson Orin Nano

Setup supportedHardware-led demand

Quote edge kit

Configured bundles

Goal-first builds, quote-first pricing

Tell us the job. We confirm stock, configure the build, and quote in 24h.

Bundle

Local coding workstation

Qwen Coder 32B, private repos, no token bill.

Runs

Qwen Coder 32B · Llama 70B (quantized)

RTX 4090 build
Coding agent template
60-min install

From R85,000

Bundle

Private docs appliance

Document agent over your team's drive, fully on-prem.

Runs

Llama 70B · Embeddings · RAG agent

RTX 5090 build
Private docs agent
Install + handover

From R125,000

Bundle

Edge AI starter kit

Cameras, sensors, robotics — small models at the edge.

Runs

Vision · Phi · Small Llama (INT4)

Jetson Orin Nano
Edge agent guide
30-min consult

From R8,500

Setup guides

Made for the questions buyers actually ask

Plain-English answers from the team configuring these machines every day.

Coding

What hardware runs Qwen Coder 32B?

The 24 GB rule, quantization tradeoffs, and the cheapest practical build.

Read guide

Workstations

RTX 4090 vs 5090 for local LLMs

Which to buy at which budget. When the 5090 actually pays back.

Read guide

Agents

Build a local document agent

Stack, hardware sizing, RAG pipeline, and the model that actually works.

Read guide