AI datacenter aisle with GPU server racks
International supplier of enterprise AI infrastructure

AI servers,
built around your workload.

NVIDIA and AMD GPU platforms — from compact inference nodes to HGX-class training systems. Entry inference from $8,900.

AI infrastructure is not just a server. It is a workload-specific system.

Modern AI workloads depend on GPU memory, interconnect, storage, networking, cooling and software compatibility. We help customers choose the right configuration for their model size, workload type, latency target, number of users and data security requirements.

Private LLM deployment

For companies that want to run language models on their own infrastructure.

RAG and document AI

For search, tender analysis, legal review, knowledge bases and enterprise assistants.

Fine-tuning and model adaptation

For teams adapting open-source or proprietary models to specific business tasks.

AI training and research

For labs, universities and engineering teams requiring multi-GPU platforms.

HGX baseboard
HGX baseboard
8-GPU NVLink platform
GPU modules
GPU modules
H100 · H200 · B200 · MI300X
Rackmount AI nodes
Rackmount AI nodes
PCIe and SXM/HGX
High-speed fabric
High-speed fabric
NDR InfiniBand · 400G Ethernet
Datacenter integration
Datacenter integration
Rack-scale clusters
Aerial view of AI data center
Built for production AI

From a single inference node to multi-rack AI infrastructure.

NVIDIA and AMD platforms validated for inference, fine-tuning, training and HPC workloads — with networking, power and cooling planned together.

Product line

From inference nodes to AI factories

Three tiers of validated configurations. Pick the class that matches your workload — we configure the rest.

Inference & Edge

Compact to mid-range GPU servers for private LLMs, RAG and computer vision.

COMTRADE AI-L4 Edge — Entry inference / edge AI
Entry inference / edge AI

COMTRADE AI-L4 Edge

from $8,900

A compact and energy-efficient inference platform for entry-level AI workloads, computer vision and document automation.

Best for
Video analyticsOCRLightweight inferenceAI agents
Typical configuration
  • ·1–2× NVIDIA L4 24GB
  • ·AMD EPYC or Intel Xeon CPU
  • ·128–256 GB ECC RAM
  • ·2–8 TB NVMe
  • ·10/25 GbE networking
  • ·Air-cooled 1U/2U chassis
Request configuration
COMTRADE AI-96 Inference — Private LLM / enterprise RAG
Private LLM / enterprise RAG

COMTRADE AI-96 Inference

from $29,000

A practical server class for companies that need large GPU memory for private LLMs without moving immediately to HGX-class systems.

Best for
Private LLM inferenceRAG systemsEnterprise chatbotsMultimodal AI
Typical configuration
  • ·1–2× NVIDIA RTX PRO 6000 Blackwell Server Edition 96GB
  • ·256–512 GB ECC RAM
  • ·4–16 TB NVMe
  • ·25/100 GbE networking
  • ·2U/4U air-cooled chassis
Request configuration
COMTRADE AI-L40S Pro — Inference / fine-tuning / visual AI
Inference / fine-tuning / visual AI

COMTRADE AI-L40S Pro

from $45,000

A balanced platform for teams that need more GPU capacity than entry inference servers, but do not require H100/H200-class systems yet.

Best for
LLM inferenceFine-tuningGenerative AIComputer vision
Typical configuration
  • ·2–4× NVIDIA L40S 48GB
  • ·512 GB – 1 TB ECC RAM
  • ·8–32 TB NVMe
  • ·25/100 GbE networking
  • ·Air-cooled 4U chassis
Request configuration
Training-class

Multi-GPU H100 platforms for fine-tuning and serious ML research.

COMTRADE AI-H100 Training Node — Training / fine-tuning
Training / fine-tuning

COMTRADE AI-H100 Training Node

from $165,000

A professional training-class system for serious AI teams that require H100 performance, multi-GPU scaling and enterprise support.

Best for
Model fine-tuningML researchMulti-GPU workloadsHigh-throughput inference
Typical configuration
  • ·4× or 8× NVIDIA H100 80GB
  • ·PCIe or SXM/HGX options
  • ·1–2 TB ECC RAM
  • ·High-speed NVMe storage
  • ·100 / 200 / 400G networking options
  • ·Air or liquid cooling depending on platform
Request configuration
HGX & AI factory

Rack-scale H200 / B200 / MI300X platforms for large-LLM and HPC workloads.

COMTRADE AI-H200 HGX — Large LLM / high-performance AI
Large LLM / high-performance AI

COMTRADE AI-H200 HGX

configured on request, typically from $360,000+

A high-end HGX-class platform for memory-intensive AI workloads where HBM capacity and interconnect performance matter.

Best for
Large LLM inferenceLarge-context workloadsTraining and fine-tuningHPC and scientific AI
Typical configuration
  • ·8× NVIDIA H200 141GB HBM3e
  • ·1.13 TB total GPU memory
  • ·NVLink / NVSwitch fabric
  • ·2–4 TB system RAM
  • ·30 TB+ NVMe
  • ·NDR InfiniBand or 400G Ethernet
  • ·Advanced cooling options
Request configuration
COMTRADE AI-B200 HGX — Blackwell AI factory platform
Blackwell AI factory platform

COMTRADE AI-B200 HGX

configured on request

A next-generation Blackwell-class platform for customers planning serious AI infrastructure at data-center scale.

Best for
Next-generation AI factoriesLarge-scale trainingHigh-density inferenceEnterprise AI platforms
Typical configuration
  • ·8× NVIDIA B200-class GPUs
  • ·Up to 1.44 TB total GPU memory depending on platform
  • ·NVLink / NVSwitch architecture
  • ·400G networking options
  • ·Liquid cooling planning
  • ·Rack integration support
Request configuration
COMTRADE AI-Memory Max — AMD GPU platform / memory-heavy AI
AMD GPU platform / memory-heavy AI

COMTRADE AI-Memory Max

from $280,000, configured on request

A high-memory AI platform for teams evaluating AMD Instinct accelerators for large model workloads. Software stack and model compatibility are validated per project.

Best for
Memory-heavy LLM inferenceLarge models requiring high GPU memoryAlternative to NVIDIA-only infrastructureCost/performance-sensitive AI deployments
Typical configuration
  • ·8× AMD Instinct MI300X 192GB or MI325X 256GB options
  • ·Up to 1.5–2.0 TB total GPU memory depending on GPU generation
  • ·AMD EPYC CPU platform
  • ·High-speed NVMe
  • ·100 / 400G networking options
  • ·Liquid or advanced air cooling depending on chassis
Request configuration

All prices are indicative and exclude VAT, customs duties, special logistics, installation and country-specific certification unless stated otherwise. Final pricing depends on configuration, GPU availability, warranty terms, Incoterms and destination.

Configuration starts with the workload, not the hardware.

Instead of selling a random GPU box, we map the workload first and then recommend the correct platform: PCIe GPU server, SXM/HGX node, AMD high-memory system or rack-scale cluster.

Open rackmount AI server with CPU sockets, RAM and NVMe
  1. 01Model size and architecture
  2. 02Required GPU memory
  3. 03Inference latency and throughput target
  4. 04Number of concurrent users
  5. 05Dataset and storage requirements
  6. 06Networking and cluster scaling
  7. 07Power and cooling constraints
  8. 08Delivery country, warranty and compliance requirements

Typical AI infrastructure scenarios

Corporate RAG server

Private knowledge base, document search, legal/tender analysis and internal assistants.

Private LLM deployment

Local inference for organizations that cannot send sensitive data to public APIs.

Computer vision and video analytics

Object detection, security analytics, industrial vision and edge AI.

Fine-tuning lab

Adaptation of open-source models to corporate datasets and specialized workflows.

AI research cluster

Multi-GPU systems for universities, labs and engineering teams.

AI infrastructure for integrators

GPU servers supplied for system integrators, SaaS teams and enterprise AI solution providers.

Why customers work with COMTRADE LTD

01

Workload-based configuration

We help select hardware based on the actual AI task, not just GPU availability.

02

Realistic specifications

We use clear product classes, correct GPU memory figures and transparent configuration logic.

03

Multi-vendor sourcing

NVIDIA, AMD, Supermicro, Dell, HPE, Lenovo and compatible OEM platforms depending on the project.

04

Logistics and compliance support

We support international sourcing, documentation, export compliance checks and destination-specific delivery planning.

05

Scalable architecture

From one inference node to multi-node GPU clusters with high-speed networking.

06

Technical documentation

Each proposal can include bill of materials, system configuration, power/cooling requirements and warranty conditions.

Transparent by design

What we do not promise

Enterprise AI infrastructure depends on fast-changing GPU availability, export rules, OEM allocation, destination country and warranty coverage. We do not promise unrealistic delivery dates, fixed global prices or unsupported configurations.

Liquid-cooled AI server with glowing coolant loops
  • No fake "always in stock" claims
  • No unrealistic GPU pricing
  • No unsupported mixed terminology
  • No configuration without workload validation

AI server terms explained simply

A short reference for business buyers and procurement teams.

GPU memory / VRAM / HBM
On-GPU memory holding model weights and activations. HBM (High Bandwidth Memory) is used on accelerators like H100, H200 and MI300X for very high throughput.
Inference
Running a trained model to generate answers, predictions or embeddings. The dominant production workload for most companies.
Fine-tuning
Adapting an existing model to a specific domain or dataset. Requires more compute than inference, but much less than training from scratch.
Training
Building a model from raw data. Typically requires multi-GPU or multi-node systems and large datasets.
RAG
Retrieval-Augmented Generation. The model retrieves relevant documents from a vector database and uses them as context for the answer.
PCIe GPU server
A server with GPUs connected over PCIe. Flexible and easier to source; suitable for inference and small-to-medium training.
SXM / HGX platform
GPU modules mounted on a dedicated baseboard with NVLink/NVSwitch interconnect. Used in H100/H200/B200-class training and large-LLM systems.
NVLink / NVSwitch
High-bandwidth GPU-to-GPU interconnect that lets multiple GPUs behave closer to a single large accelerator.
InfiniBand / 400G Ethernet
High-speed networking used between nodes in AI clusters. NDR InfiniBand and 400G Ethernet are common options for large training fabrics.
Air cooling vs liquid cooling
Air-cooled designs are simpler to deploy; liquid cooling becomes necessary for high-density H200/B200 and MI300X-class platforms with high power per rack.

Frequently asked questions

What AI server should we start with?+

For most companies, the right starting point is an inference or RAG server, not an H100/H200 training cluster. The correct choice depends on model size, number of users, data security and latency requirements.

Can we run ChatGPT-like models locally?+

Yes, but the required hardware depends on the model size, quantization, context length and number of concurrent users. Smaller models can run on inference-class servers, while larger models require high-memory GPUs or multi-GPU platforms.

Why are H100/H200/B200 systems so expensive?+

The cost is driven by GPU memory, HBM bandwidth, interconnect, OEM platform, high-speed networking, power, cooling, warranty and limited availability.

Do you sell servers for training models from scratch?+

Yes, but training from scratch requires a much larger budget than inference or fine-tuning. We recommend starting with a workload assessment.

Can you supply AMD Instinct systems?+

Yes, AMD MI300X/MI325X-class platforms can be considered for memory-heavy workloads. Software compatibility is validated per project.

Are prices fixed?+

No. The prices on the website are indicative. Final pricing depends on configuration, availability, logistics, warranty and destination.

Do you help with installation?+

We can provide configuration documentation, delivery support, and integration planning. On-site services depend on country and partner availability.

Request an AI server configuration

Send us your workload, model size or target use case. We will recommend a realistic configuration and provide an indicative price range — typically within 1 business day.

Direct contact
sales@comtradeltd.com
Phone
+66 96 824 9885
Registered office
COMMTRADE CO., LTD.
140/127 Moo 11, Nongprue, Banglamung,
Chonburi 20150, Thailand
Reg. No. 0205567056255

By submitting this form you agree to our Terms of Use. Your data is processed in line with international data-protection law (GDPR, UK GDPR, Thailand PDPA).