NVIDIA gptj - "Could not locate GPT-J fp8 quantized checkpoint model path" #9

saibulusu · 2024-12-30T20:10:02Z

Using a system with NVIDIA Ampere GPUs.
system ID: KnownSystem.A100_SXM4_40GBx8

After downloading model, downloading data, and preprocessing data, I do not see the model listed under the GPTJ-6B folder.

Running make generate_engines RUN_ARGS="--benchmarks=gptj --scenarios=Offline --config_ver=default --test_mode=AccuracyOnly" in my docker container.

For reference, both bert & 3d-unet benchmarks are working.

saibulusu · 2025-01-02T22:20:48Z

When I run the command 'rm -rf build/TRTLLM && make clone_trt_llm && make build_trt_llm' to prepare for quantization, I am seeing this error below:

The pytorch version seems correct

I am using CUDA 12.4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVIDIA gptj - "Could not locate GPT-J fp8 quantized checkpoint model path" #9

NVIDIA gptj - "Could not locate GPT-J fp8 quantized checkpoint model path" #9

saibulusu commented Dec 30, 2024

saibulusu commented Jan 2, 2025

NVIDIA gptj - "Could not locate GPT-J fp8 quantized checkpoint model path" #9

NVIDIA gptj - "Could not locate GPT-J fp8 quantized checkpoint model path" #9

Comments

saibulusu commented Dec 30, 2024

saibulusu commented Jan 2, 2025