Installation Guide #14

spacewalk01 · 2024-02-02T02:00:18Z

Installation

Download the pretrained model and install Depth-Anything:

git clone https://github.com/LiheYoung/Depth-Anything
cd Depth-Anything
pip install -r requirements.txt

Copy and paste dpt.py in this repo to <depth_anything_installpath>/depth_anything folder. Note that I've only removed a squeeze operation at the end of model's forward function in dpt.py to avoid conflicts with TensorRT.
Export the model to onnx format using export_to_onnx.py, you will get an onnx file named depth_anything_vit{}14.onnx, such as depth_anything_vitb14.onnx.
Install TensorRT using TensorRT official guidance.
Click here for Windows guide
1. Download the TensorRT zip file that matches the Windows version you are using.
2. Choose where you want to install TensorRT. The zip file will install everything into a subdirectory called TensorRT-8.x.x.x. This new subdirectory will be referred to as <installpath> in the steps below.
3. Unzip the TensorRT-8.x.x.x.Windows10.x86_64.cuda-x.x.zip file to the location that you chose. Where:
- 8.x.x.x is your TensorRT version
- cuda-x.x is CUDA version 11.6, 11.8 or 12.0
1. Add the TensorRT library files to your system PATH. To do so, copy the DLL files from <installpath>/lib to your CUDA installation directory, for example, C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\vX.Y\bin, where vX.Y is your CUDA version. The CUDA installer should have already added the CUDA path to your system PATH.
Click here for installing tensorrt on Linux.

Find trtexec and then export onnx to engine.

trtexec --onnx=depth_anything_vitb14.onnx --saveEngine=depth_anything_vitb14.engine

Add --fp16 if you want to enable fp16 precision

trtexec --onnx=depth_anything_vitb14.onnx --saveEngine=depth_anything_vitb14.engine --fp16

Download and install any recent OpenCV for Windows.

Modify TensorRT and OpenCV paths in CMakelists.txt:

# Find and include OpenCV
set(OpenCV_DIR "your path to OpenCV")
find_package(OpenCV REQUIRED)
include_directories(${OpenCV_INCLUDE_DIRS})

# Set TensorRT path if not set in environment variables
set(TENSORRT_DIR "your path to TensorRT")

Build project by using the following commands or cmake-gui(Windows).

Windows:

 mkdir build
cd build
cmake ..
cmake --build . --config Release

Linux(not tested):

mkdir build
cd build && mkdir out_dir
cmake ..
make

Tested Environment

TensorRT 8.6
CUDA 11.6
Windows 10

The text was updated successfully, but these errors were encountered:

jshin10129 · 2024-02-09T00:19:49Z

Hi,
I have an error when converting .onnx to .engine by following the above instruction (trtexec --onnx=depth_anything_vitb14.onnx --saveEngine=depth_anything_vitb14.engine)

The error message is "“Invalid Node - /If
/If_OutputLayer: IIfConditionalOutputLayer inputs must have the same shape. Shapes are [1,518,518] and [1,1,518,518].”"

BTW, I successfully created "depth_anything_vitb14.onnx" from "depth_anything_vitb14.pth".

I'm not sure where / what caused this error.
Could you point out where I should look into for debugging?

My tested environment:
Nvidia Jetson AGX Orin
Ubuntu 22.04
Jetpack 6.0 DP
pyTorch 2.2.0
python 3.10
Cuda 12.2
TensorRT 8.6
OpenCV 4.8

spacewalk01 · 2024-02-09T00:36:09Z

Copy and paste dpt.py in this repo to <depth_anything_installpath>/depth_anything folder.

jshin10129 · 2024-02-09T01:02:40Z

That beautifully works! the .engine file was successfully created by following the instruction.
Thank you so much!

jshin10129 · 2024-02-12T18:19:20Z

Hi, just a question regarding the performance issue.

Is there a way to display / process faster by modifying something in the code, such as resolution? or this 118ms per frame is the maximum we can get?

As mentioned above, my test environment is Jetson AGX Orin 64GB memory 12 cores.

Attached a sample run video and image for your reference:
1.

2.
check_speed_2160p.webm

MrAviator93 · 2024-02-14T00:31:59Z

Hi, just a question regarding the performance issue.

Is there a way to display / process faster by modifying something in the code, such as resolution? or this 118ms per frame is the maximum we can get?

As mentioned above, my test environment is Jetson AGX Orin 64GB memory 12 cores.

Attached a sample run video and image for your reference: 1. 2. check_speed_2160p.webm

Maybe attach external GPU to your Jetson, if it's possible?

MrAviator93 · 2024-02-14T00:32:15Z

To the authors I would like to say Great work! There doesn't seem to be any CMakeLists.txt in place. I would like to contribute as well to this project, maybe shall I set-up a CMakeLists and auto install or something? Also, we could move this to a newer version of C++ standard like 20, what do you think?

I am a Linux developer myself, this can be quite useful.

spacewalk01 · 2024-02-14T00:37:29Z

Yes, sure. There is indeed (CMakeLists.txt)[https://github.com/spacewalk01/depth-anything-tensorrt/blob/main/CMakeLists.txt]. But It would be great if you make it for linux.

spacewalk01 · 2024-02-14T00:47:19Z

Please send pull request

MrAviator93 · 2024-02-14T00:48:47Z

Ahh yes, I missed it, it's late for me 1am 😆 I will do it tomorrow after work will have to test it as well, it may not be that quick 😄

MrAviator93 · 2024-02-14T00:58:36Z

For example stuff like this:

bool IsPathExist(const std::string& path) {
#ifdef _WIN32
    DWORD fileAttributes = GetFileAttributesA(path.c_str());
    return (fileAttributes != INVALID_FILE_ATTRIBUTES);
#else
    return (access(path.c_str(), F_OK) == 0);
#endif
}
bool IsFile(const std::string& path) {
    if (!IsPathExist(path)) {
        printf("%s:%d %s not exist\n", __FILE__, __LINE__, path.c_str());
        return false;
    }

#ifdef _WIN32
    DWORD fileAttributes = GetFileAttributesA(path.c_str());
    return ((fileAttributes != INVALID_FILE_ATTRIBUTES) && ((fileAttributes & FILE_ATTRIBUTE_DIRECTORY) == 0));
#else
    struct stat buffer;
    return (stat(path.c_str(), &buffer) == 0 && S_ISREG(buffer.st_mode));
#endif
}

Can be replaced by the crossplatform version std::filesystem (introduced in C++ 17)

[[nodiscard]] bool IsFile(const std::string& path) 
{
    if (!std::filesystem::exists(path)) {
        printf("%s:%d %s not exist\n", __FILE__, __LINE__, path.c_str());
        return false;
    }

    return std::filesystem::is_regular_file(path);
}

And IsPathExist is simply replaced by std::filesystem::exists, I will try to make a PR for tomorrow.

spacewalk01 · 2024-02-14T01:00:28Z

Thanks looks good to me :)

MrAviator93 · 2024-02-15T00:02:32Z

I got stuck this evening with some unplanned issues. It seems that CUDA works with specific GCC compilers and as I am using the latest one it doesn't like it, there are also other problems with installing TensorRT library. I have slightly refactored the code and done improvements. But will be able to submit the PR only on the weekend.

cmake_minimum_required(VERSION 3.28)

project(depth-anything-tensorrt-simplified VERSION 1.0 LANGUAGES C CXX CUDA)

set(CMAKE_CXX_STANDARD 20)
set(CMAKE_CXX_STANDARD_REQUIRED TRUE)

set(CMAKE_C_STANDARD 11)
set(CMAKE_C_STANDARD_REQUIRED TRUE)

# TODO: Add more stricter compiler flags
if(CMAKE_CXX_COMPILER_ID MATCHES "GNU|Clang")
    add_compile_options("-Wall" "-Wextra" "-Wcast-align" "-Wunused" "-O2" "-fexceptions" "-pedantic")
elseif(CMAKE_CXX_COMPILER_ID STREQUAL "MSVC")
    add_compile_options("/W4" "/WX" "/we4715" "/O2" "/EHsc" "/permissive-")
endif()

I am thinking maybe it would be a better idea to stick this all into the docker container ? 🤔

spacewalk01 added the documentation Improvements or additions to documentation label Feb 2, 2024

Repository owner locked as resolved and limited conversation to collaborators Feb 2, 2024

spacewalk01 pinned this issue Feb 2, 2024

Repository owner unlocked this conversation Feb 2, 2024

spacewalk01 closed this as completed Feb 19, 2024

xa1on mentioned this issue Apr 9, 2024

Unable to create engine from onnx model #26

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Installation Guide #14

Installation Guide #14

spacewalk01 commented Feb 2, 2024 •

edited

Loading

jshin10129 commented Feb 9, 2024 •

edited

Loading

spacewalk01 commented Feb 9, 2024

jshin10129 commented Feb 9, 2024 •

edited

Loading

jshin10129 commented Feb 12, 2024

MrAviator93 commented Feb 14, 2024

MrAviator93 commented Feb 14, 2024 •

edited

Loading

spacewalk01 commented Feb 14, 2024 •

edited

Loading

spacewalk01 commented Feb 14, 2024

MrAviator93 commented Feb 14, 2024 •

edited

Loading

MrAviator93 commented Feb 14, 2024 •

edited

Loading

spacewalk01 commented Feb 14, 2024

MrAviator93 commented Feb 15, 2024 •

edited

Loading

Installation Guide #14

Installation Guide #14

Comments

spacewalk01 commented Feb 2, 2024 • edited Loading

Installation

Tested Environment

jshin10129 commented Feb 9, 2024 • edited Loading

spacewalk01 commented Feb 9, 2024

jshin10129 commented Feb 9, 2024 • edited Loading

jshin10129 commented Feb 12, 2024

MrAviator93 commented Feb 14, 2024

MrAviator93 commented Feb 14, 2024 • edited Loading

spacewalk01 commented Feb 14, 2024 • edited Loading

spacewalk01 commented Feb 14, 2024

MrAviator93 commented Feb 14, 2024 • edited Loading

MrAviator93 commented Feb 14, 2024 • edited Loading

spacewalk01 commented Feb 14, 2024

MrAviator93 commented Feb 15, 2024 • edited Loading

spacewalk01 commented Feb 2, 2024 •

edited

Loading

jshin10129 commented Feb 9, 2024 •

edited

Loading

jshin10129 commented Feb 9, 2024 •

edited

Loading

MrAviator93 commented Feb 14, 2024 •

edited

Loading

spacewalk01 commented Feb 14, 2024 •

edited

Loading

MrAviator93 commented Feb 14, 2024 •

edited

Loading

MrAviator93 commented Feb 14, 2024 •

edited

Loading

MrAviator93 commented Feb 15, 2024 •

edited

Loading