[Feature](mluOpCore): access variable in tensor struct through function in tensor.cpp and tensor.h #1116

nth-BYTE · 2024-10-16T10:58:59Z

Thanks for your contribution and we appreciate it a lot. 🚀🚀

1. Motivation

Please describe your motivation and the goal you want to achieve through this pull request.

2. Modification

Please briefly describe what modification is made in this pull request, and indicate where to make the modification.

Are new test cases added? If so, please post the corresponding generator-PR link here.

3. Test Report

If you want to know how to do operator testing, you can see GTest-User-Guide-zh.

3.1 Modification Details

3.1.1 Accuracy Acceptance Standard

For static threshold standard details, see: MLU-OPS™ Accuracy Acceptance Standard.

static threshold
- diff1
  - float32 mlu diff1 <= 1e-5
  - float32 mlu diff1 <= 3e-3
  - float16 mlu diff1 <= 3e-3
- diff2
  - float32 mlu diff2 <= 1e-5
  - float32 mlu diff2 <= 3e-3
  - float16 mlu diff2 <= 3e-3
- diff3
  - mlu diff3 == 0
  - mlu diff3_1 == 0
  - mlu diff3_2 == 0
dynamic threshold
- diff1: mlu diff1 <= max(baseline diff1 * 10, static threshold)
- diff2: mlu diff2 <= max(baseline diff2 * 10, static threshold)
- diff3: mlu diff3 <= max(baseline diff3 * 10, static threshold)
  - float32, threshold = 1e-5
  - float16, threshold = 1e-3

3.1.2 Operator Scheme checklist

Supported hardware
- MLU370
- MLU590
Job types
- BLOCK
- UNION1
- UNION2
- UNION4
- The operator will dynamically select the most suitable task type, for example, UNION8

3.2 Accuracy Test

3.2.1 Accuracy Test

If you have checked the following items, please tick the relevant box.

3.2.2 Parameter Check

Test Point-1: When a new operator is submitted, the test points are given and the test results are stated. Acceptance Standard: Normal error.

Please fill your test results(Error Message) in here, ...

Test Point-2: Whether illegal parameters are passed. Acceptance Standard: Normal error.

Test results...

3.3 Performance Test

See MLU-OPS™ Performance Acceptance Standard for details.

Platform：MLU370

# The test results should contain Op name, Shape, Data type,  
#   MLU Hardware Time(us), MLU Interface Time(us), MLU IO Efficiency, 
#   MLU Compute Efficiency, and Mlu Workspace Size(Bytes)
# 
# for example:
#
# ----------- case0 -----------
# case0
# [Op name                ]: abs
# [Shape                  ]: input.shape=[1024,1024,3,4], output.shape=[1024,1024,3,4]
# [Data type]             ]: float32
# [MLU Hardware Time      ]: 15728 (us)
# [MLU Interface Time     ]: 369.008 (us)
# [MLU IO Efficiency      ]: 0.23275
# [MLU Compute Efficiency ]: 0.5
# [Mlu Workspace Size     ]: -1 (Bytes)
# 
# ----------- case1 -----------
# ...

Platform：MLU590

[----------] 1 test from yolo_box/TestSuite
[ RUN      ] yolo_box/TestSuite.mluOp/0
[MLU Hardware Time      ]: 57 (us)
[MLU Interface Time     ]: 70.981 (us)
[MLU IO Efficiency      ]: 0.00171107
[MLU Compute Efficiency ]: 0.00614035
[MLU Workspace Size     ]: -1 (Bytes)
[MLU Kernel Name(s)     ]: {}
[MLU TheoryOps          ]: 806400 (Ops)
[MLU TheoryIOs          ]: 199744 (Bytes)
[MLU ComputeForce       ]: 2.304e+12 (op/s)
[MLU IoBandWidth        ]: 2048 (GB/s)
[GPU Hardware Time      ]: -1 (us)
[GPU IO Efficiency      ]: -1
[GPU Compute Efficiency ]: -1
[GPU Workspace Size     ]: -1 (Bytes)
[Diffs]:
[output1]
DIFF1: 6.567290e-09
DIFF2: 2.416225e-08
[output2]
DIFF1: 4.524716e-08
DIFF2: 6.251300e-08
[^      OK ] ../../test/mlu_op_gtest/pb_gtest/src/zoo/yolo_box/test_case/case_0.prototxt
[       OK ] yolo_box/TestSuite.mluOp/0 (2 ms)
[----------] 1 test from yolo_box/TestSuite (2 ms total)

[----------] Global test environment tear-down
[ SUMMARY  ] Total 112 cases of 72 op(s).
ALL PASSED.
[==========] 114 test cases from 74 test suites ran. (48078 ms total)
[  PASSED  ] 114 test cases.

3.4 Summary Analysis

Please give a brief overview here, if you want to note and summarize the content.

… tensor.cpp and tensor.h

nth-BYTE force-pushed the nl21427 branch 13 times, most recently from 8f25c7c to 1a49537 Compare October 30, 2024 07:27

[CNNLCORE-21117]

431e6b8

nth-BYTE force-pushed the nl21427 branch from 3dbe1d3 to 1a3a093 Compare November 13, 2024 02:48

nth-BYTE changed the title ~~Nl21427~~ [Feature](mluOpCore): Change public variable in tensor.h to private variable Nov 13, 2024

nth-BYTE changed the title ~~[Feature](mluOpCore): Change public variable in tensor.h to private variable~~ [Feature](mluOpCore): access variable in tensor struct through function in tensor.cpp and tensor.h Nov 13, 2024

nth-BYTE force-pushed the nl21427 branch from 1a3a093 to 34a696c Compare November 14, 2024 03:02

[CNNLCORE-21427] access variable in tensor struct through function in…

be61508

… tensor.cpp and tensor.h

nth-BYTE force-pushed the nl21427 branch from 34a696c to be61508 Compare November 19, 2024 02:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature](mluOpCore): access variable in tensor struct through function in tensor.cpp and tensor.h #1116

[Feature](mluOpCore): access variable in tensor struct through function in tensor.cpp and tensor.h #1116

nth-BYTE commented Oct 16, 2024 •

edited

Loading

[Feature](mluOpCore): access variable in tensor struct through function in tensor.cpp and tensor.h #1116

Are you sure you want to change the base?

[Feature](mluOpCore): access variable in tensor struct through function in tensor.cpp and tensor.h #1116

Conversation

nth-BYTE commented Oct 16, 2024 • edited Loading

1. Motivation

2. Modification

3. Test Report

3.1 Modification Details

3.1.1 Accuracy Acceptance Standard

3.1.2 Operator Scheme checklist

3.2 Accuracy Test

3.2.1 Accuracy Test

3.2.2 Parameter Check

3.3 Performance Test

3.4 Summary Analysis

nth-BYTE commented Oct 16, 2024 •

edited

Loading