[Feature](mluOpCore): access variable in tensor struct through function in tensor.cpp and tensor.h #1151

nth-BYTE · 2024-11-20T09:34:14Z

Thanks for your contribution and we appreciate it a lot. 🚀🚀

1. Motivation

Please describe your motivation and the goal you want to achieve through this pull request.

2. Modification

In mluOpTensorStruct , pubilc variables have been changed into private variables, except those will be removed in future， e.g. position,positions,scale,scales,offset,offsets. And then, some unused methods are deleted ,including tensorDimN,tensorDimC,tensorDimH,tensorDimW

3. Test Report

If you want to know how to do operator testing, you can see GTest-User-Guide-zh.

3.1 Modification Details

3.1.1 Accuracy Acceptance Standard

For static threshold standard details, see: MLU-OPS™ Accuracy Acceptance Standard.

static threshold
- diff1
  - float32 mlu diff1 <= 1e-5
  - float32 mlu diff1 <= 3e-3
  - float16 mlu diff1 <= 3e-3
- diff2
  - float32 mlu diff2 <= 1e-5
  - float32 mlu diff2 <= 3e-3
  - float16 mlu diff2 <= 3e-3
- diff3
  - mlu diff3 == 0
  - mlu diff3_1 == 0
  - mlu diff3_2 == 0
dynamic threshold
- diff1: mlu diff1 <= max(baseline diff1 * 10, static threshold)
- diff2: mlu diff2 <= max(baseline diff2 * 10, static threshold)
- diff3: mlu diff3 <= max(baseline diff3 * 10, static threshold)
  - float32, threshold = 1e-5
  - float16, threshold = 1e-3

3.1.2 Operator Scheme checklist

Supported hardware
- MLU370
- MLU590
Job types
- BLOCK
- UNION1
- UNION2
- UNION4
- The operator will dynamically select the most suitable task type, for example, UNION8

3.2 Accuracy Test

3.2.1 Accuracy Test

If you have checked the following items, please tick the relevant box.

3.2.2 Parameter Check

Test Point-1: When a new operator is submitted, the test points are given and the test results are stated. Acceptance Standard: Normal error.

Please fill your test results(Error Message) in here, ...

Test Point-2: Whether illegal parameters are passed. Acceptance Standard: Normal error.

Test results...

3.3 Performance Test

See MLU-OPS™ Performance Acceptance Standard for details.

Platform：MLU370

# The test results should contain Op name, Shape, Data type,  
#   MLU Hardware Time(us), MLU Interface Time(us), MLU IO Efficiency, 
#   MLU Compute Efficiency, and Mlu Workspace Size(Bytes)
# 
# for example:
#
# ----------- case0 -----------
# case0
# [Op name                ]: abs
# [Shape                  ]: input.shape=[1024,1024,3,4], output.shape=[1024,1024,3,4]
# [Data type]             ]: float32
# [MLU Hardware Time      ]: 15728 (us)
# [MLU Interface Time     ]: 369.008 (us)
# [MLU IO Efficiency      ]: 0.23275
# [MLU Compute Efficiency ]: 0.5
# [Mlu Workspace Size     ]: -1 (Bytes)
# 
# ----------- case1 -----------
# ...

Platform：MLU590

# ----------- case0 -----------
# ----------- case1 -----------
# ...

3.4 Summary Analysis

Please give a brief overview here, if you want to note and summarize the content.

core/tensor.h

core/tensor.cpp

core/tensor.h

… tensor.cpp and tensor.h

PetrelYy

LGTM

nth-BYTE force-pushed the nl21427_2 branch 3 times, most recently from aedefba to abfc867 Compare November 22, 2024 10:00

zhouyue624 reviewed Nov 27, 2024

View reviewed changes

core/tensor.h Show resolved Hide resolved

zhouyue624 reviewed Nov 27, 2024

View reviewed changes