optimize evaluate_accuracy #541

feevos · 2018-10-09T14:11:29Z

Dear all,

the function evaluate_accuracy that appears in various examples in the tutorial is written in a suboptimal way, e.g. from deep conv nets:

def evaluate_accuracy(data_iterator, net):
    acc = mx.metric.Accuracy()
    for d, l in data_iterator:
        data = d.as_in_context(ctx)
        label = l.as_in_context(ctx) # This is unnecessary
        output = net(data)
        predictions = nd.argmax(output, axis=1)
        acc.update(preds=predictions, labels=label)
    return acc.get()[1]

Usually in the examples, ctx refers to mx.gpu(), however within the definition of mx.metric.Accuracy
we see that both predictions and labels are transformed to numpy arrays (i.e. copies to mx.cpu() context). Therefore, if ctx = mx.gpu() this function definition has unnecessary copies of labels into gpu memory and drops performance.

Thank you for the awesome tutorial you've created!
Regards

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimize evaluate_accuracy #541

optimize evaluate_accuracy #541

feevos commented Oct 9, 2018

optimize evaluate_accuracy #541

optimize evaluate_accuracy #541

Comments

feevos commented Oct 9, 2018