simple code suggestions to improve #50

hvgazula · 2024-03-20T12:43:54Z

tissue_labeling/scripts/mit_kwyk_data.py

Line 150 in 8611ada

pixel_counts = {label:count for label,count in zip(unique,counts)}

dict(zip(unique,count))

https://github.com/sabeenlohawala/tissue_labeling/blob/8611ada4596771e1ab25cb02faf7be557509593b/scripts/mit_kwyk_data.py#L180C1-L181C85

shapes, pixel_counts = zip(*shapes_and_pixel_counts)

tissue_labeling/scripts/mit_kwyk_data.py

Lines 249 to 292 in 8611ada

    
           for i in range(label_vol.shape[d]): 
        
               # get the slice 
        
               if d == 0: 
        
                   feature_slice = feature_vol[i, :, :] 
        
                   label_slice = label_vol[i, :, :] 
        
               elif d == 1: 
        
                   feature_slice = feature_vol[:, i, :] 
        
                   label_slice = label_vol[:, i, :] 
        
               elif d == 2: 
        
                   feature_slice = feature_vol[:, :, i] 
        
                   label_slice = label_vol[:, :, i] 
        
               # discard slices with < 20% brain (> 80% background) 
        
               count_background = np.sum(label_slice == 0) 
        
               if count_background > 0.8 * (label_slice.shape[0] * label_slice.shape[1]): 
        
                   continue 
        
               # pad slices 
        
               pad_rows = max(0,max_shape[0] - label_slice.shape[0]) 
        
               pad_cols = max(0,max_shape[1] - label_slice.shape[1]) 
        
               # padding for each side 
        
               pad_top = pad_rows // 2 
        
               pad_bottom = pad_rows - pad_top 
        
               pad_left = pad_cols // 2 
        
               pad_right = pad_cols - pad_left 
        
               padded_feature_slice = np.pad(feature_slice, ((pad_top, pad_bottom), (pad_left, pad_right)), mode='constant', constant_values=0) 
        
               padded_label_slice = np.pad(label_slice, ((pad_top, pad_bottom), (pad_left, pad_right)), mode='constant', constant_values=0) 
        
               # save .npy files 
        
               feature_slice_filename = f"{os.path.basename(feature).split('.')[0]}_{slice_idx:03d}.npy" 
        
               label_slice_filename = f"{os.path.basename(label).split('.')[0]}_{slice_idx:03d}.npy" 
        
               np.save(os.path.join(feature_slice_dest_dir,feature_slice_filename), padded_feature_slice[np.newaxis,:]) 
        
               np.save(os.path.join(label_slice_dest_dir,label_slice_filename), padded_label_slice[np.newaxis,:]) 
        
               # Done: get pixel_counts 
        
               if get_pixel_counts: 
        
                   unique,counts = np.unique(padded_label_slice,return_counts = True) 
        
                   pixel_counts.update({label:count for label,count in zip(unique,counts)}) 
        
               # increase slice_idx 
        
               slice_idx += 1

Run this example and tell me if the above cannot be improved in the same way

import numpy as np
a = np.random.rand(10, 5, 3)
b  = list(map(sum, a)  # sum can be any function
print(len(b), b[0].shape)

The text was updated successfully, but these errors were encountered:

hvgazula · 2024-03-20T12:53:07Z

tissue_labeling/scripts/mit_kwyk_data.py

Line 229 in 8611ada

label_vol = (utils.load_volume(label, im_only=True)).astype('int32')

uint16 will do for the label vols

edit: please see the table at the bottom of this page

hvgazula · 2024-03-20T13:12:25Z

tissue_labeling/scripts/mit_kwyk_data.py

Line 183 in 8611ada

all_keys = {key for d in pixel_counts for key in d.keys()}

couldn't this be written as {*d.keys() for d in pixel_counts}?

update: iterable unpacking cannot be used in comprehension

hvgazula · 2024-03-20T14:43:59Z

tissue_labeling/scripts/mit_kwyk_data.py

Lines 392 to 393 in 8611ada

    
           max_rows = max(max_dims[0], max_dims[1]) 
        
           max_cols = max(max_dims[1], max_dims[2])

Not sure if I agree with this. What if the middle value is the largest? You will end up with a square and that's unnecessary. Am I missing something?

hvgazula · 2024-03-20T14:53:30Z

tissue_labeling/scripts/mit_kwyk_data.py

Lines 379 to 381 in 8611ada

    
           if mode == 'train': 
        
               for item in pixel_counts: 
        
                   train_pixel_counts += item

does this have to be done within the context manager?

hvgazula · 2024-03-20T14:57:11Z

Also, pixel_counts is a dict. Could you not simply write sum(pixel_counts), although i am not sure yet why the keys are added and not the values?

sabeenlohawala · 2024-03-20T21:07:23Z

https://github.com/sabeenlohawala/tissue_labeling/blob/8f9b20506740c2364051e9ca6975efd7f7ace38b/scripts/mit_kwyk_data.py#L293C9-L298C436

tissue_labeling/scripts/mit_kwyk_data.py

Line 411 in 8f9b205

pixel_counts = pool.starmap(

Using list(map(...)) slows down the computation, but removing list results in error thrown in line 411: 'map' object is not subscriptable.

sabeenlohawala · 2024-03-21T17:54:44Z

tissue_labeling/scripts/mit_kwyk_data.py

Lines 392 to 393 in 8611ada

max_rows = max(max_dims[0], max_dims[1])

max_cols = max(max_dims[1], max_dims[2])

Not sure if I agree with this. What if the middle value is the largest? You will end up with a square and that's unnecessary. Am I missing something?

slice[i,:,:] → shape is dim[1] x dim[2]
slice[:,i,:] → shape is dim[0] x dim[2]
slice[:,:,i] → shape is dim[0] x dim[1]
Therefore, in order for all slices to be the same shape, slice shape should be (max(dim[0], dim[1]), max(dim[1], dim[2]))

hvgazula · 2024-03-21T18:40:33Z

cool..please create a separate function with this docstring so people like me will know why 😄

Convert label vol and slices to int16 Add helper functions and docstrings Remove unnecessary comments

sabeenlohawala added a commit that referenced this issue Mar 21, 2024

Some code improvements based on #50

d37ec41

Convert label vol and slices to int16 Add helper functions and docstrings Remove unnecessary comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

simple code suggestions to improve #50

simple code suggestions to improve #50

hvgazula commented Mar 20, 2024

hvgazula commented Mar 20, 2024 •

edited

Loading

hvgazula commented Mar 20, 2024 •

edited

Loading

hvgazula commented Mar 20, 2024

hvgazula commented Mar 20, 2024

hvgazula commented Mar 20, 2024 •

edited

Loading

sabeenlohawala commented Mar 20, 2024

sabeenlohawala commented Mar 21, 2024

hvgazula commented Mar 21, 2024 •

edited

Loading

simple code suggestions to improve #50

simple code suggestions to improve #50

Comments

hvgazula commented Mar 20, 2024

hvgazula commented Mar 20, 2024 • edited Loading

hvgazula commented Mar 20, 2024 • edited Loading

hvgazula commented Mar 20, 2024

hvgazula commented Mar 20, 2024

hvgazula commented Mar 20, 2024 • edited Loading

sabeenlohawala commented Mar 20, 2024

sabeenlohawala commented Mar 21, 2024

hvgazula commented Mar 21, 2024 • edited Loading

hvgazula commented Mar 20, 2024 •

edited

Loading

hvgazula commented Mar 20, 2024 •

edited

Loading

hvgazula commented Mar 20, 2024 •

edited

Loading

hvgazula commented Mar 21, 2024 •

edited

Loading