finetune for different dataset #6

kocemir · 2024-06-14T08:48:04Z

Hi,

I would like to ask a question again. In the code below, you are reducing the number of genes in the newcoming datasets, making it different from 16906. But, isn't it problematic when you want to apply gene2vec positional embedding to this data, since gene2vec is applied by assuming that the input have 16906 genes (columns). I think it does not give an error, however the position vectors of the indexed genes are misleading.

if args.small_geneset:
data = preprocess_data_smallgeneset(args.data_path)
print("Filtered data to include {} genes present in at least 5% of cells".format(data.shape[1]))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetune for different dataset #6

finetune for different dataset #6

kocemir commented Jun 14, 2024 •

edited

Loading

finetune for different dataset #6

finetune for different dataset #6

Comments

kocemir commented Jun 14, 2024 • edited Loading

kocemir commented Jun 14, 2024 •

edited

Loading