Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

finetune for different dataset #6

Open
kocemir opened this issue Jun 14, 2024 · 0 comments
Open

finetune for different dataset #6

kocemir opened this issue Jun 14, 2024 · 0 comments

Comments

@kocemir
Copy link

kocemir commented Jun 14, 2024

Hi,

I would like to ask a question again. In the code below, you are reducing the number of genes in the newcoming datasets, making it different from 16906. But, isn't it problematic when you want to apply gene2vec positional embedding to this data, since gene2vec is applied by assuming that the input have 16906 genes (columns). I think it does not give an error, however the position vectors of the indexed genes are misleading.

if args.small_geneset:
data = preprocess_data_smallgeneset(args.data_path)
print("Filtered data to include {} genes present in at least 5% of cells".format(data.shape[1]))

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant