Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

what categories is each paper associated with? #2

Open
bhpayne opened this issue May 29, 2021 · 2 comments
Open

what categories is each paper associated with? #2

bhpayne opened this issue May 29, 2021 · 2 comments

Comments

@bhpayne
Copy link
Member

bhpayne commented May 29, 2021

For a given paper, identify which topics are in that paper. The purpose of this would be to help narrow the scope of what a variable is referring to. For example, c is relativity is usually different from c used in algebra. Categorizing the topic(s) of a paper could provide context for how to interpret the variable.

@bhpayne
Copy link
Member Author

bhpayne commented May 29, 2021

Some mixture of TF-IDF plus citation tracing?

@bhpayne
Copy link
Member Author

bhpayne commented Nov 24, 2023

A more concrete framing is the following:

  1. suppose you have 1000 .tex files
  2. of the 1000 files, 50 files reference the variable c
  3. of the 50 papers referencing c,
    • do subsets of the 50 papers have citations overlapping? If yes, those references to c might be in the same domain.
    • do subsets of the 50 papers use similar jargon? If yes, those references to c might be in the same domain.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant