-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create visualization for haploid variant analysis workflow results #155
Comments
Took a look quickly at the attached ex data today, and notice there doesn't appear to be any run info in there. How do we mean to grab things like geographic location? |
SRR Ids |
well.. yes. i more was interested in whether i should assume one data input or two, or if the run info contains something like country names should we attempt to turn those into GIS coords, or if there isnt anything relating to location at all what wed like to do then? run info isnt very controlled is my understanding, in terms of allowed values etc.. or i suppose, if we should even assume the ids will be srr ids? |
mmkay so i quickly put together a pretty rough demo of an idea i had, and will include a screenshot here and some of my rationale and thoughts on possible directions this could go. feel free to throw tomatoes hahaha rationalei find most 'traditional' views of large numbers of variants across large numbers of samples kind of visually overwhelming. like theyre throwing data at you rather than presenting information, if that makes sense. i wanted to try to think of a thing that might make the information more digestable, enable discovery of high-level patterns more easily. if we prefer traditional, on the grounds its maybe what people will be expecting or something, let me know. caveats/ growth areasthis, being a rough ex i made quickly, has a lot of room for growth. some things that would make it a lot better:
explainer/ demothe left column are loci that have variants. from the data @nekrut links, i arbitrarily chose two samples, a chromosome and included the first 10,000 positions from it for now as a demo. choosing chromosomes and positions to view would need inputs, with some reasonable restrictions probably. right column nodes are samples. edges are variants which can be colored by various attributes realistic examplethis is the same data as the mini demo but without subsetting to just 2 samples |
looked at this again just now to see how i felt about it after having walked away a bit. i still think i personally like it, but realize i didnt really finish my rationale before. so... color and position are precognitive attributes, and given the density of data here thought relying on them as heavily as possible would help people to more quickly/ effectively identify patterns. representing this data as a network diagram lets us do things like 'physically' co-locate data from all samples for a particular locus. |
Given these data https://usegalaxy.org/api/datasets/f9cad7b01a472135d0cbdeeffd6c9a1e/display?to_ext=tabular create an obervabel notebook that displaying various graphs including:
The notebook should allow dynamic filtering by various attributes such as, for example, variant effect
The text was updated successfully, but these errors were encountered: