Graph-Based Biomarker Identification
Disease mechanisms rarely arise from single-gene perturbations — they emerge from coordinated disruptions across interacting molecular networks. I model disease-associated molecular data as graphs where nodes carry multi-omics features and edges encode known biological interactions: protein-protein, regulatory, and co-expression.
A GCN architecture learns node-level representations that integrate expression features, ontology embeddings, and sequence-level information. Attention weights surface which interactions are driving predictions, turning a classification output into a biological hypothesis worth testing.