Introduction
We want to create a Python script that uses HelixDB to help students find professors based on their research areas and bio. For example a user could ask- “What professors does High-Energy Physics research?”
- “What professors are working in the X University?”
- “What professors are working in the Computer Science department?”
- “What professors are working in the X University and are working in the Computer Science department?”
- “I like doing research in Large Language Models, can you recommend me some professors doing this in X University?”
The dataset
In this example, we have a dataset on Professors with fields like:- Name
- Department
- University
- Short biography
- Key Research Areas
Building a Graph
Based on this data, we can create a Vector Graph RAG with the following nodes and edges: Nodes:- Professor Node with properties
name
,bio
- Research Area Node with properties
research_area
- Department Node with the property
name
- University Node with the property
name
- Vector Node with the property
research_area
- Professor to Research Area Edge
- Professor to Department Edge
- Professor to University Edge
- Professor to Vector Node Edge