Deep Research Agentic System

An end-to-end, multi-agent pipeline that:

Crawls the web for pages matching a research query (via Tavily)
Parses pages into structured triples and stores them in a Neo4j knowledge graph
Drafts an answer by querying the graph with a LangChain + Ollama agent

Every function in the code maps directly to one step in this workflow.

📋 Prerequisites

Python 3.8+
Tavily account & API key
Neo4j (Desktop or Docker) running at bolt://localhost:7687
Ollama installed & a local model (e.g. phi4) pulled and the ollama daemon running

Environment variables set:

export TAVILY_API_KEY="your_tvly_key"
export NEO4J_URI="bolt://localhost:7687"
export NEO4J_USER="neo4j"
export NEO4J_PASS="your_neo4j_password"

🔧 Installation Clone or copy this repository to your machine.

Install the Python dependencies:

bash Copy Edit pip install
tavily-python
neo4j
chromadb
langchain langchain-core
langchain-ollama
typing-extensions Make sure Neo4j is running and you’ve set an initial password for the neo4j user.

Start Ollama (if not already):

bash Copy Edit ollama serve ollama pull phi4 📂 Code Structure crawl_node(state) Fetches web pages matching state["query"] via Tavily and returns a list of pages.

parse_node(state) Stub for entity/relation extraction: runs extract_entities_relations() on each page’s content and MERGEs triples into Neo4j.

draft_node(state) Queries the Neo4j graph (GraphQuery tool) for facts matching state["query"], builds a prompt, and invokes the Ollama-powered LangChain agent to generate the final answer.

StateGraph orchestration Defines the workflow:

sql Copy Edit START → crawl_node → parse_node → draft_node → END query_graph() A plain Python function wrapped as a LangChain Tool named GraphQuery, issuing a Cypher query to fetch up to 5 matching triples.

🚀 Usage Ensure all services (Neo4j, Ollama) are running.

Set your environment variables.

Run the script:

bash Copy Edit python deep_research_agentic_system.py When prompted:

yaml Copy Edit Enter your research question: Type any query (e.g. “latest advances in retrieval-augmented generation”) and press Enter.

Watch the multi-agent pipeline execute and display a structured, context-rich answer.

⚙️ Customization NER / Relation Extraction Replace the extract_entities_relations() stub in parse_node() with your preferred method (e.g. spaCy, an LLM call, custom regex).

Graph & Tool Extensions Add more LangChain tools (e.g. semantic-search over ChromaDB) or new LangGraph nodes for additional processing steps.

Model & Prompt Tuning Swap in a different Ollama model or adjust the prompt templates for specialized domains.

📝 Assignment for kairon.co.in This project is for the take home assignment.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
ass.py		ass.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Deep Research Agentic System

📋 Prerequisites

About

Uh oh!

Releases

Packages

Languages

DAXXY-777/Assignment

Folders and files

Latest commit

History

Repository files navigation

Deep Research Agentic System

📋 Prerequisites

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages