Project Scribe - Intelligent Note-Taking and Journaling

Project Scribe helps you organize your thoughts, notes, and journal entries intelligently. Built on the foundation of the GenAI Stack, it leverages Large Language Models (LLMs) and a Neo4j graph database to provide features like semantic search, automated tagging, summarization, and question answering over your personal knowledge base.

This application uses Docker for containerization, ensuring a consistent development and deployment environment.

Features

User Management: Secure registration and login using JWT.
Notes & Journals: Create, read, update, and delete notes and journals. Notes can be organized within journals.
Rich Content: Notes support text, base64 encoded images, and audio (future enhancement).
Tagging: Assign tags to notes for organization and retrieval.
Journal Templates: Define structures for consistent journal entries.
Intelligent Search:
- Semantic search using vector embeddings (powered by Sentence Transformers and Neo4j's vector index) to find conceptually similar notes/journals. Cosine similarity is used by the underlying vector index for comparison.
- Full-text keyword search (used as a fallback).
- Tag-based filtering.
LLM Integration (via Ollama):
- Ask Your Notes: Chat with your knowledge base using Retrieval-Augmented Generation (RAG).
- Auto-Tagging: Generate relevant tags for notes automatically.
- Summarization: Create concise summaries of your notes.
- Template Generation: Generate structured note templates based on type/topic.
Database Management: Includes constraints, indexing (including vector indexes), initialization logic, and a reset function.

Screenshots

Registration Page	Dashboard / Note Creation

Technology Stack

Backend: Python, FastAPI
Frontend: JavaScript, SvelteKit, Tailwind CSS
Database: Neo4j (Graph Database with Vector Index support)
LLM Engine: Ollama (for local model serving)
Embedding Model: Sentence Transformers (specifically all-MiniLM-L6-v2 by default)
Containerization: Docker, Docker Compose
Core Libraries:
- langchain-neo4j: Interacting with Neo4j Graph Database.
- langchain-huggingface: Loading embedding models.
- sse-starlette: Server-Sent Events for streaming responses.
- passlib[bcrypt]: Password hashing.
- python-jose[cryptography]: JWT token handling.
- uvicorn: ASGI server.
- python-dotenv: Environment variable management.
- requests: HTTP requests to Ollama.

Getting Started

Prerequisites

Git: Ensure Git is installed (git --version). Used for version control and cloning the repository.
Docker Installation: Before proceeding with setup, you must install Docker Desktop (MacOS/Windows) or Docker Engine (Linux) on your system. This is required to build and run the application containers.
- Install Docker
[!WARNING] There was a performance issue impacting python applications in Docker Desktop 4.24.x. Please upgrade to the latest release.
Ollama Installation (MacOS/Windows Host): If running on MacOS or Windows, you must install Ollama locally on your host machine before running Docker Compose.
- Start the Ollama server (ollama serve) before running Docker Compose.
- Download the required LLM via Ollama (e.g., ollama pull llama3). (Note: For Linux, Ollama can optionally be run inside a Docker container using the specific profiles mentioned in the next section.)

Installation & Setup

Clone the Repository:

git clone https://github.com/b3hr0uz/Project-Scribe
cd Project-Scribe # Or your repository name

Configure Environment:
- Copy the example environment file: cp env.example .env
- Edit the .env file with your specific configurations (Database credentials, Ollama URL if not default, Secret Key, etc.). See the Configuration section below for details.
Build and Run Containers:
- Standard:
```
docker compose up --build
```
- Linux (with Ollama in Docker): If you prefer to run Ollama in a container on Linux instead of the host:
```
# Ensure OLLAMA_API_URL=http://llm:11434/api/chat in .env
docker compose --profile linux up --build
```
- Linux GPU (with Ollama in Docker): For GPU acceleration on Linux:
```
# Ensure OLLAMA_API_URL=http://llm-gpu:11434/api/chat in .env
docker compose --profile linux-gpu up --build
```
  The --build flag is only necessary the first time or when Dockerfiles or build dependencies change.
Access the Application:
- Frontend UI: http://localhost:8505
- Backend API Docs: http://localhost:8585/docs
- Neo4j Browser: http://localhost:7474

Configuration

Create a .env file from env.example and modify the values as needed.

Required:

Variable Name	Default value	Description
NEO4J_URI	neo4j://database:7687	URL to Neo4j database container
NEO4J_USERNAME	neo4j	Username for Neo4j database
NEO4J_PASSWORD	password	Password for Neo4j database
SECRET_KEY	your-secret-key	IMPORTANT: Change this! Secret key for JWT token generation.
OLLAMA_API_URL	http://host.docker.internal:11434/api/chat	URL to Ollama chat API endpoint (adjust if using Linux profile/different host)
OLLAMA_MODEL	llama3	The specific Ollama model tag to use (e.g., llama3, mistral)
EMBEDDING_MODEL_NAME	all-MiniLM-L6-v2	The sentence-transformer model for embeddings
EMBEDDING_MODEL_CACHE_FOLDER	/embedding_model	Docker volume path to cache the embedding model

Optional (Integrations & Tracing):

Variable Name	Default value	Description
AWS_ACCESS_KEY_ID		Only needed for potential future AWS integrations
AWS_SECRET_ACCESS_KEY		Only needed for potential future AWS integrations
AWS_DEFAULT_REGION		Only needed for potential future AWS integrations
OPENAI_API_KEY		Only needed for potential future OpenAI integrations
GOOGLE_API_KEY		Only needed for potential future Google GenAI integrations
LANGCHAIN_ENDPOINT	"https://api.smith.langchain.com"	URL to Langchain Smith API for tracing
LANGCHAIN_TRACING_V2	false	Enable Langchain tracing v2
LANGCHAIN_PROJECT		Langchain project name for tracing
LANGCHAIN_API_KEY		Langchain API key for tracing

LLM Configuration Details

Model: Ensure the model specified in OLLAMA_MODEL is available in your Ollama instance (use ollama list or ollama pull <model>).
- Tested Models: Project Scribe has been primarily tested using llama3 as the OLLAMA_MODEL and the default all-MiniLM-L6-v2 Sentence Transformer for embeddings (EMBEDDING_MODEL_NAME). Other models might work but may require prompt adjustments.
Ollama URL:
- For Docker Desktop (MacOS/Windows) with Ollama running on the host: http://host.docker.internal:11434/api/chat is typically correct.
- For Linux with Ollama on the host: You might need to use your host IP address instead of host.docker.internal.

Development

Start Services: docker compose up
Watch Mode (Auto-rebuild): After starting services, run docker compose watch in a separate terminal for automatic container rebuilding on file changes (useful for frontend development).
Rebuild Manually: docker compose up --build
Shutdown: docker compose down (use docker compose down -v to also remove volumes like the database and model cache).

Application Components

Project Scribe consists of the following services managed by Docker Compose:

Service Name	Main files/folders	Compose name	URLs	Description
Backend API	`back-end.py`	`back-end`	http://localhost:8585	FastAPI application providing RESTful endpoints. Handles auth, CRUD, search, and LLM features.
Frontend UI	`front-end/`	`front-end`	http://localhost:8505	SvelteKit web application providing the user interface. Interacts with the Backend API.
Database	(Neo4j Image)	`database`	http://localhost:7474	Neo4j graph database storing user data, notes, journals, relationships, and vector embeddings. Access via Neo4j Browser.
LLM Service	(Ollama Image)	`llm` / `llm-gpu`	N/A (internal)	(Optional - Linux profile only) Runs the Ollama LLM service within Docker. Accessed by the Backend API.

Based on GenAI Stack

Project Scribe leverages the foundational setup provided by the GenAI Stack template. This template offered the initial Docker configuration, Neo4j integration, and examples for LLM interaction. Building upon this base, the entire full-stack application, particularly the backend API (back-end.py), was bootstrapped specifically for Project Scribe's note-taking and knowledge management features.

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
.github/media		.github/media
embedding_model		embedding_model
front-end		front-end
images		images
.dockerignore		.dockerignore
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
api.Dockerfile		api.Dockerfile
api.py		api.py
back-end.Dockerfile		back-end.Dockerfile
back-end.py		back-end.py
bot.Dockerfile		bot.Dockerfile
bot.py		bot.py
chains.py		chains.py
docker-compose.yml		docker-compose.yml
env.example		env.example
front-end.Dockerfile		front-end.Dockerfile
install_ollama.sh		install_ollama.sh
loader.Dockerfile		loader.Dockerfile
loader.py		loader.py
pdf_bot.Dockerfile		pdf_bot.Dockerfile
pdf_bot.py		pdf_bot.py
pull_model.Dockerfile		pull_model.Dockerfile
requirements.txt		requirements.txt
running_on_wsl.md		running_on_wsl.md
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Scribe - Intelligent Note-Taking and Journaling

Features

Screenshots

Technology Stack

Getting Started

Prerequisites

Installation & Setup

Configuration

LLM Configuration Details

Development

Application Components

Based on GenAI Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

1ncompleteness/Project-Scribe

Folders and files

Latest commit

History

Repository files navigation

Project Scribe - Intelligent Note-Taking and Journaling

Features

Screenshots

Technology Stack

Getting Started

Prerequisites

Installation & Setup

Configuration

LLM Configuration Details

Development

Application Components

Based on GenAI Stack

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages