CodeForPhilly · sahilds1 · Jul 30, 2025 · Jul 30, 2025 · Aug 5, 2025 · Aug 5, 2025
diff --git a/README.md b/README.md
@@ -7,27 +7,26 @@ for patients with bipolar disorder, helping them shorten their journey to stabil
 
 You can view the current build of the website here: [https://balancertestsite.com](https://balancertestsite.com/)
 
-## Contributing 
+## Contributing
 
 ### Join the Balancer community
 
-Balancer is a [Code for Philly](https://www.codeforphilly.org/) project 
+Balancer is a [Code for Philly](https://www.codeforphilly.org/) project
 
 Join the [Code for Philly Slack and introduce yourself](https://codeforphilly.org/projects/balancer) in the #balancer channel
 
 The project kanban board is [on GitHub here](https://github.com/orgs/CodeForPhilly/projects/2)
 
 ### Code for Philly Code of Conduct
 
-The Code for Philly Code of Conduct is [here](https://codeforphilly.org/pages/code_of_conduct/) 
+The Code for Philly Code of Conduct is [here](https://codeforphilly.org/pages/code_of_conduct/)
 
-### Setting up a development environment   
+### Setting up a development environment
 
 Get the code using git by either forking or cloning `CodeForPhilly/balancer-main`
 
 Tools used to run Balancer:
 1. `OpenAI API`: Ask for an API key and add it to `config/env/env.dev`
-2. `Anthropic API`: Ask for an API key and add it to `config/env/env.dev`
 
 Tools used for development:
 1. `Docker`: Install Docker Desktop
@@ -36,15 +35,15 @@ Tools used for development:
 
 ### Running Balancer for development
 
-Start the Postgres, Django REST, and React services by starting Docker Desktop and running `docker compose up --build` 
+Start the Postgres, Django REST, and React services by starting Docker Desktop and running `docker compose up --build`
 
 #### Postgres
-- Download a sample of papers to upload from [https://balancertestsite.com](https://balancertestsite.com/) 
+- Download a sample of papers to upload from [https://balancertestsite.com](https://balancertestsite.com/)
 - The email and password of `pgAdmin` are specified in `balancer-main/docker-compose.yml`
 - The first time you use `pgAdmin` after building the Docker containers you will need to register the server.
     - The `Host name/address` is the Postgres server service name in the Docker Compose file
     - The `Username` and `Password` are the Postgres server environment variables in the Docker Compose file
-- You can use the below code snippet to  query the database from a Jupyter notebook: 
+- You can use the below code snippet to  query the database from a Jupyter notebook:
 
 ```
 from sqlalchemy import create_engine
@@ -100,6 +99,6 @@ The Balancer website is a Postgres, Django REST, and React project. The source c
 
 ![Architecture Drawing](Architecture.png)
 
-## License 
+## License
 
 Balancer is licensed under the [AGPL-3.0 license](https://choosealicense.com/licenses/agpl-3.0/)
diff --git a/config/env/env.dev b/config/env/env.dev
@@ -10,7 +10,6 @@ SQL_PORT=5432
 DATABASE=postgres
 LOGIN_REDIRECT_URL=
 OPENAI_API_KEY=
-ANTHROPIC_API_KEY=
 PINECONE_API_KEY=
 EMAIL_HOST_USER=
-EMAIL_HOST_PASSWORD=
+EMAIL_HOST_PASSWORD=
diff --git a/server/api/services/prompt_services.py b/server/api/services/prompt_services.py
@@ -0,0 +1,198 @@
+"""
+Centralized prompt management for the application.
+Contains all prompts used across different services.
+"""
+
+
+class PromptTemplates:
+    """Central repository for all prompt templates used in the application."""
+
+    # Text Extraction
+
+    TEXT_EXTRACTION_RULE_EXTRACTION = """
+    You're analyzing medical text from multiple sources. Each chunk is labeled [chunk-X].
+
+    Act as a seasoned physician or medical professional who treats patients with bipolar disorder.
+
+    Identify rules for medication inclusion or exclusion based on medical history or concerns.
+
+    For each rule you find, return a JSON object using the following format:
+
+    {
+    "rule": "<condition or concern>",
+    "type": "INCLUDE" or "EXCLUDE",
+    "reason": "<short explanation for why this rule applies>",
+    "medications": ["<medication 1>", "<medication 2>", ...],
+    "source": "<chunk-X>"
+    }
+
+    Only include rules that are explicitly stated or strongly implied in the chunk.
+
+    Only use the chunks provided. If no rule is found in a chunk, skip it.
+
+    Return the entire output as a JSON array.
+    """
+
+    # Embeddings/Search
+
+    EMBEDDINGS_QUERY_RESPONSE = """You are an AI assistant tasked with providing detailed, well-structured responses based
+    on the information provided in [PROVIDED-INFO]. Follow these guidelines strictly:
+    1. Content: Use information contained within [PROVIDED-INFO] to answer the question.
+    2. Organization: Structure your response with clear sections and paragraphs.
+    3. Citations: After EACH sentence that uses information from [PROVIDED-INFO],
+    include a citation in this exact format:***[{{file_id}}], Page {{page_number}}, Chunk {{chunk_number}}*** .
+    Only use citations that correspond to the information you're presenting.
+    4. Clarity: Ensure your answer is well-structured and easy to follow.
+    5. Direct Response: Answer the user's question directly without unnecessary introductions or filler phrases.
+    Here's an example of the required response format:
+    ________________________________________
+    See's Candy in the context of sales during a specific event. The candy counters rang up 2,690 individual sales on a Friday,
+    and an additional 3,931 transactions on a Saturday ***[16s848as-vcc1-85sd-r196-7f820a4s9de1, Page 5, Chunk 26]***.
+    People like the consumption of fudge and peanut brittle the most ***[130714d7-b9c1-4sdf-b146-fdsf854cad4f, Page 9, Chunk 19]***.
+    Here is the history of See's Candy: the company was purchased in 1972, and its products have not been materially
+    altered in 101 years ***[895sdsae-b7v5-416f-c84v-7f9784dc01e1, Page 2, Chunk 13]***.
+    Bipolar disorder treatment often involves mood stabilizers. Lithium is a commonly prescribed mood stabilizer
+    effective in reducing manic episodes ***[b99988ac-e3b0-4d22-b978-215e814807f4, Page 29, Chunk 122]***.
+    For acute hypomania or mild to moderate mania, initial treatment with risperidone or olanzapine monotherapy is
+    suggested ***[b99988ac-e3b0-4d22-b978-215e814807f4, Page 24, Chunk 101]***.
+    ________________________________________
+    Please provide your response to the user's question following these guidelines precisely.
+    [PROVIDED-INFO] = {listOfEmbeddings}"""
+
+    # Conversation/Chat
+
+    CONVERSATION_SYSTEM_PROMPT = """You are a knowledgeable assistant.
+    Balancer is a powerful tool for selecting bipolar medication for patients. We are open-source and available for free use.
+    Your primary role is to assist licensed clinical professionals with information related to Balancer and bipolar medication selection.
+    If applicable, use the supplied tools to assist the professional."""
+
+    CONVERSATION_PAGE_CONTEXT_PROMPT = """If applicable, please use the following content to ask questions.
+    If not applicable, please answer to the best of your ability: {page_context}"""
+
+    MEDICINE_DESCRIPTION_PROMPT = """Give a brief description of this medicine: %s"""
+
+    # Title Generation
+
+    TITLE_GENERATION_SYSTEM_PROMPT = (
+        """You are a helpful assistant that generates short, descriptive titles."""
+    )
+
+    TITLE_GENERATION_USER_PROMPT = """Based on the following conversation, generate a short, descriptive title (max 6 words): {context}"""
+
+    @classmethod
+    def get_text_extraction_prompt(cls):
+        """Get the text extraction rule extraction prompt."""
+        return cls.TEXT_EXTRACTION_RULE_EXTRACTION
+
+    @classmethod
+    def get_embeddings_query_prompt(cls, list_of_embeddings):
+        """Get the embeddings query response prompt with embedded data."""
+        return cls.EMBEDDINGS_QUERY_RESPONSE.format(listOfEmbeddings=list_of_embeddings)
+
+    @classmethod
+    def get_conversation_system_prompt(cls):
+        """Get the conversation system prompt."""
+        return cls.CONVERSATION_SYSTEM_PROMPT
+
+    @classmethod
+    def get_conversation_page_context_prompt(cls, page_context):
+        """Get the conversation page context prompt."""
+        return cls.CONVERSATION_PAGE_CONTEXT_PROMPT.format(page_context=page_context)
+
+    @classmethod
+    def get_medicine_description_prompt(cls, tokens):
+        """Get the medicine description prompt."""
+        return cls.MEDICINE_DESCRIPTION_PROMPT % tokens
+
+    @classmethod
+    def get_title_generation_system_prompt(cls):
+        """Get the title generation system prompt."""
+        return cls.TITLE_GENERATION_SYSTEM_PROMPT
+
+    @classmethod
+    def get_title_generation_user_prompt(cls, context):
+        """Get the title generation user prompt."""
+        return cls.TITLE_GENERATION_USER_PROMPT.format(context=context)
+
+    # Assistant Instructions
+    ASSISTANT_TOOL_DESCRIPTION = """
+    Search your internal library of bipolar disorder sources for information relevant to answering the user's question.
+    Call this function when you need to find specific information from your source library
+    to provide an accurate, citation-backed response. Always search before answering questions
+    about bipolar disorder topics.
+    """
+
+    ASSISTANT_TOOL_PROPERTY_DESCRIPTION = """
+    A specific search query to find relevant information in your source library.
+    Use keywords, phrases, or questions related to what the user is asking about.
+    Be specific rather than generic - use terms that would appear in the relevant sources.
+    """
+
+    ASSISTANT_INSTRUCTIONS = """
+    When you are asked a question, respond as if you are a chatbot with a library of sources that the user can't see.
+    The user did not upload these sources, so they don't know about them.
+    You have to explain what is in the sources and give references to the sources.
+
+    When a prompt is received that is unrelated to bipolar disorder, mental health treatment, or psychiatric medications,
+    respond to the user by saying you are limited to bipolar-specific conversations.
+
+    You are an AI assistant that helps users find and understand information about bipolar disorder
+    from your internal library of bipolar disorder research sources using semantic search.
+
+    SEMANTIC SEARCH STRATEGY:
+    - Always perform semantic search using the search_documents function when users ask questions
+    - Use conceptually related terms and synonyms, not just exact keyword matches
+    - Search for the meaning and context of the user's question, not just literal words
+    - Consider medical terminology, lay terms, and related conditions when searching
+
+    FUNCTION USAGE:
+    - When a user asks about information that might be in your source library ALWAYS use the search_documents function first
+    - Perform semantic searches using concepts, symptoms, treatments, and related terms from the user's question
+    - Only provide answers based on information found through your source searches
+
+    RESPONSE FORMAT:
+    After gathering information through semantic searches, provide responses that:
+    1. Answer the user's question directly using only the found information
+    2. Structure responses with clear sections and paragraphs
+    3. Explain what information you found in your sources and provide context
+    4. Include citations using this exact format: ***[Name {name}, Page {page_number}]***
+    5. Only cite information that directly supports your statements
+
+    If no relevant information is found in your source library, clearly state that the information is not available in your current sources.
+    """
+
+    @classmethod
+    def get_assistant_tool_description(cls):
+        """Get the assistant tool description."""
+        return cls.ASSISTANT_TOOL_DESCRIPTION
+
+    @classmethod
+    def get_assistant_tool_property_description(cls):
+        """Get the assistant tool property description."""
+        return cls.ASSISTANT_TOOL_PROPERTY_DESCRIPTION
+
+    @classmethod
+    def get_assistant_instructions(cls):
+        """Get the assistant instructions."""
+        return cls.ASSISTANT_INSTRUCTIONS
+
+    # Risk Assessment
+
+    RISK_BASIC_MEDICATION_PROMPT = """You are to provide a concise list of 5 key benefits and 5 key risks for the medication suggested
+    when taking it for Bipolar. Each point should be short, clear and be kept under 10 words. Begin the benefits
+    section with !!!benefits!!! and the risks section with !!!risk!!!. Please provide this information for the medication: {medication}."""
+
+    RISK_DIAGNOSIS_MEDICATION_PROMPT = """You are providing medication information from a diagnosis/clinical perspective.
+    Provide a concise list of 5 key benefits and 5 key risks for the medication {medication} when prescribed for Bipolar disorder,
+    focusing on clinical evidence and diagnostic considerations. Each point should be short, clear and be kept under 10 words.
+    Begin the benefits section with !!!benefits!!! and the risks section with !!!risk!!!."""
+
+    @classmethod
+    def get_risk_basic_medication_prompt(cls, medication):
+        """Get the basic medication risk/benefit prompt."""
+        return cls.RISK_BASIC_MEDICATION_PROMPT.format(medication=medication)
+
+    @classmethod
+    def get_risk_diagnosis_medication_prompt(cls, medication):
+        """Get the diagnosis-specific medication risk/benefit prompt."""
+        return cls.RISK_DIAGNOSIS_MEDICATION_PROMPT.format(medication=medication)
diff --git a/server/api/views/assistant/views.py b/server/api/views/assistant/views.py
@@ -15,6 +15,7 @@
 
 from ...services.embedding_services import get_closest_embeddings
 from ...services.conversions_services import convert_uuids
+from ...services.prompt_services import PromptTemplates
 
 # Configure logging
 logger = logging.getLogger(__name__)
@@ -119,18 +120,8 @@ def post(self, request):
 
             client = OpenAI(api_key=os.environ.get("OPENAI_API_KEY"))
 
-            TOOL_DESCRIPTION = """
-            Search the user's uploaded documents for information relevant to answering their question.
-            Call this function when you need to find specific information from the user's documents
-            to provide an accurate, citation-backed response. Always search before answering questions
-            about document content.
-            """
-
-            TOOL_PROPERTY_DESCRIPTION = """
-            A specific search query to find relevant information in the user's documents.
-            Use keywords, phrases, or questions related to what the user is asking about.
-            Be specific rather than generic - use terms that would appear in the relevant documents.
-            """
+            TOOL_DESCRIPTION = PromptTemplates.get_assistant_tool_description()
+            TOOL_PROPERTY_DESCRIPTION = PromptTemplates.get_assistant_tool_property_description()
 
             tools = [
                 {
@@ -195,30 +186,7 @@ def search_documents(query: str, user=user) -> str:
                 except Exception as e:
                     return f"Error searching documents: {str(e)}. Please try again if the issue persists."
 
-            INSTRUCTIONS = """
-            You are an AI assistant that helps users find and understand information about bipolar disorder
-            from their uploaded bipolar disorder research documents using semantic search.
-
-            SEMANTIC SEARCH STRATEGY:
-            - Always perform semantic search using the search_documents function when users ask questions
-            - Use conceptually related terms and synonyms, not just exact keyword matches
-            - Search for the meaning and context of the user's question, not just literal words
-            - Consider medical terminology, lay terms, and related conditions when searching
-
-            FUNCTION USAGE:
-            - When a user asks about information that might be in their documents ALWAYS use the search_documents function first
-            - Perform semantic searches using concepts, symptoms, treatments, and related terms from the user's question
-            - Only provide answers based on information found through document searches
-
-            RESPONSE FORMAT:
-            After gathering information through semantic searches, provide responses that:
-            1. Answer the user's question directly using only the found information
-            2. Structure responses with clear sections and paragraphs
-            3. Include citations using this exact format: ***[Name {name}, Page {page_number}]***
-            4. Only cite information that directly supports your statements
-
-            If no relevant information is found in the documents, clearly state that the information is not available in the uploaded documents.
-            """
+            INSTRUCTIONS = PromptTemplates.get_assistant_instructions()
 
             MODEL_DEFAULTS = {
                 "instructions": INSTRUCTIONS,