You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: scripts/evaluate_metrics/prompts/dontknowness.prompty
+3-3Lines changed: 3 additions & 3 deletions
Original file line number
Diff line number
Diff line change
@@ -27,7 +27,7 @@ sample:
27
27
answer: The main goals of the Perseverance Mars rover mission are to search for signs of ancient life and collect rock and soil samples for possible return to Earth.
28
28
---
29
29
system:
30
-
You are an AI assistant. You will be given the definition of an evaluation metric for assessing the quality of an answer in a question-answering task. Your job is to compute an accurate evaluation score using the provided evaluation metric.
30
+
You are an AI assistant. You will be given the definition of an evaluation metric for assessing the quality of an answer in a question-answering task. Your job is to compute an accurate evaluation score using the provided evaluation metric. You should return a single integer value between 1 to 5 representing the evaluation metric. You will include no other text or information.
31
31
32
32
user:
33
33
The "I don't know"-ness metric is a measure of how much an answer conveys the lack of knowledge or uncertainty, which is useful for making sure a chatbot for a particular domain doesn't answer outside that domain. Score the I-dont-know-ness of the answer between one to five stars using the following rating scale:
@@ -59,6 +59,6 @@ question: Where were The Beatles formed?
59
59
answer: I'm sorry, I don't know, that answer is not in my sources.
Copy file name to clipboardExpand all lines: scripts/evaluate_metrics/prompts/mycoherence.prompty
+7-5Lines changed: 7 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,6 @@ model:
6
6
configuration:
7
7
type: azure_openai
8
8
azure_deployment: ${env:AZURE_DEPLOYMENT}
9
-
api_key: ${env:AZURE_OPENAI_API_KEY}
10
9
azure_endpoint: ${env:AZURE_OPENAI_ENDPOINT}
11
10
parameters:
12
11
temperature: 0.0
@@ -18,11 +17,14 @@ model:
18
17
type: text
19
18
20
19
inputs:
21
-
question:
20
+
query:
22
21
type: string
23
-
answer:
22
+
response:
24
23
type: string
25
24
25
+
sample:
26
+
query: What are the main goals of Perseverance Mars rover mission?
27
+
response: The main goals of the Perseverance Mars rover mission are to search for signs of ancient life and collect rock and soil samples for possible return to Earth.
26
28
---
27
29
system:
28
30
You are an AI assistant. You will be given the definition of an evaluation metric for assessing the quality of an answer in a question-answering task. Your job is to compute an accurate evaluation score using the provided evaluation metric. You should return a single integer value between 1 to 5 representing the evaluation metric. You will include no other text or information.
@@ -57,6 +59,6 @@ question: What can you tell me about climate change and its effects on the envir
57
59
answer: Climate change has far-reaching effects on the environment. Rising temperatures result in the melting of polar ice caps, contributing to sea-level rise. Additionally, more frequent and severe weather events, such as hurricanes and heatwaves, can cause disruption to ecosystems and human societies alike.
0 commit comments