Spaces:
Running
on
Zero
Running
on
Zero
Create 03_evaluation-results.json
Browse files- 03_evaluation-results.json +85 -0
03_evaluation-results.json
ADDED
@@ -0,0 +1,85 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"groundedness": {
|
3 |
+
"0": {
|
4 |
+
"groundedness": 5.0,
|
5 |
+
"gpt_groundedness": 5.0,
|
6 |
+
"groundedness_reason": "The RESPONSE thoroughly addresses the QUERY by providing specific benefits of the PerksPlus program, making it both relevant and complete based on the CONTEXT. It does not introduce unrelated information and accurately reflects the purpose of the program."
|
7 |
+
},
|
8 |
+
"1": {
|
9 |
+
"groundedness": 1.0,
|
10 |
+
"gpt_groundedness": 1.0,
|
11 |
+
"groundedness_reason": "The response does not provide any relevant information or attempt to answer the query about the number of windows in New York. It merely states that the context does not contain the necessary information, which does not address the question at all. Thus, it deserves a score of 1 for being completely unrelated."
|
12 |
+
},
|
13 |
+
"2": {
|
14 |
+
"groundedness": 3.0,
|
15 |
+
"gpt_groundedness": 3.0,
|
16 |
+
"groundedness_reason": "The RESPONSE provides a general overview of what annual evaluations might consider, but it does not relate to any specific information from the CONTEXT. Since the CONTEXT does not provide any details about evaluation criteria, the RESPONSE cannot be considered fully grounded. It is relevant to the topic but lacks specific anchoring in the provided information."
|
17 |
+
}
|
18 |
+
},
|
19 |
+
"query": {
|
20 |
+
"0": "What are the benefits of the PerksPlus program?",
|
21 |
+
"1": "How many windows are there in new york?",
|
22 |
+
"2": "How will I be evaluated each year?"
|
23 |
+
},
|
24 |
+
"response": {
|
25 |
+
"0": "You are asking about the benefits offered by the PerksPlus program. \n\nThe PerksPlus program provides a variety of health and wellness reimbursement options designed for Contoso Electronics employees. Key benefits include financial support for health-related expenses, which may encompass wellness activities, fitness memberships, and other health initiatives aimed at promoting employees' overall well-being.\n\nThis program encourages a healthier lifestyle among employees by mitigating some of the costs associated with health and wellness activities, thereby enhancing their work-life balance and overall job satisfaction.\n\nOverall, the PerksPlus program aims to support the health and wellness of employees while fostering a positive workplace environment.",
|
26 |
+
"1": "Your question is about the number of windows in New York. \n\nUnfortunately, the provided context does not contain any information or statistics regarding the number of windows in New York or related data. \n\nAs a result, I am unable to provide a precise answer to your question. The number of windows in New York would depend on numerous factors, including the types of buildings, their sizes, and designs, which would require a comprehensive study or estimation to determine accurately.",
|
27 |
+
"2": "You are inquiring about the process and criteria for your annual evaluation. \n\nTypically, annual evaluations often consider several key factors including job performance, achievement of goals, adherence to company values, collaboration with team members, and overall contributions to the organization. Evaluations may involve feedback from supervisors, self-assessments, and possibly peer reviews.\n\nIf there is a specific evaluation model or set of criteria used within your organization, it would allow for a more tailored understanding of how you will be assessed each year.\n\nIn summary, you can expect your annual evaluation to be based on your performance, accomplishment of set objectives, and contributions to team and company goals."
|
28 |
+
},
|
29 |
+
"context": {
|
30 |
+
"0": [
|
31 |
+
[
|
32 |
+
{
|
33 |
+
"id": "UGVya3NQbHVzLnBkZjE=",
|
34 |
+
"content": "Title: PerksPlus.pdfThis document contains information generated using a language model (Azure OpenAI ). The information contained in this document is only for demonstration purposes and does not reflect the opinions or beliefs of Microsoft. Microsoft makes no representations or warranties of any kind, express or implied, about the completeness, accuracy, reliability, suitability or availability with respect to the information contained in this document. All rights reserved to Microsoft",
|
35 |
+
"filepath": "PerksPlus.pdf",
|
36 |
+
"title": "",
|
37 |
+
"url": "PerksPlus.pdf"
|
38 |
+
},
|
39 |
+
{
|
40 |
+
"id": "UGVya3NQbHVzLnBkZjA=",
|
41 |
+
"content": "Title: PerksPlus.pdf \n \n \nPerksPlus Health and Wellness \nReimbursement Program for \nContoso Electronics Employees \n \n \n \n \n \n \n",
|
42 |
+
"filepath": "PerksPlus.pdf",
|
43 |
+
"title": "",
|
44 |
+
"url": "PerksPlus.pdf"
|
45 |
+
}
|
46 |
+
]
|
47 |
+
],
|
48 |
+
"1": [
|
49 |
+
[
|
50 |
+
{
|
51 |
+
"id": "QmVuZWZpdF9PcHRpb25zLnBkZjE=",
|
52 |
+
"content": "Title: Benefit_Options.pdfThis document contains information generated using a language model (Azure OpenAI). The information \ncontained in this document is only for demonstration purposes and does not reflect the opinions or \nbeliefs of Microsoft. Microsoft makes no representations or warranties of any kind, express or implied, \nabout the completeness, accuracy, reliability, suitability or availability with respect to the information \ncontained in this document. \nAll rights reserved to Microsoft",
|
53 |
+
"filepath": "Benefit_Options.pdf",
|
54 |
+
"title": "",
|
55 |
+
"url": "Benefit_Options.pdf"
|
56 |
+
},
|
57 |
+
{
|
58 |
+
"id": "UGVya3NQbHVzLnBkZjE=",
|
59 |
+
"content": "Title: PerksPlus.pdfThis document contains information generated using a language model (Azure OpenAI ). The information \ncontained in this document is only for demonstration purposes and does not reflect the opinions or \nbeliefs of Microsoft. Microsoft makes no representations or warranties of any kind, express or implied, \nabout the completeness, accuracy, reliability, suitability or availability with respect to the information \ncontained in this document. All rights reserved to Microsoft",
|
60 |
+
"filepath": "PerksPlus.pdf",
|
61 |
+
"title": "",
|
62 |
+
"url": "PerksPlus.pdf"
|
63 |
+
}
|
64 |
+
]
|
65 |
+
],
|
66 |
+
"2": [
|
67 |
+
[
|
68 |
+
{
|
69 |
+
"id": "QmVuZWZpdF9PcHRpb25zLnBkZjE=",
|
70 |
+
"content": "Title: Benefit_Options.pdfThis document contains information generated using a language model (Azure OpenAI). The information \ncontained in this document is only for demonstration purposes and does not reflect the opinions or \nbeliefs of Microsoft. Microsoft makes no representations or warranties of any kind, express or implied, \nabout the completeness, accuracy, reliability, suitability or availability with respect to the information \ncontained in this document. All rights reserved to Microsoft",
|
71 |
+
"filepath": "Benefit_Options.pdf",
|
72 |
+
"title": "",
|
73 |
+
"url": "Benefit_Options.pdf"
|
74 |
+
},
|
75 |
+
{
|
76 |
+
"id": "cm9sZV9saWJyYXJ5LnBkZjE=",
|
77 |
+
"content": "Title: role_library.pdfThis document contains information generated using a language model (Azure OpenAI). The \ninformation contained in this document is only for demonstration purposes and does not \nreflect the opinions or beliefs of Microsoft. Microsoft makes no representations or \nwarranties of any kind, express or implied, about the completeness, accuracy, reliability, \nsuitability or availability with respect to the information contained in this document. All rights reserved to Microsoft",
|
78 |
+
"filepath": "role_library.pdf",
|
79 |
+
"title": "",
|
80 |
+
"url": "role_library.pdf"
|
81 |
+
}
|
82 |
+
]
|
83 |
+
]
|
84 |
+
}
|
85 |
+
}
|