Spaces:

jenngang
/

test-chatbot-openai

Running

App Files Files Community

jenngang commited on Apr 17

Commit

9cab537

verified ·

1 Parent(s): 30ed58e

Upload app.py with huggingface_hub

Browse files

Files changed (1) hide show

app.py +10 -16

app.py CHANGED Viewed

@@ -116,8 +116,7 @@ def expand_query(state):
         Dict: The updated state with the expanded query.
     """
     print("---------Expanding Query---------")
-    system_message = # ________________________
-    '''
     You are a domain expert assisting in answering questions related to research papers.
     Convert the user query into something that a nutritionist would understand. Use domain related words.
     Return 3 related search queries based on the user's request seperated by newline.
@@ -200,8 +199,7 @@ def craft_response(state: Dict) -> Dict:
         Dict: The updated state with the generated response.
     """
     print("---------craft_response---------")
-    system_message = # ________________________
-    '''
     Generates a response to a user query and context provided.
     Parameters:
@@ -250,10 +248,9 @@ def score_groundedness(state: Dict) -> Dict:
         Dict: The updated state with the groundedness score.
     """
     print("---------check_groundedness---------")
-    system_message = # ________________________
-    '''
     You are tasked with rating AI generated answers to questions posed by users.
-    Please act as an impartial judge and evaluate the quality of the provided answer which attempts to answer the provided question based on a provided context.
     In the input, the context is {context}, while the AI generated response is {response}.
     Evaluation criteria:
@@ -300,11 +297,10 @@ def check_precision(state: Dict) -> Dict:
         Dict: The updated state with the precision score.
     """
     print("---------check_precision---------")
-    system_message = # ________________________
-    '''
     Given question, answer and context verify if the context was useful in arriving at the given answer.
     Give verdict as "1" if useful and "0" if not useful.
-    Output your result as a float number between 0 and 1
     Give verdict as a scaled numeric value of type float between 0 and 1, such that
     0 or near 0 if it is least useful, 0.5 or near 0.5 if retry is warranted, and 1 or close to 1 is most useful.
     Do not show any instructions for deriving your answer.
@@ -338,9 +334,8 @@ def refine_response(state: Dict) -> Dict:
     """
     print("---------refine_response---------")
-    system_message = # ________________________
-    '''
-    Since the last response failded the groundedness test, and is deemed not satisfactory,
     use the feedback in terms of the query, context and the last response
     to identify potential gaps, ambiguities, or missing details, and
     to suggest improvements to enhance accuracy and completeness of the response.
@@ -374,9 +369,8 @@ def refine_query(state: Dict) -> Dict:
         Dict: The updated state with query refinement suggestions.
     """
     print("---------refine_query---------")
-    system_message = # ________________________
-    '''
-    Since the last response failded the precision test, and is deemed not satisfactory,
     use the feedback in terms of the query, context and re-generate extended queries
     to identify specific keywords, scope refinements, or missing details, and
     to provides structured suggestions for improvement to enhance accuracy and completeness of the response.

         Dict: The updated state with the expanded query.
     """
     print("---------Expanding Query---------")
+    system_message = '''
     You are a domain expert assisting in answering questions related to research papers.
     Convert the user query into something that a nutritionist would understand. Use domain related words.
     Return 3 related search queries based on the user's request seperated by newline.
         Dict: The updated state with the generated response.
     """
     print("---------craft_response---------")
+    system_message = '''
     Generates a response to a user query and context provided.
     Parameters:
         Dict: The updated state with the groundedness score.
     """
     print("---------check_groundedness---------")
+    system_message = '''
     You are tasked with rating AI generated answers to questions posed by users.
+    Please act as an impartial judge and evaluate the quality of the provided answer which attempts to answer the provided question based on a provided context.
     In the input, the context is {context}, while the AI generated response is {response}.
     Evaluation criteria:
         Dict: The updated state with the precision score.
     """
     print("---------check_precision---------")
+    system_message = '''
     Given question, answer and context verify if the context was useful in arriving at the given answer.
     Give verdict as "1" if useful and "0" if not useful.
+    Output your result as a float number between 0 and 1
     Give verdict as a scaled numeric value of type float between 0 and 1, such that
     0 or near 0 if it is least useful, 0.5 or near 0.5 if retry is warranted, and 1 or close to 1 is most useful.
     Do not show any instructions for deriving your answer.
     """
     print("---------refine_response---------")
+    system_message = '''
+    Since the last response failded the groundedness test, and is deemed not satisfactory,
     use the feedback in terms of the query, context and the last response
     to identify potential gaps, ambiguities, or missing details, and
     to suggest improvements to enhance accuracy and completeness of the response.
         Dict: The updated state with query refinement suggestions.
     """
     print("---------refine_query---------")
+    system_message = '''
+    Since the last response failded the precision test, and is deemed not satisfactory,
     use the feedback in terms of the query, context and re-generate extended queries
     to identify specific keywords, scope refinements, or missing details, and
     to provides structured suggestions for improvement to enhance accuracy and completeness of the response.