Changed Directory structure, small fix, prompts #3850

joachim-danswer · 2025-01-30T04:42:13Z

Description

Changed Directory structure, small fix, prompts

How Has This Been Tested?

Locally

Backporting (check the box to trigger backport action)

Note: You have to check that the action passes, otherwise resolve the conflicts manually and tag the patches.

This PR should be backported (make sure to check that the backport attempt succeeds)
[Optional] Override Linear Check

vercel · 2025-01-30T04:42:17Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
internal-search	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Feb 1, 2025 5:27am

evan-danswer

Directory structure looks a lot better! Asked ChatGPT about a few I took issue with, but mostly it's time to focus on filename/function/variable renames now. Check out the comments on the main PR #3749 before you start that, since Yuhong had some things to say on that front

evan-danswer · 2025-01-30T16:11:29Z

...end/onyx/agents/agent_search/deep_search_a/initial/individual_sub_answer_generation/edges.py

@@ -3,10 +3,10 @@

 from langgraph.types import Send

-from onyx.agents.agent_search.deep_search_a.initial__individual_sub_answer__subgraph.states import (
+from onyx.agents.agent_search.deep_search_a.initial.individual_sub_answer_generation.states import (


LLM recommends individual_sub_answer

Here was the prompt:
For a langgraph graph created with the purpose of answering a subquestion, choose the best folder name or create your own that you believe is better:

individual_sub_answer

individual_sub_answer_generation

answer_subquestion

^^ the above was using Cursor with knowledge of our current codebase, plain ChatGPT picked answer_subquestion. Seems like either is fine

evan-danswer · 2025-01-30T16:17:57Z

...ent_search/deep_search_a/initial/individual_sub_answer_generation/nodes/answer_generation.py

@@ -102,7 +102,9 @@ def answer_generation(
        )

    answer_citation_ids = get_answer_citation_ids(answer_str)
-    cited_docs = [context_docs[id] for id in answer_citation_ids]
+    cited_docs = [
+        context_docs[id] for id in answer_citation_ids if id < len(context_docs)


out of bounds citation should never happen unless the LLM is really weak, is this a difference in the docs we give the LLM vs the context docs here? Probably we prefer failing loudly

evan-danswer · 2025-01-30T16:37:50Z

...nd/onyx/agents/agent_search/deep_search_a/initial/initial_answer_generation/graph_builder.py

    generate_initial_answer,
 )
-from onyx.agents.agent_search.deep_search_a.initial__retrieval_sub_answers__subgraph.nodes.initial_answer_quality_check import (
+from onyx.agents.agent_search.deep_search_a.initial.initial_answer_generation.nodes.initial_answer_quality_check import (


Cursor thinks we should go with "individual_sub_answer" which I think is pretty weird, ChatGPT likes your choice but also thinks "generate_first_answer" is good. I'm inclined to stick with your choice since it's backed up by ChatGPT and doesn't seem clearly wrong like individual_sub_answer

prompt: For a langgraph graph created with the purpose of creating the first answer to a user question, choose the best folder name or create your own that you believe is better (keep in mind that this will be a subfolder of a new directory "initial" that contains the subgraphs used for gathering information and generating the first answer to the user question):

retrieval_sub_answers

initial_answer_generation

answer_question

evan-danswer · 2025-01-30T16:43:45Z

...nyx/agents/agent_search/deep_search_a/refininement/sub_answer_consolidation/graph_builder.py

    AnswerQuestionState,
 )
-from onyx.agents.agent_search.deep_search_a.refinement__consolidate_sub_answers__subgraph.edges import (
+from onyx.agents.agent_search.deep_search_a.refininement.sub_answer_consolidation.edges import (


refininement -> refinement

evan-danswer · 2025-01-30T16:45:27Z