Clarification on long-term memory #28

korbinian-hoermann · 2025-01-29T19:00:47Z

Hi,

first of all, congrats on the great work !
I was wondering, if you could clarify a few points on "long-term memory" for me.

Q1: As I understand it, you do not have an explicit long-term memory module (as e.g. in Agent Workflow Memory),
it's rather distributed across its neural network parameters. Is that correct ?

And as a follow-up: in this issue you mention, you're using 'history 5' for multi step tasks and give the following example:

# To predict third action
messages.append({
    "role": "user",
    "content": [
        {
            "type": "text",
            "text": PROMPT_FOR_COMPUTER + f"{instruction}"
        },
        {
            "type": "image_url",
            "image_url": screenshot_from_init
        },
        {
            "type": "text",
            "text": previous_actions[0],
        },
        {
            "type": "image_url",
            "image_url": screenshot_from_state_0
        },
        {
            "type": "text",
            "text": previous_actions[1],
        },
        {
            "type": "image_url",
            "image_url": screenshot_from_state_1
        }
    ],
})

Q2: Does this mean, the agent never sees a full action history (not even the textual representation), but maximum the last 5 time steps?
Q3: If this is the case, do you think the agent "long-term memory" would benefit from seeing and thereby connecting whole workflows with task execution ?
Q4: In the given example, does

{
            "type": "text",
            "text": previous_actions[1],
},

contain the full prediction (thought + action) ?

The text was updated successfully, but these errors were encountered:

JjjFangg · 2025-01-31T05:48:02Z

We truly appreciate your attention to our work. Here are the answers to you question.

A1: Yes, you're correct. It's distributed across its neural network parameters.

A2: Yes, at most 5 history images are given.

A3: Yes, we believe that seeing all historical images is beneficial. However, the history5 approach is designed to balance computational efficiency and performance.

A4: Yes, your understanding is correct.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification on long-term memory #28

Clarification on long-term memory #28

korbinian-hoermann commented Jan 29, 2025

JjjFangg commented Jan 31, 2025

Clarification on long-term memory #28

Clarification on long-term memory #28

Comments

korbinian-hoermann commented Jan 29, 2025

JjjFangg commented Jan 31, 2025