Configurable image descriptor fn `get_image_description` and support for list type content in messages #2286

suneeta-mall · 2025-03-03T04:39:24Z

🚀 The feature

Hey, Thanks for your work on mem0. I was wondering what is the vision towards increasing the support for multi-model input/message. At the moment I am running into a few issues, namely:

List-based content can not be added/indexed by the mem0 (with redis backing). An example of list based content is shown here:

    [
        {
            "role": "user",
            "content": [
                {
                    "type": "text",
                    "text": "Describe the this image",
                },
                {
                    "type": "image_url",
                    "image_url": {
                        "url": "https://someimage_somewhere_jumbo.jpeg"
                    },
                },
            ],
        }
    ]

These fail with error TypeError: list indices must be integers or slices, not str but the entirely correct format for multi-modal OpenAI message format.

I can see that json/dict content is processed fine, however. i.e. the following is okay:

```json
    [
        {
            "role": "user",
            "content": {
                    "type": "text",
                    "text": "Describe the this image",
                },
 }, {
            "role": "user",
            "content": {
                    "type": "image_url",
                    "image_url": {
                        "url": "https://someimage_somewhere_jumbo.jpeg"
                    },
                },
        }
    ]

The image descriptor method get_image_description assumes a call to OpenAI. Can we make this method configurable for base_url, api_key and model i.e., in general, BaseLlmConfig to point it to any LLM?

mem0/mem0/memory/utils.py

Line 48 in f4dc5f6

def get_image_description(image_url):

Motivation, pitch

To make mem0 more usable in a multi-modal setting where custom image descriptors can be more valuable for cost and domain fit purposes.

The text was updated successfully, but these errors were encountered:

deshraj · 2025-03-04T07:25:12Z

Thanks for opening the issue @suneeta-mall. We are happy to add support for it.

Dev-Khant · 2025-03-04T18:48:44Z

Hey @suneeta-mall I'm working on this issue, so can you please elaborate on why there is a need to pass the text field in the content key? Want to understand the use-case here. Thanks!

suneeta-mall changed the title ~~Configurable image descriptor~~ Configurable image descriptor fn get_image_description and support for list type content in messages Mar 3, 2025

deshraj assigned Dev-Khant Mar 4, 2025

Dev-Khant linked a pull request Mar 4, 2025 that will close this issue

Improve multimodal functionality #2297

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configurable image descriptor fn `get_image_description` and support for list type content in messages #2286

Configurable image descriptor fn `get_image_description` and support for list type content in messages #2286

suneeta-mall commented Mar 3, 2025 •

edited

Loading

deshraj commented Mar 4, 2025

Dev-Khant commented Mar 4, 2025

Configurable image descriptor fn get_image_description and support for list type content in messages #2286

Configurable image descriptor fn get_image_description and support for list type content in messages #2286

Comments

suneeta-mall commented Mar 3, 2025 • edited Loading

🚀 The feature

Motivation, pitch

deshraj commented Mar 4, 2025

Dev-Khant commented Mar 4, 2025

Configurable image descriptor fn `get_image_description` and support for list type content in messages #2286

Configurable image descriptor fn `get_image_description` and support for list type content in messages #2286

suneeta-mall commented Mar 3, 2025 •

edited

Loading