AI

Dynamic Memory Networks for Visual and Textual Question Answering

[Image: Yurii / Adobe Stock]

Neural network architectures with memory and attention mechanisms exhibit certain reasoning capabilities required for question answering.

Caiming Xiong

April 4, 2016 1 min read

Neural network architectures with memory and attention mechanisms exhibit certain reasoning capabilities required for question answering. One such architecture, the dynamic memory network (DMN), obtained high accuracy on a variety of language tasks. However, it was not shown whether the architecture achieves strong results for question answering when supporting facts are not marked during training or whether it could be applied to other modalities such as images.

Based on an analysis of the DMN, we propose several improvements to its memory and input modules. Together with these changes we introduce a novel input module for images in order to be able to answer visual questions. Our new DMN+ model improves the state of the art on both the Visual Question Answering dataset and the babi-10k text question-answering dataset without supporting fact supervision.

Citation credit

Caiming Xiong, Stephen Merity, Richard Socher. 2016
Dynamic Memory Networks for Visual and Textual Question Answering

An image of a woman and an AI agent standing together

LLMs and Copilots Alone Won’t Save You: Why You’re Doing Enterprise AI Wrong

11 min read

Illustration showing the concept of AI agents

How Agents Can Take Smarter Actions With Prompt Builder

6 min read

Caiming Xiong

Caiming Xiong VP Salesforce Research

More by Caiming

An AI agent and a human worker hold a flag with an image of a checkmark on it: trustworthy AI

5 Ways To Build Trustworthy AI Agents

6 min read

Illustration showing AI prompts helping small business sales

5 AI Prompts for Small Business Sales

7 min read

Sales rep looking at a laptop screen: sales tech consolidation

Want the Most of AI? Start By Consolidating Your Tech Stack

2 min read

Female software developer students working on computer and laptop

Red Teaming xGen Text-Generation Model for Safety

8 min read

An illustration of a Salesforce AI agent robot interacting with floating task bubbles of human representatives.

AI-Powered Service Trends Reshaping the Telecom Industry

6 min read

8 Ways Marketing Agents Can Help You Build, Launch, and Track Campaigns Like Never Before

6 min read

An AI agent working alongside a human representative with floating task icons between them.

Human-AI Synergy in Media & Entertainment: The Next Frontier of Personalization

5 min read

Itai, Peter and Silvio at Dreamforce

The State of AI: How We Got Here (and What’s To Come)

14 min read

Get the latest articles in your inbox.

Enter a valid e-mail address

Select your Country

Select a state/province

Select a state/province

Select a state/province

Yes, I would like to receive the Salesforce 360 Highlights newsletter as well as marketing emails regarding Salesforce products, services, and events. I can unsubscribe at any time.

I agree to the Privacy Statement and to the handling of my personal information. In particular, I consent to the transfer of my personal information to other countries, including the United States, for the purpose of hosting and processing the information as set forth in the Privacy Statement. Learn More

I understand that these countries may not have the same data protection laws as the country from which I provide my personal information. For more information, click here.

Please read and agree to the Master Subscription Agreement

By registering, you confirm that you agree to the processing of your personal data by Salesforce as described in the Privacy Statement.

New to Salesforce?

About Salesforce

Popular Links