Towards Multi-modal Graph Large Language Model

Multi-modal graphs are everywhere in the digital world. Yet the tools used to understand them haven't evolved as much as one would expect. What if the same model could handle your social network analysis, molecular discovery, AND urban planning tasks? A new paper from Tsinghua University proposes Multi-modal Graph Large Language Models (MG-LLM) - a paradigm shift in how we process complex interconnected data that combines text, images, audio, and structured relationships. Think of it as ChatGPT for graphs, but, metaphorically speaking, with eyes, ears, and structural understanding. Their key insight? Treating all graph tasks as generative problems. Instead of training separate models for node classification, link prediction, or graph reasoning, MG-LLM frames everything as transforming one multi-modal graph into another. This unified approach means the same model that predicts protein interactions could also analyze social media networks or urban traffic patterns. What makes this particularly exciting is the vision for natural language interaction with graph data. Imagine querying complex molecular structures or editing knowledge graphs using plain English, without learning specialized query languages. The challenges remain substantial - from handling the multi-granularity of data (pixels to full images) to managing multi-scale tasks (entire graph input, single node output). But if successful, this could fundamentally change the level of graph-based insights across industries that have barely scratched the surface of AI adoption. ↓ 𝐖𝐚𝐧𝐭 𝐭𝐨 𝐤𝐞𝐞𝐩 𝐮𝐩? Join my newsletter with 50k+ readers and be the first to learn about the latest AI research: llmwatch.com 💡

Towards Multi-modal Graph Large Language Model

#GraphAI #LLM #AI #research

·linkedin.com·Jun 16, 2025

Towards Multi-modal Graph Large Language Model

Multi-modal Graph Large Language Models (MG-LLM)

Multi-modal graphs are everywhere in the digital world. Yet the tools used to understand them haven't evolved as much as one would expect. What if the same model could handle your social network analysis, molecular discovery, AND urban planning tasks? A new paper from Tsinghua University proposes Multi-modal Graph Large Language Models (MG-LLM) - a paradigm shift in how we process complex interconnected data that combines text, images, audio, and structured relationships. Think of it as ChatGPT for graphs, but, metaphorically speaking, with eyes, ears, and structural understanding. Their key insight? Treating all graph tasks as generative problems. Instead of training separate models for node classification, link prediction, or graph reasoning, MG-LLM frames everything as transforming one multi-modal graph into another. This unified approach means the same model that predicts protein interactions could also analyze social media networks or urban traffic patterns. What makes this particularly exciting is the vision for natural language interaction with graph data. Imagine querying complex molecular structures or editing knowledge graphs using plain English, without learning specialized query languages. The challenges remain substantial - from handling the multi-granularity of data (pixels to full images) to managing multi-scale tasks (entire graph input, single node output). But if successful, this could fundamentally change the level of graph-based insights across industries that have barely scratched the surface of AI adoption. ↓ 𝐖𝐚𝐧𝐭 𝐭𝐨 𝐤𝐞𝐞𝐩 𝐮𝐩? Join my newsletter with 50k+ readers and be the first to learn about the latest AI research: llmwatch.com 💡

Multi-modal Graph Large Language Models (MG-LLM)

#AI #research #GraphAI #LLM

·linkedin.com·Jun 13, 2025

Multi-modal Graph Large Language Models (MG-LLM)

What if your LLM is… a graph?

What if your LLM is… a graph? A few days ago, Petar Veličković from Google DeepMind gave one of the most interesting and thought provoking conference I've seen in a while, "Large Language Models as Graph Neural Networks". Once you start seeing LLM as graph neural network, many structural oddities suddenly falls into place. For instance, OpenAI currently recommends to put the instructions at the top of a long prompt. Why is that so? Because due to the geometry of attention graphs, LLM are counter-intuitively biased in favors of the first tokens: they travel constinously through each generation steps, are internally repeated a lot and end up "over-squashing" the latter ones. Models then use a variety of internal metrics/transforms like softmax to moderate this bias and better ponderate distribution, but this is a late patch that cannot solve long time attention deficiencies, even more so for long context. The most interesting aspect of the conference from an applied perspective: graph/geometric representations directly affect accuracy and robustness. As the generated sequence grow and deal with sequences of complex reasoning steps, cannot build solid expert system when attention graphs have single point of failures. Or at least, without extrapolating this information in the first place and providing more detailed accuracy metrics. I do believe LLM explainability research is largely underexploited right now, despite being accordingly a key component of LLM devops in big labs. If anything, this is literal "prompt engineering", seeing models as nearly physical structure under stress and providing the right feedback loops to make them more reliable. | 30 comments on LinkedIn

What if your LLM is… a graph?

#GraphAI #GenAI #LLM #research #AI

·linkedin.com·Apr 18, 2025

What if your LLM is… a graph?

Towards Mechanistic Interpretability of Graph Transformers via Attention Graphs

Our first attempts at mechanistic interpretability of Transformers from the perspective of network science and graph theory! Check out our preprint: arxiv.org/abs/2502.12352 A wonderful collaboration with superstar MPhil students Batu El, Deepro Choudhury, as well as Pietro Lio' as part of the Geometric Deep Learning class last year at University of Cambridge Department of Computer Science and Technology We were motivated by Demis Hassabis calling AlphaFold and other AI systems for scientific discovery as ‘engineering artifacts’. We need new tools to interpret the underlying mechanisms and advance our scientific understanding. Graph Transformers are a good place to start. The key ideas are: - Attention across multi-heads and layers can be seen as a heterogenous, dynamically evolving graph. - Attention graphs are complex systems represent information flow in Transformers. - We can use network science to extract mechanistic insights from them! More to come on the network science perspective to understanding LLMs next! | 13 comments on LinkedIn

#research #LLM #GraphAI #AI

·linkedin.com·Apr 7, 2025

Towards Mechanistic Interpretability of Graph Transformers via Attention Graphs

LLMs as Graph Neural Networks | Petar Veličković @ GLOW

Join our slack and come to the next Graph Learning on Wednesdays (GLOW) session.https://sites.google.com/view/graph-learning-on-wedsOn March 26th, 2025, we h...

#GraphAI #LLM #AI #research

·youtube.com·Apr 1, 2025

LLMs as Graph Neural Networks | Petar Veličković @ GLOW

Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large Language Models

LLMs are taking Graph Neural Networks to the next level: While we've been discussing LLMs for natural language, they're quietly changing how we represent…

Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large

#LLM #AI #GraphAI #research

·linkedin.com·Mar 19, 2025

Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large Language Models

KET-RAG: Turbocharging AI Agents with 10x Cheaper, Smarter Knowledge Retrieval

KET-RAG: Turbocharging AI Agents with 10x Cheaper, Smarter Knowledge Retrieval This Multi-Granular Graph Framework uses PageRank and Keyword-Chunk Graph to have the Best Cost-Quality Tradeoff ﹌﹌﹌﹌﹌﹌﹌﹌﹌》The Problem: Knowledge Graphs Are Expensive (and Clunky) AI agents need context to answer complex questions—like connecting “COVID vaccines” to “myocarditis risks” across research papers. But today’s solutions face two nightmares: ✸ Cost: Building detailed knowledge graphs with LLMs can cost $33,000 for a 5GB legal case. ✸ Quality: Cheap methods (like KNN graphs) miss key relationships, leading to 32% worse answers. ☆ Imagine training an AI doctor that either bankrupts you or misdiagnoses patients. Ouch. ﹌﹌﹌﹌﹌﹌﹌﹌﹌》The Fix: KET-RAG’s Two-Layer Brain KET-RAG merges precision (knowledge graphs) and efficiency (keyword-text maps) into one system: ✸ Layer 1: Knowledge Graph Skeleton ☆ Uses PageRank to find core text chunks (like “vaccine side effects” in medical docs). ☆ Builds a sparse graph only on these chunks with LLMs—saving 80% of indexing costs. ✸ Layer 2: Keyword-Chunk Bipartite Graph ☆ Links keywords (e.g., “myocarditis”) to all related text snippets—no LLM needed. ☆ Acts as a “fast lane” for retrieving context without expensive entity extraction. ﹌﹌﹌﹌﹌﹌﹌﹌﹌》Results: Beating Microsoft’s Graph-RAG with Pennies On HotpotQA and MuSiQue benchmarks, KET-RAG: ✸ Retrieves 81.6% of critical info vs. Microsoft’s 74.6%—with 10x lower cost. ✸ Boosts answer accuracy (F1 score) by 32.4% while cutting indexing bills by 20%. ✸ Scales to terabytes of data without melting budgets. ☆ Think of it as a Tesla Model 3 outperforming a Lamborghini at 1/10th the price. ﹌﹌﹌﹌﹌﹌﹌﹌﹌》Why AI Agents Need This AI agents aren’t just chatbots—they’re problem solvers for medicine, law, and customer service. KET-RAG gives them: ✸ Real-time, multi-hop reasoning: Connecting “drug A → gene B → side effect C” in milliseconds. ✸ Cost-effective scalability: Deploying agents across millions of documents without going broke. ✸ Adaptability: Mixing precise knowledge graphs (for critical data) with keyword maps (for speed). Paper in comments ≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣≣ 》Build Your Own Supercharged AI Agent? 🔮 Join My 𝐇𝐚𝐧𝐝𝐬-𝐎𝐧 𝐀𝐈 𝐀𝐠𝐞𝐧𝐭𝐬 𝐓𝐫𝐚𝐢𝐧𝐢𝐧𝐠 TODAY! and Learn Building AI Agent with Langgraph/Langchain, CrewAI and OpenAI Swarm + RAG Pipelines 𝐄𝐧𝐫𝐨𝐥𝐥 𝐍𝐎𝐖 [34% discount]: 👉 https://lnkd.in/eGuWr4CH | 10 comments on LinkedIn

KET-RAG: Turbocharging AI Agents with 10x Cheaper, Smarter Knowledge Retrieval

#KnowledgeGraph #LLM #AI #GraphAI #research

·linkedin.com·Feb 18, 2025

KET-RAG: Turbocharging AI Agents with 10x Cheaper, Smarter Knowledge Retrieval

Can Graph Learning Improve Planning in LLM-based Agents?

Task planning in language agents is emerging as an important research topic alongside the development of large language models (LLMs). It aims to break down complex user requests in natural...

#LLM #AI #research #GraphAI

·arxiv.org·Dec 29, 2024

Can Graph Learning Improve Planning in LLM-based Agents?

Graphs + Transformers = the best of both worlds

Graphs + Transformers = the best of both worlds 🤝 The same models powering breakthroughs in natural language processing are now being adapted for graphs…

Graphs + Transformers = the best of both worlds

#GraphAI #LLM #AI #research

·linkedin.com·Dec 2, 2024

Graphs + Transformers = the best of both worlds

More Graph, More Agents: Scaling Graph Reasoning with LLMs

More Graph, More Agents: Scaling Graph Reasoning with LLMs Graph reasoning tasks have proven to be a tough nut to crack for Large Language Models (LLMs).…

More Graph, More Agents: Scaling Graph Reasoning with LLMs

#research #LLM #AI #GraphAI

·linkedin.com·Oct 21, 2024

More Graph, More Agents: Scaling Graph Reasoning with LLMs

GNN-RAG: combining LLMs language abilities with GNNs reasoning in RAG style

Knowledge Graphs (KGs) represent human-crafted factual knowledge in the form of triplets (head, relation, tail), which collectively form a…

#KnowledgeGraph #GraphAI #LLM #AI #research

·medium.com·Aug 1, 2024

GNN-RAG: combining LLMs language abilities with GNNs reasoning in RAG style

GraCoRe: Benchmarking Graph Comprehension and Complex Reasoning in Large Language Models

Can LLMs understand graphs? The results might surprise you. Graphs are everywhere, from social networks to biological pathways. As AI systems become more…

GraCoRe: Benchmarking Graph Comprehension and Complex Reasoning in Large Language Models

#LLM #KnowledgeGraph #research #AI #GraphAI

·linkedin.com·Jul 4, 2024

GraCoRe: Benchmarking Graph Comprehension and Complex Reasoning in Large Language Models

GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning

GNN-RAG Combines the language understanding abilities of LLMs with the reasoning abilities of GNNs in a RAG style. The GNN extracts useful and relevant…

#GraphAI #AI #LLM #research

·linkedin.com·Jun 1, 2024

GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning

Revolutionizing Document Reranking with G-RAG: A Graph-Based Approach

Revolutionizing Document Reranking with G-RAG: A Graph-Based Approach ... Discover how a novel graph-based reranker is transforming the way we retrieve…

Revolutionizing Document Reranking with G-RAG: A Graph-Based Approach

#GraphAI #llm #AI #research

·linkedin.com·May 29, 2024

Revolutionizing Document Reranking with G-RAG: A Graph-Based Approach

Semantic search and its supplement ‘Graph based prompting’ | by Jeong ii tae | Mar, 2024 | Medium

Graph Neural Prompting with Large Language Models

#GraphAI #research #technical #AI #KnowledgeGraph #LLM

·medium.com·Mar 29, 2024

Semantic search and its supplement ‘Graph based prompting’ | by Jeong ii tae | Mar, 2024 | Medium

GraphGPT

🌟GraphGPT🌟 (385 stars in GitHub) is accepted by 🌟SIGIR'24🌟 (only 20.1% acceptance rate)! Thank Yuhao Yang, wei wei, and other co-authors for their precious…

GraphGPT

#LLM #AI #GraphAI #research

·linkedin.com·Mar 27, 2024

GraphGPT

Exploring the Potential of Large Language Models in Graph Generation

Large language models (LLMs) have achieved great success in many fields, and recent works have studied exploring LLMs for graph discriminative tasks such as node classification. However, the abilities of LLMs for graph generation remain unexplored in the literature. Graph generation requires the LLM to generate graphs with given properties, which has valuable real-world applications such as drug discovery, while tends to be more challenging. In this paper, we propose LLM4GraphGen to explore the ability of LLMs for graph generation with systematical task designs and extensive experiments. Specifically, we propose several tasks tailored with comprehensive experiments to address key questions regarding LLMs' understanding of different graph structure rules, their ability to capture structural type distributions, and their utilization of domain knowledge for property-based graph generation. Our evaluations demonstrate that LLMs, particularly GPT-4, exhibit preliminary abilities in graph generation tasks, including rule-based and distribution-based generation. We also observe that popular prompting methods, such as few-shot and chain-of-thought prompting, do not consistently enhance performance. Besides, LLMs show potential in generating molecules with specific properties. These findings may serve as foundations for designing good LLMs based models for graph generation and provide valuable insights and further research.

#GraphAI #LLM #AI #research

·arxiv.org·Mar 22, 2024

Exploring the Potential of Large Language Models in Graph Generation