GraphNews

4434 bookmarks
Custom sorting
Agentic Knowledge Graph Construction
Agentic Knowledge Graph Construction
Stop manually building your company's brain. โŒ Having reviewed the excellent DeepLearning.AI lecture on Agentic Knowledge Graph Construction, by Andreas Kollegger and writing a book on Agentic graph system with Sam Julien, it is clear that the use of agentic systems represents a shift in how we build and maintain knowledge graphs (KGs). Most organizations are sitting on a goldmine of data spread across CSVs, documents, and databases. The dream is to connect it all into a unified Knowledge Graph, an intelligent brain that understands your entire business. The reality? It's a brutal, expensive, and unscalable manual process. But a new approach is changing everything. Hereโ€™s the new playbook for building intelligent systems: ๐Ÿง  Deploy an AI Agent Workforce Instead of rigid scripts, you use a cognitive assembly line of specialized AI agents. A Proposer agent designs the data model, a Critic refines it, and an Extractor pulls the facts. This modular approach is proven to reduce errors and improve the accuracy and coherence of the final graph. ๐ŸŽจ Treat AI as a Designer, Not Just a Doer The agents act as data architects. In discovery mode, they analyze unstructured data (like customer reviews) and propose a new logical structure from scratch. In an enterprise with an existing data model, they switch to alignment mode, mapping new information to the established structure. ๐Ÿ›๏ธ Use a 3-Part Graph Architecture This technique is key to managing data quality and uncertainty. You create three interconnected graphs: The Domain Graph: Your single source of truth, built from trusted, structured data. The Lexical Graph: The raw, original text from your documents, preserving the evidence. The Subject Graph: An AI-generated bridge that connects them. It holds extracted insights that are validated before being linked to your trusted data. Jaro-Winkler is a string comparison algorithm that measures the similarity or edit distance between two strings. It can be used here for entity resolution, the process of identifying and linking entities from the unstructured text (Subject Graph) to the official entities in the structured database (Domain Graph). For example, the algorithm compares a product name extracted from a customer review (e.g., "the gothenburg table") with the official product names in the database. If the Jaro-Winkler similarity score is above a certain threshold, the system automatically creates a CORRESPONDS_TO relationship, effectively linking the customer's comment to the correct product in the supply chain graph. ๐Ÿค Augment Humans, Don't Replace Them The workflow is Propose, then Approve. AI does the heavy lifting, but a human expert makes the final call. This process is made reliable by tools like Pydantic and Outlines, which enforce a rigid contract on the AI's output, ensuring every piece of data is perfectly structured and consistent. And once discovered and validated, a schema can be enforced. | 32 comments on LinkedIn
ยทlinkedin.comยท
Agentic Knowledge Graph Construction
The Convergence of Ontology and Logic | LinkedIn
The Convergence of Ontology and Logic | LinkedIn
by J Bittner John Sowa once observed: In logic, the existential quantifier โˆƒ is a notation for asserting that something exists. But logic itself has no vocabulary for describing the things that exist.
ยทlinkedin.comยท
The Convergence of Ontology and Logic | LinkedIn
FinReflectKG: Agentic Construction and Evaluation of Financial Knowledge Graphs
FinReflectKG: Agentic Construction and Evaluation of Financial Knowledge Graphs
Sharing our recent research ๐…๐ข๐ง๐‘๐ž๐Ÿ๐ฅ๐ž๐œ๐ญ๐Š๐†: ๐€๐ ๐ž๐ง๐ญ๐ข๐œ ๐‚๐จ๐ง๐ฌ๐ญ๐ซ๐ฎ๐œ๐ญ๐ข๐จ๐ง ๐š๐ง๐ ๐„๐ฏ๐š๐ฅ๐ฎ๐š๐ญ๐ข๐จ๐ง ๐จ๐Ÿ ๐…๐ข๐ง๐š๐ง๐œ๐ข๐š๐ฅ ๐Š๐ง๐จ๐ฐ๐ฅ๐ž๐๐ ๐ž ๐†๐ซ๐š๐ฉ๐ก๐ฌ. It is the largest financial knowledge graph built from unstructured data. The preprint of our article is out on arXiv now (link is in the comments). It is coauthored with Abhinav Arun | Fabrizio Dimino | Tejas Prakash Agrawal While LLMs make it easier than ever to generate knowledge graphs, the real challenge lies in ensuring quality without hallucinations, with strong coverage, precision, comprehensiveness, and relevance. FinReflectKG tackles this through an iterative, evaluation-driven agentic approach, carefully optimized across multiple evaluation metrics to deliver a trustworthy and high-quality knowledge graph. Designed to power use cases like entity search, question answering, signal generation, predictive modeling, and financial network analysis, FinReflectKG sets a new benchmark for building reliable financial KGs and showcases the potential of agentic workflows in LLM-driven systems. We will be creating a suite of benchmarks using FinReflectKG for KG related tasks in financial services. More details to come soon. | 15 comments on LinkedIn
ยทlinkedin.comยท
FinReflectKG: Agentic Construction and Evaluation of Financial Knowledge Graphs
barnard59 is a toolkit to automate extract, transform and load (ETL) tasks. It allows you to generate RDF out of non-RDF data sources
barnard59 is a toolkit to automate extract, transform and load (ETL) tasks. It allows you to generate RDF out of non-RDF data sources
Reliability in data pipelines depends on knowing what went wrong before your users do. With the new OpenTelemetry integration in our RDF ETL framework barnard59, every pipeline and API integration is now fully traceable! Errors, validation results and performance metrics are automatically collected and visualised in Grafana. Instead of hunting through logs, you immediately see where time was spent and where an error occurred. This makes RDF-based ETL pipelines far more transparent and easier to operate at scale.
ยทlinkedin.comยท
barnard59 is a toolkit to automate extract, transform and load (ETL) tasks. It allows you to generate RDF out of non-RDF data sources
From raw data to a knowledge graph with SynaLinks
From raw data to a knowledge graph with SynaLinks
SynaLinks is an open-source framework designed to make it easier to partner language models (LMs) with your graph technologies. Since most companies are not in a position to train their own language models from scratch, SynaLinks empowers you to adapt existing LMs on the market to specialized tasks.
ยทgdotv.comยท
From raw data to a knowledge graph with SynaLinks
When Standards Fail to Ontologize | LinkedIn
When Standards Fail to Ontologize | LinkedIn
In the history of data standards, a recurring pattern should concern anyone working in semantics today. A new standard emerges, promises interoperability, gains adoption across industries or agencies, and for a time seems to solve the immediate need.
ยทlinkedin.comยท
When Standards Fail to Ontologize | LinkedIn
MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains
MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains
MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains ... When AI Diagnoses Patients, Should Reasoning Be a Team Sport? ๐Ÿ‘‰ Why Existing Approaches Fall Short Medical question answering demands precision, but current AI methods struggle with two key issues: 1. Error Accumulation: Linear reasoning chains (like Chain-of-Thought) risk compounding mistakesโ€”if the first step is wrong, the entire answer falters. 2. Flat Knowledge Retrieval: Traditional retrieval-augmented methods treat medical facts as unrelated text snippets, ignoring complex relationships between symptoms, diseases, and treatments. This leads to unreliable diagnoses and opaque decision-makingโ€”a critical problem when patient outcomes are at stake. ๐Ÿ‘‰ What MIRAGE Does Differently MIRAGE transforms reasoning from a solo sprint into a coordinated team effort: - Parallel Detective Work: Instead of one linear chain, multiple specialized "detectives" (reasoning chains) investigate different symptoms or entities in parallel. - Structured Evidence Hunting: Retrieval operates on medical knowledge graphs, tracing connections between symptoms (e.g., "face pain โ†’ lead poisoning") rather than scanning documents. - Cross-Check Consensus: Answers from parallel chains are verified against each other to resolve contradictions, like clinicians discussing differential diagnoses. ๐Ÿ‘‰ How It Works (Without the Jargon) 1. Break It Down ย ย - Splits complex queries ("Why am I fatigued with knee pain?") into focused sub-questions grounded in specific symptoms/entities. ย ย - Example: "Conditions linked to fatigue" and "Causes of knee lumps" become separate investigation threads. 2. Graph-Guided Retrieval ย ย - Each thread explores a medical knowledge graph like a map: ย ย ย - Anchor Mode: Examines direct connections (e.g., diseases causing a symptom). ย ย ย - Bridge Mode: Hunts multi-step relationships (e.g., toxin exposure โ†’ neurological symptoms โ†’ joint pain). 3. Vote & Verify ย ย - Combines evidence from all threads, prioritizing answers supported by multiple independent chains. ย ย - Discards conflicting hypotheses (e.g., ruling out lupus if only one chain suggests it without corroboration). ๐Ÿ‘‰ Why This Matters Tested on three medical benchmarks (including real clinician queries), MIRAGE: - Outperformed GPT-4 and Tree-of-Thought variants in accuracy (84.8% vs. 80.2%) - Reduced error propagation by 37% compared to linear retrieval-augmented methods - Produced answers with traceable evidence paths, critical for auditability in healthcare The Big Picture MIRAGE shifts AI reasoning from brittle, opaque processes to collaborative, structured exploration. By mirroring how clinicians synthesize information from multiple angles, it highlights a path toward AI systems that are both smarter and more trustworthy in high-stakes domains. Paper: Wei et al. MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains
MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains
ยทlinkedin.comยท
MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains
Hot take on "faster then Dijkstra"
Hot take on "faster then Dijkstra"
๐—›๐—ผ๐˜ ๐˜๐—ฎ๐—ธ๐—ฒ ๐—ผ๐—ป ๐˜๐—ต๐—ฒ โ€œ๐—ณ๐—ฎ๐˜€๐˜๐—ฒ๐—ฟ ๐˜๐—ต๐—ฎ๐—ป ๐——๐—ถ๐—ท๐—ธ๐˜€๐˜๐—ฟ๐—ฎโ€ ๐—ต๐—ฒ๐—ฎ๐—ฑ๐—น๐—ถ๐—ป๐—ฒ๐˜€: The recent result given in the paper: https://lnkd.in/dQSbqrhD is a breakthrough for theory. It beats Dijkstraโ€™s classic worst-case bound for single-source shortest paths on directed graphs with non-negative weights. Thatโ€™s big for the research community. ๐—•๐˜‚๐˜ ๐—ถ๐˜ ๐—ฑ๐—ผ๐—ฒ๐˜€๐—ปโ€™๐˜ โ€œ๐—ฟ๐—ฒ๐˜„๐—ฟ๐—ถ๐˜๐—ฒโ€ ๐—ฝ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป ๐—ฟ๐—ผ๐˜‚๐˜๐—ถ๐—ป๐—ด. In practice, large-scale systems (maps, logistics, ride-hailing) moved past plain Dijkstra years ago. They rely on heavy preprocessing. Contraction Hierarchies, Hub Labels and other methods are used to answer point-to-point queries in milliseconds, even on large, continental networks. ๐—ช๐—ต๐˜† ๐˜๐—ต๐—ฒ ๐—ฑ๐—ถ๐˜€๐—ฐ๐—ผ๐—ป๐—ป๐—ฒ๐—ฐ๐˜? ย โ€ข Different goals: The paper targets single-source shortest paths; production prioritizes point-to-point queries at interactive latencies. ย โ€ข Asymptotics vs. constants: Beating O(m + n log n) matters in principle, but real systems live and die by constants, cache behavior, and integration with traffic/turn costs. ย โ€ข Preprocessing wins: Once you allow preprocessing, the speedups from hierarchical/labeling methods dwarf Dijkstra and likely any drop-in replacement without preprocessing. We should celebrate the theoretical advance and keep an eye on practical implementations. Just donโ€™t confuse a sorting-barrier result with an immediate upgrade for Google Maps. ๐—•๐—ผ๐˜๐˜๐—ผ๐—บ ๐—น๐—ถ๐—ป๐—ฒ: Great theory milestone. Production routing already โ€œchanged the rulesโ€ years ago with preprocessing and smart graph engineering.
๐—›๐—ผ๐˜ ๐˜๐—ฎ๐—ธ๐—ฒ ๐—ผ๐—ป ๐˜๐—ต๐—ฒ โ€œ๐—ณ๐—ฎ๐˜€๐˜๐—ฒ๐—ฟ ๐˜๐—ต๐—ฎ๐—ป ๐——๐—ถ๐—ท๐—ธ๐˜€๐˜๐—ฟ๐—ฎโ€ ๐—ต๐—ฒ๐—ฎ๐—ฑ๐—น๐—ถ๐—ป๐—ฒ๐˜€
ยทlinkedin.comยท
Hot take on "faster then Dijkstra"
4.7 times better write query price-performance with AWS Graviton4 R8g instances using Amazon Neptune v1.4.5 | Amazon Web Services
4.7 times better write query price-performance with AWS Graviton4 R8g instances using Amazon Neptune v1.4.5 | Amazon Web Services
Amazon Neptune version 1.4.5 introduces engine improvements and support for AWS Graviton-based r8g instances. In this post, we show you how these updates can improve your graph database performance and reduce costs. We walk you through the benchmark results for Gremlin and openCypher comparing Neptune v1.4.5 on r8g instances against previous versions. You'll see performance improvements of up to 4.7x for write throughput and 3.7x for read throughput, along with the cost implications.
ยทaws.amazon.comยท
4.7 times better write query price-performance with AWS Graviton4 R8g instances using Amazon Neptune v1.4.5 | Amazon Web Services
Faster than Dijkstra? Tsinghua Universityโ€™s new shortest path algorithm just rewrite the rules of graph traversal.
Faster than Dijkstra? Tsinghua Universityโ€™s new shortest path algorithm just rewrite the rules of graph traversal.
๐Ÿš€ Faster than Dijkstra? Tsinghua Universityโ€™s new shortest path algorithm just rewrite the rules of graph traversal. For 65+ years, Dijkstraโ€™s algorithm was the gold standard for finding shortest paths in weighted graphs. But now, a team from Tsinghua University has introduced a recursive partial ordering method that outperforms Dijkstraโ€”especially on directed graphs. ๐Ÿ” Whatโ€™s different?ย  Instead of sorting all vertices by distance (which adds log-time overhead), this new approach uses a clever recursive structure that breaks the O(m + n log n) barrier โœจ.ย  Itโ€™s faster, leaner, and already winning awards at STOC 2025 ๐Ÿ†. ๐Ÿ“ Why it matters:ย  Think Google Maps, Uber routing, disaster evacuation planning, circuit designโ€”any system that relies on real-time pathfinding across massive graphs. Paper โžก https://lnkd.in/dGTdRj2X #Algorithms #ComputerScience #Engineering #Dijkstra #routing #planning #logistic | 34 comments on LinkedIn
Faster than Dijkstra? Tsinghua Universityโ€™s new shortest path algorithm just rewrite the rules of graph traversal.
ยทlinkedin.comยท
Faster than Dijkstra? Tsinghua Universityโ€™s new shortest path algorithm just rewrite the rules of graph traversal.
Quality metrics: mathematical functions designed to measure the โ€œgoodnessโ€ of a network visualization
Quality metrics: mathematical functions designed to measure the โ€œgoodnessโ€ of a network visualization
Iโ€™m proud to share an exciting piece of work by my PhD student,ย Simon van Wageningen, whom I have the pleasure of supervising. Simon asked a bold question that challenges the state of the art in our field! A bit of background first: together with Simon, we studyย network visualizationsย โ€” those diagrams made of dots and lines. Theyโ€™re more than just pretty pictures: they help us gain intuition about the structure of networks around us, such as social networks, protein networks, or even money-laundering networks ๐Ÿ˜‰. But how do we know if a visualization really shows the structure well? Thatโ€™s whereย quality metricsย come in โ€” mathematical functions designed to measure the โ€œgoodnessโ€ of a network visualization. Many of these metrics correlate nicely with human intuition. Yet, in our community, there has long been a belief โ€” more of a tacit knowledge โ€” that these metrics fail in certain cases. This is exactly where Simonโ€™s work comes in: he set out to make this tacit knowledge explicit. Take a look at the dancing man and the network in the slider โ€” they represent theย same networkย withย very similar quality metric values. And yet, the dancing man clearly does not donโ€™t show the network's structure. This tells us something important: we canโ€™t blindly rely on quality metrics. Simonโ€™s work will be presented at theย International Symposium on Graph Drawing and Network Visualizationย in Norrkรถping, Sweden this year. ๐ŸŽ‰ If youโ€™d like to dive deeper, hereโ€™s the link to the GitHub repository https://lnkd.in/eqw3nYmZ #graphdrawing #networkvisualization #qualitymetrics #research with Simon van Wageningen and Alex Telea | 13 comments on LinkedIn
quality metricsย come in โ€” mathematical functions designed to measure the โ€œgoodnessโ€ of a network visualization
ยทlinkedin.comยท
Quality metrics: mathematical functions designed to measure the โ€œgoodnessโ€ of a network visualization
True Cost of Enterprise Knowledge Graph Adoption from PoC to Production | LinkedIn
True Cost of Enterprise Knowledge Graph Adoption from PoC to Production | LinkedIn
Enterprise Knowledge Graph costs scale in phasesโ€”from a modest $50Kโ€“$100K PoC, to a $1Mโ€“$3M pilot with infrastructure and dedicated teams, to a $10Mโ€“$20M enterprise-wide platform. Reusability reduces costs to ~30% of the original for new domains, with faster delivery and self-sufficiency typically b
ยทlinkedin.comยท
True Cost of Enterprise Knowledge Graph Adoption from PoC to Production | LinkedIn
Enabling Industrial AI: How Siemens and AIT Leverage TDengine and Ontop to Help TCG UNITECH Boost Productivity and Efficiency
Enabling Industrial AI: How Siemens and AIT Leverage TDengine and Ontop to Help TCG UNITECH Boost Productivity and Efficiency
I'm extremely excited to announce that Siemens and AIT Austrian Institute of Technologyโ€”two leaders in industrial innovationโ€”chose TDengine as the time-series backbone for a groundbreaking project at TCG Unitech GmbH! Hereโ€™s the magic: Imagine stitching together over a thousand time-series signals per machine with domain knowledge, and connecting it all through an intelligent semantic layer. With TDengine capturing high-frequency sensor data, PostgreSQL holding production context, and Ontopic virtualizing everything into a cohesive knowledge graphโ€”this isnโ€™t just data collection. Itโ€™s an orchestration that reveals hidden patterns, powers real-time anomaly and defect detection, supports traceability, and enables explainable root-cause analysis. And none of this works without good semantics. The system understands the relationshipsโ€”between sensors, machines, processes, and defectsโ€”which means both AI and humans can ask the right questions and get meaningful, actionable answers. For me, this is the future of smart manufacturing: when data, infrastructure, and domain expertise come together, you get proactive, explainable, and scalable insights that keep factories running at peak performance. It's a true pleasure working with Stefan B. from Siemens AG ร–sterreich, Stephan Strommer and David Gruber from AIT, Peter Hopfgartner from Ontopic and our friends Klaus Neubauer, Herbert Kerbl, Bernhard Schmiedinger from TCG on this technical blog! We hope this will bring some good insights into how time-series data and semantics can transform the operations of modern manufacturing! Read the full case study: https://lnkd.in/gtuf8KzU
ยทlinkedin.comยท
Enabling Industrial AI: How Siemens and AIT Leverage TDengine and Ontop to Help TCG UNITECH Boost Productivity and Efficiency
GraphCogent: Overcoming LLMsโ€™ Working Memory Constraints via Multi-Agent Collaboration in Complex Graph Understanding
GraphCogent: Overcoming LLMsโ€™ Working Memory Constraints via Multi-Agent Collaboration in Complex Graph Understanding
Unlocking LLMs' Graph Reasoning Potential Through Cognitive-Inspired Collaboration ๐Ÿ‘‰ Why This Matters Large language models often falter when analyzing transportation networks, social connections, or citation graphsโ€”not due to lacking intelligence, but because of working memory constraints. Imagine solving a 1,000-node shortest path problem while simultaneously memorizing every connection. Like humans juggling too many thoughts, LLMs lose accuracy as graph complexity increases. ๐Ÿ‘‰ What GraphCogent Solves This new framework addresses three core limitations: 1. Representation confusion: Mixed graph formats (adjacency lists, symbols, natural language) 2. Memory overload: Context window limitations for large-scale graphs 3. Execution fragility: Error-prone code generation for graph algorithms Drawing inspiration from human cognition's working memory model, GraphCogent decomposes graph reasoning into specialized processes mirroring how our brains handle complex tasks. ๐Ÿ‘‰ How It Works Sensory Module - Acts as an LLM's "eyes," standardizing diverse graph inputs through subgraph sampling - Converts web links, social connections, or traffic routes into uniform adjacency lists Buffer Module - Functions as a "mental workspace," integrating graph data across formats (NetworkX/PyG/NumPy) - Maintains persistent memory beyond standard LLM context limits Execution Module - Combines two reasoning modes: ย - Tool calling for common tasks (pathfinding, cycle detection) ย - Model generation for novel problems using preprocessed data ๐Ÿ‘‰ Proven Impact - Achieves 98.5% accuracy on real-world graphs (social networks, transportation systems) using Llama3.1-8B - Outperforms 671B parameter models by 50% while using 80% fewer tokens - Handles graphs 10x larger than previous benchmarks through efficient memory management The framework's secret sauce? Treating graph reasoning as a team effort rather than a single AI's taskโ€”much like how human experts collaborate on complex problems. Key Question for Discussion As multi-agent systems become more sophisticated, how might we redesign LLM architectures to better emulate human cognitive processes for specific problem domains? Paper: "GraphCogent: Overcoming LLMsโ€™ Working Memory Constraints via Multi-Agent Collaboration in Complex Graph Understanding" (Wang et al., 2025)
- Achieves 98.5% accuracy on real-world graphs (social networks, transportation systems) using Llama3.1-8B- Outperforms 671B parameter models by 50% while using 80% fewer tokens- Handles graphs 10x larger than previous benchmarks through efficient memory management
ยทlinkedin.comยท
GraphCogent: Overcoming LLMsโ€™ Working Memory Constraints via Multi-Agent Collaboration in Complex Graph Understanding