Skip to content

Triplet Store

Store and query RDF triplets with SPARQL support and semantic reasoning using industry-standard triplet stores.


🎯 Overview

The Triplet Store Module provides storage and querying for RDF (Resource Description Framework) triplets. It supports industry-standard triplet stores with SPARQL querying and semantic reasoning capabilities.

What is a Triplet Store?

A triplet store (also called an RDF store) is a database designed to store and query RDF triplets. RDF triplets are statements in the form:

  • Subject: The entity being described
  • Predicate: The relationship or property
  • Object: The value or related entity

Example: `(Apple Inc., foundedBy, Steve Jobs)`

Why Use the Triplet Store Module?

  • W3C Standards: Full support for RDF and SPARQL standards
  • Semantic Reasoning: RDFS and OWL reasoning for inference
  • Multiple Backends: Support for Blazegraph, Apache Jena, RDF4J
  • SPARQL Queries: Powerful SPARQL 1.1 query language
  • Federation: Query across multiple stores
  • Bulk Loading: High-performance data loading

How It Works

  1. Store Selection: Choose a backend (Blazegraph, Jena, RDF4J)
  2. Triplet Storage: Store subject-predicate-object triplets
  3. SPARQL Queries: Query using SPARQL 1.1
  4. Reasoning: Apply RDFS/OWL reasoning for inference
  5. Federation: Query across multiple stores if needed
  • RDF Storage


    Store subject-predicate-object triplets in W3C-compliant RDF format

  • SPARQL Queries


    Full W3C SPARQL 1.1 query language support for powerful semantic queries

  • Reasoning


    RDFS and OWL reasoning for inference and knowledge discovery

  • Multiple Backends


    Blazegraph, Apache Jena, and RDF4J support

  • Federation


    Query across multiple triplet stores with SPARQL federation

  • Bulk Loading


    High-performance bulk data loading with progress tracking

Choosing the Right Backend

  • Blazegraph: High-performance, excellent for large datasets, GPU acceleration
  • Apache Jena: Full-featured, TDB2 storage, SHACL validation
  • RDF4J: Java-based, excellent tooling, multiple storage backends

⚙️ Algorithms Used

Query Algorithms

  • SPARQL Query Optimization: Join reordering with selectivity estimation
  • Triplet Pattern Matching: Index-based lookup with B+ trees
  • Graph Pattern Matching: Subgraph isomorphism with backtracking
  • Query Planning: Cost-based optimization with statistics
  • Join Algorithms: Hash join, merge join, nested loop join
  • Filter Pushdown: Early filter application for performance

Indexing

  • SPO Index: Subject-Predicate-Object index for subject lookups
  • POS Index: Predicate-Object-Subject index for predicate lookups
  • OSP Index: Object-Subject-Predicate index for object lookups
  • Six-Index Scheme: All permutations (SPO, SOP, PSO, POS, OSP, OPS) for optimal query performance
  • B+ Tree Indexing: Efficient range queries and sorted access
  • Hash Indexing: O(1) exact match lookups

Reasoning Algorithms

  • RDFS Reasoning: Subclass/subproperty inference, domain/range inference
  • OWL Reasoning: Class hierarchy, property characteristics, cardinality constraints
  • Forward Chaining: Materialization of inferred triplets
  • Backward Chaining: On-demand inference during query execution
  • Rule-Based Inference: Custom SWRL rules

Bulk Loading

  • Batch Processing: Chunked triplet insertion with configurable batch size
  • Parallel Loading: Multi-threaded data loading
  • Index Building: Deferred index construction for faster loading
  • Transaction Management: Atomic batch commits with rollback support

Main Classes

TripletStore

Main interface for triplet store operations.

Methods:

Method Description Algorithm
__init__(backend, endpoint) Initialize triplet store Factory pattern
add_triplet(triplet) Add single triplet Single insert
add_triplets(triplets, batch_size) Add multiple triplets Bulk load with batching
get_triplets(s, p, o) Retrieve triplets Pattern matching
delete_triplet(triplet) Delete triplet Pattern matching deletion
execute_query(query) Execute SPARQL Query engine delegation

BulkLoader

High-volume data loading utility.

Methods:

Method Description Algorithm
load_triplets(triplets, store) Bulk load triplets Batch processing with retries

QueryEngine

SPARQL query execution and optimization engine.

Methods:

Method Description Algorithm
execute(query) Execute SPARQL query Query execution
optimize(query) Optimize SPARQL query Query rewriting

Cookbook

Interactive tutorials that use triplet stores:

  • Reasoning and Inference: Use logical reasoning with SPARQL and triplet stores
  • Topics: SPARQL reasoning, RDF stores, inference engines
  • Difficulty: Advanced
  • Use Cases: Semantic reasoning, SPARQL queries, RDF-based knowledge graphs

🚀 Usage

Initialization

from semantica.triplet_store import TripletStore

# Initialize Blazegraph store
store = TripletStore(
    backend="blazegraph",
    endpoint="http://localhost:9999/blazegraph"
)

Adding Data

from semantica.semantic_extract.triplet_extractor import Triplet

# Single triplet
triplet = Triplet("http://s", "http://p", "http://o")
store.add_triplet(triplet)

# Bulk load
triplets = [Triplet(f"http://s{i}", "http://p", "http://o") for i in range(1000)]
store.add_triplets(triplets)

Querying

query = """
SELECT ?s ?p ?o
WHERE {
  ?s ?p ?o
}
LIMIT 10
"""
results = store.execute_query(query)