Every module works independently — import only what you need. This page maps developer goals to starting points. The Module Reference covers every module in depth.

Quick Reference

Find your goal below. The Module column is your import path; Key class is what you instantiate first.
I want to…ModuleKey class
Load a PDF, DOCX, HTML, CSV, or archiveingestFileIngestor
Crawl a websiteingestWebIngestor
Load Parquet files or partitioned datasetsingestParquetIngestor
Ingest XML with schema validationingestXMLIngestor
Ingest from SQL, Snowflake, Kafka, or emailingestDBIngestor, SnowflakeIngestor, StreamIngestor
Extract clean text and tables from a documentparseDocumentParser
Parse complex PDFs with OCR or multi-column layoutparseDoclingParser
Chunk text for embedding or RAGsplitTextSplitter
Normalize text, dates, entities, or encodingsnormalizeTextNormalizer, EntityNormalizer
Find named entities (people, orgs, locations) in textsemantic_extractNERExtractor
Extract typed relationships from textsemantic_extractRelationExtractor
Extract RDF subject–predicate–object tripletssemantic_extractTripletExtractor
Build a queryable knowledge graphkgGraphBuilder
Add time-validity (valid_from / valid_until) to factskgTemporalGraphQuery
Run graph algorithms (centrality, communities, paths)kgGraphAnalyzer, CentralityCalculator
Generate vector embeddingsembeddingsEmbeddingGenerator
Store and search vectorsvector_storeVectorStore
Persist a graph in Neo4j or FalkorDBgraph_storeNeo4jStore, FalkorDBStore
Store RDF triples and query with SPARQLtriplet_storeTripletStore
Deduplicate entities across sourcesdeduplicationDuplicateDetector, EntityMerger
Detect and resolve contradictory factsconflictsConflictDetector, ConflictResolver
Give an AI agent persistent memorycontextAgentContext
Ground LLM responses in a knowledge graph (GraphRAG)contextAgentContext.query_with_reasoning()
Record AI decisions with a full audit trailcontextAgentContext.record_decision()
Search past decisions before making a new onecontextAgentContext.find_precedents()
Trace the causal chain of a decisioncontextAgentContext.get_causal_chain()
Track where every fact came from (W3C PROV-O)provenanceProvenanceManager
Version a graph with checksums and rollbackchange_managementTemporalVersionManager
Auto-generate an OWL schema from a graphontologyOntologyGenerator
Validate a graph against SHACL constraintsontologySHACLGenerator, OntologyValidator
Derive new facts from existing knowledgereasoningReasoner, GraphReasoner
Export to RDF Turtle, JSON-LD, or N-TriplesexportRDFExporter
Export to Parquet for Spark / BigQueryexportParquetExporter
Export for ArangoDBexportArangoAQLExporter
Export to Neo4j or Memgraph via CypherexportLPGExporter
Visualize a knowledge graph interactivelyvisualizationKGVisualizer
Run a reproducible multi-step pipelinepipelinePipelineBuilder
Use Semantica from Claude Desktop or Cursormcp_serversemantica-mcp
Bootstrap a graph from verified seed dataseedSeedDataManager
Extend Semantica with a custom componentcorePluginRegistry

Goal-by-Goal Starting Points

Pick your goal to see the minimum imports and a working skeleton.
Turn documents, web pages, or databases into a structured, queryable graph.Pipeline: ingestparsesemantic_extractkg
from semantica.ingest import FileIngestor
from semantica.parse import DocumentParser
from semantica.semantic_extract import NERExtractor, RelationExtractor
from semantica.kg import GraphBuilder

sources       = FileIngestor().ingest("report.pdf")
parsed        = DocumentParser().parse_document("report.pdf")

# No API key required — pattern-based extraction
entities      = NERExtractor(method="pattern").extract(parsed)
relationships = RelationExtractor(method="rule").extract(parsed, entities=entities)

graph = GraphBuilder(merge_entities=True).build(
    sources=[{"entities": entities, "relationships": relationships}]
)
print(f"{len(graph.nodes)} nodes, {len(graph.edges)} edges")
Pass method="pattern" to NERExtractor for zero-cost, zero-API-key extraction. Switch to method="llm" with any of the supported providers for higher recall.
Next: Quickstart → — full pipeline with visualization and export.

Still Unsure?

Use a knowledge graph (kg) when you need structured reasoning, multi-hop traversal, provenance, or compliance audit trails.Use a vector store (vector_store) when you need fast fuzzy similarity search over large text corpora and relationships between items don’t matter.Use both together via AgentContext (GraphRAG) to get grounded LLM responses where every claim traces back to a source node.See also: Core Concepts
Start with the Quickstart. It builds a complete pipeline (ingest → parse → extract → graph → visualize → export) with no API key required.
Add AgentContext. It wraps your existing agent with memory, decision tracking, and precedent search — no changes to your LLM provider or agent framework needed.
from semantica.context import AgentContext, ContextGraph
from semantica.vector_store import VectorStore

context = AgentContext(
    vector_store=VectorStore(backend="faiss", dimension=768),
    knowledge_graph=ContextGraph(advanced_analytics=True),
    decision_tracking=True,
)
Context module reference →
LayerModuleKey class
IngestioningestFileIngestor
Extractionsemantic_extractNERExtractor
GraphkgGraphBuilder
LineageprovenanceProvenanceManager
Versioningchange_managementTemporalVersionManager
Audit exportexportRDFExporter
Supports HIPAA, SOX, GDPR, and FDA 21 CFR Part 11 audit requirements.