Every module works independently — import only what you need. This page maps developer goals to starting points. The Module Reference covers every module in depth.
Quick Reference
Find your goal below. The Module column is your import path; Key class is what you instantiate first.| I want to… | Module | Key class |
|---|---|---|
| Load a PDF, DOCX, HTML, CSV, or archive | ingest | FileIngestor |
| Crawl a website | ingest | WebIngestor |
| Load Parquet files or partitioned datasets | ingest | ParquetIngestor |
| Ingest XML with schema validation | ingest | XMLIngestor |
| Ingest from SQL, Snowflake, Kafka, or email | ingest | DBIngestor, SnowflakeIngestor, StreamIngestor |
| Extract clean text and tables from a document | parse | DocumentParser |
| Parse complex PDFs with OCR or multi-column layout | parse | DoclingParser |
| Chunk text for embedding or RAG | split | TextSplitter |
| Normalize text, dates, entities, or encodings | normalize | TextNormalizer, EntityNormalizer |
| Find named entities (people, orgs, locations) in text | semantic_extract | NERExtractor |
| Extract typed relationships from text | semantic_extract | RelationExtractor |
| Extract RDF subject–predicate–object triplets | semantic_extract | TripletExtractor |
| Build a queryable knowledge graph | kg | GraphBuilder |
Add time-validity (valid_from / valid_until) to facts | kg | TemporalGraphQuery |
| Run graph algorithms (centrality, communities, paths) | kg | GraphAnalyzer, CentralityCalculator |
| Generate vector embeddings | embeddings | EmbeddingGenerator |
| Store and search vectors | vector_store | VectorStore |
| Persist a graph in Neo4j or FalkorDB | graph_store | Neo4jStore, FalkorDBStore |
| Store RDF triples and query with SPARQL | triplet_store | TripletStore |
| Deduplicate entities across sources | deduplication | DuplicateDetector, EntityMerger |
| Detect and resolve contradictory facts | conflicts | ConflictDetector, ConflictResolver |
| Give an AI agent persistent memory | context | AgentContext |
| Ground LLM responses in a knowledge graph (GraphRAG) | context | AgentContext.query_with_reasoning() |
| Record AI decisions with a full audit trail | context | AgentContext.record_decision() |
| Search past decisions before making a new one | context | AgentContext.find_precedents() |
| Trace the causal chain of a decision | context | AgentContext.get_causal_chain() |
| Track where every fact came from (W3C PROV-O) | provenance | ProvenanceManager |
| Version a graph with checksums and rollback | change_management | TemporalVersionManager |
| Auto-generate an OWL schema from a graph | ontology | OntologyGenerator |
| Validate a graph against SHACL constraints | ontology | SHACLGenerator, OntologyValidator |
| Derive new facts from existing knowledge | reasoning | Reasoner, GraphReasoner |
| Export to RDF Turtle, JSON-LD, or N-Triples | export | RDFExporter |
| Export to Parquet for Spark / BigQuery | export | ParquetExporter |
| Export for ArangoDB | export | ArangoAQLExporter |
| Export to Neo4j or Memgraph via Cypher | export | LPGExporter |
| Visualize a knowledge graph interactively | visualization | KGVisualizer |
| Run a reproducible multi-step pipeline | pipeline | PipelineBuilder |
| Use Semantica from Claude Desktop or Cursor | mcp_server | semantica-mcp |
| Bootstrap a graph from verified seed data | seed | SeedDataManager |
| Extend Semantica with a custom component | core | PluginRegistry |
Goal-by-Goal Starting Points
Pick your goal to see the minimum imports and a working skeleton.- Build a Knowledge Graph
- Build GraphRAG
- Add Agent Memory
- Track Provenance
- Export
- MCP — Claude / Cursor
Turn documents, web pages, or databases into a structured, queryable graph.Pipeline: Next: Quickstart → — full pipeline with visualization and export.
ingest → parse → semantic_extract → kgStill Unsure?
Knowledge graph vs. vector store — which do I need?
Knowledge graph vs. vector store — which do I need?
Use a knowledge graph (
kg) when you need structured reasoning, multi-hop traversal, provenance, or compliance audit trails.Use a vector store (vector_store) when you need fast fuzzy similarity search over large text corpora and relationships between items don’t matter.Use both together via AgentContext (GraphRAG) to get grounded LLM responses where every claim traces back to a source node.See also: Core ConceptsI just want to run something quickly.
I just want to run something quickly.
Start with the Quickstart. It builds a complete pipeline (ingest → parse → extract → graph → visualize → export) with no API key required.
I'm adding Semantica to an existing agent — what's the minimum?
I'm adding Semantica to an existing agent — what's the minimum?
Add Context module reference →
AgentContext. It wraps your existing agent with memory, decision tracking, and precedent search — no changes to your LLM provider or agent framework needed.I need a compliance-ready pipeline — what's the minimum stack?
I need a compliance-ready pipeline — what's the minimum stack?
| Layer | Module | Key class |
|---|---|---|
| Ingestion | ingest | FileIngestor |
| Extraction | semantic_extract | NERExtractor |
| Graph | kg | GraphBuilder |
| Lineage | provenance | ProvenanceManager |
| Versioning | change_management | TemporalVersionManager |
| Audit export | export | RDFExporter |
| Supports HIPAA, SOX, GDPR, and FDA 21 CFR Part 11 audit requirements. |
- Quickstart — Full pipeline in 5 minutes.
- Module Reference — Every module with examples and common chains.
- API Reference — Complete class and method documentation.
