graph technologies
PyTextRank
a popular Python implementation of multiple textgraph graph algorithms for phrase extraction, provided as a configurable spaCy pipeline component.
- lightweight phrase extraction (unsupervised learning, CPU-based)
- leverages available NER capabilities within a pipeline
- uses: enhance indexing and document similarity; building knowledge graphs
kglab
a simple Python abstraction layer for Graph Data Science, integrating NetworkX, RAPIDS, RDFLib, Morph-KGC, pythonPSL, and more.
- provides PyData-esque interfaces to other popular graph libraries
- includes several Jupyter-based graph data science tutorials
- uses: building knowledge graphs; integrating disjoint graph technologies
- biblio – semantic bibliography entries, generated from RDF
- glossary – semantic glossary entries, generated from RDF
- apidocs – semantic apidocs supporting the Diátaxis grammar for documentation, generated as RDF from Python modules
disparity_filter
implements a disparity filter in Python, based on NetworkX, to extract the multiscale backbone of a complex weighted network (Serrano, et al., 2009)
- analogous to centrality calculated on the edges of a graph rather than its nodes; in other words, consider this as a "dual" problem of the typical graph analysis of social networks
- uses: paring down automatically-generated graphs, such as those produced by NLP methods
recommended projects within our community:
Our focus is on Python open source libraries for: building knowledge graphs, natural language, dataset quality embeddings, graph representation learning, probabilistic data structures, leveraging motifs, hybrid reasoning systems, semantic inference, graph visualization, network analysis, following the general criteria:
- based on reasonably current dependencies
- uses a business-friendly license
- installs correctly with
pip
orconda
- provides example code which runs without exceptions
- reproduces published results
- affords data integration (not optimized for benchmark/paper)
- called as a library, not requiring container/microservice orchestration nor CLI
- supports concurrency and parallelization
- maintained within the past six months
- passes a reasonable level of security audit
Argilla
the open-source data curation platform for LLMs, integrating closely with Hugging Face, LlamaIndex, and more.
KùzuDB
a highly scalable, extremely fast, and very easy-to-use embeddable graph database, implementing openCypher in Python for labeled property graphs
RDFLib
a pure Python library for working with RDF, a simple yet powerful language for representing information
StellarGraph
a Python library for machine learning on graphs and networks, including implementations of GraphSAGE and other graph embedding algorithms
SpanMarker
a framework for training powerful Named Entity Recognition models using familiar encoders such as BERT, RoBERTa and ELECTRA, built atop 🤗 Transformers
dotmotif
a performant, powerful query framework for identifying subgraphs or motifs in a large graph, based on NetworkX
pySHACL
a pure Python module which allows for the validation of RDF graphs against Shapes Constraint Language (SHACL) rules
pyvis
a Python-based approach for constructing and visualizing interactive network graphs within the same space, built on VisJS
We can help your organization build and deploy advanced AI applications!
Derwen, Inc., provides evaluation, integration, packaging, concurrency, performance optimization, and security hardening for open source software used in machine learning use cases, with expertise in natural language, graph technologies, and operations research. Our mission is to bridge between open source communities of practice in artificial intelligence, and the enterprise customers who need to work with them.
Rates ⊗ Contact ⊗ Policy ⊗ Newsletter
Areas of expertise within our firm include:
- graph data science applications
- developing précis, executive briefings, expert presentations
- technical co-authoring for books, industry reports, whitepapers, etc.
- methods for using natural language (LLMs, etc.) to build knowledge graphs
- leveraging W3C semantic graphs and LPG labeled property graphs together
- integrating graph visualization frameworks and customized UI/UX
- applications of probabilistic graphs
- expert software engineering+packaging in Python and C++
- developing APIs and services based on FastAPI/Pydandtic/Redis/Thespian
- parallelization and performance analysis+optimization
- building human-in-the-loop pipelines to enrich your data
- using distributed systems and cloud computing in general
Partner firms include:
Disclaimer: all trademarks, service marks, trade names, trade dress, product names, and logos appearing above are the property of their respective owners
Derwen on Medium
Derwen publication on Medium.com: newsletter summaries, articles, conferences, announcements