HO HolmesGPT
An AI agent platform for cloud-native environments that automates alert investigation, root cause analysis, and remediation suggestions.
Cloud-native AI platform capabilities and infrastructure.
Kubernetes-native AI infrastructure and scheduling.
HO An AI agent platform for cloud-native environments that automates alert investigation, root cause analysis, and remediation suggestions.
OP A scalable, feature-rich web interface for interacting with large language models, providing a ChatGPT-like experience with support for multiple models and customization options.
VO Volcano is a Kubernetes-native batch scheduling system (a CNCF project) that enhances kube-scheduler with advanced features for batch, HPC, and AI workloads.
Data platforms, lakehouse stacks, and data services.
3F A high-performance distributed file system designed for AI training and inference workloads, optimizing parallel I/O and data locality to support large-scale training.
AI An open-source tool that integrates an interactive Python environment with LLMs for natural-language-driven Python execution and automation.
AP Apache Doris is an easy-to-use, high-performance unified analytics database for real-time and offline analysis.
CU A GPU DataFrame library for accelerating data analysis and tabular computing with GPU acceleration.
CV CVAT is an industry-leading computer vision annotation tool suitable for annotation at any scale.
DA ETL, analytics, and versioning for unstructured data to build reproducible and auditable data pipelines.
DA A data preparation and pipeline platform for domain training and retrieval-augmented generation.
DE A database for AI optimized for storing, querying and versioning vectors and multimodal data (images, video, audio, text) for LLM and deep learning workflows.
DO DocuTranslate is a lightweight document translation tool leveraging LLMs and multiple parsing engines.
DU An analytical, in-process SQL database suited for interactive queries, ETL, and local analytics.
FA A high-performance library for similarity search and clustering of dense vectors, suitable for large-scale vector retrieval.
JU Interactive computing environment widely used for data science and machine learning development.
LA Label Studio is a multi-type data labeling and annotation tool with standardized output formats.
PG pgvector is an open-source PostgreSQL extension that adds vector data types and similarity search, supporting exact and approximate search (HNSW, IVFFlat) inside Postgres.
PR Proton is a single-binary C++ high-performance SQL stream processing engine designed for real-time analytics and stream ETL.
PY A high-performance Python library for data extraction, analysis, conversion, and manipulation of PDF and other documents.
TE Google's open-source end-to-end machine learning platform for building and training deep learning models.
UN A no-code LLM platform to convert unstructured documents into structured data and quickly launch APIs and ETL pipelines.
VA A high-performance distributed key-value database optimized for caching and real-time workloads.
VE A reinforcement learning training framework for large models, designed for scalable RLHF and agent training.
Deployment pipelines and operations tooling.
AG ADK is an open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
AP An open-source data visualization and exploration platform supporting interactive dashboards, SQL-based analysis, and multiple data sources.
AU AutoGPT — a platform to build, deploy and run continuous AI agents, supporting self-hosting and platform deployments.
AX An extensible deep learning library built on JAX/XLA, designed for developing, training and deploying large-scale models.
CA Candle by Hugging Face: a minimalist, high-performance ML framework in Rust designed for serverless inference and lightweight deployments.
CO ByteDance's open-source platform-level solution for AI Agent development and operations, providing full lifecycle management capabilities.
CR An open-source web crawler and scraper optimized for large language model workflows, producing clean Markdown and structured data with browser control and Docker deployment.
DA Dask is a Python library for parallel computing and task scheduling, suited for scaling NumPy, Pandas and machine learning workloads across clusters.
DI Open-source LLM application development platform providing visual AI application building tools and enterprise-grade deployment solutions.
DL DLRover is an automatic distributed deep learning system that provides elastic scheduling, flash checkpointing and auto-scaling to simplify large-scale model training on Kubernetes and Ray.
DY Explore Dynamo by NVIDIA, an open-source framework for efficient multi-GPU inference, optimizing throughput and latency for large-scale deployments.
GO Google Research aggregates open-source research code and datasets from Google, covering machine learning, vision, NLP and other research areas.
GP Open-source GPU cluster manager for efficient model training and high-performance inference orchestration.
KI KitOps is a CNCF-backed open-source project that standardizes packaging AI/ML projects into signable, versioned OCI artifacts.
LI A high-performance, engineering-focused LLM toolkit that provides end-to-end recipes and practical tutorials for training and deploying large models.
LM LMDeploy is a toolkit for compressing, deploying and serving large language models, providing optimized inference engines, quantization and distribution features.
MA A high-performance, highly scalable open-source LLM library and reference implementation built with Python and JAX, targeting Google Cloud TPUs and GPUs.
ME Reference implementation from NVIDIA for large-scale model training and inference with distributed optimizations.
ML MLC LLM is a machine learning compiler and deployment engine that enables high-performance LLM inference across platforms using compilation and runtime optimizations.
ML MLflow is an open-source platform for managing the machine learning lifecycle, including experiment tracking, packaging, model registry and deployment.
ML MLRun is an open-source MLOps platform for building and managing continuous ML applications across their lifecycle.
ML An array framework for machine learning optimized for Apple Silicon, offering NumPy-like Python APIs plus C++, C and Swift bindings.
MO An open, production-grade AI platform including the MAX inference server and Mojo libraries to accelerate model deployment across hardware.
PA Parlant is a compliance-first AI agent framework designed for real-world business scenarios. Deploy in minutes and ensure agents follow your rules.
PH Phoenix is a high-performance web framework built with Elixir, optimized for realtime, distributed, and scalable web applications.
PY An open-source deep learning framework for fast, flexible research and production, featuring dynamic computation graphs and strong GPU acceleration.
SG High-performance open-source framework for LLM and VLM inference, supporting multimodal, extreme concurrency, and flexible frontend programming.
SW SwanLab is an open-source, modern training tracking and visualization tool that supports cloud and self-hosted deployment.
TA Tabby is an open-source, self-hosted AI coding assistant designed for teams that need on-premises deployment and code privacy.
TE NVIDIA's open-source toolbox for optimized LLM inference, designed for efficient GPU serving and enterprise deployment.
TO A PyTorch-native platform for generative model pretraining and distributed optimization.
TR Transformer Engine is an NVIDIA library focused on low-precision training and inference optimizations for Transformer models, supporting formats like FP8 to improve speed and memory efficiency.
TR Explore Transformer Lab, the open-source app for downloading and fine-tuning large models locally or in the cloud with powerful tools and multi-engine support.
VL High-throughput, memory-efficient inference and serving engine for large language models.
WE High-performance in-browser LLM inference engine that leverages WebGPU for hardware-accelerated, privacy-preserving inference in the browser.
WE A machine learning development and observability platform for tracking experiments, managing models and artifacts, and visualizing results across the ML lifecycle.
WO workerd is Cloudflare's open-source JavaScript/Wasm server runtime designed to run Workers-compatible nanoservices and edge applications in local or self-hosted environments.
ZE A unified MLOps framework to develop, evaluate and deploy everything from classical models to multi-agent AI systems.
No projects match the current filters.