Platform & Infrastructure

Cloud-native AI platform capabilities and infrastructure.

61 Projects 3 Subcategory 22 Tags
Tracked

Kubernetes-native AI infrastructure and scheduling.

HolmesGPT

An AI agent platform for cloud-native environments that automates alert investigation, root cause analysis, and remediation suggestions.

-- Loading score

Open WebUI

A scalable, feature-rich web interface for interacting with large language models, providing a ChatGPT-like experience with support for multiple models and customization options.

-- Loading score

Volcano

Volcano is a Kubernetes-native batch scheduling system (a CNCF project) that enhances kube-scheduler with advanced features for batch, HPC, and AI workloads.

-- Loading score

Data platforms, lakehouse stacks, and data services.

3FS

A high-performance distributed file system designed for AI training and inference workloads, optimizing parallel I/O and data locality to support large-scale training.

-- Loading score

AIPyApp

An open-source tool that integrates an interactive Python environment with LLMs for natural-language-driven Python execution and automation.

-- Loading score

Apache Doris

Apache Doris is an easy-to-use, high-performance unified analytics database for real-time and offline analysis.

-- Loading score

cuDF

A GPU DataFrame library for accelerating data analysis and tabular computing with GPU acceleration.

-- Loading score

CVAT

CVAT is an industry-leading computer vision annotation tool suitable for annotation at any scale.

-- Loading score

Datachain

ETL, analytics, and versioning for unstructured data to build reproducible and auditable data pipelines.

-- Loading score

DataFlow

A data preparation and pipeline platform for domain training and retrieval-augmented generation.

-- Loading score

Deep Lake

A database for AI optimized for storing, querying and versioning vectors and multimodal data (images, video, audio, text) for LLM and deep learning workflows.

-- Loading score

DocuTranslate

DocuTranslate is a lightweight document translation tool leveraging LLMs and multiple parsing engines.

-- Loading score

DuckDB

An analytical, in-process SQL database suited for interactive queries, ETL, and local analytics.

-- Loading score

Faiss

A high-performance library for similarity search and clustering of dense vectors, suitable for large-scale vector retrieval.

-- Loading score

Jupyter Notebook

Interactive computing environment widely used for data science and machine learning development.

-- Loading score

Label Studio

Label Studio is a multi-type data labeling and annotation tool with standardized output formats.

-- Loading score

pgvector

pgvector is an open-source PostgreSQL extension that adds vector data types and similarity search, supporting exact and approximate search (HNSW, IVFFlat) inside Postgres.

-- Loading score

Proton

Proton is a single-binary C++ high-performance SQL stream processing engine designed for real-time analytics and stream ETL.

-- Loading score

PyMuPDF

A high-performance Python library for data extraction, analysis, conversion, and manipulation of PDF and other documents.

-- Loading score

TensorFlow

Google's open-source end-to-end machine learning platform for building and training deep learning models.

-- Loading score

Unstract

A no-code LLM platform to convert unstructured documents into structured data and quickly launch APIs and ETL pipelines.

-- Loading score

Valkey

A high-performance distributed key-value database optimized for caching and real-time workloads.

-- Loading score

verl

A reinforcement learning training framework for large models, designed for scalable RLHF and agent training.

-- Loading score

Deployment pipelines and operations tooling.

Agent Development Kit (ADK)

ADK is an open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

-- Loading score

Apache Superset

An open-source data visualization and exploration platform supporting interactive dashboards, SQL-based analysis, and multiple data sources.

-- Loading score

AutoGPT

AutoGPT — a platform to build, deploy and run continuous AI agents, supporting self-hosting and platform deployments.

-- Loading score

AXLearn

An extensible deep learning library built on JAX/XLA, designed for developing, training and deploying large-scale models.

-- Loading score

Candle

Candle by Hugging Face: a minimalist, high-performance ML framework in Rust designed for serverless inference and lightweight deployments.

-- Loading score

Coze Loop

ByteDance's open-source platform-level solution for AI Agent development and operations, providing full lifecycle management capabilities.

-- Loading score

Crawl4AI

An open-source web crawler and scraper optimized for large language model workflows, producing clean Markdown and structured data with browser control and Docker deployment.

-- Loading score

Dask

Dask is a Python library for parallel computing and task scheduling, suited for scaling NumPy, Pandas and machine learning workloads across clusters.

-- Loading score

Dify

Open-source LLM application development platform providing visual AI application building tools and enterprise-grade deployment solutions.

-- Loading score

DLRover

DLRover is an automatic distributed deep learning system that provides elastic scheduling, flash checkpointing and auto-scaling to simplify large-scale model training on Kubernetes and Ray.

-- Loading score

Dynamo

Explore Dynamo by NVIDIA, an open-source framework for efficient multi-GPU inference, optimizing throughput and latency for large-scale deployments.

-- Loading score

Google Research

Google Research aggregates open-source research code and datasets from Google, covering machine learning, vision, NLP and other research areas.

-- Loading score

gpustack

Open-source GPU cluster manager for efficient model training and high-performance inference orchestration.

-- Loading score

KitOps

KitOps is a CNCF-backed open-source project that standardizes packaging AI/ML projects into signable, versioned OCI artifacts.

-- Loading score

LitGPT

A high-performance, engineering-focused LLM toolkit that provides end-to-end recipes and practical tutorials for training and deploying large models.

-- Loading score

LMDeploy

LMDeploy is a toolkit for compressing, deploying and serving large language models, providing optimized inference engines, quantization and distribution features.

-- Loading score

MaxText

A high-performance, highly scalable open-source LLM library and reference implementation built with Python and JAX, targeting Google Cloud TPUs and GPUs.

-- Loading score

Megatron-LM

Reference implementation from NVIDIA for large-scale model training and inference with distributed optimizations.

-- Loading score

MLC LLM

MLC LLM is a machine learning compiler and deployment engine that enables high-performance LLM inference across platforms using compilation and runtime optimizations.

-- Loading score

MLflow

MLflow is an open-source platform for managing the machine learning lifecycle, including experiment tracking, packaging, model registry and deployment.

-- Loading score

MLRun

MLRun is an open-source MLOps platform for building and managing continuous ML applications across their lifecycle.

-- Loading score

MLX

An array framework for machine learning optimized for Apple Silicon, offering NumPy-like Python APIs plus C++, C and Swift bindings.

-- Loading score

Modular Platform

An open, production-grade AI platform including the MAX inference server and Mojo libraries to accelerate model deployment across hardware.

-- Loading score

Parlant

Parlant is a compliance-first AI agent framework designed for real-world business scenarios. Deploy in minutes and ensure agents follow your rules.

-- Loading score

Phoenix

Phoenix is a high-performance web framework built with Elixir, optimized for realtime, distributed, and scalable web applications.

-- Loading score

PyTorch

An open-source deep learning framework for fast, flexible research and production, featuring dynamic computation graphs and strong GPU acceleration.

-- Loading score

SGLang

High-performance open-source framework for LLM and VLM inference, supporting multimodal, extreme concurrency, and flexible frontend programming.

-- Loading score

SwanLab

SwanLab is an open-source, modern training tracking and visualization tool that supports cloud and self-hosted deployment.

-- Loading score

Tabby

Tabby is an open-source, self-hosted AI coding assistant designed for teams that need on-premises deployment and code privacy.

-- Loading score

TensorRT-LLM

NVIDIA's open-source toolbox for optimized LLM inference, designed for efficient GPU serving and enterprise deployment.

-- Loading score

TorchTitan

A PyTorch-native platform for generative model pretraining and distributed optimization.

-- Loading score

Transformer Engine

Transformer Engine is an NVIDIA library focused on low-precision training and inference optimizations for Transformer models, supporting formats like FP8 to improve speed and memory efficiency.

-- Loading score

Transformer Lab

Explore Transformer Lab, the open-source app for downloading and fine-tuning large models locally or in the cloud with powerful tools and multi-engine support.

-- Loading score

vLLM

High-throughput, memory-efficient inference and serving engine for large language models.

-- Loading score

WebLLM

High-performance in-browser LLM inference engine that leverages WebGPU for hardware-accelerated, privacy-preserving inference in the browser.

-- Loading score

Weights & Biases (W&B)

A machine learning development and observability platform for tracking experiments, managing models and artifacts, and visualizing results across the ML lifecycle.

-- Loading score

workerd

workerd is Cloudflare's open-source JavaScript/Wasm server runtime designed to run Workers-compatible nanoservices and edge applications in local or self-hosted environments.

-- Loading score

ZenML

A unified MLOps framework to develop, evaluate and deploy everything from classical models to multi-agent AI systems.

-- Loading score