Skip to content
OpenCatalogcurated by FLOSSK

Browse & filter

Filter by platform, license text, maturity, maintenance cadence, and editorial tags like privacy-focused or self-hosted. Search matches names, summaries, tags, and use cases.

22 tools match your filters

Alibaba’s lightweight inference engine for mobile and edge—used for on-device LLMs and classic CV models with aggressive optimization.

inferenceedgemobilellmtaaft-repositories

Alibaba’s high-performance LLM inference engine (CUDA-focused) for production serving of diverse decoder architectures.

llminferenceservinggputaaft-repositories

Physics-ML / scientific deep learning framework: neural operators, PINNs, and domain-parallel training on GPUs.

physics-mlsimulationpytorchgputaaft-repositories

NVIDIA library for FP8/FP4 and fused kernels on Hopper/Ada-class GPUs to accelerate Transformer training and inference.

trainingtransformersfp8nvidiataaft-repositories

NVIDIA research-oriented toolkit for LLM KV-cache compression to stretch context within fixed VRAM budgets.

llmkv-cachecompressioninferencetaaft-repositories

Flexible, high-performance serving system for TensorFlow (and related) models with versioning, batching, and gRPC/REST.

servingtensorflowinferencegrpctaaft-repositories

Retargetable MLIR-based compiler and runtime to lower ML graphs to CPUs, GPUs, and accelerators from multiple frontends.

compilermlirruntimedeploymenttaaft-repositories

AutoTrain Advanced: low-code training flows for classification, LLM fine-tunes, and diffusion tasks tied to the Hub.

fine-tuningautomlhuggingfacetaaft-repositories

TypeScript/JavaScript libraries to call Inference API, manage Hub assets, and build browser or Node AI features.

huggingfacejavascripttypescriptinferencetaaft-repositories

Open-source Svelte/TypeScript app that powers HuggingChat—multi-model chat, tools, and self-hostable UI patterns.

chatuillmself-hostedtaaft-repositories

Rust LSP server that plugs LLM-backed completions into editors—designed to pair with local or API models.

lspidellmdeveloper-toolstaaft-repositories

Contrastive vision–language pretraining reference implementation: map images and text to a shared embedding space.

multimodalvisionnlpembeddingstaaft-repositories

Google Research pretrained time-series foundation model for forecasting with open Apache-2.0 code and checkpoints.

time-seriesforecastingfoundation-modeltaaft-repositories

Google library to extract structured fields from unstructured text with LLMs, source grounding, and visualization helpers.

llmextractionstructured-outputtaaft-repositories

ByteDance open agent harness for long-horizon research, coding, and creation with tools, memory, and subagents.

agentsorchestrationllmtaaft-repositories

DeepSeek Janus series: unified multimodal understanding and generation models with MIT-licensed research code.

multimodalvisionllmdeepseektaaft-repositories

Open-source TypeScript ‘AI coworker’ framework with memory, tool use, and agent workflows for product integration.

agentstypescriptmemorytaaft-repositories

Apple’s Python utilities to convert, compress, and validate models for Core ML deployment on Apple devices.

coremlon-deviceappleconversiontaaft-repositories