Important: This documentation covers Yarn 1 (Classic).

For Yarn 2+ docs and migration guide, see yarnpkg.com.

keywords: reranking

found 6 packages in 3ms

node-llama-cpp46kMIT3.14.0

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

withcatai15 days agollama, llama-cpp, llama.cpp, bindings

npmGitHubHomepage

@basetenlabs/performance-client4.8kMIT0.0.10

This library provides a high-performance Node.js client for Baseten.co endpoints including embeddings, reranking, and classification. It was built for massive concurrent POST requests to any URL, also outside of baseten.co. The PerformanceClient is built

basetenlabs2 months agobaseten, performance, client, embedding

npmGitHub

llama-cpp-capacitor1.3kMIT0.0.22

A native Capacitor plugin that embeds llama.cpp directly into mobile apps, enabling offline AI inference with chat-first API design. Supports both simple text generation and advanced chat conversations with system prompts, multimodal processing, TTS, LoRA