Stars
- All languages
- Assembly
- C
- C#
- C++
- CAP CDS
- CMake
- CSS
- Clojure
- CodeQL
- CoffeeScript
- Common Lisp
- Cuda
- Cython
- D
- DM
- Dart
- Dockerfile
- Erlang
- Go
- HTML
- Handlebars
- Haskell
- Java
- JavaScript
- Jinja
- Jsonnet
- Julia
- Jupyter Notebook
- Kotlin
- LLVM
- Lua
- MLIR
- Makefile
- Markdown
- Meson
- Mojo
- MoonScript
- Nim
- OCaml
- Objective-C
- PHP
- Perl
- PowerShell
- Puppet
- Python
- QML
- R
- Roff
- Ruby
- Rust
- Scala
- Shell
- Smarty
- Starlark
- Svelte
- Swift
- SystemVerilog
- TeX
- Thrift
- TypeScript
- V
- Verilog
- Vim Script
- Visual Basic 6.0
- Vue
- YARA
- Yacc
- Zeek
- Zig
Refine high-quality datasets and visual AI models
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Supercharge Your LLM Application Evaluations 🚀
Modeling, training, eval, and inference code for OLMo
Browser automation system that uses AI-driven planning to navigate web pages and perform goals.
Label, clean and enrich text datasets with LLMs.
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
a state-of-the-art-level open visual language model | 多模态预训练模型
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
A simple screen parsing tool towards pure vision based GUI agent
nomic-ai / usearch
Forked from unum-cloud/usearchFast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
A RAG LLM co-pilot for browsing the web, powered by local LLMs
Open source Claude Artifacts – built with Llama 3.1 405B
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
🧙 Build, run, and manage data pipelines for integrating and transforming data.
The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.
A simple, easy-to-hack GraphRAG implementation
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
📻Terminal/ssh/telnet/serialport/RDP/VNC/sftp client(linux, mac, win)
Sensitive-rs is a Rust library for finding, validating, filtering, and replacing sensitive words. It provides efficient algorithms to handle sensitive words, suitable for various application scenar…
新闻网页正文通用抽取器 Beta 版.
Awada 是一个基于微信场景的团队知识助理智能体。它可以从群聊、公众号、网站等来源中进行在线自主学习(同时也接受自主文档上传),打造团队私域知识库,并为团队成员提供问答、资料查找以及写作(Word)服务。
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
A modular graph-based Retrieval-Augmented Generation (RAG) system
Building a modern alternative to Salesforce, powered by the community.
Free and Open Source Enterprise Resource Planning (ERP)