Highlights
- Pro
-
lighteval Public
Forked from huggingface/lightevalLightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
Python MIT License UpdatedSep 3, 2024 -
datatrove Public
Forked from huggingface/datatroveFreeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Python Apache License 2.0 UpdatedJun 14, 2024 -
ai-town Public
Forked from a16z-infra/ai-townA MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
-
-
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedMay 6, 2024 -
-
wimbd Public
Forked from allenai/wimbdWhat's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
Python Apache License 2.0 UpdatedMar 12, 2024 -
-
azure-search-openai-demo Public
Forked from Azure-Samples/azure-search-openai-demoA sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
Python MIT License UpdatedFeb 19, 2024 -
This repository contains download scripts for downloading the CZE-NEC dataset.
-
Klokan Public
Repository with scraping scripts for klokan dataset
Jupyter Notebook UpdatedJan 5, 2024 -
lilac Public
Forked from lilacai/lilacCurate better data for LLMs
Python Apache License 2.0 UpdatedJan 5, 2024 -
jetbrains-fc-name Public
Contains my solution for jetbrains repository fc name prediction
Jupyter Notebook UpdatedDec 29, 2023 -
-
runpod-templates Public
Collection of tempaltes I use for running run-pod nodes
Shell UpdatedDec 26, 2023 -
-
pyairtable Public
Forked from gtalarico/pyairtablePython Api Client for Airtable
Python MIT License UpdatedDec 8, 2023 -
GPTTextMaster Public
Chrome extension for text manipulation using GPT
Apache License 2.0 UpdatedDec 2, 2023 -
-
pebble Public
Forked from noxdafox/pebbleMulti threading and processing eye-candy.
Python GNU Lesser General Public License v3.0 UpdatedSep 22, 2023 -
axolotl Public
Forked from axolotl-ai-cloud/axolotlGo ahead and axolotl questions
Python Apache License 2.0 UpdatedSep 20, 2023 -
mlflow Public
Forked from mlflow/mlflowOpen source platform for the machine learning lifecycle
Python Apache License 2.0 UpdatedAug 24, 2023 -
openai-python Public
Forked from openai/openai-pythonThe OpenAI Python library provides convenient access to the OpenAI API from applications written in the Python language.
Python MIT License UpdatedAug 23, 2023 -
unstructured Public
Forked from Unstructured-IO/unstructuredOpen source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
HTML Apache License 2.0 UpdatedAug 4, 2023 -
-
-
-
-
-