wenjie-yuan

Follow

wenjie-yuan

Follow

0 followers · 4 following

Stars

horseee / LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 874 103 Updated Oct 7, 2024

XingYu-Zhong / LangChainStudy

这个项目是一个Jupyter notebook的集合，专门用于学习和探索LangChain框架。

Jupyter Notebook 331 76 Updated Jan 16, 2024

tianyi-lab / MoE-Embedding

Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"

Python 36 5 Updated Oct 15, 2024

wzk1015 / Scraper

小红书、微信公众号、马蜂窝爬虫

Python 12 1 Updated Sep 12, 2023

YHPeter / Awesome-RAG-Evaluation

The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.

99 8 Updated Oct 9, 2024

deepset-ai / haystack

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

Python 17,745 1,918 Updated Nov 18, 2024

YihongT / ITINERA

[EMNLP 2024 Industry Track & KDD UrbComp 2024 Best Paper Award] ITINERA: Integrating Spatial Optimization with Large Language Models for Open-domain Urban Itinerary Planning

Python 21 2 Updated Nov 9, 2024

princeton-nlp / tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,817 450 Updated Jun 22, 2024

OSU-NLP-Group / TravelPlanner

[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

Python 247 32 Updated Oct 24, 2024

chenluda / zhihu-download

将知乎专栏文章转换为 Markdown 文件保存到本地

Python 251 40 Updated Jun 6, 2024

ryuichi-sumida / LUFY

LUFY: A RAG Chatbot that forgets unimportant conversations

Python 2 Updated Sep 20, 2024

ContextualAI / HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 741 45 Updated Nov 2, 2024

wuhy68 / Parameter-Efficient-MoE

Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks

Python 129 18 Updated Sep 20, 2024

jingyaogong / minimind

「大模型」3小时完全从0训练26M的小参数GPT，个人显卡即可推理训练！

Python 2,719 330 Updated Nov 10, 2024

WowCZ / LongMIT

LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets

Python 34 1 Updated Sep 30, 2024

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,134 454 Updated Nov 6, 2024

allenai / OLMoE

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 455 35 Updated Nov 4, 2024

togethercomputer / RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,572 350 Updated Oct 17, 2024

NVlabs / Minitron

A family of compressed models obtained via pruning and knowledge distillation

282 17 Updated Nov 13, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,488 4,123 Updated Nov 15, 2024

answerlink / IntelliQ

Advanced Multi-Turn QA System with LLM and Intent Recognition. 基于LLM大语言模型意图识别、参数抽取结合slot词槽技术实现多轮问答、NL2API. 打造Function Call多轮问答最佳实践

Python 483 65 Updated Aug 15, 2024

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,348 120 Updated Jun 13, 2024

deepseek-ai / ESFT

Expert Specialized Fine-Tuning

Python 145 13 Updated Sep 22, 2024

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 11,883 1,536 Updated Feb 29, 2024

modelscope / agentscope

Start building LLM-empowered multi-agent applications in an easier way.

Python 5,280 324 Updated Nov 11, 2024

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,016 368 Updated Oct 29, 2024

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 38,817 4,346 Updated Nov 18, 2024

TencentARC / SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 745 56 Updated Oct 11, 2024

mem0ai / mem0

The Memory layer for your AI apps

Python 22,863 2,103 Updated Nov 18, 2024

vinusankars / BEAST

Implementation of BEAST adversarial attack for language models (ICML 2024)

Python 73 4 Updated May 14, 2024