[go: up one dir, main page]

Skip to content
View wenjie-yuan's full-sized avatar

Block or report wenjie-yuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 874 103 Updated Oct 7, 2024

这个项目是一个Jupyter notebook的集合,专门用于学习和探索LangChain框架。

Jupyter Notebook 331 76 Updated Jan 16, 2024

Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"

Python 36 5 Updated Oct 15, 2024

小红书、微信公众号、马蜂窝爬虫

Python 12 1 Updated Sep 12, 2023

The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.

99 8 Updated Oct 9, 2024

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

Python 17,745 1,918 Updated Nov 18, 2024

[EMNLP 2024 Industry Track & KDD UrbComp 2024 Best Paper Award] ITINERA: Integrating Spatial Optimization with Large Language Models for Open-domain Urban Itinerary Planning

Python 21 2 Updated Nov 9, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,817 450 Updated Jun 22, 2024

[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

Python 247 32 Updated Oct 24, 2024

将知乎专栏文章转换为 Markdown 文件保存到本地

Python 251 40 Updated Jun 6, 2024

LUFY: A RAG Chatbot that forgets unimportant conversations

Python 2 Updated Sep 20, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 741 45 Updated Nov 2, 2024

Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks

Python 129 18 Updated Sep 20, 2024

「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!

Python 2,719 330 Updated Nov 10, 2024

LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets

Python 34 1 Updated Sep 30, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,134 454 Updated Nov 6, 2024

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 455 35 Updated Nov 4, 2024

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,572 350 Updated Oct 17, 2024

A family of compressed models obtained via pruning and knowledge distillation

282 17 Updated Nov 13, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,488 4,123 Updated Nov 15, 2024

Advanced Multi-Turn QA System with LLM and Intent Recognition. 基于LLM大语言模型意图识别、参数抽取结合slot词槽技术实现多轮问答、NL2API. 打造Function Call多轮问答最佳实践

Python 483 65 Updated Aug 15, 2024

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,348 120 Updated Jun 13, 2024

Expert Specialized Fine-Tuning

Python 145 13 Updated Sep 22, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 11,883 1,536 Updated Feb 29, 2024

Start building LLM-empowered multi-agent applications in an easier way.

Python 5,280 324 Updated Nov 11, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,016 368 Updated Oct 29, 2024

Making large AI models cheaper, faster and more accessible

Python 38,817 4,346 Updated Nov 18, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 745 56 Updated Oct 11, 2024

The Memory layer for your AI apps

Python 22,863 2,103 Updated Nov 18, 2024

Implementation of BEAST adversarial attack for language models (ICML 2024)

Python 73 4 Updated May 14, 2024
Next