uvafan

Eli Lifland uvafan

Achievements

Stars

princeton-nlp / SWE-agent

[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challen…

Python 13,714 1,392 Updated Nov 15, 2024

elicit / machine-learning-list

A curriculum for learning about foundation models, from scratch to the frontier

958 72 Updated Jun 6, 2024

thestephencasper / everything-you-need

we got you bro

33 Updated Jul 29, 2024

princeton-nlp / SWE-bench

[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 1,992 345 Updated Nov 17, 2024

neoneye / arc-notes

My writings about ARC (Abstraction and Reasoning Corpus)

59 2 Updated Nov 15, 2024

lukasberglund / reversal_curse

Python 266 20 Updated Nov 17, 2023

Sage-Future / fatebook

The fastest way to make and track predictions

TypeScript 33 9 Updated Nov 12, 2024

hamishhuggard / AI-alignment-map

A map of the AI alignment landscape

JavaScript 10 2 Updated Sep 3, 2024

anthropics / evals

239 24 Updated Jul 2, 2024

collin-burns / discovering_latent_knowledge

Python 252 37 Updated Mar 2, 2024

rethinkpriorities / squigglepy

Squiggle programming language for intuitive probabilistic estimation features in Python

Python 65 8 Updated Nov 10, 2024

quantified-uncertainty / squiggle

An estimation language

TypeScript 155 23 Updated Nov 16, 2024

quantified-uncertainty / squiggle-models

Experimental models by the QURI team & others

4 1 Updated Aug 2, 2023

manifoldmarkets / manifold

Manifold Markets: A market for every question

TypeScript 421 157 Updated Nov 16, 2024

nyu-mll / quality

Python 122 8 Updated Sep 3, 2024

QData / TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

Python 2,974 398 Updated Jul 25, 2024

orpatashnik / StyleCLIP

Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)

HTML 4,000 560 Updated May 30, 2023

csinva / imodels

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).

Jupyter Notebook 1,399 124 Updated Nov 6, 2024

hendrycks / math

The MATH Dataset (NeurIPS 2021)

Python 893 86 Updated Aug 5, 2024

nilesc / Long-Structured-Debate-Generation-and-Evaluation

Python 13 8 Updated Dec 8, 2022

kingoflolz / mesh-transformer-jax

Model parallel transformers in JAX and Haiku

Python 6,295 892 Updated Jan 21, 2023

sylinrl / TruthfulQA

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Jupyter Notebook 618 71 Updated Nov 6, 2023

AI21Labs / lm-evaluation

Evaluation suite for large-scale language models.

Python 124 14 Updated Aug 15, 2021

reglab / casehold

Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings"

Python 84 17 Updated Mar 27, 2023

hendrycks / test

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,213 92 Updated May 28, 2023

hendrycks / ethics

Aligning AI With Shared Human Values (ICLR 2021)

Python 253 44 Updated Apr 21, 2023

quantified-uncertainty / metaforecast

Fetch forecasts from prediction markets/forecasting platforms to make them searchable. Integrate these forecasts into other services.

TypeScript 63 6 Updated Nov 7, 2024

gruns / icecream

🍦 Never use print() to debug again.

Python 9,149 186 Updated Nov 12, 2024

textflint / textflint

Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing

Python 642 95 Updated Sep 27, 2022

rrmenon10 / ADAPET

[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training

Python 153 15 Updated Jun 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eli Lifland uvafan

Achievements

Achievements

Block or report uvafan

Stars

princeton-nlp / SWE-agent

elicit / machine-learning-list

thestephencasper / everything-you-need

princeton-nlp / SWE-bench

neoneye / arc-notes

lukasberglund / reversal_curse

Sage-Future / fatebook

hamishhuggard / AI-alignment-map

anthropics / evals

collin-burns / discovering_latent_knowledge

rethinkpriorities / squigglepy

quantified-uncertainty / squiggle

quantified-uncertainty / squiggle-models

manifoldmarkets / manifold

nyu-mll / quality

QData / TextAttack

orpatashnik / StyleCLIP

csinva / imodels

hendrycks / math

nilesc / Long-Structured-Debate-Generation-and-Evaluation

kingoflolz / mesh-transformer-jax

sylinrl / TruthfulQA

AI21Labs / lm-evaluation

reglab / casehold

hendrycks / test

hendrycks / ethics

quantified-uncertainty / metaforecast

gruns / icecream

textflint / textflint

rrmenon10 / ADAPET