Social Foundations of Computation

All

12 repositories

causal-features
Public
Code to reproduce the paper "Predictors from causal features do not generalize better to new domains"
Python
•
Other
•7•5•0•0•Updated Oct 23, 2024Oct 23, 2024
folktexts
Public
Get classification risk scores on tabular tasks using LLMs
python machine-learning tabular-data transformers large-language-models
Jupyter Notebook
•
MIT License
•0•9•0•0•Updated Oct 4, 2024Oct 4, 2024
surveying-language-models
Public
Code to reproduce the paper "Questioning the Survey Responses of Large Language Models"
Jupyter Notebook
•
MIT License
•1•7•0•0•Updated Sep 23, 2024Sep 23, 2024
lm-evaluation-harness
Public
A framework for few-shot evaluation of language models.
Python
•
MIT License
•1.8k•1•0•0•Updated Sep 20, 2024Sep 20, 2024
lawma
Public
Lawma: A lightly fine-tuned Llama model for legal classification tasks.
language-model legaltech legaltools
Jupyter Notebook
•0•9•0•0•Updated Sep 14, 2024Sep 14, 2024
benchbench
Public
BenchBench is a Python package to evaluate multi-task benchmarks.
Python
•
MIT License
•1•11•0•0•Updated Jul 18, 2024Jul 18, 2024
training-on-the-test-task
Public
Code to reproduce the experiments in the paper Training on the Test Task Confounds Evaluation and Emergence.
Jupyter Notebook
•0•6•0•0•Updated Jul 14, 2024Jul 14, 2024
folktables
Public
Datasets derived from US census data
Python
•
MIT License
•20•236•5•3•Updated May 15, 2024May 15, 2024
error-parity
Public
Achieve error-rate fairness between societal groups for any score-based classifier.
Python
•
MIT License
•5•16•0•1•Updated Apr 26, 2024Apr 26, 2024
tttlm
Public
Test-time-training on nearest neighbors for large language models
Python
•
MIT License
•4•24•0•0•Updated Apr 18, 2024Apr 18, 2024
backward_baselines
Public
Code for "Is your model predicting the past?"
Jupyter Notebook
•
MIT License
•0•1•0•0•Updated Mar 10, 2024Mar 10, 2024
whynot
Public
A Python sandbox for decision making in dynamics
Python
•
MIT License
•45•417•8•2•Updated Aug 21, 2023Aug 21, 2023