Name	Name	Last commit message	Last commit date
Latest commit History 81 Commits
README.md	README.md

References

Curriculum, personal interests, and reference material.

Courses

Papers

Machine Learning

Neural Networks and the Chomsky Hierarchy (A comparative study on model generalization)
Liquid Time-constant Networks
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Deep learning

Event-Based Backpropagation can compute Exact Gradients for Spiking Neural Networks
The Forward-Forward Algorithm: Some Preliminary Investigations
The Predictive Forward-Forward Algorithm
Knowledge is a Region in Weight Space for Fine-tuned Language Models
Beyond neural scaling laws: beating power law scaling via data pruning
LoRA Learns Less and Forgets Less

NLP

Attention Is All You Need
Google’s Neural Machine Translation System: Bridging the Gapbetween Human and Machine Translation
Rethinking Search: Making Experts out of Dilettantes
Toolformer: Language Models Can Teach Themselves to Use Tools
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Reformer: The Efficient Transformer
Semantic Tokenizer for Enhanced Natural Language Processing
Unlimiformer: Long-Range Transformers with Unlimited Length Input
The Power of Scale for Parameter-Efficient Prompt Tuning
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models
Prompting Is Programming: A Query Language for Large Language Models (LMQL)
Fine-Tuning Language Models with Just Forward Passes - code
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
MemGPT: Towards LLMs as Operating Systems
The Curious Case of Neural Text Degeneration
LoRA: Low-Rank Adaptation of Large Language Models
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
RWKV: Reinventing RNNs for the Transformer Era
Think before you speak: Training Language Models With Pause Tokens
The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering
Chameleon: Mixed-Modal Early-Fusion Foundation Models

Computer Vision

Lumiere: A Space-Time Diffusion Model for Video Generation

RL

Player of Games

Portfolio Optimization

Portfolio optimization for heavy-tailed assets: Extreme Risk Index vs. Markowitz

Control theory

Study on a Kalman Filter based PID Controller
Can a Transformer Represent a Kalman Filter?

Gists

KNN vc SVM for embedding similarity ranking

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

References

Courses

Category Theory

Game Theory

Algorithmic Game Theory

Information Theory

Statistics

Machine Learning

NLP

CV

RL

Algorithms and Data Structures

Database Design

Programming

Computer Architecture

Mathematics for Computer Science

Operational Systems

Papers

Machine Learning

Deep learning

NLP

Computer Vision

RL

Portfolio Optimization

Control theory

Gists

Channels

About

Releases

Packages

naripok/ref

Folders and files

Latest commit

History

Repository files navigation

References

Courses

Category Theory

Game Theory

Algorithmic Game Theory

Information Theory

Statistics

Machine Learning

NLP

CV

RL

Algorithms and Data Structures

Database Design

Programming

Computer Architecture

Mathematics for Computer Science

Operational Systems

Papers

Machine Learning

Deep learning

NLP

Computer Vision

RL

Portfolio Optimization

Control theory

Gists

Channels

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages