AI

Evaluating LLMs is a minefield

Evaluating LLMs is a minefield

0

Annotated slides from a recent talk

LLM Evaluation For Text Summarization

LLM Evaluation For Text Summarization

0

Text summarization is a prime use case of LLMs (Large Language Models). It aims ...

Building LLM Applications With Vector Databases

Building LLM Applications With Vector Databases

0

As a Machine Learning Engineer working with many companies, I repeatedly encount...

Is the future of AI open or closed? Watch today’s Princeton-Stanford workshop

Is the future of AI open or closed? Watch today’s Princ...

0

By Sayash Kapoor, Rishi Bommasani, Percy Liang, Arvind Narayanan Perhaps the big...

LLM Training: RLHF and Its Alternatives

LLM Training: RLHF and Its Alternatives

0

I frequently reference a process called Reinforcement Learning with Human Feedba...

From Self-Alignment to LongLoRA

From Self-Alignment to LongLoRA

0

Another month, another round of interesting research papers ranging from large l...

LLM Business and Busyness: Recent Company Investments and AI Adoption, New Small Openly Available LLMs, and LoRA Research

LLM Business and Busyness: Recent Company Investments a...

0

Discussing Recent Company Investments and AI Adoption, New Small Openly Availabl...

AI and Open Source in 2023

AI and Open Source in 2023

0

The Highs and Lows: A Year in Review

Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)

Practical Tips for Finetuning LLMs Using LoRA (Low-Rank...

0

Things I Learned From Hundreds of Experiments

A Potential Successor to RLHF for Efficient LLM Alignment and the Resurgence of CNNs

A Potential Successor to RLHF for Efficient LLM Alignme...

0

From Vision Transformers to innovative large language model finetuning technique...

Tackling Hallucinations, Boosting Reasoning Abilities, and New Insights into the Transformer Architecture

Tackling Hallucinations, Boosting Reasoning Abilities, ...

0

This month, I want to focus on three papers that address three distinct problem ...

Ten Noteworthy AI Research Papers of 2023

Ten Noteworthy AI Research Papers of 2023

0

This year has felt distinctly different. I've been working in, on, and with mach...

Understanding and Coding Self-Attention, Multi-Head Attention, Cross-Attention, and Causal-Attention in LLMs

Understanding and Coding Self-Attention, Multi-Head Att...

0

This article will teach you about self-attention mechanisms used in transformer ...

Model Merging, Mixtures of Experts, and Towards Smaller LLMs

Model Merging, Mixtures of Experts, and Towards Smaller...

0

Model Merging, Mixtures of Experts, and Towards Smaller LLMs

Improving LoRA: Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch

Improving LoRA: Implementing Weight-Decomposed Low-Rank...

0

Low-rank adaptation (LoRA) is a machine learning technique that modifies a pretr...

Research Papers in February 2024: A LoRA Successor, Small Finetuned LLMs Vs Generalist LLMs, and Transparent LLM Research

Research Papers in February 2024: A LoRA Successor, Sma...

0

Once again, this has been an exciting month in AI research. This month, I'm cove...

2
3
4
5

This website uses cookies to enhance your browsing experience. By continuing to use this site, you consent to the use of cookies. Please review our Privacy Policy for more information on how we handle your data.