AI

AI leaderboards are no longer useful. It's time to swit...

What spending $2,000 can tell us about evaluating AI agents

On the Societal Impact of Open Foundation Models

Adding precision to the debate on openness in AI

AI existential risk probabilities are too unreliable to...

How speculation gets laundered through pseudo-quantification

What the executive order means for openness in AI

Good news on paper, but the devil is in the details

AI Snake Oil is now available to preorder

What artificial intelligence can do, what it can't, and how to tell the difference

Introducing Redesigned Navigation, Run Groups, Reports,...

We’ve been working on these improvements for quite some time, so it’s exciting t...

LLM For Structured Data

It is estimated that 80% to 90% of the data worldwide is unstructured. However, ...

3 Takes on End-to-End For the MLOps Stack: Was It Worth...

As machine learning (ML) drives innovation across industries, organizations seek...

How Transparent Are Foundation Model Developers?

Introducing the Foundation Model Transparency Index

ML/AI Platform Build vs Buy Decision: What Factors to C...

An ML/AI platform provides a coherent collection of tools and frameworks to buil...

Strategies For Effective Prompt Engineering

When I first delved into machine learning, prompt engineering seemed like a nich...

Adversarial Machine Learning: Defense Strategies

The growing prevalence of ML models in business-critical applications results in...

Evaluating LLMs is a minefield

Annotated slides from a recent talk

LLM Evaluation For Text Summarization

Text summarization is a prime use case of LLMs (Large Language Models). It aims ...

Building LLM Applications With Vector Databases

As a Machine Learning Engineer working with many companies, I repeatedly encount...

Is the future of AI open or closed? Watch today’s Princ...

By Sayash Kapoor, Rishi Bommasani, Percy Liang, Arvind Narayanan Perhaps the big...

This website uses cookies to enhance your browsing experience. By continuing to use this site, you consent to the use of cookies. Please review our Privacy Policy for more information on how we handle your data.