LeanDojo: Theorem Proving with Retrieval-Augmented Language Models

Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language
We propose LENS, a modular approach for tackling computer vision problems byleveraging the power of large language models (LLMs). Our system uses alanguage model to reason over outputs from a set of independent and highlydescriptive vision modules that provide exhaustive information about an imag…

ViNT: A Foundation Model for Visual Navigation
A foundation model for visual navigation that generalizes across environments and robots, and can be readily adapted to downstream tasks.

Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors
Generative AI and large language models hold great promise in enhancingcomputing education by powering next-generation educational technologies forintroductory programming. Recent works have studied these models for differentscenarios relevant to programming education; however, these works are li…

Understanding Social Reasoning in Language Models with Language Models
As Large Language Models (LLMs) become increasingly integrated into oureveryday lives, understanding their ability to comprehend human mental statesbecomes critical for ensuring effective interactions. However, despite therecent attempts to assess the Theory-of-Mind (ToM) reasoning capabilities o…

Bring Your Own Data! Self-Supervised Evaluation for Large Language Models
With the rise of Large Language Models (LLMs) and their ubiquitous deploymentin diverse domains, measuring language model behavior on realistic data isimperative. For example, a company deploying a client-facing chatbot mustensure that the model will not respond to client requests with profanity.…
