📢 검색 기능 추가 예정

주민건

기록 = 성장

zoomg

23.08.15 (Tue)

LLM As DBA
Database administrators (DBAs) play a crucial role in managing, maintainingand optimizing a database system to ensure data availability, performance, andreliability. However, it is hard and tedious for DBAs to manage a large numberof database instances (e.g., millions of instances on the cloud da…
AgentBench: Evaluating LLMs as Agents
Large Language Models (LLMs) are becoming increasingly smart and autonomous,targeting real-world pragmatic missions beyond traditional NLP tasks. As aresult, there has been an urgent need to evaluate LLMs as agents on challengingtasks in interactive environments. We present AgentBench, a multi-di…
Studying Large Language Model Generalization with Influence Functions
When trying to gain better visibility into a machine learning model in orderto understand and mitigate the associated risks, a potentially valuable sourceof evidence is: which training examples most contribute to a given behavior?Influence functions aim to answer a counterfactual: how would the m…
Pre-Trained Large Language Models for Industrial Control
For industrial control, developing high-performance controllers with fewsamples and low technical debt is appealing. Foundation models, possessing richprior knowledge obtained from pre-training with Internet-scale corpus, have thepotential to be a good controller with proper prompts. In this pape…
Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models’ Alignment
Ensuring alignment, which refers to making models behave in accordance withhuman intentions [1,2], has become a critical task before deploying largelanguage models (LLMs) in real-world applications. For instance, OpenAI devotedsix months to iteratively aligning GPT-4 before its release [3]. Howev…

23.08.13 (Sun)

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Reinforcement learning from human feedback (RLHF) is a technique for trainingAI systems to align with human goals. RLHF has emerged as the central methodused to finetune state-of-the-art large language models (LLMs). Despite thispopularity, there has been relatively little public work systematizi…
Med-Flamingo: a Multimodal Medical Few-shot Learner
Medicine, by its nature, is a multifaceted domain that requires the synthesisof information across various modalities. Medical generative vision-languagemodels (VLMs) make a first step in this direction and promise many excitingclinical applications. However, existing models typically have to be…
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
Despite the advancements of open-source large language models (LLMs) andtheir variants, e.g., LLaMA and Vicuna, they remain significantly limited inperforming higher-level tasks, such as following human instructions to useexternal tools (APIs). This is because current instruction tuning largelyf…
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models
We introduce OpenFlamingo, a family of autoregressive vision-language modelsranging from 3B to 9B parameters. OpenFlamingo is an ongoing effort to producean open-source replication of DeepMind’s Flamingo models. On sevenvision-language datasets, OpenFlamingo models average between 80 - 89% ofcor…
The Hydra Effect: Emergent Self-repair in Language Model Computations
We investigate the internal structure of language model computations usingcausal analysis and demonstrate two motifs: (1) a form of adaptive computationwhere ablations of one attention layer of a language model cause another layerto compensate (which we term the Hydra effect) and (2) a counterbal…
SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning
The recent progress in large language models (LLMs), especially the inventionof chain-of-thoughts (CoT) prompting, makes it possible to solve reasoningproblems. However, even the strongest LLMs are still struggling with morecomplicated problems that require non-linear thinking and multi-step reas…
Learning to Model the World with Language
To interact with humans in the world, agents need to understand the diversetypes of language that people use, relate them to the visual world, and actbased on them. While current agents learn to execute simple languageinstructions from task rewards, we aim to build agents that leverage diversela…

23.08.09 (Wed)

23.08.08 (Tue)

Great! You’ve successfully signed up.

Welcome back! You've successfully signed in.

You've successfully subscribed to zoomg.

Success! Check your email for magic link to sign-in.

Success! Your billing info has been updated.

Your billing was not updated.