Papers Read on AI

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

2024/4/24

We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whos

Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations

2024/4/23

Large-scale recommendation systems are characterized by their reliance on high cardinality, heteroge

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

2024/4/22

We study how to apply large language models to write grounded and organized long-form articles from

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

2024/4/19

In this work, we introduce Mini-Gemini, a simple and effective framework enhancing multi-modality Vi

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

2024/4/18

We present InstantMesh, a feed-forward framework for instant 3D mesh generation from a single image,

From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples

2024/4/16

We analyze how well pre-trained large language models (e.g., Llama2, GPT-4, Claude 3, etc) can do li

AutoCodeRover: Autonomous Program Improvement

2024/4/15

Researchers have made significant progress in automating the software development process in the pas

TrustLLM: Trustworthiness in Large Language Models

2024/4/15

Large language models (LLMs), exemplified by ChatGPT, have gained considerable attention for their e

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

2024/4/12

In this study, we propose AniPortrait, a novel framework for generating high-quality animation drive

Fast Timing-Conditioned Latent Audio Diffusion

2024/4/11

Generating long-form 44.1kHz stereo audio from text prompts can be computationally demanding. Furthe

Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians

2024/4/10

Creating high-fidelity 3D head avatars has always been a research hotspot, but there remains a great

ReFT: Representation Finetuning for Language Models

2024/4/9

Parameter-efficient fine-tuning (PEFT) methods seek to adapt large models via updates to a small num

Long-form factuality in large language models

2024/4/8

Large language models (LLMs) often generate content that contains factual errors when responding to

Jamba: A Hybrid Transformer-Mamba Language Model

2024/4/6

We present Jamba, a new base large language model based on a novel hybrid Transformer-Mamba mixture-

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

2024/4/5

Recently years have witnessed a rapid development of large language models (LLMs). Despite the stron

MegaBlocks: Efficient Sparse Training with Mixture-of-Experts

2024/4/4

We present MegaBlocks, a system for efficient Mixture-of-Experts (MoE) training on GPUs. Our system

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

2024/4/3

We introduce VoiceCraft, a token infilling neural codec language model, that achieves state-of-the-a

LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression

2024/4/2

This paper focuses on task-agnostic prompt compression for better generalizability and efficiency. C

Evolutionary Optimization of Model Merging Recipes

2024/3/27

We present a novel application of evolutionary algorithms to automate the creation of powerful found

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

2024/3/26

Jailbreak attacks are crucial for identifying and mitigating the security vulnerabilities of Large L

Episodes