Industry researchand insights.

A curated collection of profound, recent research papers that define the state of our industry.

Curated Papers

Current State

Research Themes

Industry Labs

Research Themes

The science shaping our industry

Exploring the seminal concepts that drive modern AI development.

Foundation Models

Scale & Architecture

The scaling laws and architectural innovations driving the next generation of large language models.

LLMsTransformersScaling Laws

Alignment & Safety

Constitutional AI & RLHF

Techniques for aligning model behavior with human intent, including RLHF, RLAIF, and Constitutional AI.

RLHFSafetyRobustness

Synthetic Data

Data Augmentation

Leveraging generative models to create high-quality synthetic training data for reasoning and coding tasks.

GenerationDistillationPrivacy

Evaluation

Benchmarking & Metrics

Rigorous methodologies for assessing model performance, truthfulness, and reasoning capabilities.

BenchmarksEvalsAnalysis

Efficient Inference

Optimization & Serving

Techniques for reducing the computational cost and latency of deploying large models.

QuantizationPruningDistillation

Human-AI Collaboration

Interactive Systems

Designing interfaces and workflows that enable effective collaboration between humans and AI agents.

UXAgentsWorkflow

Industry Papers

Profound industry
research & insights.

A curated collection of seminal papers shaping the future of Artificial Intelligence.

2022

ArXiv

Constitutional AI: Harmlessness from AI Feedback

Anthropic

Introduces a method for training a harmless AI assistant through self-improvement without human labels. The model is trained to critique and revise its own responses based on a set of principles (a 'constitution').

AlignmentRLAIFSafety

Foundational for RLAIF

View Paper

2023

ArXiv

Llama 2: Open Foundation and Fine-Tuned Chat Models

Meta AI

Details the development of Llama 2, a collection of pre-trained and fine-tuned large language models ranging from 7B to 70B parameters, setting a new standard for open models.

LLMsOpen SourceFoundation Models

Standard for Open Models

View Paper

2022

NeurIPS

Training Language Models to Follow Instructions with Human Feedback

OpenAI

The seminal InstructGPT paper that demonstrated how fine-tuning with human feedback (RLHF) significantly improves the alignment of language models with user intent.

RLHFAlignmentInstructGPT

Established RLHF standard

View Paper

2023

NeurIPS

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Stanford University

Proposes DPO, a stable and efficient alternative to RLHF that optimizes the language model directly to satisfy human preferences without training a separate reward model.

DPOOptimizationAlignment

Efficient Alignment

View Paper

2020

ArXiv

Scaling Laws for Neural Language Models

OpenAI

Empirical analysis demonstrating that model performance scales as a power-law with model size, dataset size, and compute, providing a roadmap for the development of large models.

ScalingDeep LearningTheory

Guided LLM Scaling

View Paper

2023

ArXiv

Sparks of Artificial General Intelligence: Early experiments with GPT-4

Microsoft Research

An investigation into the capabilities of an early version of GPT-4, arguing that it exhibits more general intelligence than previous models across a wide range of tasks.

AGIEvaluationGPT-4

AGI Discourse

View Paper

2023

UIST

Generative Agents: Interactive Simulacra of Human Behavior

Stanford & Google

Demonstrates how generative agents can simulate believable human behavior in an interactive sandbox environment, opening new avenues for simulation and social science.

AgentsSimulationHCI

Agentic Behavior

View Paper

2023

CoRL

Q-Transformer: Scalable Offline Reinforcement Learning

Google DeepMind

Presents a scalable method for offline reinforcement learning using Transformer architectures, enabling effective policy learning from large, diverse datasets.

RLTransformersRobotics

Scalable RL

View Paper

Join the AuraOne research loop

Partner with us to advance the science that keeps launches calm.
Share data, co-author papers, and push guardrails, routing, and telemetry forward.

Open publications

Shared AuraOne datasets

Joint AuraOne authorship

Industry researchand insights.

The science shaping our industry

Foundation Models

Alignment & Safety

Synthetic Data

Evaluation

Efficient Inference

Human-AI Collaboration

Profound industryresearch & insights.

Constitutional AI: Harmlessness from AI Feedback

Llama 2: Open Foundation and Fine-Tuned Chat Models

Training Language Models to Follow Instructions with Human Feedback

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Scaling Laws for Neural Language Models

Sparks of Artificial General Intelligence: Early experiments with GPT-4

Generative Agents: Interactive Simulacra of Human Behavior

Q-Transformer: Scalable Offline Reinforcement Learning

Join the AuraOne research loop

Profound industry
research & insights.