Skip to content

[NEW REQUEST] RLHF #84

@nerdai

Description

@nerdai

Book

nlp

Pocket Reference Title

RLHF

Proposed Content

Pocket Ref for human alignment with RLHF

Rationale

One of the most popular alignment methods for LLMs

Content Types

  • Theoretical foundations
  • Mathematical formulations
  • Code examples
  • Diagrams/visualizations
  • Practical applications
  • Common pitfalls/challenges

Additional Resources

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

Status

In Progress

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions