-
Notifications
You must be signed in to change notification settings - Fork 7
Open
Labels
Description
Book
nlp
Pocket Reference Title
RLHF
Proposed Content
Pocket Ref for human alignment with RLHF
Rationale
One of the most popular alignment methods for LLMs
Content Types
- Theoretical foundations
- Mathematical formulations
- Code examples
- Diagrams/visualizations
- Practical applications
- Common pitfalls/challenges
Additional Resources
Metadata
Metadata
Assignees
Labels
Type
Projects
Status
In Progress