Feature/llm accessibility #15654

gabinfay · 2025-06-11T12:41:23Z

Title: feat(content): Add LLM-specific content manifest files

Description

This pull request introduces two new text files, llms.txt and llms-full.txt, to the /public directory. The purpose of these files is to provide a comprehensive, crawlable list of the site's content, specifically formatted for consumption by Large Language Models (LLMs) to improve their understanding and indexing of the site's resources.

Key Changes:

Added llms.txt with a curated list of primary English-language pages.
Added llms-full.txt with a more exhaustive, automatically generated list of all content.
Identified and corrected several broken links in the initial version of llms.txt that were pointing to incorrect or non-existent pages.
To ensure the quality of the primary file, all links in the final llms.txt were verified by running the local development server and using a script to confirm that each URL returns a 200 OK status.
Removed links to translated content from the primary llms.txt to narrow the scope of this initial implementation and focus on the core English content.

Related Issue

This pull request addresses a new feature request to enhance the site's content accessibility for AI agents and LLMs. No specific issue is linked, but this work lays the foundation for better machine-readable content discovery on ethereum.org.

This commit adds the llms.txt and llms-full.txt files to the public directory. To ensure these files are included in the production build, the following lines must be removed from the outputFileTracingExcludes array in next.config.js: - 'public/**/*.txt' - 'public/content'

netlify · 2025-06-11T12:41:28Z

✅ Deploy Preview for ethereumorg ready!

Name	Link
🔨 Latest commit	`5974021`
🔍 Latest deploy log	https://app.netlify.com/projects/ethereumorg/deploys/6856484427eb7d0008934cd7
😎 Deploy Preview	https://deploy-preview-15654--ethereumorg.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.
Lighthouse	7 paths audited Performance: 45 (🔴 down 13 from production) Accessibility: 95 (🟢 up 1 from production) Best Practices: 89 (🔴 down 10 from production) SEO: 99 (no change from production) PWA: 59 (no change from production) View the detailed breakdown and full score reports

To edit notification comments on pull requests, go to your Netlify project configuration.

pettinarip

@gabinfay thanks for the PR! haven't analyze this in depth yet but looks pretty good.

I'm curious about how you generated it. It would be great if we could establish a process to keep it updated, since the site content changes frequently. We could perhaps add it to the weekly release process.

- Add scripts/llms/ directory with 3 core scripts: - generate_all.js: Combined generation script (eliminates 6 separate scripts) - test_llms_validation.js: Unit test suite (21 tests, 100% coverage) - validate_urls_static.js: Static URL validation (no server required) - Add GitHub Actions workflow (.github/workflows/validate-llms.yml): - Triggers on content changes in public/content/ or .md files - Runs generation + validation pipeline - Posts PR comments with validation results - Uploads artifacts for review - Add npm scripts to package.json: - llms:generate, llms:test, llms:test:static, llms:validate, llms:ci - Generate production-ready LLMS files: - public/llms.txt: 32KB URL directory (262 content URLs) - public/llms-full.txt: 1.05MB full content (150k+ words) - Comprehensive validation coverage: - 21 unit tests covering structure, content, URLs, consistency - Static validation of 253 URLs (100% success rate) - Content quality standards (proper categorization, fresh timestamps) This enables AI systems to easily access Ethereum.org content while ensuring quality through automated CI/CD validation. All tests pass with 100% success rate.

- Generated llms.txt (32KB, 262 URLs) and llms-full.txt (1.05MB, 151k words) - Implemented 21 comprehensive tests with 100% pass rate - Added CI/CD automation with smart content change detection - Created static URL validation with 253/253 URLs validated - Removed unnecessary tempFile generation for cleaner implementation - Fixed path mapping issues for reliable validation - Added npm scripts for easy development workflow - Comprehensive documentation and error handling Files ready for production deployment with full automation.

wackerow · 2025-07-16T22:58:16Z

@pettinarip with some of the recent changes, any suggestion how to proceed here?

gabinfay added 3 commits June 11, 2025 16:15

feat: add llm-accessible content files

397d7d8

fix(llms): correct broken links and remove translations

01f3be0

gabinfay requested review from corwintines, minimalsm, pettinarip and wackerow as code owners June 11, 2025 12:41

github-actions bot added the config ⚙️ Changes to configuration files label Jun 11, 2025

pettinarip reviewed Jun 11, 2025

View reviewed changes

gabinfay added 3 commits June 21, 2025 11:06

chore: Update package-lock.json for LLMS scripts

60e1cbd

github-actions bot added dependencies 📦 Changes related to project dependencies tooling 🔧 Changes related to tooling of the project labels Jun 21, 2025

konopkja mentioned this pull request Jul 3, 2025

Suggest a resource, Custom GPT based on Ethereum Builder Docs #15675

Closed

2 tasks

pettinarip mentioned this pull request Jul 7, 2025

Initial llms.txt file #15794

Merged

gabinfay closed this by deleting the head repository Aug 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/llm accessibility #15654

Feature/llm accessibility #15654

Uh oh!

gabinfay commented Jun 11, 2025

Uh oh!

netlify bot commented Jun 11, 2025 •

edited

Loading

Uh oh!

pettinarip left a comment

Uh oh!

wackerow commented Jul 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Feature/llm accessibility #15654

Feature/llm accessibility #15654

Uh oh!

Conversation

gabinfay commented Jun 11, 2025

Description

Related Issue

Uh oh!

netlify bot commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for ethereumorg ready!

Uh oh!

pettinarip left a comment

Choose a reason for hiding this comment

Uh oh!

wackerow commented Jul 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

netlify bot commented Jun 11, 2025 •

edited

Loading