Discrepancy between CLI behavior (v0.6.3) and GitHub documentation for crwl command #1283
Unanswered
Sheld0n-Cooper
asked this question in
Forums - Q&A
Replies: 1 comment
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Dear Crawl4AI Developer,
I've encountered some discrepancies between the crwl command-line interface (CLI) behavior of the installed Crawl4AI version and the examples provided in your main GitHub repository documentation.
Context:
Observed Discrepancies:
CLI Options (
--deep-crawl,--max-pagesvs.--crawler):--max-pages 10) suggests the use of --deep-crawl and --max-pages for configuring
the crawler.
executable from the venv (e.g., venv/bin/crwl crawl ...), the command
consistently returns the error: Error: No such option: --deep-crawl Did you mean
--crawler?.
does not list --deep-crawl or --max-pages. Instead, it indicates that crawler
parameters should be passed via -c, --crawler TEXT (e.g., -c
strategy=bfs,max_pages=X). This latter syntax is what I found to be functional
for version 0.6.3.
Required Subcommand (
crawl):https://www.nbcnews.com/business -o markdown) show crwl being used directly with
the URL and options, without a subcommand.
subcommand (i.e., crwl crawl [URL] ...).
Questions for Clarification:
Could you please clarify why the CLI behavior of the 0.6.3 version installed via pip
differs from the command examples provided in the main GitHub documentation?
--deep-crawl and --max-pages options in version 0.6.3?
For building interactive crawling workflows, it is crucial to programmatically obtain
a structured list of all discovered URLs during a crawl (not just by parsing the
generated Markdown output). Is there a recommended way or a specific crwl option to
get this list directly in a machine-readable format (e.g., JSON)?
Impact:
These discrepancies lead to confusion for users attempting to follow the documented
examples and require trial-and-error to determine the correct command syntax for a given
installed version.
Suggestion:
Updating the GitHub documentation to accurately reflect the CLI behavior of the
currently released pip versions would be highly beneficial for users.
Thank you for your time and assistance.
Beta Was this translation helpful? Give feedback.
All reactions