- Feature Name: cargo-script
- Start Date: 2023-04-26
- Pre-RFC: internals
- eRFC PR: rust-lang/rfcs#3424
- Tracking Issue: rust-lang/cargo#12207
Summary
This experimental RFC adds unstable support for single-file packages in cargo so we can explore the design and resolve questions with an implementation to collect feedback on.
Single-file packages are .rs files with an embedded
manifest. These will be accepted with just like Cargo.toml files with
--manifest-path. cargo will be modified to accept cargo <file>.rs as a
shortcut to cargo run --manifest-path <file>.rs. This allows placing
cargo in a #! line for directly running these files.
Example:
#!/usr/bin/env cargo
//! ```cargo
//! [dependencies]
//! clap = { version = "4.2", features = ["derive"] }
//! ```
use clap::Parser;
#[derive(Parser, Debug)]
#[clap(version)]
struct Args {
#[clap(short, long, help = "Path to config")]
config: Option<std::path::PathBuf>,
}
fn main() {
let args = Args::parse();
println!("{:?}", args);
}
$ ./prog --config file.toml
Args { config: Some("file.toml") }
See cargo-script-mvs for a demo.
Motivation
Collaboration:
When sharing reproduction cases, it is much easier when everything exists in a single code snippet to copy/paste. Alternatively, people will either leave off the manifest or underspecify the details of it.
This similarly makes it easier to share code samples with coworkers or in books / blogs when teaching.
Interoperability:
One angle to look at proposals is if there is a single obvious solution. While this isn't the case for single-file packages, there is enough of a subset of one. By standardizing that subset, we allow greater interoperability between solutions (e.g. playground could gain support ). This would make it easier to collaborate..
Prototyping:
Currently to prototype or try experiment with APIs or the language, you need to either
- Use the playground
- Can't access local resources
- Limited in the crates supported
- Note: there are alternatives to the playground that might have fewer restrictions but are either less well known or have additional complexities.
- Find a place to do
cargo new, editCargo.tomlandmain.rsas necessary, andcargo runit, then delete it- This is a lot of extra steps, increasing the friction to trying things out
- This will fail if you create in a place that
cargowill think it should be a workspace member
By having a single-file package,
- It is easier to setup and tear down these experiments, making it more likely to happen
- All crates will be available
- Local resources are available
One-Off Utilities:
It is fairly trivial to create a bunch of single-file bash or python scripts into a directory and add it to the path. Compare this to rust where
cargo neweach of the "scripts" into individual directories- Create wrappers for each so you can access it in your path, passing
--manifest-pathtocargo run
Guide-level explanation
As an eRFC, this is meant to convey what we are looking to accomplish. Many of the details may change before stablization.
Creating a New Package
(Adapted from the cargo book)
To start a new package with Cargo, create a file named hello_world.rs:
#!/usr/bin/env cargo
fn main() {
println!("Hello, world!");
}
Let's run it
$ chmod +x hello_world.rs
$ ./hello_world.rs
Hello, world!
Dependencies
(Adapted from the cargo book)
crates.io is the Rust community's central package registry
that serves as a location to discover and download
packages. cargo is configured to use it by default to find
requested packages.
Adding a dependency
To depend on a library hosted on crates.io, you modify hello_world.rs:
#!/usr/bin/env cargo
//! ```cargo
//! [dependencies]
//! regex = "1.8.0"
//! ```
fn main() {
let re = Regex::new(r"^\d{4}-\d{2}-\d{2}$").unwrap();
println!("Did our date match? {}", re.is_match("2014-01-01"));
}
The cargo section in the doc-comment (any module inner doc-comment style is supported) is
called a manifest, and it contains all of the metadata
that Cargo needs to compile your package. This is written in the TOML format
(pronounced /tɑməl/).
regex = "1.8.0" is the name of the crate and a SemVer version
requirement. The specifying
dependencies docs have more
information about the options you have here.
You can then re-run this and Cargo will fetch the new dependencies and all of
their dependencies. You can see this by passing in --verbose:
$ cargo --verbose ./hello_world.rs
Updating crates.io index
Downloading memchr v0.1.5
Downloading libc v0.1.10
Downloading regex-syntax v0.2.1
Downloading memchr v0.1.5
Downloading aho-corasick v0.3.0
Downloading regex v0.1.41
Compiling memchr v0.1.5
Compiling libc v0.1.10
Compiling regex-syntax v0.2.1
Compiling memchr v0.1.5
Compiling aho-corasick v0.3.0
Compiling regex v0.1.41
Compiling hello_world v0.1.0 (file:///path/to/package/hello_world)
Did our date match? true
Cargo will cache the exact information (in a location referred to as
CARGO_HOME) about which revision of all of these dependencies we used.
Now, if regex gets updated, we will still build with the same revision until
we choose to cargo update --manifest-path hello_world.rs.
Package Layout
(Adapted from the cargo book)
When a single file is not enough, you can separately define a Cargo.toml file along with the src/main.rs file. Run
$ cargo new hello_world --bin
We’re passing --bin because we’re making a binary program: if we
were making a library, we’d pass --lib. This also initializes a new git
repository by default. If you don't want it to do that, pass --vcs none.
Let’s check out what Cargo has generated for us:
$ cd hello_world
$ tree .
.
├── Cargo.toml
└── src
└── main.rs
1 directory, 2 files
Unlike the hello_world.rs, a little more context is needed in Cargo.toml:
[package]
name = "hello_world"
version = "0.1.0"
edition = "2021"
[dependencies]
Cargo uses conventions for file placement to make it easy to dive into a new Cargo package:
.
├── Cargo.lock
├── Cargo.toml
├── src/
│ ├── lib.rs
│ ├── main.rs
│ └── bin/
│ ├── named-executable.rs
│ ├── another-executable.rs
│ └── multi-file-executable/
│ ├── main.rs
│ └── some_module.rs
├── benches/
│ ├── large-input.rs
│ └── multi-file-bench/
│ ├── main.rs
│ └── bench_module.rs
├── examples/
│ ├── simple.rs
│ └── multi-file-example/
│ ├── main.rs
│ └── ex_module.rs
└── tests/
├── some-integration-tests.rs
└── multi-file-test/
├── main.rs
└── test_module.rs
Cargo.tomlandCargo.lockare stored in the root of your package (package root).- Source code goes in the
srcdirectory. - The default library file is
src/lib.rs. - The default executable file is
src/main.rs.- Other executables can be placed in
src/bin/.
- Other executables can be placed in
- Benchmarks go in the
benchesdirectory. - Examples go in the
examplesdirectory. - Integration tests go in the
testsdirectory.
If a binary, example, bench, or integration test consists of multiple source
files, place a main.rs file along with the extra [modules][def-module]
within a subdirectory of the src/bin, examples, benches, or tests
directory. The name of the executable will be the directory name.
You can learn more about Rust's module system in the book.
See Configuring a target for more details on manually configuring targets. See Target auto-discovery for more information on controlling how Cargo automatically infers target names.
Reference-level explanation
The details will be deferred to the implementation.
Initial guidelines for evaluating decisions:
- Single-file packages should have a first-class experience
- Provides a higher quality of experience (doesn't feel like a hack or tacked on)
- Transferable knowledge, whether experience, stackoverflow answers, etc
- Easier unassisted migration between single-file and multi-file packages
- The more the workflows deviate, the higher the maintenance and support costs for the cargo team
- Example implications:
- Workflows, like running tests, should be the same as multi-file packages rather than being bifurcated
- Manifest formats should be the same rather than using a specialized schema or data format
- Friction for starting a new single-file package should be minimal
- Easy to remember, minimal syntax so people are more likely to use it in one-off cases, experimental or prototyping use cases without tool assistance
- Example implications:
- Embedded manifest is optional which also means we can't require users specifying
edition - See also the implications for first-class experience
- Workspaces for single-file packages should not be auto-discovered as that will break unless the workspaces also owns the single-file package which will break workflows for just creating a file anywhere to try out an idea.
- Embedded manifest is optional which also means we can't require users specifying
- Cargo/rustc diagnostics and messages (including
cargo metadata) should be in terms of single-file packages and not any temporary files- Easier to understand the messages
- Provides a higher quality of experience (doesn't feel like a hack or tacked on)
- Example implications:
- Most likely, we'll need single-file packages to be understood directly by
rustc so cargo doesn't have to split out the
.rscontent into a temp file that gets passed to cargo which will cause errors to point to the wrong file - Most likely, we'll want to muck with the errors returned by
toml_editso we render manifest errors based on the original source code which will require accurate span information.
- Most likely, we'll need single-file packages to be understood directly by
rustc so cargo doesn't have to split out the
Drawbacks
This will likely permeate cargo's code base. While we are fairly positive this has a path to stablization and it won't extend out for too long, we will be paying for that cost with little benefit until then.
Then when this is stablized, this increases the surface area of cargo for the cargo team to maintain and support.
We will not be reserving syntax for build.rs, [lib]
support, proc-maros, or other functionality to be added later
with the assumption that if these features are needed, a user
should be using a multi-file package.
Rationale and alternatives
Scope
The cargo-script family of tools has a single command
- Run
.rsfiles with embedded manifests - Evaluate command-line arguments (
--expr,--loop)
This behavior (minus embedded manifests) mirrors what you might expect from a scripting environment, minus a REPL. We could design this with the future possibility of a REPL.
However
- The needs of
.rsfiles and REPL / CLI args are different, e.g. where they get their dependency definitions - A REPL is a lot larger of a problem, needing to pull in a lot of interactive behavior that is unrelated to
.rsfiles - A REPL for Rust is a lot more nebulous of a future possibility, making it pre-mature to design for it in mind
Therefore, this eRFC is limited in scope to running single-file rust packages.
First vs Third Party
As mentioned, a reason for being first-party is to standardize the convention for this which also allows greater interop.
A default implementation ensures people will use it. For example, clap
received an issue with a reproduction case using a cargo-play script that
went unused because it just wasn't worth installing yet another, unknown tool.
This also improves the overall experience as you do not need the third-party command to replicate support for every potential feature including:
cargo testand other built-in cargo commandscargo expandand other third-party cargo commandsrust-analyzerand other editor/IDE integration
While other third-party cargo commands might not immediately adopt single-file packages, first-party support for them will help encourage their adoption.
This still leaves room for third-party implementations, either differentiating themselves or experimenting with
- Alternative caching mechanisms for lower overhead
- Support for implicit
main, like doc-comment examples - Template support for implicit
mainfor customizinguse,extern,#[feature], etc - Short-hand dependency syntax (e.g.
//# serde_json = "*") - Prioritizing other workflows, like runtime performance
Prior art
See Single-file scripts that download their dependencies for a wide view of this space.
Existing Rust solutions:
cargo-script- Single-file (
.crsextension) rust code- Partial manifests in a
cargodoc comment code fence or dependencies in a comment directive run-cargo-scriptfor shebangs and setting up file associations on Windows
- Partial manifests in a
- Performance: Shares a
CARGO_TARGET_DIR, reusing dependency builds --expr <expr>for expressions as args (wraps in a block and prints blocks value as{:?})--depflags since directives don't work as easily
--loop <expr>for a closure to run on each line--test, etc flags to make up for cargo not understanding thesefiles--forceto rebuildand--clear-cache`- Communicates through scripts through some env variables
- Single-file (
cargo-scripter- See above with 8 more commits
cargo-eval- See above with a couple more commits
rust-script- See above
- Changed extension to
.ers/.rs - Single binary without subcommands in primary case for ease of running
- Implicit main support, including
async main(different implementation than rustdoc) --toolchain-versionflag
cargo-play- Allows multiple-file scripts, first specified is the
main - Dependency syntax
//# serde_json = "*" - Otherwise, seems like it has a subset of
cargo-scripts functionality
- Allows multiple-file scripts, first specified is the
cargo-wopcargo wopis to single-file rust scripts ascargois to multi-file rust projects- Dependency syntax is a doc comment code fence
Related Rust solutions:
- Playground
- Includes top 100 crates
- Rust Explorer
- Uses a comment syntax for specifying dependencies
runner- Global
Cargo.tomlwith dependencies added viarunner --add <dep>and various commands / args to interact with the shared crate - Global, editable prelude / template
-e <expr>support-i <expr>support for consuming and printing iterator values-n <expr>runs per line
- Global
evcxr- Umbrella project which includes a REPL and Jupyter kernel
- Requires opting in to not ending on panics
- Expressions starting with
:are repl commands - Limitations on using references
irust- Rust repl
- Expressions starting with
:are repl commands - Global, user-editable prelude crate
- papyrust
- Not single file; just gives fast caching for a cargo package
D:
- dub
dub hello.dis shorthand fordub run --single hello.d- Regular nested block comment (not doc-comment) at top of file with
dub.sdl:header
Java
- JEP 330: Launch Single-File Source-Code Programs
- jbang
jbang initw/ templatesjbang editsupport, setting up a recommended editor w/ environment- Discourages
#!and instead encourages looking like shell code with///usr/bin/env jbang "$0" "$@" ; exit $? - Dependencies and compiler flags controlled via comment-directives, including
//DEPS info.picocli:picocli:4.5.0(gradle-style locators)- Can declare one dependency as the source of versions for other dependencies (bom-pom)
//COMPILE_OPTIONS <flags>//NATIVE_OPTIONS <flags>//RUNTIME_OPTIONS <flags>
- Can run code blocks from markdown
--codeflag to execute code on the command-line- Accepts scripts from
stdin
Haskell
runghc/runhaskell- Users can use the file stem (ie leave off the extension) when passing it in
- cabal's single-file haskel script
- Command is just
cabal, which could run into weird situations if a file has the same name as a subcommand - Manifest is put in a multi-line comment that starts with
cabal: - Scripts are run with
--quiet, regardless of which invocation is used - Documented in their "Getting Started" and then documented further under
cabal run.
- Command is just
stack scriptstackacts as a shortcut for use in#!- Delegates resolver information but can be extended on the command-line
- Command-line flags may be specified in a multi-line comment starting with
stack script
Cross-language
scriptisto- Supports any compiled language
- Comment-directives give build commands
- nix-script
- Nix version of scriptisto, letting you use any Nix dependency
Unresolved questions
Through the eRFC process, we particularly want to resolve:
What command should be used in #! lines?
- If
cargo, what precedence does it have compared to built-in commands, aliases, and external commands. - If something else, what to name it?
- If
cargo-<foo>how to deal with diverging behavior betweencargo fooandcargo-foosincecargo foowon't play nice in a#!line across platforms
How to keep build-times down for the best exploratory experience?
- e.g. using a central
CARGO_TARGET_DIR - e.g. locking to similar dependencies across scripts for reusing more of the cache in `CARGO_TARGET_DIR``
How the default RUST_BACKTRACE setting affects the use cases for single-file
packages if working around it is worth it?
Whether single-file packages should be run within the
environment (.cargo/config.toml, rust-toolchain.toml) of
the current working directory (like cargo run) or a fixed
location like their own directory (more like cargo install)
How to embed the manifest within the file?
- How obvious it is for new users when they see it
- How easy it is for newer users to remember it and type it out
- How machine editable it is for
cargo addand friends - Needs to be valid Rust code based on the earlier stated design guidelines
- Lockfiles might also need to reuse how we attach metadata to the file
How do we handle the lockfile, balancing single-file package use case needs (single file, easy copy / paste, etc) with the expectations of Rust for reproducibility?
- Sharing of single-file projects should be easy
- In "copy/paste" scenarios, like reproduction cases in issues, how often have lockfiles been pertinent for reproduction?
- There is an expectation of a reproducible Rust experience
- Dropping of additional files might be frustrating for users to deal with (in addition to making it harder to share it all)
- We would need a way to store the lockfile for
stdinwithout conflicting with parallel runs cargoalready makes persisting ofCargo.lockoptional for multi-file packages, encouraging not persisting it in some cases- Newer users should feel comfortable reading and writing single-file packages
- A future possibility is allowing single-file packages to belong to a
workspace at which point they would use the workspace's
Cargo.lockfile. This limits the scope of the conversation and allows an alternative to whatever is decided here. - Read-only single-file packages (e.g. running
/usr/bin/package.rswithout root privileges)
How do we handle the package.edition field, balancing
single-file package use case needs (no boilerplate, modern
experience) with the expectations of Rust for reproducibility?
- Matching the expectation of a reproducible Rust experience
- Users wanting the latest experience, in general
- Boilerplate runs counter to experimentation and prototyping
- There might not be a backing file if we read from
stdin
Smaller questions include:
- Should we support explicit stdin with
-? Implicit stdin? - Should we support workspaces as part of the initial MVP?
- Whether single-file packages need a distinct file extension or not?
- What, if any, file associations should be registered on Windows?
- As single-file packages aren't auto discovered (e.g.
cargo testbeing short forcargo test --manifest-path Cargo.toml), is there a way we can make running cargo commands on single-file packages more convenient?
Potential answers to these questions were intentionally left out to help focus the conversation on the proposed experiment. For a previous enumeration of potential answers to these questions, see the Pre-RFC on Internals.
Future possibilities
Implicit main support
Like with doc-comment examples, we could support an implicit main.
Ideally, this would be supported at the language level
- Ensure a unified experience across the playground,
rustdoc, andcargo cargocan directly run files rather than writing to intermediate files- This gets brittle with top-level statements like
extern(more historical) or bin-level attributes
- This gets brittle with top-level statements like
Behavior can be controlled through editions
A REPL
See the REPL exploration
In terms of the CLI side of this, we could name this cargo shell where it
drops you into an interactive shell within your current package, loading the
existing dependencies (including dev). This would then be a natural fit to also have a --eval <expr> flag.
Ideally, this repl would also allow the equivalent of python -i <file>, not
to run existing code but to make a specific file's API items available for use
to do interactive whitebox testing of private code within a larger project.