Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Check build target supports std when building with -Zbuild-std=std #14183

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

harmou01
Copy link
Contributor

@harmou01 harmou01 commented Jul 3, 2024

What does this PR try to resolve?

Ensures that Cargo first verifies whether a given target supports building the standard library when the -Zbuild-std=std option is passed to Cargo (see issue here). This information can be obtained by querying rustc --print=target-spec-json. The target spec "metadata" contains an Option<bool> determining whether the target supports building std.

In the case this value is false, cargo will stop the build. This avoids the numerous "use of unstable library" errors, giving a cleaner, and simpler, "building std is not supported on this target".

How should we test and review this PR?

It can be manually tested by running cargo build --target <target> -Zbuild-std=std. If a target who's target-spec marks std as false is passed, cargo will exit. This works with multiple --target's, and if any of them don't support std, cargo will exit.

Additional Information

This change relies on two things:

  • The target-spec metadata in rustc needs to be filled out. Currently, most fields are None, which is treated as OK with this change. Meaning this can be merged before the rustc changes.
  • The new test case added with this change will fail without at least aarch64-unknown-none having it's target-spec metadata completed.
  • Some targets have std support marked as "?". If this state is properly represented in the target-spec in the future, it needs to be determined whether this is allowed, so the build can continue, or whether it fails.

@rustbot
Copy link
Collaborator

rustbot commented Jul 3, 2024

Thanks for the pull request, and welcome! The Rust team is excited to review your changes, and you should hear from @ehuss (or someone else) some time within the next two weeks.

Please see the contribution instructions for more information. Namely, in order to ensure the minimum review times lag, PR authors and assigned reviewers should ensure that the review label (S-waiting-on-review and S-waiting-on-author) stays updated, invoking these commands when appropriate:

  • @rustbot author: the review is finished, PR author should check the comments and take action accordingly
  • @rustbot review: the author is ready for a review, this PR will be queued again in the reviewer's queue

@rustbot rustbot added A-cfg-expr Area: Platform cfg expressions S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jul 3, 2024
@harmou01
Copy link
Contributor Author

harmou01 commented Jul 3, 2024

r? @ehuss

@rustbot
Copy link
Collaborator

rustbot commented Jul 3, 2024

Could not assign reviewer from: ehuss.
User(s) ehuss are either the PR author, already assigned, or on vacation, and there are no other candidates.
Use r? to specify someone else to assign.

@ehuss
Copy link
Contributor

ehuss commented Jul 3, 2024

Thanks for the PR! Unfortunately I'm not sure this is something we can move forward with in the short term. The --print target-spec-json option would need to be stabilized in order for this to be supported in build-std. I'm not sure how feasible that is. My impression is that it seems like that will be difficult, since some of the fields are somewhat inherently unstable.

There is some more information at rust-lang/rust#38338 and rust-lang/wg-cargo-std-aware#6.

I would suggest getting in contact with the compiler team to suss out the viability of that. There might be other alternatives, such as providing other rustc options for querying/extracting information like this.

@adamgemmell
Copy link

adamgemmell commented Jul 3, 2024

I didn't think that target-spec-json would need to be stable because Cargo and Rustc are versioned together - this creates a contract between the two, but one that can be changed, similar to the contract between Rustc and std (or stdarch as a closer example). Would a test in Rust that tests this integration alleviate your concerns? We only need to parse part of the spec.

Otherwise I doubt the spec can be stabilised, so I think we'd need some other mechanism of getting target info from Rustc. I think this might be useful for things like getting the default panic or unwind strategy too.

@ehuss
Copy link
Contributor

ehuss commented Jul 3, 2024

I didn't think that target-spec-json would need to be stable because Cargo and Rustc are versioned together - this creates a contract between the two, but one that can be changed, similar to the contract between Rustc and std (or stdarch as a closer example).

They aren't exactly tightly coupled with each other. Cargo is expected to work on the previous two versions of rustc (although build-std makes that complicated). Also, since cargo is developed in a separate repository, it makes it difficult to synchronize changes, since a change in rustc could immediately break development in cargo.


I think it might be possible to support this on an advisory basis, but not something we can strictly rely upon. It would likely need a different approach than what is taken in this PR, though. A few things to consider:

  • I'm uncomfortable with setting RUSTC_BOOTSTRAP in TargetInfo::new, since that could leak exposure of unstable options on stable (and generally uncomfortable setting it anywhere). I would suggest collecting the JSON information in a separate call to cached_output. That would also avoid needing to guess where the JSON boundaries are.
  • Ignore all errors from rustc, to handle the case where target-spec-json changes.
  • Ignore all errors from deserialization, to also handle the case if target-spec-json changes.

However, I'm reluctant to do that since I think it introduces risk of breaking cargo's CI and development (since cargo's testsuite could fail). I'm not sure how a test in rust-lang/rust would help, since if the compiler needs to make a change to the target-spec-json output or CLI option, there would be a problem where you can't update both rustc and cargo at the same time. That has been a fundamental problem that has prevented us from adding build-std tests to rust-lang/rust.

I would suggest opening a dialog with the compiler team to see if they have any input on how cargo and rustc can coordinate to obtain this information in a way that would be reliable.

@bors
Copy link
Contributor

bors commented Jul 5, 2024

☔ The latest upstream changes (presumably #14185) made this pull request unmergeable. Please resolve the merge conflicts.

@adamgemmell
Copy link

I appreciate the detailed answer, thanks. Agreed on the RUSTC_BOOTSTRAP=1 hack. On checking the target-spec-json tracking issue it does seem like there's some support for stabilising the print option which surprised me - I think I had it mixed up in my head with the idea of providing the JSON as an input target.

Considering the case where it was stabilised with all fields, current and future, optional. In the case of this PR, it would mean that Cargo handles the "not found" case for the Metadata::std field, probably by printing out a warning, but not error out or break Cargo's CI. Then we'd basically have to wait for a bug report from someone who got the warning and parse the new format while continuing to support the old one.

This seems not ideal, but might it be good enough? The alternative would be a stricter form of stabilisation that makes all current fields mandatory, any future fields optional and all enums non-exhaustive and strings have an undefined format.

@harmou01 harmou01 force-pushed the dev/harmou01/handle-target-build-std branch from d79de0d to db383e7 Compare October 14, 2024 13:09
Copy link
Member

@weihanglo weihanglo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Regardless of my review, Eric's concern in #14183 (comment) should be deeply considered.

src/cargo/core/compiler/build_context/target_info.rs Outdated Show resolved Hide resolved
src/cargo/core/compiler/standard_lib.rs Outdated Show resolved Hide resolved
tests/testsuite/standard_lib.rs Outdated Show resolved Hide resolved
tests/testsuite/freshness.rs Outdated Show resolved Hide resolved
@weihanglo weihanglo added the Z-build-std Nightly: build-std label Oct 15, 2024
@harmou01 harmou01 force-pushed the dev/harmou01/handle-target-build-std branch from db383e7 to e34950e Compare October 15, 2024 09:25
@harmou01
Copy link
Contributor Author

@weihanglo Regarding Eric's concerns about the use of the --print=target-spec-json and the possibility of it changing, @adamgemmell has an RFC open to add a --print=std-support flag, which gives us a simpler and more reliable method for determining std support. We wouldn't have to grab the entire target-spec-json anymore, and therefore wouldn't risk things changing between versions, because rustc would always give a true/false answer. I think this should address Eric's concerns.

The second rustc query wouldn't be needed either if this RFC is implemented, we could revert back to collecting everything with a single query.

rust-lang/rfcs#3693

@adamgemmell
Copy link

rust-lang/rfcs#3693

To elaborate on this we tried to make this patch improve things without waiting for the RFC's proposal to become stable by removing RUSTC_BOOTSTRAP and making Cargo tolerate all failure cases. If you'd feel more comfortable waiting for that solution then we're fine with that of course.

Running cargo with "-Zbuild-std=std" should check whether the build
target supports building the standard library. This information can be
obtained from rustc with the target-spec-json "--print" option. When
'std' is false for the build target, cargo should not attempt to build
the standard library.

This avoids the "use of unstable library" errors, as this check is
performed before Cargo starts trying to build restricted_std code.

Cargo will now emit a warning if the requested target does not support
building the standard library, or if there was an issue when collecting
the necessary information via rustc
Add a new test case to check cargo handles building a target which
doesn't support the standard library properly.
An additional rustc query is made in Cargo to collect the
target-spec-json from rustc to determine if the build target supports
compiling the standard library. As this functionality relies on a
nightly rustc, Cargo does not output any errors when parsing, and
continues as normal.

This commit adds a new test case to bad_cfg_discovery to ensure that
Cargo handles this properly. Unlike the other tests, there is no
expected output and an exit code 0.
@harmou01 harmou01 force-pushed the dev/harmou01/handle-target-build-std branch from e34950e to 37a326e Compare October 30, 2024 13:47
Comment on lines +61 to +67
#[derive(Deserialize)]
pub struct Metadata {
pub std: Option<bool>,
}

#[derive(Deserialize)]
pub struct TargetSpec {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No need to be pub.

Also, while not really a blocker, personally I may move these closer to where they are used, Like inside the gctx.cli_unstable().build_std.is_some() {} block.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

See also the suggestion in #14183 (comment)

// try to fetch this information if rustc allows nightly features. Additionally,
// to avoid making two rustc queries when not required, only try to fetch the
// target-spec when the '-Zbuild-std' option is passed.
if gctx.nightly_features_allowed && gctx.cli_unstable().build_std.is_some() {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You cannot set build_std = Some(…) when not on nightly, so it is redundant.

Suggested change
if gctx.nightly_features_allowed && gctx.cli_unstable().build_std.is_some() {
if gctx.cli_unstable().build_std.is_some() {

@@ -183,6 +184,11 @@ impl BuildStd for Execs {
self.arg("--target").arg(rustc_host());
self
}

fn target(&mut self, target: &str) -> &mut Self {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While this method looks quite convenient, I consider it as a refactor. There are a lot of places just calling cargo.run("build --target).arg("<triple>") now. I may put off this change a bit if it is only used by one place.

Comment on lines +77 to +84
for (target, target_info) in target_data.target_info() {
if target_info.supports_std == Some(false) {
anyhow::bail!(
"building std is not supported on this target: {:?}",
target.short_name()
);
}
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Instead of exposing fn target_info(), should we have a method say RustcTargetData::supports_std instead?

Suggested change
for (target, target_info) in target_data.target_info() {
if target_info.supports_std == Some(false) {
anyhow::bail!(
"building std is not supported on this target: {:?}",
target.short_name()
);
}
}
target_data.supports_std()?;

A perhaps could show all unsupported targets instead of the first one found.
(though I don't know if people really build multi-target std in one invocation)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is a good idea. I'll have a think about how to handle listing all unsupported targets.

Comment on lines +329 to +342

let target_spec_cached_output =
rustc.cached_output(&target_spec_process, extra_fingerprint);

if target_spec_cached_output.is_ok() {
let (stdout, _stderr) = target_spec_cached_output.unwrap();
match serde_json::from_str(&stdout) {
Result::Ok(val) => {
let target_spec: TargetSpec = val;
supports_std = target_spec.metadata.std;
}
Result::Err(_) => (),
}
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

rustc.cached_output caches the rustc output to target/.rustc_info.json file. I guess it is better not to cache it. A direct call to ProcessBuilder::output is probably good enough. Would be something like:

Suggested change
let target_spec_cached_output =
rustc.cached_output(&target_spec_process, extra_fingerprint);
if target_spec_cached_output.is_ok() {
let (stdout, _stderr) = target_spec_cached_output.unwrap();
match serde_json::from_str(&stdout) {
Result::Ok(val) => {
let target_spec: TargetSpec = val;
supports_std = target_spec.metadata.std;
}
Result::Err(_) => (),
}
}
#[derive(Deserialize)]
pub struct Metadata {
pub std: Option<bool>,
}
#[derive(Deserialize)]
pub struct TargetSpec {
pub metadata: Metadata,
}
if let Ok(output) = target_spec_process.output() {
if let Ok(spec) = serde_json::from_slice::<TargetSpec>(&output.stdout) {
supports_std = spec.metadata.std;
}
}

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As Eric has noted, target-spec-json is not going to be stabilized in a near feature, I would suggest not adding a new mode to this test. It is already complicated. The test_std_on_unsupported_target test should have covered the parsing end-to-end pretty much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-cfg-expr Area: Platform cfg expressions S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. Z-build-std Nightly: build-std
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants