How can I get bash to exit on backtick failure in a similar way to pipefail?

Question

So I like to harden my bash scripts wherever I can (and when not able to delegate to a language like Python/Ruby) to ensure errors do not go uncaught.

In that vein I have a strict.sh, which contains things like:

set -e
set -u
set -o pipefail

And source it in other scripts.
However, while pipefail would pick up:

false | echo it kept going | true

It will not pick up:

echo The output is '`false; echo something else`'

The output would be

The output is ''

False returns non-zero status, and no-stdout. In a pipe it would have failed, but here the error isn't caught. When this is actually a calculation stored in a variable for later, and the value is set to blank, this may then cause later problems.

So - is there a way to get bash to treat a non-zero returncode inside a backtick as reason enough to exit?

Ron Wolf · Answer

Solution
If you are running Bash 4.4 or later, you can use the shopt option inherit_errexit to do just that. You can check compatibility from within Bash using echo $BASH_VERSION.
Here is the shebang you would use if Bash 4.4 or later were installed and came before /bin in your $PATH:
#!/usr/bin/env -S bash -euET -o pipefail -O inherit_errexit

The -S is there to coax Linux’s env into accepting more than one argument for bash, as kindly pointed out by @UVV and explained further on StackOverflow.

Background
inherit_errexit is an option to shopt, while the rest of the arguments are options to set. In most modern iterations, they can be passed directly to bash when invoking the shell.
Let’s review the options you have already been using:

-u/-o nounset, as the name ambiguously hints, disallows dereferencing of variables that have not been set; e.g., $IJUSTMADETHISUP. This mostly helps guard against typos, or oversights when copy-pasting from Stack Overflow ?
-e/-o errexit does some of what you are requesting: if called directly†, shell commands with nonzero status codes will cause the shell to exit with an error. That is, everything preceding a line of shell script must “go right” in order for that line to execute.
-o pipefail is needed to extend this guarantee to commands whose output is redirected with an I/O pipe |.

† i.e., not from within a subshell (...)
Now for the options I’ve added:

-O inherit_errexit further extends this functionality (exiting on nonzero status code) to commands called from within subshells. This closes an important loophole, since subshells are used for command substitution $(...), `...` and process substitution <(...), >(...), and both are found in many shell scripts in the wild.‡
The -E/-o errtrace and -T/-o functrace options are there for the comparatively rare case that you use trap to perform an action when the shell receives a signal. These two options extend signal handlers to the inner bodies of shell functions for ERR signals and DEBUG/RETURN signals, respectively.

‡ This is just one reason to prefer the $(...) syntax for command substitution, since the parentheses make it explicit that you are entering a subshell. It also happens to be nestable.

See also

Doug Richardson: Fail Fast Bash Scripting
Unix & Linux Stack Exchange: Command substitution inside a function does not stop the script on a failure even if -e is set

errr · Answer

To exit on a command substitution failure you may explicitly set -e in a subshell, like this:

set -e
x=$(set -e; false; true)
echo "this will never be shown"

Will · Answer

As others have said, local will always return 0. The solution is to declare the variable first:

function testcase()
{
    local MYRESULT

MYRESULT=$(false)
    if (( $? != 0 )); then
        echo "False returned false!"
        return 1
    fi

return 0
}

Output:

$ testcase
False returned false!
$

mogsie · Answer

As the OP pointed out in his own answer, assigning the output of the subcommand to a variable does solve the problem; the $? is left unscathed.

However, one edge case can still puzzle you with false negatives (i.e. command fails but error doesn't bubble up), local variable declaration:

local myvar=$(subcommand) will always return 0!

bash(1) points this out:

local [option] [name[=value] ...]
          ... The return status is 0 unless local is used outside a function,
          an invalid name is supplied, or name is a readonly variable.

Here's a simple test case:

#!/bin/bash

function test1() {
  data1=$(false) # undeclared variable
  echo 'data1=$(false):' "$?"
  local data2=$(false) # declaring and assigning in one go
  echo 'local data2=$(false):' "$?"
  local data3
  data3=$(false) # assigning a declared variable
  echo 'local data3; data3=$(false):' "$?"
}

test1

The output:

data1=$(false): 1
local data2=$(false): 0
local data3; data3=$(false): 1

Gilles 'SO- stop being evil' · Answer

The exact language used in the Single UNIX specification to describe the meaning of set -e is:

When this option is on, if a simple command fails for any of the reasons listed in Consequences of Shell Errors or returns an exit status value >0, and is not [a conditional or negated command], then the shell shall immediately exit.

There is an ambiguity as to what happens when such a command occurs in a subshell. From a practical point of view, all the subshell can do is exit and return a nonzero status to the parent shell. Whether the parent shell will in turn exit depends on whether this nonzero status translates into a simple command failing in the parent shell.

One such problematic case is the one you encountered: a nonzero return status from a command substitution. Since this status is ignored, it does not cause the parent shell to exit. As you've already discovered, a way to take the exit status into account is to use the command substitution in a simple assignment: then the exit status of the assignment is the exit status of the last command substitution in the assignment(s).

Note that this will perform as intended only if there is a single command substitution, as only the last substitution's status is taken into account. For example, the following command is successful (both according to the standard and in every implementation I've seen):

a=$(false)$(echo foo)

Another case to watch for is explicit subshells: (somecommand). According to the interpretation above, the subshell may return a nonzero status, but since this is not a simple command in the parent shell, the parent shell should continue. In fact, all the shells I know of do make the parent return at this point. While this is useful in many cases such as (cd /some/dir && somecommand) where the parentheses are used to keep an operation such as a current directory change local, it violates the specification if set -e is turned off in the subshell, or if the subshell returns a nonzero status in a way that would not terminate it, such as using ! on a true command. For example, all of ash, bash, pdksh, ksh93 and zsh exit without displaying foo on the following examples:

set -e; (set +e; false); echo "This should be displayed"
set -e; (! true); echo "This should be displayed"

Yet no simple command has failed while set -e was in effect!

A third problematic case is elements in a nontrivial pipeline. In practice, all shells ignore failures of the elements of the pipeline other than the last one, and exhibit one of two behaviors regarding the last pipeline element:

ATT ksh and zsh, which execute the last element of the pipeline in the parent shell, do business as usual: if a simple command fails in the last element of the pipeline, the shell executing that command, which happens to be the parent shell, exits.
Other shells approximate the behavior by exiting if the last element of the pipeline returns a nonzero status.

Like before, turning off set -e or using a negation in the last element of the pipeline causes it to return a nonzero status in a way that should not terminate the shell; shells other than ATT ksh and zsh will then exit.

Bash's pipefail option causes a pipeline to exit immediately under set -e if any of its elements returns a nonzero status.

Note that as a further complication, bash turns off set -e in subshells unless it's in POSIX mode (set -o posix or have POSIXLY_CORRECT in the environment when bash starts).

All of this shows that the POSIX specification unfortunately does a poor job at specifying the -e option. Fortunately, existing shells are mostly consistent in their behavior.

ktf · Answer

Interesting point!

I've never stumbled across that, because I'm no friend of set -e (Instead I prefer to trap ... ERR) but already tested that: trap ... ERR also don't catch errors within $(...) (or the oldfashioned backticks).

I think the problem is (as so often) that here a subshell is called and -e explicitely means the current shell.

Only other solution that came to mind at this moment would be to use read:

ls -l ghost_under_bed | read name

This throws ERR and with -e the shell will be terminated. Only problem: this works only for commands with one line of output (or you pipe through something that joins lines).

Danny Staple · Answer

(Answering my own because I've found a solution)
One solution is to always assign this to an intermediate variable. 
This way the returncode ($?) is set.

So

ABC=`exit 1`
echo $?

Will output 1 (or instead exit if set -e is present), however:

echo `exit 1`
echo $?

Will output 0 after a blank line. The return code of the echo (or other command that ran with the backtick output) will replace the 0 return code.

I am still open to solutions that do not require the intermediate variable, but this gets me some of the way.

How can I get bash to exit on backtick failure in a similar way to pipefail?

7 Answers

Solution

Background

See also

Add your own answers!

Ask a Question