Unix & Linux Asked by Danny Staple on January 3, 2022
So I like to harden my bash scripts wherever I can (and when not able to delegate to a language like Python/Ruby) to ensure errors do not go uncaught.
In that vein I have a strict.sh, which contains things like:
set -e
set -u
set -o pipefail
And source it in other scripts.
However, while pipefail would pick up:
false | echo it kept going | true
It will not pick up:
echo The output is '`false; echo something else`'
The output would be
The output is ”
False returns non-zero status, and no-stdout. In a pipe it would have failed, but here the error isn’t caught. When this is actually a calculation stored in a variable for later, and the value is set to blank, this may then cause later problems.
So – is there a way to get bash to treat a non-zero returncode inside a backtick as reason enough to exit?
If you are running Bash 4.4 or later, you can use the shopt
option inherit_errexit
to do just that. You can check compatibility from within Bash using echo $BASH_VERSION
.
Here is the shebang you would use if Bash 4.4 or later were installed and came before /bin
in your $PATH
:
#!/usr/bin/env -S bash -euET -o pipefail -O inherit_errexit
The -S
is there to coax Linux’s env
into accepting more than one argument for bash
, as kindly pointed out by @UVV and explained further on StackOverflow.
inherit_errexit
is an option to shopt
, while the rest of the arguments are options to set
. In most modern iterations, they can be passed directly to bash
when invoking the shell.
Let’s review the options you have already been using:
-u
/-o nounset
, as the name ambiguously hints, disallows dereferencing of variables that have not been set; e.g., $IJUSTMADETHISUP
. This mostly helps guard against typos, or oversights when copy-pasting from Stack Overflow ?-e
/-o errexit
does some of what you are requesting: if called directly†, shell commands with nonzero status codes will cause the shell to exit with an error. That is, everything preceding a line of shell script must “go right” in order for that line to execute.-o pipefail
is needed to extend this guarantee to commands whose output is redirected with an I/O pipe |
.† i.e., not from within a subshell (...)
Now for the options I’ve added:
-O inherit_errexit
further extends this functionality (exiting on nonzero status code) to commands called from within subshells. This closes an important loophole, since subshells are used for command substitution $(...)
, `...`
and process substitution <(...)
, >(...)
, and both are found in many shell scripts in the wild.‡-E
/-o errtrace
and -T
/-o functrace
options are there for the comparatively rare case that you use trap
to perform an action when the shell receives a signal. These two options extend signal handlers to the inner bodies of shell functions for ERR
signals and DEBUG
/RETURN
signals, respectively.‡ This is just one reason to prefer the $(...)
syntax for command substitution, since the parentheses make it explicit that you are entering a subshell. It also happens to be nestable.
Answered by Ron Wolf on January 3, 2022
To exit on a command substitution failure you may explicitly set -e
in a subshell, like this:
set -e
x=$(set -e; false; true)
echo "this will never be shown"
Answered by errr on January 3, 2022
As others have said, local
will always return 0. The solution is to declare the variable first:
function testcase()
{
local MYRESULT
MYRESULT=$(false)
if (( $? != 0 )); then
echo "False returned false!"
return 1
fi
return 0
}
Output:
$ testcase
False returned false!
$
Answered by Will on January 3, 2022
As the OP pointed out in his own answer, assigning the output of the subcommand to a variable does solve the problem; the $?
is left unscathed.
However, one edge case can still puzzle you with false negatives (i.e. command fails but error doesn't bubble up), local
variable declaration:
local myvar=$(subcommand)
will always return 0
!
bash(1)
points this out:
local [option] [name[=value] ...]
... The return status is 0 unless local is used outside a function,
an invalid name is supplied, or name is a readonly variable.
Here's a simple test case:
#!/bin/bash
function test1() {
data1=$(false) # undeclared variable
echo 'data1=$(false):' "$?"
local data2=$(false) # declaring and assigning in one go
echo 'local data2=$(false):' "$?"
local data3
data3=$(false) # assigning a declared variable
echo 'local data3; data3=$(false):' "$?"
}
test1
The output:
data1=$(false): 1
local data2=$(false): 0
local data3; data3=$(false): 1
Answered by mogsie on January 3, 2022
The exact language used in the Single UNIX specification to describe the meaning of set -e
is:
When this option is on, if a simple command fails for any of the reasons listed in Consequences of Shell Errors or returns an exit status value >0, and is not [a conditional or negated command], then the shell shall immediately exit.
There is an ambiguity as to what happens when such a command occurs in a subshell. From a practical point of view, all the subshell can do is exit and return a nonzero status to the parent shell. Whether the parent shell will in turn exit depends on whether this nonzero status translates into a simple command failing in the parent shell.
One such problematic case is the one you encountered: a nonzero return status from a command substitution. Since this status is ignored, it does not cause the parent shell to exit. As you've already discovered, a way to take the exit status into account is to use the command substitution in a simple assignment: then the exit status of the assignment is the exit status of the last command substitution in the assignment(s).
Note that this will perform as intended only if there is a single command substitution, as only the last substitution's status is taken into account. For example, the following command is successful (both according to the standard and in every implementation I've seen):
a=$(false)$(echo foo)
Another case to watch for is explicit subshells: (somecommand)
. According to the interpretation above, the subshell may return a nonzero status, but since this is not a simple command in the parent shell, the parent shell should continue. In fact, all the shells I know of do make the parent return at this point. While this is useful in many cases such as (cd /some/dir && somecommand)
where the parentheses are used to keep an operation such as a current directory change local, it violates the specification if set -e
is turned off in the subshell, or if the subshell returns a nonzero status in a way that would not terminate it, such as using !
on a true command. For example, all of ash, bash, pdksh, ksh93 and zsh exit without displaying foo
on the following examples:
set -e; (set +e; false); echo "This should be displayed"
set -e; (! true); echo "This should be displayed"
Yet no simple command has failed while set -e
was in effect!
A third problematic case is elements in a nontrivial pipeline. In practice, all shells ignore failures of the elements of the pipeline other than the last one, and exhibit one of two behaviors regarding the last pipeline element:
Like before, turning off set -e
or using a negation in the last element of the pipeline causes it to return a nonzero status in a way that should not terminate the shell; shells other than ATT ksh and zsh will then exit.
Bash's pipefail
option causes a pipeline to exit immediately under set -e
if any of its elements returns a nonzero status.
Note that as a further complication, bash turns off set -e
in subshells unless it's in POSIX mode (set -o posix
or have POSIXLY_CORRECT
in the environment when bash starts).
All of this shows that the POSIX specification unfortunately does a poor job at specifying the -e
option. Fortunately, existing shells are mostly consistent in their behavior.
Answered by Gilles 'SO- stop being evil' on January 3, 2022
Interesting point!
I've never stumbled across that, because I'm no friend of set -e
(Instead I prefer to trap ... ERR
) but already tested that: trap ... ERR
also don't catch errors within $(...)
(or the oldfashioned backticks).
I think the problem is (as so often) that here a subshell is called and -e
explicitely means the current shell.
Only other solution that came to mind at this moment would be to use read:
ls -l ghost_under_bed | read name
This throws ERR
and with -e
the shell will be terminated. Only problem: this works only for commands with one line of output (or you pipe through something that joins lines).
Answered by ktf on January 3, 2022
(Answering my own because I've found a solution)
One solution is to always assign this to an intermediate variable.
This way the returncode ($?
) is set.
So
ABC=`exit 1`
echo $?
Will output 1
(or instead exit if set -e
is present), however:
echo `exit 1`
echo $?
Will output 0
after a blank line. The return code of the echo (or other command that ran with the backtick output) will replace the 0 return code.
I am still open to solutions that do not require the intermediate variable, but this gets me some of the way.
Answered by Danny Staple on January 3, 2022
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP