FLinux File Descriptor

The Linux Newbie Guide

⇒

Fundamentals

Advanced

Supplement

Command Index

ENG⇒中

File descriptor

1.0 Introduction to File Descriptors
File descriptors (fd) and redirection
Directory "/proc/<PID>/fd" and file descriptors
1.1 exec and fd Redirection
exec X>FILE : Redirect fd X to a file
exec X>&Y : Redirect fd X to fd Y
exec X<FILE : Redirect a file to fd X
exec X<&Y : Redirect fd Y to fd X
exec fd X<>FILE : Redirect file to fd X for both reading and writing
X>&- or X<&- : Close fd X
1.2 Who Stole stdin?

ENG⇒中 ENG⇒中
1.0 Introduction to File Descriptors
One day, I came across a certain book discussing Shell Scripting, which provided an example of reading a file line by line. The sample script is shown below:

(Example ex1.sh)

$ cat ex1.sh
#!/bin/bash
# read〝FILE.txt〞 line by line
while read -r line #←This line is one of the puzzling parts for me – how can "read" read a file instead of the keyboard?
do
echo $line
done <FILE.txt #←This line is another puzzling part for me – what does "done < FILE" mean???

Although the above example consists of only a few lines, its syntax is perplexing. As someone with limited knowledge, I find it hard to understand, and the author hasn't explained the principles thoroughly (probably copied from somewhere?).

Well, if I can't comprehend a piece of writing, it's not a big deal. I also want to adopt it for my application. I tested the example, and it works perfectly fine.

My application is also quite simple: I just want to display one line at a time and require pressing any key to proceed to the next line. So, I added an extra command to rewrite it as follows:

((Example ex2.sh)

$ cat ex2.sh
#!/bin/bash
while read -r line
do
echo $line
read -p "Press any key to continue" -n 1 #←This is the line I added
done <FILE.txt

Strange enough, my modified program, ex2.sh, doesn't work as expected! Who's the culprit? Is it something peculiar in my understanding? Or is it the line I added myself?

So, I turned into a keyboard detective, determined to catch the culprit and bring them to justice.

After several nights of intense investigation, I finally caught the culprit – it's none other than "File Descriptor," often abbreviated as "fd".

In the original example (ex1.sh), the ability to read a file line by line is achieved through a shell simplification that hides the intricacies of file descriptor operations. However, my modified example (ex2.sh) doesn't work properly because the shell's hidden file descriptor operations have stolen stdin (standard input).

In the literature within my country, mentions of "file descriptor" are rare, and even when they are touched upon, they fail to address the core issues. Therefore, I decided to jot down my insights from these past few days, serving as both my personal reminder and potentially assisting others who encounter the same problem. This is particularly relevant for shell scripts that involve file reading and keyboard input prompts, where the use of file descriptors might eliminate the need to rely on tools like awk or sed to accomplish tasks.

In simple terms, a "file descriptor" is a number assigned by Unix-like operating systems when reading files. This number acts as an index for the kernel to track the input/output of processes associated with the opened files.

For instance, while browsing this article, your browser may have opened around 20 files (HTML files and various image files), each with a unique index (such as 100, 101, 102, and so on). These index numbers are the file descriptors (fd). However, maintaining these file descriptors is the responsibility of the kernel, and regular users need not concern themselves with this level of detail.

^ back on top ^

file descriptors (fd) and redirection
There are three file descriptors (fd) that are always open: stdin (keyboard input), stdout (screen output), and stderr (error messages). The POSIX standard reserves the fd numbers 0 to 2 for these three files, allowing users to perform redirection for various applications.

fd Number	Name	Function
0	stdin	Standard Input
1	stdout	Standard Output
2	stderr	Standard Error

Redirection is essentially the process of redirecting the three uncloseable file descriptors (0 to 2) to other destinations (usually files or other fds).

	Function	Example	Example note
COMMAND 1>	Redirect stdout	echo '123' >fileA	fd 1 output to a file
COMMAND 1>>	Append stdout:	seq 100 200 >> fileA	fd 1 append output to a file
COMMAND 2>	Redirect stderr:	find / -name '*.conf' 2>/dev/null	fd 2 output to a file
COMMAND 2>>	Append stderr:	seq 1 10 >>fileA	fd 2 append output to a file
COMMAND 0<	Redirect stdin	cat < fileA	fd 0 replaced by a file

For output redirection, the syntax is COMMAND [fd]>, where fd defaults to 1.
if omitted. For input redirection, the syntax is COMMAND [fd]<, where fd defaults to 0 if omitted.

Redirection can also change the original output to go to stderr or vice versa.
The syntax is X>&Y (where X is the original fd, and Y is the redirected fd; if X is omitted, it defaults to 1). For example, redirecting stderr (2) to stdout (1) is written as "2>&1".

	Function	Example
2>&1	Redirect stderr(2) to stdout(1):	ls -R /home > fileA 2>&1
1>&2	Redirect stdout to stderr	find / -name '*readme.txt' 1>&2 2>/dev/null

Since stdin, stdout, and stderr (fd 0 to 2) are always open and can be used directly, if you need an fd greater than 3, you generally need to use the exec command to open it.

^ back on top ^

Directory "/proc/<PID>/fd" and file descriptors
When a process runs, it generates a Process ID (PID), and corresponding file descriptors fd are mapped to the directory "/proc/<PID>/fd" (where "<PID>" is the process's PID number). This directory allows you to observe the usage of file descriptors. Example:

$ seq 1 1000000
1
2
3
4 Ctrl+Z ←Press <Ctrl+Z> to pause
`
[1]+ Stopped seq 1 100000000 ←The program is stopped
$ jobs -p ←List the PIDs of paused commands
$ 2373 ←The PID of the command "seq 1 1000000" is 2373
ls -lgG /proc/2373/fd/ ←List /proc/<PID>/fd to observe fd usage
total 0
lrwx------ 1 64 2015-04-26 22:28 0 -> /dev/tty1
lrwx------ 1 64 2015-04-26 22:28 1 -> /dev/tty1
lrwx------ 1 64 2015-04-26 22:28 2 -> /dev/tty1

In the above example, the directory "/proc/<PID>/fd/" contains 3 files, corresponding to file descriptors 0, 1, and 2, respectively. These are linked to "/dev/tty1" (in graphical interface tests, it could be "/dev/pts/N").

This means that in the example, stdin (fd 0), stdout (fd 1), and stderr (fd 2) are all connected to the tty (terminal) or /dev/pts/N (virtual terminal).

Let's modify the experiment with the command seq 1 1000000 > fileA 2>&1 and observe the results:

lrwx------ 1 64 2015-04-26 15:04 0 -> /dev/tty1
l-wx------ 1 64 2015-04-26 15:04 1 -> /home/basalt/fileA
l-wx------ 1 64 2015-04-26 15:04 2 -> /home/basalt/fileA

In this example, stdin (fd 0) remains as the tty, but stdout (fd 1) and stderr (fd 2) are both redirected to "fileA."

Therefore, when a command becomes confusing due to piping and redirection, you can gain clarity by observing the information provided by the file descriptors in the directory "/proc/<PID>/fd/."

Now, if you modify it further to seq 1 100 > fileB >&2, after the computation, the contents of the file "fileB" are empty. Why is that? Take a look at "/proc/<PID>/fd/" to understand!

^ back on top ^

1.1 exec and fd Redirection

Excluding fd 0 (stdin), fd 1 (stdout), fd 2 (stderr), and system-reserved fd 10 to 255, general users are advised to only use fd 3 to 9 for redirection purposes.
(fd 255 is usually reserved for shell scripts, and process substitution may use fd 63 or fd 62, so it's best to avoid using the system's own fd 10 to 255 to prevent conflicts.)

To use fd 3 to 9, the exec command is used. In a process, the exec function serves two main purposes: it closes the parent process and runs the child process directly. Another important function of exec is fd redirection.

There are two types of redirection: redirecting one fd to another fd and redirecting an fd to a file. Let's explain each of them:

Redirecting one fd to another fd
For example, to create fd 6 and redirect it to fd 1, you write exec 6>&1. Why not write exec 1<&6, which has the redirection in the opposite direction? The principle is that the existing fd should be on the right side, and the new fd should be on the left side. Since fd 1 (stdout) already exists, it should be placed on the right side, hence exec 6>&1.
Another principle is that the input source for input redirection should be on the right side of "<:. For example, to redirect fd 0 (stdin) to fd 7, you write exec 7<&0.
Redirecting an fd to a file
To redirect output to a file, you use ">". For example, exec 3>FILE means creating fd 3 and redirecting it to the file "FILE".
To redirect a file to an fd, you use "<". For example, exec 0<FILE means the file replaces fd 0 (stdin).

Combining these principles, the possible usages are as follows:

exec X>FILE : Redirect fd X to a file
exec X>FILE, where X is the fd number (recommended to use 3 to 9), means creating fd X and redirecting it to the file FILE.

Is it a bit abstract? Let's test it step by step and observe the results. Here's an example of an operation:

$ exec 8>/tmp/fd_test ←Create fd 8 and redirect it to the file "tmp/fd_test"
$ echo $$ ←View the current shell's PID
2633 ←Current shell's PID
$ ls -lgG /proc/2633/fd ←Observe fd usage total 0
total 0
lr-x------ 1 64 2015-05-04 10:17 0 -> /dev/tty1
l-wx------ 1 64 2015-05-04 10:17 1 -> /dev/tty1
l-wx------ 1 64 2015-05-04 10:17 2 -> /dev/tty1
lrwx------ 1 64 2015-05-04 10:58 255 -> /dev/tty1
lr-x------ 1 64 2015-05-04 10:17 8 -> /tmp/fd_test ←fd 8 created and redirected to the file

From the above example, the exec 8>/tmp/fd_test command opens fd 8, and that fd 8 is then redirected to the file "/tmp/fd_test". Now, let's redirect fd 1 (stdout) to fd 8, which means writing stdout to the file "/tmp/fd_test".

Okay, let's continue the experiment.

$ echo 'hello world !' >&8 Write a string to fd 8
$ cat /tmp/fd_test ←Verify the content
hello world !

As a recap, redirecting stdout (1) to stderr (2) is written as "1>&2" or" >&2". Similarly, redirecting fd 1 to fd 8 in the above example is written as "1>&8" or" >&8".

Remember to close fd 8 if it's no longer needed.

The shell script "ex3.sh" below demonstrates opening multiple fds and closing them after use:

Example:

$ cat ex3.sh
#!/bin/bash

# flowing create fd 3~5 and redirect to file1~file3
exec 3>/tmp/file1
exec 4>/tmp/file2
exec 5>/tmp/file3

# flowing write string to fd1 then redirect to fd 3~5
echo '1234' >&3
echo 'abcd' >&4
echo 'I II III IV' >&5

# flowing close fd 3~5
exec 3>&-
exec 4>&-
exec 5>&-

^ back on top ^

exec X>&Y : Redirect fd X to fd Y
In the example provided, the exec X>&Y syntax is used to redirect fd X to fd Y. In this context, we can use "X>&Y" to handle stderr when using a pipeline.

The pipeline functionality allows the stdout of one command to become the stdin of the next command, meaning the stderr part cannot pass through the pipeline to the next command. However, if we want to process stderr after the pipeline, we can use "X>&Y" for redirection. In the following example, stderr is piped to tr to convert it to uppercase, but stdout is not piped.

Example: (Tested as a non-root user)

$ exec 6>&1 ←Redirect fd 6 to fd 1
$ ls -l /root /etc/fstab 2>&1 1>&6 | tr a-z A-Z ←stderr is piped to tr to convert to uppercase
-rw-r--r-- 1 root root 608 2014-09-26 15:47 /etc/fstab ←stdout remains unchanged as it is redirected to fd 6

LS: CANNOT OPEN DIRECTORY /ROOT: PERMISSION DENIED ←stderr is converted to uppercase by tr
$ exec 6>&- ←Close fd 6

In the above example, the "2>&1" redirects both stderr and stdout to stdout first, and then "1>&6" redirects stdout to fd 6. Therefore, after the pipeline and tr, only stderr is affected because stdout was redirected to fd 6.

^ back on top ^

exec X<FILE : Redirect a file to fd X
The exec X< FILE command redirects a file to file descriptor X, where X is optional and defaults to 0 (stdin).

If a file is redirected to fd 0 (most commonly used), a more accurate way to describe it would be to say "file replacing keyboard" (emphasizing "replacing").

For example, exec < FILE redirects the file to fd 0 (stdin), which means that the input source is no longer stdin (keyboard), but instead replaced by the file content (keyboard input becomes ineffective). This allows the original commands that were entered from the keyboard (stdin) to be read from the file, one line at a time.

In an interactive shell, the most essential interaction involves the keyboard and screen. Without a keyboard, how can interaction occur? Hence, inputting exec < FILE (if such a file exists) would exit the shell. However, in non-interactive shells (such as shell scripts), the file would replace stdin (keyboard).

An example of a script file, "ex4.sh", reads the first three lines of the file "/etc/fstab".

Example:

$ cat ex4.sh
#!/bin/bash

exec < /etc/fstab #fd 0 (stdin)= file〝/etc/fstab〞

# flowing read〝/etc/fstab〞 line1~3
read line1
read line2
read line3

# flowing print〝/etc/fstab〞 line1~3
echo $line1
echo $line2
echo $line3

If you want to read an entire file, you can use a loop, as shown in the shell script example below.

Example:

$ cat ex5.sh
#!/bin/bash

# read file "/etc/fstab" line by line
exec 0< /etc/fstab # fd 0 (stdin) = file

while read line # now command read from file instead of stdin
do
echo $line
done

^ back on top ^

exec X<&Y : Redirect fd Y to fd X
The syntax exec X<&Y is used to redirect fd Y to fd X and creates fd X. After this operation, fd X becomes equal to fd Y

.In the example you provided

$ exec 3>/tmp/fd_test ←Redirect fd 3 to the file "tmp/fd_test"
$ echo "line1" >&3 ←Write the string to fd 3
$ cat /tmp/fd_test ←Verify the content line1
line1
$ exec 9<&3 ←Open fd 9 and redirect fd 3 to fd 9 (now fd 9 is equal to fd 3)
$ echo "line2" >&9 ←Write the string to fd 9
$ cat /tmp/fd_test ←Verify the content
line1
line2
$ exec 9>&- ←Close fd 9
$ exec 3<&- ←Close fd 3

^ back on top ^

exec fd X<>FILE : Redirect file to fd X for both reading and writing
The exec fd X<> FILE command is used to redirect the file "FILE" bidirectionally to the File Descriptor (fd) X, allowing both reading from and writing to the file. Note that there should be no spaces between "X<>".

Example: [Note:1.1]

$ echo 1234567890 > File ←Create a file named "File"
$ exec 3<> File ←Open the file "File" and redirect it bidirectionally to fd 3
$ read -n 4 <&3 ←Read the 4th character from fd 3
$ echo -n "." >&3 ←Write a period "." to fd 3
$ exec 3>&- ←Close fd 3
$ cat File ←Verify the content
1234.67890

In the example provided, a file named "File" is created and populated with "1234567890". Then, the exec 3<> File command opens the file "File" and redirects it bidirectionally to fd 3. It reads the 4th character from fd 3 and writes a period "." to fd 3. Afterward, fd 3 is closed, and the content of the file "File" is verified, resulting in "1234.67890".

X>&- or X<&- : Close fd X
To close the input redirection of fd, use X<&-, and to close the output redirection of fd, use X>&-.

If closing an fd without using exec, it will be temporarily closed (its effect lasts for only one operation).

In the following example, stderr is temporarily closed. (Note: You cannot close stdin and stdout in an interactive shell.)

Example: (Tested as a non-root user)

$ find / -name 'readme.*' 2>&- ←Find all files named "readme." in the filesystem & close stderr
/usr/share/icons/Bluecurve/48x48/mimetypes/readme.png
/usr/share/doc/cyrus-sasl-lib-2.1.22/readme.html
/usr/share/doc/words-3.0/readme.txt
/usr/share/icons/Bluecurve/48x48/mimetypes/readme.png

In the above example, find / -name 'readme.*' 2>&- is equivalent to find / -name 'readme.*' 2> /dev/null, which filters out stderr.

For permanently closing an fd, it is necessary to use exec, for example, exec 9>&-. Remember to close unused fds to allow other programs to use them and to avoid potential conflicts where multiple programs compete for the same fd number, leading to hard-to-detect bugs.

Also, note that only stdout can pass through a pipeline. Other unrelated fds should be temporarily closed. For example, in the example provided with exec X>&Y, it is safer to rewrite it as follows:

Example:

$ exec 6>&1
$ ls -l /root /etc/fstab 2>&1 1>&6 6>&- | tr a-z A-Z 6>&- ←Temporarily close fd 6 for "tr"
$ exec 6>&- ←Permanently close fd 6 for the shell

^ back on top ^

1.2 Who Stole stdin?
Let's revisit the mysterious example ex1.sh. When running it and observing the "/proc/<PID>/fd" directory, the result is as follows:

lr-x------ 1 64 2015-04-26 14:45 0 -> /home/basalt/FILE.txt ←stdin changed to a file
lrwx------ 1 64 2015-04-26 14:45 1 -> /dev/tty1
lrwx------ 1 64 2015-04-26 14:45 10 -> /dev/tty1 ←additional fd 10 opened
lrwx------ 1 64 2015-04-26 14:45 2 -> /dev/tty1
lr-x------ 1 64 2015-04-26 14:45 255 -> /home/basalt/ex1.sh

The stdin has become a file, and an additional fd 10 is opened. We can boldly speculate that when encountering the "done < FILE" type of syntax in ex1.sh, the shell cleverly changes the stdin source to a file. Let's deduce the hidden file descriptor operations made by the shell:

exec 10<&0 #←Backup fd 0 to fd 10
exec < FILE.txt #← stdin=file
while read -r line
do
echo $line
done #←Original command was done<FILE.txt

exec 0<&10 #← Restore fd 0 from fd 10

Due to the" done < FILE", the stdin is changed to a file. Since the stdin is taken away by the file, this is the reason why the modified ex2.sh cannot read from the keyboard using the read command. After the modification, the script can read from the file and the stdin (keyboard). The modified example "ex6.sh" is as follows:

$ cat ex6.sh
#!/bin/bash
# read〝FILE.txt〞 line by line

exec 7<FILE.txt # ←fd 7=FILE.txt)

while read -u 7 line #←read reads from fd 7 instead of stdin
do
echo $line
read -p "Press any key to continue" -n 1
done

On the internet, I stumbled upon a classic case where many people encountered the issue of stdin being taken away without understanding the reason. A user wanted to write a shell script to list files in the working directory and then prompt whether to delete them, but it didn't work, so the user sought help online.

The problematic shell script looks like this:

$ cat ex7.sh
#!/bin/bash
while read file_name
do
rm -iv $file_name
done < <(ls)

If users are familiar with fd operations, they should be able to help the user find the issue (hint: the last part "<(ls)" is process substitution). (For a corrected version that works, refer to [Note 1.1a].)

^ back on top ^

[Note1.1] Example Source Advanced Bash-Scripting Guide

[Note1.1a]

$ cat ex8.sh
#!/bin/bash
exec 3<&0
while read file_name
do
rm -iv $file_name <&3
done < <(ls)

# To exclude directories from the list and avoid error messages, the last line can be modified as follows:
# done < <(ls -F | grep -v '/$')