linux Virtual block device

The Linux Newbie Guide

⇒

Fundamentals

Advanced

Supplement

Command Index

ENG⇒中

Virtual block device

1.0 Virtual Block Device
dd : Physical to virtual conversion
losetup : Virtual to physical conversion
kpartx : Mapping partitions in disk images to loop devices
1.1 Swap : Auxiliary Memory
swap-partition
swapon : Enabling swap
swapoff : Disabling swap
swap-file : Using a file for swap
mkswap : Formatting swap
swappiness : Adjusting the aggressiveness of swap usage

ENG⇒中 ENG⇒中
1.0 Virtual Block Device - Form is not different from emptiness, emptiness is not different from form

If a user has experience installing Linux on a virtual machine, they may find it difficult to distinguish between what is virtual and what is real. The entire operating system can be virtualized, including virtual block devices such as CD-ROMs, hard disks, and filesystems. For Linux, this virtualization is a common and essential feature, and Linux also comes with various built-in tools for virtualization and real device interaction.

The philosophy of UNIX/Linux is "Everything is a file," and the key to this concept is that a "file" can also represent a physical storage device like "/dev/sda." Therefore, in Linux, there is only a thin line between files and physical devices, and understanding the interchangeability of virtual and real can lead to infinite possibilities.

The Heart Sutra states, "Form is not different from emptiness, emptiness is not different from form." In this context, "form" refers to physical objects, and "emptiness" refers to the virtual. In simpler terms, it means "virtual and real are the same." In the world of UNIX/Linux, physical storage devices and the "device files" representing them embody the principle of "form is emptiness, emptiness is form."

dd : Physical to virtual conversion
In Windows, to convert a CD/DVD to an ISO file, commonly used tools are "WinISO" or "UltraISO." For complete disk duplication, commercial software like "Norton GHOST" or "Acronis True Image" is often used. However, in Linux, there is a powerful and widely known tool called "dd," which provides similar functionality to most of those tools and is free to use. The only difference is that dd does not have a graphical interface and requires command-line usage. Interestingly, the name dd has multiple interpretations despite its powerful features.

The primary purpose o fdd" is to read data and perform conversions, somewhat similar to tr, as shown in the example below:

Example:

$ cat fileA | dd conv=ucase > fileB ←Converts lowercase letters in fileA to uppercase and stores the result in fileB

The parameter "conv=ucase" converts lowercase to uppercase, and the opposite parameter "conv=lcase" converts uppercase to lowercase.

dd can also be used with input and output file options, represented by "if=" (input file) and "of=" (output file), respectively.

$ dd if=fileA of=fileB conv=lcase ←Converts uppercase letters in fileA to lowercase and stores the result in fileB

Though dd may seem simple at first glance, it is capable of more than just character conversions. It can be used to convert physical devices to virtual ones, bridging the gap between real and virtual. dd is able to convert physical devices to virtual device files, often referred to as "image files" or "disk images," which can be treated as real media when needed.

For instance, the following example converts a physical CD-ROM (CentOS 7 installation disc) to a virtual image-file (ISO file):

Example:

$ isoinfo -d -i /dev/cdrom ←Reads CD-ROM information
CD-ROM is in ISO 9660 format
System id: LINUX
Volume id: CentOS 7 x86_64
Volume set id:
Publisher id:
Data preparer id:
Application id: GENISOIMAGE ISO 9660/HFS FILESYSTEM CREATOR (C) 1993 E.YOUNGDALE
Copyright File id:
Abstract File id:
Bibliographic File id:
Volume set size is: 1
Volume set sequence number is: 1
Logical block size is: 2048
Volume size is: 2182204
. . .

By using isoinfo we can see that the format of this CD is ISO 9660, the Logical block size is 2048, and the Volume size is 2182204. Based on this information, we can use dd to convert the CD to an .iso file:

$ dd if=/dev/cdrom of=centos7.iso bs=2048 count=2182204 status=progress
4422363136 bytes (4.4 GB, 4.1 GiB) copied, 57 s, 77.6 MB/s
2182204+0 records in
2182204+0 records out 4469153792 bytes(4.5 GB,4.2 GiB)copied,60.7711 s, 73.5 MB/s

In this example, the parameter "bs" indicates the block size for reading/writing, and the "status=progress" parameter is added to show the conversion progress.

Similarly, dd can also be used to convert physical partitions or floppy disks into image files.

Example::

# dd if=/dev/sda5 of=sda5.img ←Convert a partition to an image-file
# dd if=/dev/fd0 of=floppy2.img ←Convert a floppy disk to an image-file
# dd if=floppy2.img of=/dev/fd0 ←Restore the image-file back to the floppy disk

Once an image-file is created, it can be mounted as a virtual device or restored using losetup, the reverse of dd.

IT professionals who have used Norton Ghost to copy/backup Windows systems can use dd as a free alternative.

Example:

# dd if=/dev/sda of=/dev/sdb conv=noerror ←Hard disk copying
# dd if=/dev/sda conv=noerror | gzip -c > win_xp.img.gz ←Backup the hard disk to a compressed image-file
# gzip -dc win_xp.img.gz | dd of=/dev/sda ←Restore the image-file back to the hard disk

The parameter "conv=noerror" means to continue reading despite errors. If you are sure there will be no errors, you can omit this parameter.

Sometimes, the MBR (Master Boot Record) or GPT (GUID Partition Table) of a hard disk may become corrupted due to malware or unknown reasons, preventing the system from booting. In such cases, dd can be used to create a recovery image-file, backing up the MBR or GPT table, which can later be used for disaster recovery.

MBR is stored in the first 512 bytes of the hard disk, and GPT can be backed up by saving the first 34 Logical Block Addresses (LBA). The following examples demonstrate backing up and restoring the MBR and GPT:

Example for MBR:

# dd if=/dev/sdb of=mbr bs=512 count=1 ←Backup the MBR table
# dd if=mbr of=/dev/sdb bs=512 count=1 ←Restore the MBR table

In the above example," bs=512" is used because the MBR table is only 512 bytes in size. The "count" parameter specifies how many times to execute the "bs" block size.

To backup the GPT table, you can simply backup the first 34 LBAs. The GPT scheme is as follows: (512e or traditional hard drive, one LBA = block = 512B)

CUID Partition Table (GPT) Scheme

LBA 0	Protective MBR
LBA 1	Primary GPT Header	Primary GUID Partition Entry Array
LBA 2	GUID Partition entries 1~4
LBA 3	GUID Partition entries 5~8
...	...
LBA 32	GUID Partition entries 121~124
LBA 33	GUID Partition entries 125~128
LBA 34	Partition 1	Partition or unused area
...	...
LBA N	Partition 2

[Note 1.0]。

Example for GPT:

# dd if=/dev/sdc of=gpt bs=512 count=34 ←Backup the primary GPT table
# dd if=gpt of=/dev/sdc bs=512 count=34 ←Restore the primary GPT table

dd can also be used to completely erase hard drive data. This is because mkfs only rebuilds the inodes when formatting an ext2/ext3/ext4 partition, and does not erase the blocks. This means that someone with the right tools may be able to recover the data. To completely erase a hard drive, you can use dd if=/dev/zero of=/dev/sda to fill the entire hard drive with zeros.

dd is also used to create empty image-files that can be used for various purposes. For example, creating a swap-file as a swap partition or creating a virtual partition that can be formatted and mounted to have additional partition space

Example:

# dd if=/dev/zero of=200M-image bs=200M count=1 ←Create a 200MB empty image-file
# mkfs 200M-image ←Format the image-file
# mount -t ext2 -o loop 200M-image /mnt ←Mount the virtual partition

(The command dd if=/dev/zero of=200M-image obs=1 count=0 seek=200M will create a 200MB empty image file more quickly than the command dd if=/dev/zero of=200M-image bs=200M count=1.)

When necessary, the virtual partition can be added to "/etc/fstab" for automatic mounting on startup. Such virtual image-files can be used not only as partitions but also as disks for partitioning using tools like parted.

Using dd provides the added benefit of time measurement, making it a useful tool for testing the speed of a hard disk. For example, the following commands can be used to test

# dd if=/dev/zero bs=1024 count=1000000 of=1G-file ←Test write speed (block=1k, size=1G)
# dd if=1G-file bs=1024 of=/dev/null ←Test read speed

The dd command is a powerful tool that can be used for a variety of purposes. It is important to use dd carefully, as it can be used to erase data permanently.

^ back on top ^

losetup : Virtual to physical conversion
Why is it possible to use the mount command to mount a file created with dd as a filesystem, even though the file is not a partition? When mounting, you can observe the behavior using the "-v" option. Here's an example of how this works:

Example:

# mount -v -t ext2 -o loop 200M-image /mnt ←Using the -v option during mounting
mount: going to use the loop device /dev/loop0 ←Using a loop device??
/root/200M-image on /mnt type ext2 (rw,loop=/dev/loop0)

When mounting an image file using mount -v -o loop, you might encounter the term "loop device." So, what is a "loop device"? A "loop device" is a virtual block device. Loop devices are available by default, usually eight of them, numbered from "/dev/loop0" to "/dev/loop7". The number of loop devices supported may vary among different Linux distributions. These devices cannot be directly used until you specify a file (an image file) to associate with the loop device. The tool used to associate a file with a loop device is called losetup (loop device setup).

The usage of losetup is as follows: losetup [-option] [/dev/loop#] device (where # is a number). For instance, losetup /dev/loop0 200M-image associates the virtual block device file "/dev/loop0" with the image file 200M-image. At this point, "/dev/loop0" becomes an interface for the image file. The operating system treats this virtual block device file, such as /dev/loop#, just like a real storage device, making operations indistinguishable from those performed on physical storage devices.

When using the mount -o loop command, it is actually utilizing the losetup command behind the scenes. If you were to break down the process without using mount -o loop, it would look something like this:

Example:

# losetup /dev/loop0 200M-image ←Associating the image file with the virtual block device file "/dev/loop0"
# mount -t ext2 /dev/loop0 /mnt ←Mounting "/dev/loop0"

To detach the association between losetup and an image file, you can use the "-d" option. For instance, losetup -d /dev/loop0 would disconnect the association between "/dev/loop0" and the image file.

The "-v" option displays whether a loop device is associated with an image file or not. For example, using losetup -v /dev/loop3 or losetup -a scans from" /dev/loop0" to "/dev/loop7" to show the current associations. Another option, "-f", finds the next available unused loop device automatically, eliminating the need to specify a loop device explicitly.

Example:

# losetup -a ←Scan for used loop devices
/dev/loop0: [0805]:261784 (disk-image) ←/dev/loop0 is in use
# losetup -fv part-image ←Automatically find an unused loop device for the file part-image
Loop device is /dev/loop1 ←Automatically found "/dev/loop1"

In the section about dd, there was a demonstration of creating an empty image file and then immediately formatting it, resulting in a "not a block special device" warning. This is because an empty image file created with dd pretends to be a storage device but is quickly recognized as a sham. If you use losetup to associate this empty image file, it will closely resemble a real block device, even appearing in the output of blkid. Here's an example:

Example:

# dd if=/dev/zero of=disk-image bs=100M count=1 ←Create a 100M empty image file
# losetup /dev/loop0 disk-image ←Associate the image file as a simulated hard disk partition

After using losetup on "/dev/loop0", it behaves just like a real hard disk. When you use mkfs /dev/loop0 to format, there will be no "not a block special device" warning as before when formatting the image file directly. You can also use fdisk /dev/loop0 to partition it (for image files larger than 2TB, you might need to use parted).

From these experiments, it's evident that losetup transforms virtual image files into actual devices. By combining dd and losetup, you can seemingly create an additional eight hard disks (from "/dev/loop0" to "/dev/loop7") without dealing with the hassle of handling physical disks. You can use these virtual devices to experiment with concepts like LVM and RAID without risking your system stability.

If you've used dd to backup a hard disk with multiple partitions, and you want to separate individual partitions, you can use the offset option "-o". Here's an example of a split image file with three partitions:

# fdisk -l disk-image ←List the partitions of the image file
Device Boot      Start         End      Blocks   Id System
disk-image1               1         320       20464    c W95 FAT32 (LBA)
disk-image2             321         800       30720   83 Linux
disk-image3             801        1600       51200   82 Linux swap / Solaris

When an image file contains more than one partition, and the "-o" (offset) option is not used, it can lead to incorrect usage and potential damage to the image file when accessing it.

Since the offset unit for the "-o" option is in bytes, you can use parted to automatically convert the offset units to bytes for use with the losetup -o option. Here's an example of how to do this:

# parted disk-image unit B print ← List the starting bytes of partitions

Number Start      End         Size       Type     File system Flags
1      16384B     20971519B   20955136B primary               lba
2      20971520B 52428799B   31457280B primary
3      52428800B 104857599B 52428800B primary

In the example above, the starting bytes for partitions #1 to #3 are 16384, 20971520, and 52428800 respectively. These are the values to be used as the offset for the loop devices. You can set up the loop devices to correspond to these partitions as shown below:

Example:

# losetup -o 16384 /dev/loop0 disk-image ←/dev/loop0 corresponds to partition 1
# losetup -o 20971520 /dev/loop1 disk-image ←/dev/loop1 corresponds to partition 2
# losetup -o 52428800 /dev/loop2 disk-image ←/dev/loop2 corresponds to partition 3

Alternatively, you can set up the loop devices based on another loop device and offset, as shown here:

# losetup /dev/loop0 disk-image ←Use "/dev/loop0" as the offset base
# losetup -o 16384 /dev/loop1 /dev/loop0 ←"/dev/loop1" corresponds to partition 1
# losetup -o 20971520 /dev/loop2 /dev/loop0 ←"/dev/loop2" corresponds to partition 2
# losetup -o 52428800 /dev/loop3 /dev/loop0 ←"/dev/loop3" corresponds to partition 3

Another approach is to apply the offset during the mounting process using the "-o" option, as shown in the following example:

# losetup /dev/loop0 disk-image ←Use /dev/loop0 as the offset base
# mount -o loop,offset=16384 /dev/loop0 /mnt/disk1 ←Mount with offset
# mount -o loop,offset=20971520 /dev/loop0 /mnt/disk2
# mount -o loop,offset=52428800 /dev/loop0 /mnt/disk3

Because mount -o loop essentially executes losetup in the background, additional loop devices may be used (you can scan the loop devices with losetup -a). After applying the correct offset handling, "/dev/loop0" to "/dev/loop2" will virtually represent the three partitions of the image file.

The offset values for partitions can also be calculated using sector values provided by fdisk, as shown in the following example:

# fdisk -lu disk-image ←List the starting sectors of partitions
Units = sectors of 1 * 512 = 512 bytes ←One sector = 512 bytes

Device Boot      Start         End      Blocks   Id System
disk-image1              32       40959       20464    c W95 FAT32 (LBA)
disk-image2           40960      102399       30720   83 Linux
disk-image3          102400      204799       51200   82 Linux swap / Solaris

The output from fdisk -lu is in sectors (with one sector being 512 bytes for hard disks, but possibly 2048 or 512 bytes for optical discs). The "Units" information in the fdisk -lu output specifies this sector size. In the example above, partition #1 starts at sector 32, which translates to a starting byte of 32 x 512 = 16384. This result should align with the output from parted DEVICE unit B print.

kpartx : Mapping partitions in disk images to loop devices
When using dd to back up a hard disk with multiple partitions and using losetup -o to separate the partitions, calculating the offset values and handling unformatted partitions can be quite cumbersome. An alternative tool called kpartx can simplify this process.

The usage of kpartx is similar to losetup, but you don't need to worry about specifying offset values or loop devices. The usage is: kpartx [-option] image-file. The "-a" option creates links between the image file and loop devices, the "-d" option removes these links, and the "-v" option displays detailed information about loop device mappings.

In the following example, we have an image file with four partitions, and we use kpartx to automatically map the image file to loop devices:

Example:

# kpartx -av disk-image
add map loop0p1 : 0 30688 linear /dev/loop0 32 ←Mapping partition 1 to loop0p1
add map loop0p2 : 0 51200 linear /dev/loop0 30720 ←Mapping partition 2 to loop0p2
add map loop0p5 : 0 81888 linear /dev/loop0 81952 ←Mapping partition 5 to loop0p5
add map loop0p6 : 0 40928 linear /dev/loop0 163872 ←Mapping partition 6 to loop0p6

After using kpartx -a image-file as shown above, if you use losetup -a, you'll notice that one loop device is occupied. However, even though the image file contains four partitions, only one loop device is used. This is because the partitions are mapped to files in the directory "/dev/mapper", where you'll find files named" loop[0-7]p[0-9]".

These mapped files "/dev/mapper/loop[0-7]p[0-9]" can be used as if they were physical hard disks. You can format and mount them just like real hard disks. Here's an example:

Example:

# kpartx -av disk-image
# ls /dev/mapper/loop* ←Check the mapped file names
/dev/mapper/loop0p1 /dev/mapper/loop0p5
/dev/mapper/loop0p2 /dev/mapper/loop0p6
# mkfs -j /dev/mapper/loop0p1 ←format partition #1
# mount /dev/mapper/loop0p5 /media ←mount partition #5

To remove the mappings created in the above example, us kpart -dv disk-image.

^ back on top ^

1.1 Swap : Auxiliary Memory

We've all experienced situations in Windows where running too many applications or using a resource-intensive software slows down the system, and the disk LED keeps flashing. This is often due to insufficient memory, and Windows resorts to using the hard disk as virtual memory. In Windows, this technique is known as "paging." However, traditional mechanical hard disks are significantly slower than memory, causing the system to become sluggish when using the hard disk as auxiliary memory. While it's not ideal, this approach prevents the system from crashing or becoming unresponsive.

Unix/Linux uses a similar concept called "swap." Swap involves utilizing the hard disk space as temporary memory, essentially swapping space for time. However, the question of whether to use swap, how much to allocate for swap, or when to use it remains a debated topic. Generally, if the system has ample physical memory, the need to use swap is minimized. Consequently, a smaller swap size can be set. Conversely, a larger swap size is advisable if physical memory is limited. If the system uses hibernation, where memory data is written to the disk, the swap space must be at least as large as the system's memory.

Linux industry leader Red Hat recommends a minimum swap size of 256MB or more. Here are their recommended swap sizes based on system memory, along with considerations for hibernation:

System Memory:	Recommended Swap Size	Recommended Swap Size with Hibernation
< 2GB	Twice the size of memory	Three times the size of memory
2GB-8GB	Equal to the size of memory	Twice the size of memory
8GB-64GB	Equal to the size of memory	1.5 times the size of memory
>64GB	4G	4G

[Note 1.1]

Traditionally, swap space was created using partitions. However, modern Linux systems can also use swap files, similar to Windows' paging file. Each approach has its advantages and disadvantages. Swap partitions generally offer better performance, but resizing them can be more complicated. On the other hand, swap files might have slightly lower performance (due to potential non-contiguous allocation) but are easier to resize. To check if swap is enabled on your system and its size, you can use the command swapon -s (equivalent to cat /proc/swaps).

Example:

# swapon -s ←Display swap usage information
Filename Type Size Used Priority
/dev/sda3 partition 1020116 1088 -1

In the above example, the output indicates that a partition-based swap space of 1GB is being used. The "Used" column shows the current usage of the swap space. If it's 0, that means the swap space hasn't been utilized yet. To observe swap usage, you might need to have multiple users logged in or run memory-intensive software such as office applications, web browsers, or graphics editors.

swap-partition
If a partition serves as a swap partition and has corresponding settings in the "/etc/fstab" file, it will automatically be enabled during system startup.

Example:

# cat /etc/fstab | grep 'swap'
LABEL=SWAP-sda3 swap swap defaults 0 0

swapon : Enabling swap
If you're observant and use the mount or df commands, you'll notice that a swap partition is not actually mounted. Enabling a swap partition is not done using mount, but rather a separate command called swapon. The "-a" option enables swaps according to the settings in "/etc/fstab."

If a swap partition is not configured in "/etc/fstab," you can use swapon later to enable it. The syntax for swapon is either swapon DEVICE (where "DEVICE" is the device file) or swapon -L LABEL (where "LABEL" is the filesystem label).

Example:

# swapon -a ←Enable swap according to settings in "/etc/fstab"
# swapon /dev/sdb3 ←Enable swap on the device "/dev/sdb3"
# swapon -L SWAP-sda3 ←Enable swap with the label "SWAP-sda3"

swapoff : Disabling swap
To disable swap, you can use the swapoff command. The usage and options of swapoff are similar to swapon, but it reverses the action. For instance, swapoff -a disables swaps according to the settings in "/etc/fstab."

Example:

# swapoff /dev/sdb3 ←Disable swap on the device "/dev/sdb3"

If you didn't configure a swap partition during Linux installation or want to adjust its size, follow these steps:

If you already have a swap partition and want to adjust its size, you need to delete the existing partition and recreate it. Deleting a partition can be done using the d command in fdisk or the rm command in parted (Note: Linux also supports using a single dedicated swap partition, so you don't necessarily need to delete the existing one.)
Create the new swap partition. For fdisk, change the partition type to 82_(HEX) using the t command. For parted, use mkpart with fs-type "linux-swap."
After creating the swap partition, format it using the mkswap command (e.g., mkswap /dev/sdb1).
Update the "/etc/fstab" file with the new swap partition settings.
Run the swapon -a command, or restart the system. Then use either cat /proc/swaps or swapon -s to check if the swap partition has been enabled properly.
Using a dedicated hard disk for a swap partition will be faster than having multiple partitions, each with a swap partition. If speed is a priority, consider using a solid-state drive (SSD).

swap-file : Using a file for swap
Linux also allows you to use a file as swap space, similar to how it's done in Windows XP or Windows 7, where the file is typically named "pagefile.sys" and located in the root directory.

mkswap : Formatting swap
If you didn't reserve a swap partition during the installation of the system and later find the need for swap space, you can create a swap file using dd and then format it using mkswap.

Here's how you can create a 512 MiB swap file using dd:

Example:

# dd if=/dev/zero of=/mnt/512MiB_swap bs=512M count=1 ←Create a 512M swap file

Once the swap file is generated, you're halfway there. Next, you need to set permissions on the file so that others can't read or write to it. Then, format it as swap using mkswap and finally, enable it using swapon.

Example:

# chmod 600 /mnt/512MiB_swap ←←Change permissions to "rw-------"
# mkswap /mnt/512MiB_swap ←Format as swap
# swapon /mnt/512MiB_swap ←Enable swap
# cat /proc/swaps ←erify that swap is enabled
Filename Type Size Used Priority
/mnt/512MiB_swap file 524280 0 -1

If you want to enable swap at boot time, add the following configuration to "/etc/fstab":

Example:

# cat /etc/fstab | grep 'swap'
/mnt/512MiB_swap swap swap defaults 0 0

swappiness : Adjusting the aggressiveness of swap usage
"Swappiness" controls how aggressively the system uses swap space. You can adjust the swappiness value to match your system's behavior. If you have a system with limited RAM, you might want it to use swap more often, while if you have ample RAM, you might prefer it to use swap less often.

The swappiness value ranges from 0 to 100, where 0 means the system will try to use RAM as much as possible, and 100 means it will aggressively use swap. To check the current swappiness value, you can use:
:
Example:

# cat /proc/sys/vm/swappiness ←Display the swappiness value
60

The default swappiness value is typically set to 60. If you want to change it, you can edit "/etc/sysctl.con"" and add the line "vm.swappiness = n" (where n is a value between 0 and 100). After rebooting, the new swappiness value will take effect. Here's an example of setting swappiness to 30:

# cat /etc/sysctl.conf | grep 'swappiness'
vm.swappiness=30 ←his line sets swappiness to 30

^ back on top ^

[Note1.0] Reference source: http://en.wikipedia.org/wiki/GUID_Partition_Table

[Notge1.1] Reference source: https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Installation_Guide/s2-diskpartrecommend-x86.html