How To Use Tr Command.
In this tutorial, we will show you how to use the tr command through practical examples and detailed explanations of the most common options.
tr is a command-line utility in Linux and Unix systems that translates, deletes, and squeezes characters from the standard input and writes the result to the standard output.
The tr command can perform operations like removing repeated characters, converting uppercase to lowercase, and basic character replacing and removing. Typically, it is used in combination with other commands through piping.
How to Use the tr Command
The syntax for the tr command is as follows:
tr accepts two sets of characters, usually with the same length, and replaces the characters of the first sets with the corresponding characters from the second set.
A SET is basically a string of characters, including the special backslash-escaped characters.
In the following example, tr will replace all characters from the standard input (nexonhost), by mapping the characters from the first set with the matching ones from the second set.
Each occurrence of l is replaced with r, i with e, and n with d:
The character sets can also be defined using character ranges. For example, instead of writing:
you can use:
When -c (–complement) option is used, tr replaces all characters that are not in SET1.
In the example below all characters except “li” will be replaced with the last character from the second set:
As you may have noticed, the output above has one more visible character than the input. This is because the echo command prints an invisible newline character \n that is also replaced with y. To echo a string without a new line, use the -n option.
The -d (–delete) option tells tr to delete characters specified in SET1. When deleting characters without squeezing, specify only one set.
The command below will remove l, i and z characters:
The L character is not deleted because the input includes an uppercase L while the l character in the SET is lowercase.
The -s (–squeeze-repeats) option replaces a sequence of repeated occurrences with the character set in the last SET.
In the following example, tr removes the repeated space characters:
When SET2 is used the sequence of the character specified in SET1 is replaced with SET2.
The -t (–truncate-set1) option forces tr to truncate SET1 to the length of SET2 before doing further processing.
By default, if SET1 is larger than SET2 tr will reuse the last character of SET2. Here is an example:
The output shows that the character e from SET1 is matched with the latest character of SET2, which is 2:
Now, use the same command with the -t option:
You can see that the last three characters of the SET1 are removed. SET1 becomes ‘ab’, the same length as SET2, and no replacement is made.
The tr command also allows you to combine its options. For example, the following command first replaces all characters except i with 0 and then squeezes the repeated 0 characters:
Tr Command Examples
In this section, we’ll cover a few examples of common uses of the tr command.
Convert lower case to upper case
Converting lower case to upper case or reverse is one of the typical use cases of the tr command. [:lower:] matches all lower case characters and [:upper:] matches all uppercase characters.
Instead of character classes, you can also use ranges:
To convert upper case to lower case, simply switch the places of the sets.
Remove all non-numeric characters
The following command removes all non-numeric characters:
[:digit:] stands for all digit characters, and by using the -c option, the command removes all non-digit characters. The output will look like this:
Put each word in a new line
To put each word in a new line, we need to match all non-alphanumerical characters and replace them with a new line:
Remove blank lines
To delete the blank lines simply squeeze the repetitive newline characters:
In the command above we are using the redirection symbol < to pass the content of the file.txt to the tr command. The redirection > writes the output of the command to new_file.txt.
Print $PATH directories on a separate line
The $PATH environmental variable is a colon-delimited list of directories that tells the shell which directories to search for executable files when you type a command.
To print each directory on a separate line we need to match the colon (:) and replace it with the new line:
tr is a command for translating or deleting characters.
Although very useful, tr can work only with single characters. For more complex pattern matching and string manipulation, you should use sed or awk .