Text processing

Convert file encoding

Get available encodings.

iconv -l

Convert text from the ISO 8859-15 character encoding to UTF-8.

iconv -f ISO-8859-15 -t UTF-8 < input.txt > output.txt

wc -l targetFile | grep -Eo '[0-9]+'

awk -F ',' '{print $3 "," $1}' a1.csv > b2.csv

note

awk '!seen[$0]++' target.csv

sort -u target.csv

tip

-f - Case insensitive comparisons.

sed '/regex/d' file

sed 's/regex/string/g' file

sed '/lineregex/s/regex/string/g' file

Remove g from any of the expressions above to replace only the first occurrence on each line.

-i - Make changes overwriting the file.
--in-place=.bkp - Also update in-place but create a backup of the original file with .bkp extension.
-e - Apply multiple expressions (i.e. sed -e 's/regex0/string0/' -e 's/regex1/string1/' file).
-r - Allow extended regular expressions.

sort -k1,1 -k2,2nr

note