How to Clean Data at the Command Line