add argparse support to corpus script

completed by: Daniel Huang

mentors: Francis Tyers, Jonathan

Add argparse support to the script.  This should include a nice way of handling input and output files by default, as well as the addition of an option to output content split by sentence (as opposed to the default by paragraph).