To fully understand Hadoop/Spark IO, you should better to first understand "Sequence File" and "Serializable".
I have been confused for a while though....and decide to write this post.
I have been confused for a while though....and decide to write this post.