If you are using Apache Spark, check out DDQ. The code base is rather simple and very extensible. We also have 100% test coverage so you can get your hands dirty without fear :P
We're a place where coders share, stay up-to-date and grow their careers.
We strive for transparency and don't collect excess data.