Hortonworks DataFlow (HDF) is the first integrated platform that solves the real time challenges of collecting and transporting data from many sources and provides interactive command and control of live flows with full and automated data provenance.