0.3.150326
- Added support for accessing
PipelineOptionsin the Dataflow worker. - Removed one of the type parameters in
PCollectionView, which may require simple changes to user's code that usesPCollectionView. - Changed side input API to apply per window. Calls to
sideInput()now return values only in the specific window corresponding to the window of the main input element, and not the whole side inputPCollectionView. Consequently,sideInput()can no longer be called fromstartBundleandfinishBundleof aDoFn. - Added support for viewing a
PCollectionas aMapwhen used as a side input. SeeView.asMap(). - Renamed custom source API to use term "bundle" instead of "shard" in all names. Additionally, term "fork" is replaced with "dynamic split".
- Custom source
Readernow requires implementing new methodstart(). Existing code can be fixed by simply adding this method that just callsadvance()and returns its value. Additionally, code that uses theReadershould be updated to use bothstart()andadvance(), instead ofadvance()only.
0.3.150227
- Initial Alpha version of the Dataflow SDK for Java with support for streaming pipelines.
- Added determinism checker in
AvroCoderto make it easier to interoperate withGroupByKey. - Added support for accessing
PipelineOptionsin the worker. - Added support for compressed sources.
0.3.150211
- Removed the dependency on the
gcloud corecomponent version 2015.02.05 or newer.
0.3.150210
Caution: depends on thegcloud core component version 2015.02.05 or
newer.
- Included streaming pipeline runner, which, for now, requires additional whitelisting.
- Renamed several windowing-related APIs in a non-backward-compatible way.
- Added support for custom sources, which you can use to read from your own input formats.
- Introduced worker parallelism: one task per processor.
0.3.150109
- Fixed several platform-specific issues for Microsoft Windows.
- Fixed several Java 8-specific issues.
- Added a few new examples.
0.3.141216
- Initial Alpha version of the Dataflow SDK for Java.