12
.
11
.
2014

Java 8 Collectors for Guava Collections

Java 8 comes with streaming API, it divides data processing into two phases: intermediate operations and terminal operation.Terminal operation can have few different forms and I would like to concentrate on reduction to collections, especially to Guava immutable collections. Terminal operation requires collector which will collect data and return it as required structure, but Guava does not provide such collector. In order to create Guava collection out of a stream we have to first reduce stream result into temporary collection and than transfer it:

github:5f5da327891f8a50951a

Reduction of our stream stores results in a temporary List (Collectors.toList()). Once stream processing is done the finisher function will convert content of this List into Guava collection (ImmutableSortedSet::copyOf).

The problem with this approach is… that we have this extra converting loop and two arrays in memory (List and Builder). This could be avoided it we would have collector that is based on Guava’s Builder. So…. I’ve implemented one, once we use it, the code above can be simplified into such form:

github:cfff2ffa9859546122bf

The code is straight forward, let’s concentrate on implementation of #toNaturalImmutableSortedSet()

github:e47872f914cb005ab71a

Our collector is being created by factory method Collector#of that takes four arguments:

  • #supplier – this function will be called only once to create structure that will collect stream results – in our case it’s Biulder from ImmutableSortedSet
  • #accumulator – provides function that will get executed for each element that reaches terminal operation, meaning each element that went trough stream and should be collected for returning. In our case we are providing function that will execute #add(v) on Builder which has been provided in first argument (#supplier)
  • #combiner – this one will be not used in our example, but it’s necessary for processing of parallel streams, it would be used to merge them
  • #finisher – this is the final step and it will be executed after stream processing is done. Elements returned by stream are contained in Builder (#supplier) and in this last phase we are calling #build() method on it, which results in ImmutableSortedSet !

Based on this pattern we can implement other collectors:

github:26282bfa98bad31bbc94

Finally here is the source code: Gullectors.java and unit tests:TestGullectors.java

Happy collecting!

In case you’ve become curious and would like to discover which potential for innovative software solutions your company holds, get to know us: either in a first conversation, an individually crafted workshop or at one of our numerous community events!

Maciej
Lean Java Expert