org.apache.pig.builtin
Class VALUELIST
java.lang.Object
org.apache.pig.EvalFunc<DataBag>
org.apache.pig.builtin.VALUELIST
public class VALUELIST
- extends EvalFunc<DataBag>
This UDF takes a Map and returns a Bag containing the values from map.
Note that output tuple contains all values, not just unique ones.
For obtaining unique values from map, use VALUESET instead.
grunt> cat data
[open#apache,1#2,11#2]
[apache#hadoop,3#4,12#hadoop]
grunt> a = load 'data' as (M:[]);
grunt> b = foreach a generate VALUELIST($0);
grunt> dump b;
({(apache),(2),(2)})
({(4),(hadoop),(hadoop)})
Methods inherited from class org.apache.pig.EvalFunc |
finish, getArgToFuncMapping, getCacheFiles, getInputSchema, getLogger, getPigLogger, getReporter, getReturnType, getSchemaName, getSchemaType, isAsynchronous, progress, setInputSchema, setPigLogger, setReporter, setUDFContextSignature, warn |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
VALUELIST
public VALUELIST()
exec
public DataBag exec(Tuple input)
throws IOException
- Description copied from class:
EvalFunc
- This callback method must be implemented by all subclasses. This
is the method that will be invoked on every Tuple of a given dataset.
Since the dataset may be divided up in a variety of ways the programmer
should not make assumptions about state that is maintained between
invocations of this method.
- Specified by:
exec
in class EvalFunc<DataBag>
- Parameters:
input
- the Tuple to be processed.
- Returns:
- result, of type T.
- Throws:
IOException
outputSchema
public Schema outputSchema(Schema input)
- Description copied from class:
EvalFunc
- Report the schema of the output of this UDF. Pig will make use of
this in error checking, optimization, and planning. The schema
of input data to this UDF is provided.
The default implementation interprets the OutputSchema
annotation,
if one is present. Otherwise, it returns null
(no known output schema).
- Overrides:
outputSchema
in class EvalFunc<DataBag>
- Parameters:
input
- Schema of the input
- Returns:
- Schema of the output
Copyright © 2007-2012 The Apache Software Foundation