public abstract class AbstractColumnProcessor<T extends Context> extends java.lang.Object implements Processor<T>, ColumnReader<java.lang.String>
Processor
implementation that stores values of columns.
Values parsed in each row will be split into columns of Strings. Each column has its own list of values.
At the end of the process, the user can access the lists with values parsed for all columns using the methods getColumnValuesAsList()
,
getColumnValuesAsMapOfIndexes()
and getColumnValuesAsMapOfNames()
.
Note: Storing the values of all columns may be memory intensive. For large inputs, use a AbstractBatchedColumnProcessor
instead
AbstractParser
,
Processor
,
ColumnReader
Modifier and Type | Field and Description |
---|---|
private ColumnSplitter<java.lang.String> |
splitter |
Constructor and Description |
---|
AbstractColumnProcessor()
Constructs a column processor, pre-allocating room for 1000 rows.
|
AbstractColumnProcessor(int expectedRowCount)
Constructs a column processor pre-allocating room for the expected number of rows to be processed
|
Modifier and Type | Method and Description |
---|---|
java.util.List<java.lang.String> |
getColumn(int columnIndex)
Returns the values of a given column.
|
java.util.List<java.lang.String> |
getColumn(java.lang.String columnName)
Returns the values of a given column.
|
java.util.List<java.util.List<java.lang.String>> |
getColumnValuesAsList()
Returns the values processed for each column
|
java.util.Map<java.lang.Integer,java.util.List<java.lang.String>> |
getColumnValuesAsMapOfIndexes()
Returns a map of column indexes and their respective list of values parsed from the input.
|
java.util.Map<java.lang.String,java.util.List<java.lang.String>> |
getColumnValuesAsMapOfNames()
Returns a map of column names and their respective list of values parsed from the input.
|
java.lang.String[] |
getHeaders()
Returns the column headers.
|
void |
processEnded(T context)
This method will by invoked by the parser once, after the parsing process stopped and all resources were closed.
|
void |
processStarted(T context)
This method will by invoked by the parser once, when it is ready to start processing the input.
|
void |
putColumnValuesInMapOfIndexes(java.util.Map<java.lang.Integer,java.util.List<java.lang.String>> map)
Fills a given map associating each column index to its list of values
|
void |
putColumnValuesInMapOfNames(java.util.Map<java.lang.String,java.util.List<java.lang.String>> map)
Fills a given map associating each column name to its list o values
|
void |
rowProcessed(java.lang.String[] row,
T context)
Invoked by the parser after all values of a valid record have been processed.
|
private final ColumnSplitter<java.lang.String> splitter
public AbstractColumnProcessor()
public AbstractColumnProcessor(int expectedRowCount)
expectedRowCount
- the expected number of rows to be processedpublic void processStarted(T context)
Processor
processStarted
in interface Processor<T extends Context>
context
- A contextual object with information and controls over the current state of the parsing processpublic void rowProcessed(java.lang.String[] row, T context)
Processor
rowProcessed
in interface Processor<T extends Context>
row
- the data extracted by the parser for an individual record. Note that:
CommonSettings.setSkipEmptyLines(boolean)
Format.setComment(char)
to '\0'context
- A contextual object with information and controls over the current state of the parsing processpublic void processEnded(T context)
Processor
It will always be called by the parser: in case of errors, if the end of the input us reached, or if the user stopped the process manually using Context.stop()
.
processEnded
in interface Processor<T extends Context>
context
- A contextual object with information and controls over the state of the parsing processpublic final java.lang.String[] getHeaders()
ColumnReader
CommonSettings.getHeaders()
or the headers parsed in
the input when CommonSettings.getHeaders()
equals to true
getHeaders
in interface ColumnReader<java.lang.String>
public final java.util.List<java.util.List<java.lang.String>> getColumnValuesAsList()
ColumnReader
getColumnValuesAsList
in interface ColumnReader<java.lang.String>
public final void putColumnValuesInMapOfNames(java.util.Map<java.lang.String,java.util.List<java.lang.String>> map)
ColumnReader
putColumnValuesInMapOfNames
in interface ColumnReader<java.lang.String>
map
- the map to hold the values of each columnpublic final void putColumnValuesInMapOfIndexes(java.util.Map<java.lang.Integer,java.util.List<java.lang.String>> map)
ColumnReader
putColumnValuesInMapOfIndexes
in interface ColumnReader<java.lang.String>
map
- the map to hold the values of each columnpublic final java.util.Map<java.lang.String,java.util.List<java.lang.String>> getColumnValuesAsMapOfNames()
ColumnReader
getColumnValuesAsMapOfNames
in interface ColumnReader<java.lang.String>
public final java.util.Map<java.lang.Integer,java.util.List<java.lang.String>> getColumnValuesAsMapOfIndexes()
ColumnReader
getColumnValuesAsMapOfIndexes
in interface ColumnReader<java.lang.String>
public java.util.List<java.lang.String> getColumn(java.lang.String columnName)
ColumnReader
getColumn
in interface ColumnReader<java.lang.String>
columnName
- the name of the column in the input.public java.util.List<java.lang.String> getColumn(int columnIndex)
ColumnReader
getColumn
in interface ColumnReader<java.lang.String>
columnIndex
- the position of the column in the input (0-based).