Data manipulation

Basic
Grouping
Sort & Select
Table Manipulation
Statistics
Statistical tests
Position
Processing
Curve Fitting
Detect
Sequences
Python
- Create Column
- Create Table
JavaScript
- JS Create Column
- JS Create Table

Data manipulation > Basic > Accumulate Records

Measurement on the whole image file is performed frame-by-frame (volume-by-volume). This action can be used once or multiple times to obtain records (table rows) from all frames (or volumes) in the selected loop. By default (without accumulate) for performance reasons, records are generated per frame (or per volume). In order to work (sorting, filtering, calculating statistics, ...) on a larger record, use accumulate.

Accumulate loop

Which loop to accumulate. Select All Loops to process all data at once.

Apply only on current frame in preview

Action is applied just on the current frame.

Data manipulation > Basic > Append Columns

Columns from the input tables are appended in the order that they appear. This is useful when joining more features measured on the same number of objects. Number of records across all input tables is expected to be the same.

Data manipulation > Basic > Calculated Column

This node simply works as a calculator with pre-defined operators. Existing columns or their statistics (see Data manipulation > Grouping > Aggregate Rows ) can be used as variables. Type of the calculated column and Unit should be set first. Once the calculation is written, confirm it by clicking Apply.

Data manipulation > Basic > Modify Columns

Selects which features are displayed in the resulting table, change their order using the arrows, change their name (New Title ), number format (New Format ) and Precision.

Note

The Modify Table Columns dialog having the same functionality as this node can be opened directly from Analysis Results by clicking .

Data manipulation > Basic > Reduce Records

Groups records and computes the selected statistic of rows. If the table is grouped the statistic is calculated for each group. Each group of rows becomes one row in the output table.

For more information about grouping see Data manipulation > Grouping > Group Records , for more information about statistics see Data manipulation > Grouping > Aggregate Rows .

Data manipulation > Basic > Scale Column Data

This node is used for changing units, skipping values and similar tasks. Select the Column which is about to be changed, define the new Unit (NewVal example is shown below) and set the value Offset or Gain. No new column is created, only the values in the source column are skipped.

Data manipulation > Grouping > Aggregate Rows

Aggregate Rows computes the selected statistic of rows. If the table is grouped the statistic is calculated for each group. Each group of rows becomes one row in the output table.

Table 27. Aggregation Statistics

None	Any value (typically first).
Total	Number of values including empty (null).
Count	Number of values not including empty (nulls).*
Distinct	Number of distinct (unique) values.
Mean	Mean value. SUM(values)/N.*
Median	Middle value of all ordered values.*
Sum	All values added.*
Min	Minimum value.*
Max	Maximum value.*
StDev.P	Population Standard deviation.*
StDev.S	Sample Standard deviation.*
Var.P	Population Variance.*
Var.S	Sample Variance.*
VarCoef.P	Population Coefficient of Variation.*
VarCoef.S	Sample Coefficient of Variation.*
StErr.P	Population Standard Error.*
StErr.S	Sample Standard Error.*
Skewness.P	Population Skewness.*
Skewness.S	Sample Skewness.*
Kurtosis.P	Population Kurtosis.*
Kurtosis.S	Sample Kurtosis.*
RootMeanSquare.P	Population Root Mean Square.*
RootMeanSquare.S	Sample Root Mean Square.*
Coalesce	First not empty (non-null) value.
LastMinusFirst	Last value minus first (values[N-1] - values[0])
First	First value (values[0])
Last	Last value. (values[N-1])
Mode	Mode.*
Entropy	Entropy.*
HistoUniformity	Uniformity of histogram (link to Measurement - Uniformity).*
QuartileQ1	First Quartile.*
QuartileQ3	Third Quartile.*
PercentileP01	1st Percentile.*
PercentileP05	5th Percentile.*
PercentileP10	10th Percentile.*
PercentileP90	90th Percentile.*
PercentileP95	95th Percentile.*
PercentileP99	99th Percentile.*

* Does not include empty (null) values.

Data manipulation > Grouping > Filter Groups

Filters whole groups (selected in the Column) based on the selected Statistics per group. Define the filtering using the Comparator drop-down menu and the Value edit box.

This node uses aggregation statistics. Please see Data manipulation > Grouping > Aggregate Rows .

Data manipulation > Grouping > Group Records

All rows are in one group by default. Rows having an equal given column (selected in this node) form a group which is visualized in the table. Select the feature(s) in the Group by drop-down menu. To remove a grouping feature, switch it back to “---”.

Grouping is used with statistics (Aggregate Rows) for calculating per group (BinaryLayer, ZStack, Object...) aggregates. Only limitation is that the result of the grouping is currently not visible in the preview and in the final result.

Data manipulation > Grouping > Ungroup Records

Previously grouped records are ungrouped by this action.

Data manipulation > Sort & Select > Current Records

Filters records associated to the current frame. This node is useful only when the connected table is accumulated.

Data manipulation > Sort & Select > Filter Records

This action can be used to filter table records (rows) using a selected column. Resulting table contains only the rows satisfying the filter condition applied to the given column.

Column

Column to be filtered.

Comparator

Comparison operator.

Value

Comparison operand.

Example 10. Filtering out Spots not having any Cell as a parent.

Example 11. Filter Records action used in series to act as an intersection. Both conditions (10 <= EqDiameter AND 0.9 <= Circularity) must be met at the same time.

Example 12. Filter Records action used n parallel to act as an union. One or both conditions (10 <= EqDiameter OR 0.9 <= Circularity) must be true.

Data manipulation > Sort & Select > Pivot Table

This node “pivots” the input table by the specified Pivot Column (“ObjectId” in the example below), thus it creates a new column for each “ObjectId” value. The new column will contain values of columns having a column Role in the bottom definition table (“Entity” and “EqDiameter” in the example below). E.g. in the second column of the results table, the “EqDiameter” values for objects with ID==1 are arranged in rows by the “ZStackIndex”. The number of rows depends on the possible combinations of all values of columns marked with the role Row. Each column will be present for the first number of values specified in the Column Count edit box. The order of the columns is specified by the Column Order switch. In our example below, Append would add 10x Entity whereas Mix orders them: Entity, EqDia, Entity, EqDia, etc. Column Suffix sets the name of the created column. PIVOT_NAME (“ObjectId” in our example) and PIVOT_VALUE (“ObjectId” value) or any other text can be used here.

Data manipulation > Sort & Select > Select First & Last

Shows the first and last record from the connected table.

Data manipulation > Sort & Select > Select Records

Select Records action takes a specified number of records (Count) from a selected starting row (First Row ). Typical use-case is to select the first row of a sorted record set.

Data manipulation > Sort & Select > Select Top

Reorders the records based on a given column and then shows the defined number of records (either smallest or biggest). This node is a combination of Data manipulation > Sort & Select > Sort Records and Data manipulation > Sort & Select > Select Records .

Data manipulation > Sort & Select > Sort Records

Sort action reorders records so that the selected column is sorted in an ascending order.

Data manipulation > Table Manipulation > Append Records

Appends records (rows) from input tables one by one into the output table. If an input table contains a column already present (has same column ID) in the output table (from the preceeding input tables) it will use it (i.e. not append a new column).

The resulting table is ordered by the identification columns.

Column ID
Every column has an implicit ID (invisible to the user) given to it by the node that creates the column. The ID is used internally to reference columns. Therefore, even after a column is renamed it is still correctly pointed to by subsequent nodes. Consequently, columns are considered “same” if they have the same ID. Special identification columns like Loop indexes, Object Entity, Object IDs are given the same ID for each special column. Therefore the ObjectID column will be considered the same from all tables. This behavior is usually expected. If not, it can be altered with these nodes: Data manipulation > Table Manipulation > Copy Column ID , Data manipulation > Table Manipulation > New Column ID and Data manipulation > Table Manipulation > Compact Columns .

Appending records is useful when joining two tables with the same columns. For example for joining two disjunct filtrations or aggregations.

Data manipulation > Table Manipulation > Compact Columns

Aggregates all columns with the same title (or title different just in numerical suffix) to the first column using the First Valid rule.

This is useful when Data manipulation > Table Manipulation > Join Records or Data manipulation > Table Manipulation > Append Records produces duplicate colums because of their different IDs (please see Data manipulation > Table Manipulation > Append Records for more on Column IDs).

For more control on which columns will merge use Copy Column ID.

The following example demonstrates how to force two different columns to merge into single one using Data manipulation > Table Manipulation > Append Records .

After append records there are two columns “MeanObjectIntensity” and “MeanObjectIntensity2” that need to be merged into one.

Data manipulation > Table Manipulation > Copy Column ID

The resulting table contains all columns from table A with specified columns IDs replaced with IDs from reference table B.

This is useful when tables A and B are intended to be merged using Data manipulation > Table Manipulation > Join Records or Data manipulation > Table Manipulation > Append Records node and they contain columns with different IDs (please see Data manipulation > Table Manipulation > Append Records for more on Column IDs) that should be treated as same column.

There is a simpler automatic way using Data manipulation > Table Manipulation > Compact Columns .

The following example demonstrates how to force two different columns to merge into a single one using Data manipulation > Table Manipulation > Append Records .

Data manipulation > Table Manipulation > Duplicate Column

Duplicates the selected column.

Data manipulation > Table Manipulation > Join Records

This action is useful for merging incompatible tables:

frames and objects
cells (objects) and nuclei (objects)

It joins records from two or more unrelated (with different number of rows and columns) record sets. Resulting table contains a union of all distinct input table columns (loop indexes, entity and Object Ids are considered the same).

Select the type of join (see below) and click to add a table row where you specify the relation between the joined features taken from two different tables.

Inner Join

Inner Join outputs a Cartesian product of the related rows (rows where “using columns” have same value):

CountRows(R) = CountRows(A) × CountRows(B) × CountRows(C) × ...,

where A, B, C, ... are input tables and R is an output table.

A related row (e.g. ZStackIndex = 3) must be present in all tables.

Left Join

Left Join is an inner join plus all related rows which do not occur in tables to the right. All rows from the leftmost table (A) are in the result table (R).

Right Join

Right Join is the same as the Left Join but with reversed input cables (... C, B, A).

Outer Join

Outer join is a union of the left and right join.

Please see Join for more information.

Data manipulation > Table Manipulation > New Column ID

Makes the specified columns IDs unique (please see Data manipulation > Table Manipulation > Append Records for more on Column IDs).

It is useful when nodes like Data manipulation > Basic > Append Columns , Data manipulation > Table Manipulation > Join Records and others treat a column coming from two different tables as the same one and does not include it twice. This is typically because both columns were made by one node or the is a “system” column like “Loop Index Columns”, “Entity” or “Object ID”.

Without the Data manipulation > Table Manipulation > New Column ID node both source columns are merged (overwritten) into one.

Data manipulation > Table Manipulation > Shift Records

Shifts data in selected columns by given number of rows.

Fill

Empty

Rows which are empty after shift are left empty.

Original

Rows which are empty after shift are filled with original values.

Cycle

Rows which are empty after shift are filled with values of rows on the other end of columns.

Data manipulation > Statistics > Aggregate Columns

Aggregate Columns action creates a new column with the given name (Title) in which the result of a calculation is placed (Aggregation). The calculation is based on the checked parameters.

Title

Name of the new column.

Aggregation

Statistics to be computed.

Column

Checked columns will be used for calculation.

This node uses aggregation statistics. Please see Data manipulation > Grouping > Aggregate Rows .

Data manipulation > Statistics > Binning

Creates new bin column and assigns each row to one bin based on the value of the selected column. Then the rows can be grouped based on the bins.

Source column

Column for binning.

New column

Name of the new bin column.

Column Type

Sets the column type either to a Number or a String (sequence of characters).

Unit

Unit of the new bin column.

Add bin

Adds a new table row (one bin).

Remove selected bin

Deletes a row record (one bin).

Remove all bins

Clears the table (deletes all bins).

Generate bins automatically

Opens the Generate Bins window used to calculates linear or logarithmic bins based on the bin width. Set the lower starting value of the bin set (Min), then the higher ending value (max) and enter the width of the bin. Choose whether the generated bins will be Linear or Logarithmic with a specified base. Set the Label for each newly generated bin. Enter “lo” to start from the lowest generated value or “hi” to start from the lowest generated value + width. Any other mathematical combination of “lo” and “hi” is also possible (see the examples below).

Validate bins

If the bins are created improperly, this function automatically corrects the classes so that they follow each other.

Copy to clipboard

Copies the full table to clipboard.

Paste from clipboard

Inserts the table data from clipboard into the frequency table.

lo, hi, value

Fill in the table so that lo and hi values represent the bin size and value sets the user defined label to each bin. When lo or hi is empty (null), it is interpreted as negative infinity or positive infinity respectively. Bins should be exclusive otherwise the behavior is not defined.

Data manipulation > Statistics > Binning (simple)

Creates a new column and fills it with bin labels to which the value in the source column falls into. Bins are equidistant intervals between min and max.

New column name

Name of the new column.

Source column

Column for binning.

Min (incl.)

Minimal bin value.

Max (excl.)

Maximal bin value.

Count of bins

Number of bins.

Class label

Sets options how each bin is labeled (Start point: min of each bin, Middle point: middle of each bin, Bin class Id: bin index starting from 1.

Data manipulation > Statistics > Frequency Table

This table can be used to classify the results data to see the number of elements in each defined class. Select the Source column, name the New column and add units (Unit). Fill in the table so that From and To values represent the bin size (range of source values) which will be substituted by the new value specified in the third column.

Other tools are similar to the Data manipulation > Statistics > Binning node.

Data manipulation > Statistics > Generate Distribution

(requires: Local Option)

Creates a table with points of Probability density function or Cumulative distribution function. Select the Distribution, its Parameters, left and right interval endpoints and Step. If checked, y- values of the Critical Region are generated to a separate column. Tested Value is generated to a separated column as well.

Data manipulation > Statistics > Statistics

Creates a summary statistics table, as seen in the example below.

This node uses aggregation statistics. Please see Data manipulation > Grouping > Aggregate Rows .

Data manipulation > Statistical tests > ANOVA One-way

(requires: Local Option)

Performs One-way ANOVA (analysis of variance). Please see one-way ANOVA.

Select a column with Sample Data and parameters of the test. The table must be grouped. Every group is one factor (i.e. treatment group). The output results of the analysis are displayed in a one row table.

Working example from a Wikipedia page:

Create a table with the example data and group it by the “group” column.
Figure 916. Analysis definition.
Figure 917. JS Create Table node.
```
return [
[ 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3 ],
[ 6, 8, 4, 5, 3, 4, 8, 12, 9, 11, 6, 8, 13, 9, 11, 8, 7, 12 ]
];
```
Figure 918. Grouped data.
Setup the ANOVA and get the results.
Figure 919. ANOVA One-way node.
Figure 920. Results.

Data manipulation > Statistical tests > F-test

(requires: Local Option)

This test can be used to test:

the hypothesis that the means of a given set of normally distributed populations, all having the same standard deviation, are equal.
the hypothesis that a proposed regression model fits the data well.
the hypothesis that two normal populations have the same variance.
Please see F-test.

Select Sample A from table A, Sample B from table B and parameters of test. If tables are grouped, then count of the groups in each table must be same. F-test is done for each pair of the groups (i-th group in table A and i-th group in table B). Output is a table with one row of data for each group pair.

Data manipulation > Statistical tests > t-test 1s

(requires: Local Option)

This is a location test of whether the mean of a population has a value specified in a null hypothesis (please see t-test).

Select Sample and parameters of test. If table is grouped, then the t-test is done for each group. Output is a table with one row of data for each group.

Data manipulation > Statistical tests > t-test 2s pair

(requires: Local Option)

Paired samples t-tests typically consist of a sample of matched pairs of similar units, or one group of units that has been tested twice (a "repeated measures" t-test). A typical example of the repeated measures t-test would be where subjects are tested prior to a treatment, say for high blood pressure, and the same subjects are tested again after treatment with a blood-pressure-lowering medication. By comparing the same patient's numbers before and after treatment, we are effectively using each patient as their own control (please see t-test).

Select Sample A, Sample B and parameters of test. If table is grouped, then the t-test is done for each group. Output is a table with one row of data for each group.

Data manipulation > Statistical tests > t-test 2s unpair

(requires: Local Option)

The independent (unpaired) samples t-test is used when two separate sets of independent and identically distributed samples are obtained, one from each of the two populations being compared. For example, suppose we are evaluating the effect of a medical treatment, and we enroll 100 subjects into our study, then randomly assign 50 subjects to the treatment group and 50 subjects to the control group. In this case, we have two independent samples and would use the unpaired form of the t-test (please see t-test).

Select Sample A from table A, Sample B from table B and parameters of test. If tables are grouped, then count of the groups in each table must be the same. t-test is done for each pair of groups (i-th group in table A and i-th group in table B). Output is a table with one row of data for each group pair.

Data manipulation > Statistical tests > Z-factor

Calculates Z-factor for positive and negative control groups.

Labels

Select column containing labels (positive, negative and others).

Values

Select column containing measured control values.

Negative Label

Enter label of rows with negative control (in Labels column).

Positive Label

Enter label of rows with positive control (in Labels column).

Z-factor Column

Check to create column with Z-factor value and enter its name.

Data manipulation > Position > Addition

Vectorally adds two positions. Z positions are optional. The result is two or three new columns (X, Y, (Z)).

ResX = X0 + X1, ...

Data manipulation > Position > Difference (vector)

Vectorally subtracts two positions in time. Z positions are optional. The result is two or three new columns (X, Y, (Z)).

ResX = X0 - X1, ...

Data manipulation > Position > Distance

Calculates the distance between two points in time. Z positions are optional. The result is two or three new columns (X, Y, (Z)).

Dist = SQRT( (X0 - X1)^2 + (Y0 - Y1)^2 + (Z0 - Z1)^2 )

Data manipulation > Position > Optimal path

Sorts given points represented by their positions in the most effective order for scanning.

Data manipulation > Position > Vector Length

Calculates the length of the vector. If Z (Vector Z) column is left blank 2D length is calculated:

Data manipulation > Position > Vector Orientation

Calculates the heading of the vector. Heading is the angle in degrees between the positive X axis and the counterclockwise in range <0; 360>. Elevation is calculated as well when the Z (Vector Z) column is given. Elevation is the angle in degrees between the XY plane and the in range <-90; 90>.

Data manipulation > Position > Stage Transformation

Transforms (recalculates) the position coordinates between the stage (absolute) and the image (relative) system.

Position X, Position Y, Position Z

Defines the X, Y, Z positions.

Transformation

Defines the source and destination format of the position.

New Column

Name of the transformed features.

Unit

Unit of the transformed features.

Data manipulation > Processing > Find Local Extrema

Only the local extreme values found in the source Column are copied into the new column. Name the New Column and specify its units (Unit). Select which Extrema are taken into account and adjust the Threshold to filter out subtle changes.

Data manipulation > Processing > Rolling Average

Calculates the rolling average value as an average of three consecutive values (current value and the value before and after the current one). Select the Column from which the rolling average will be calculated, name the New Column and specify its units (Unit). Use Window Radius to expand the number of consecutive values from which the average is calculated.

Data manipulation > Processing > Rolling Median

Calculates the rolling median value as a median of three consecutive values (current value and the value before and after the current one).

Data manipulation > Processing > Rolling Minimum

Calculates the rolling minimum value as a minimum of three consecutive values (current value and the value before and after the current one).

Data manipulation > Processing > Rolling Maximum

Calculates the rolling maximum value as a maximum of three consecutive values (current value and the value before and after the current one).

Data manipulation > Curve Fitting > Curve Fitting

Fits a curve to data using the method of least squares. Fit values are placed into a new column. Select the Dependent Column and Independent Column, choose the Curve type and specify the Outputs.

Data

Independent Column

Column with independent data, [x]

Dependent Column

column with dependent data, [y=f(x)]

Model

Curve

Type of the curve in case of higher polynomial curve its degree n.

Table 28. Curve type

Curve	Formula	Parameters
Mean (constant)		a₀
Linear		a₁, a₀
Quadratic		a₂, a₁, a₀
Higher Polynomial		a_n, ... , a₂, a₁, a₀
Simple Exponential		A, B
Gaussian		A, µ, σ

Note

Gaussian curve is calculated using weighted linear least squares, if it cannot be calculated this way, nonlinear least square method is used instead and is 0.

use significant parameters only

P-value is often between 0 and 1, higher value shows that parameter is less significant. You can filter out all values higher than given value. This is done iteratively until all p-values are lower than given value.

Outputs

Fitted value

Approximated value given by the resulting equation.

Equation of Trend Line

Resulting equation which can be used in a linechart.

R²

Displays goodness of fit, between 0 and 1, higher means better.

Coefficients

Appends columns with parameters.

p-values

Appends columns with p-values.

Data manipulation > Curve Fitting > Dose Response

Fits a Dose-Response curve to data using the method of non-linear least squares. Fitted values are placed into a new column.

Data

Dose Column

Column with independent data, [x] i.e. Dose.

Zero Handling

Substitute (with a value entered in the blank field) or discard zero.

Response Column

Column with dependent data, [y=f(x)] i.e. Response.

Model

Curve

Type of the curve in case of higher polynomial curve its degree n.

Table 29. Curve type

Curve	Formula	Parameters
4PL (Symmetrical)		B, T, E, H
5PL (Asymmetrical)		B, T, E, H, S

Note

You can constrain (fix) any of the parameters.

Table 30.

Parameter	Meaning
B	Bottom (minimum of the function).
T	Top (maximum of the function).
E	Inflection point of the curve.
H	Hill (hill coefficient), gives direction and how steep the response curve is.
S	Gives assymetry around the inflection point.

EC50, IC50, LD50, ... are same as E for the 4LP model, for 5PL they are calculated as follows:

Output columns

Adjust the naming of the output columns.

Data manipulation > Curve Fitting > Gauss Mixture

Fits multiple gaussian curves to data using the method of nonlinear least squares.

Data

Dependent Column

Column with dependent data, [y=f(x)].

Independent Column

Column with independent data, [x].

Model

Number of Peaks

Number of gaussian curves n, number of outputed curves can be smaller.

Table 31. Curve type

Curve	Formula	Parameters
Multiple Gaussians		A_i, μ_i, σ_i

Outputs

Fitted value

Approximated value given by the resulting equation.

Equation of mixture

Resulting equation containing all curves, can be used in a linechart.

Separate equation for each peak

Appends column for each fitted curve.

Coefficients

Appends columns with parameters.

Data manipulation > Curve Fitting > Growth

This node is similar to the Data manipulation > Curve Fitting > Dose Response node. It fits a growth curve to data and is used by the Results & Graphs > Graphs > Fitplot and Results & Graphs > Reports > HTML Report .

Data manipulation > Detect > DBSCAN

This Density-Based Spatial Clustering of Applications with Noise (DBSCAN ) action clusters 2D points. Cluster IDs are inserted into a new column. Name the new column (New Column Name ), select the Position X Column and Position Y Column , set the Max. Distance in Cluster and the Min. Number of Neighbours . Two points are neighbors if their distance from each other is not greater than the Maximal Distance in Cluster. If any point has at least Minimal Number of Neighbours, this point and all his neighbors are added to the same cluster. If any point is part of a cluster, all of his neighbors are also part of this cluster.

For details about this action, please see the DBSCAN method.

Data manipulation > Detect > Grid Points

Detects a square grid and appends rows with coordinates of the missing points. Select columns containing coordinates of the grid points and select what to ignore during the detecting of the missing points.

Data manipulation > Detect > Parse Well Name

Recognizes and parses well names to row and column.

Table 32.

Input Column	WellRow	WellColumn
AA01	AA	01
01AA	01	AA
AB	A	B
01	0	1

Data manipulation > Detect > Object Classifier

The node classifies objects in a table based on their measured features.

Column name

Name of the table column.

Classifier

Select a classifier best working with your image.

Object classes

Enter a name for the first class, select its color, then click the Select binary objects in image tool, click on the object(s) in the image which you want to add to the first class and then click the right mouse button to train the model. Click Esc to stop the object selection.

Add class

Adds another class.

Train

(Re)trains the model.

Data manipulation > Detect > TMA Dearraying

Inserts a new column containing new indexes of binary objects of detected TMA (Tissue microarray) cores. Choose the Position X Column and Position Y Column with position of core centers, select the Orientation and then set the indexing start (Index from) or choose the Meander indexing. The settings are shown in the preview scheme below. Select Continuous indexing to not include numbers of missing cores in indexing (so binaries will be numbered from 1 to N, where N is the count of binary objects).

This node automatically detects columns and rows of grid of TMA cores. Then it assigns new indexes so that cores in the upper row have smaller indexes than cores in the lower row. In each row, cores are indexed from left to right. Numbers of missing cores are not used (for example, if you have 10 cores in a 4x3 grid, core in the bottom right corner (if not missing) will always have an index of 12).

For good results cores should be laying in a square grid which is not rotated.

Note

This node can be used on any binary objects laying in a square grid (not just TMA cores).

Data manipulation > Detect > Values Run

(requires: Local Option)

Detects run of the same values and returns the column with IDs. Same ID indicates the same values in the Column for consecutive indexes in the Index Column. New Column adjusts the name of the new column.

Data manipulation > Sequences > Difference

Calculates a new column as a difference (subtraction) between the second and first value, third and second, etc. Select the Column from which the difference will be calculated, choose the Loop, name the New Column and specify its units (Unit). First record is blank when First item is selected.

Data manipulation > Sequences > Integrate

Calculates a new column as a sum of the first and the second value, second and the third value, etc. First row stays the same. Select the Column from which the integration will be calculated, choose the Loop, name the New Column and specify its units (Unit). Enter a Constant if you want to add a number to each sum of the two values.

Data manipulation > Sequences > Position Difference

Computes difference (vector) X, Y, Z columns in every two consecutive rows.

Data manipulation > Sequences > Position Integrate

Adds up X, Y, Z columns in every two consecutive rows.

Data manipulation > Sequences > High Pass Filter

ΔT is a time interval [s] and fc is a cut-off frequency [Hz].

Please see High Pass Filter for more details.

Data manipulation > Sequences > Low Pass Filter

ΔT is a time interval [s] and fc is a cut-off frequency [Hz].

Please see Low Pass Filter for more details.

Data manipulation > Sequences > Sequence (Int)

Generates a new column with an integer sequence. Name the New Column and set the Start and Step value of the sequence. Optionally the sequence can respect a selected loop over column value if switched from the default “<rows>” value. New Column adjusts the name of the new column.

Data manipulation > Sequences > Sequence (Exp)

Generates a new column with an exponential sequence. Name the New Column and set the Start and Factor (exponent) value of the sequence. To create an inverted exponential sequence, click Invert factor and the Factor value is automatically changed. Optionally the sequence can respect a selected Loop over column value if switched from the default “<rows>” value.

Data manipulation > Python > Create Column

Define new columns in the upper table by specifying their name, type, and unit.

Copy prompt for a LLM

It is possible to ask large language models (LLMs) like ChatGPT, Gemini or Copilot to generate python code that will calculate the content of the column. To simplify the interaction with the LLMs, use this button which prepares a prompt ready to be pasted into the LLMs. The user has to replace the <USER TASK HERE> at the end of the prompt.

Note that it is a good practice to create a new chat for the prompt so that it does not share context with unrelated conversation.

The answer should contain two parts as shown in the example:

JSON with new columns definition - copy it and Paste columns from LLM and
copy and paste the python code – replace it: Ctrl+A, Ctrl+V.

Paste columns from LLM

Pastes the columns from an LLM into the node columns.

Add new column

Adds a new column (row entry) to the table.

Remove current column

Removes the current column (row entry) from the table.

Title

Name of the output column.

Unit

Unit of the output column.

Type

Type of the output column.

Example 13. The example shows k-means object classification using scikit-learn.

import pandas as pd
import numpy as np
from sklearn.cluster import KMeans
from sklearn.preprocessing import StandardScaler

# take only the float columns
df_float64 = df.select_dtypes(include=["Float64"])

# scale the input
df_float64[ ["BinMeanOfBlue", "BinMeanOfGreen", "BinMeanOfRed" ] ] *= 5.0

print(df_float64.head())

X = df_float64.values

# Scale after weighting
scaler = StandardScaler()
scaled_features = scaler.fit_transform(X)

# Apply K-means clustering
kmeans = KMeans(n_clusters=4, random_state=42, n_init=10)

#set the output dataset
out["Pollen_Class"] = kmeans.fit_predict(X)

When the following question is added to the generated prompt:

Task (provided by the user):
Classify the objects into two classes.

The ChatGPT 4o outputs following column definition and code:

[
  {"name": "ClassLabel", "type": "int", "unit": null}
]

import pandas as pd
import numpy as np
from sklearn.cluster import KMeans

# Start from an empty frame with the same index
out = pd.DataFrame(index=df.index)

# Select numeric features for clustering (excluding IDs and categorical fields)
features = [
    "Area", "EqDiameter", "Perimeter", "Length", "Width", "MaxFeret", "MinFeret",
    "Circularity", "Elongation", "MeanIntensityEGFP", "MeanIntensityTD",
    "SumIntensityEGFP", "SumIntensityTD"
]

# Prepare data: drop rows with missing values in the selected features
valid_mask = df[features].notna().all(axis=1)
X = df.loc[valid_mask, features].astype("float64")

# Perform KMeans clustering into 2 classes
kmeans = KMeans(n_clusters=2, n_init="auto", random_state=42)
labels = pd.Series(kmeans.fit_predict(X), index=X.index).astype("Int64")

# Create the output column and fill with cluster labels, keeping <NA> where data was insufficient
out["ClassLabel"] = pd.Series(pd.NA, index=df.index, dtype="Int64")
out.loc[labels.index, "ClassLabel"] = labels

Data manipulation > Python > Create Table

Define new columns or copy them from the source tables in the upper table by specifying their name, type, and unit. Copying columns keeps their ID and Metadata.

Copy prompt for a LLM

Note that it is a good practice to create a new chat for the prompt so that it does not share context with unrelated conversation.

The answer should contain two parts as shown in the example:

JSON with new columns definition - copy it and Paste columns from LLM and
copy and paste the python code – replace it: Ctrl+A, Ctrl+V.

Paste columns from LLM

Pastes the columns from an LLM into the node columns.

Add new column

Adds a new column (row entry) to the table.

Remove current column

Removes the current column (row entry) from the table.

Move current column forward

Moves the selected item up in the list.

Move current column backward

Moves the selected item down in the list.

Title

Name of the output column.

Unit

Unit of the output column.

Type

Type of the output column.

Example 14. The example shows an inner join of two object tables tables:

Table A: all columns from results of object measurements with one row per object and
Table B: “Value” and “Frequency” from results of object pixel histogram with 16 rows per each object
on columns (“TimeLapseIndex”, “Entity”, “ObjectId”).

Same columns (having same name) are taken from the left table (“Table A”) with the suffixes parameter.

See Pandas documentation for the merge() function.

import pandas as pd
import numpy as np

# predefined globals:
# df: pd.DataFrame with the input table of input "a"
# mdf: pd.DataFrame with the input column metadata table of input "a"
# out: pd.DataFrame with the new columns to be filled

print("Table A:", df[0].columns)
print("Table B:", df[1].columns)

out = pd.merge(*df, on=("TimeLapseIndex", "Entity", "ObjectId"), suffixes=("", None))

For even more programmatic approach without having to define the columns in the dialog use the ND Processing & Conversions > Python Scripting > Python generic node.

Example 15. LLM Example

When the following question was given:

Task: I want to find out if there is one or more classes in the input table based on the coefficient of diffusion (CD). Calculate the CD from MSDs. Note that one TrackId has more than one row. Can you help me with that?

The python create table is connected to the MSD table in the Single particle tracking example as follows:

ChatGPT 5 produced following answer:

[
  {"name": "Entity", "type": "text", "unit": null},
  {"name": "TrackId", "type": "int", "unit": null},
  {"name": "PointsUsed", "type": "int", "unit": null},
  {"name": "CD", "type": "number", "unit": "µm²/s"},
  {"name": "CD_SE", "type": "number", "unit": "µm²/s"},
  {"name": "R2_Origin", "type": "number", "unit": null},
  {"name": "NumClasses", "type": "int", "unit": null},
  {"name": "ClassLabel", "type": "int", "unit": null},
  {"name": "ClassMeanCD", "type": "number", "unit": "µm²/s"},
  {"name": "ClassCount", "type": "int", "unit": null},
  {"name": "BIC_Best", "type": "number", "unit": null}
]

Python code is too long (150 lines) to reproduce here.

From the CD per track it is clear that there is only one class:

Data manipulation > JavaScript > JS Create Column

Note

This node is deprecated in favor of the Data manipulation > Python > Create Column node.

Define one or more new columns and calculate its data using JavaScript.

In the upper table, you can define new columns by specifying their name, type, and unit.

To reference columns from the input table in your JavaScript code, simply click the button with the column name. This will insert a variable declaration with the appropriate column index. For example, clicking on the ZStackIndex column will insert:

var ZStackIndexIndex = tableA.colIndexById("_loopZStackIndex"); //ZStackIndex

This variable holds the index of the input column and can be used to access its values in your script, e.g.:

var tableA = new Table(context.tableForParameter(node.child("A")));
var a = tableA.data;
var ig = tableA.rowsToGroupRows;

var ZStackIndexIndex = tableA.colIndexById("_loopZStackIndex"); //ZStackIndex

function newItemForRow(i) {
   return a[ZStackIndexIndex][i];
}

Input table is already accessible as tableA variable of Table class.

The Table class wraps the input table data and metadata. It provides methods to access columns, rows, groupings, and statistics.

Methods and properties of the Table class

Properties

tableName: string Name of the table.
tableMetadata: object` Full metadata of the table.
colIdList: string[] List of internal column IDs.
colTitleList: string[] Visible column titles.
colMetadataList: object[] Metadata for each column.
data: any[][] Column-major 2D array of values.
dataRowList: any[][] Transposed version for row-wise access.
colCount: number Total number of columns.
rowCount: number Total number of rows.

Column Lookup

colIndexById(id: string): number
colIndexByTitle(title: string): number
colIndexByFeature(feature: string): number
colIndexByFullText(text: string): number Looks up a column by ID, title, or both.
matchColsFulltext(param: string | RegExp | string[]): number[] Returns indices of matching columns.

Grouping & Sorting

groupedBy: number[] Indices of columns used for grouping.
orderedBy: number[] Indices of columns used for sorting.
groups: number[][] List of row indices grouped by column values.
rowsToGroupIds: number[] For each row, the corresponding group ID.
rowsToGroupRows: number[] For each row, its index within its group.

Column Info Access

colIdAt(index: number): string
colMetadataAt(index: number): object
colTitleAt(index: number): string
colUnitAt(index: number): string
colTitleAndUnitAt(index: number, sep = " "): string
colDecltypeAt(index: number): string
colIsNumericAt(index: number): boolean
listColIds(filter?: (id: string) => boolean): string[]
listIdentColIds(): string[]

Column Data & Stats

colDataAt(index: number): any[] Returns raw values from the specified column.
colDataStatsAt(index: number, stats: string[]): (number | null)[] Returns requested statistics:
- “total” – all values including nulls
- “count” – non-null values
- “sum”, “mean”, “min”, “max”
- “stdev” – standard deviation

For each table row function newItemForRow(i) is called. It expects to return either single value or array. If you have defined multiple columns in upper table it expects to return array only.

Creating column with indices of rows

function newItemForRow(i) {
   return i;
}

Creating column with indices of rows per group

var tableA = new Table(context.tableForParameter(node.child("A")));
var ig = tableA.rowsToGroupRows;

function newItemForRow(i) {
   return ig[i];
}

Creating column with time delta

var tableA = new Table(context.tableForParameter(node.child("A")));

//insert here your own time column index (by clicking column button above code editor)
var TimeIndex = tableA.colIndexById("ColumnId");

var times = tableA.data[TimeIndex]

function newItemForRow(i) {
   return (i==0) ? null : times[i] - times[i-1];
}

Creating column with cumulative sum

var tableA = new Table(context.tableForParameter(node.child("A")));

//insert here your own column index (by clicking column button above code editor)
var ColumnIndex = tableA.colIndexById("ColumnId");

var columnData = tableA.data[ColumnIndex];
var sum = 0;

function newItemForRow(i) {
   sum += columnData[i];
   return sum;
}

Creating column with mean of column data

var tableA = new Table(context.tableForParameter(node.child("A")));

//insert here your own time column index (by clicking column button above code editor)
var ColumnIndex = tableA.colIndexById("ColumnId");

var columnData = tableA.data[ObjectIdIndex];
var mean = MathApi.mean(columnData);

function newItemForRow(i) {
   return mean;
}

Creating column with group mean of column data

var tableA = new Table(context.tableForParameter(node.child("A")));
var rowsToGroupIds = tableA.rowsToGroupIds;
var groups = tableA.groups;

//insert here your own time column index (by clicking column button above code editor)
var ColumnIndex  = tableA.colIndexById("ColumnId");

var columnData = tableA.data[ColumnIndex ];

var means = [];
for(let i=0; i<groups.length; i++) {
    let groupData = groups[i].map(idx => columnData[idx]);
    means.push(MathApi.mean(groupData));
}

function newItemForRow(i) {
   return means[rowsToGroupIds[i]];
}

Parameters

Input

A (Table)

Output

R (Table)

Control

jsCode (Text)
newColumns (ListModel)

Data manipulation > JavaScript > JS Create Table

Note

This node is deprecated in favor of the Data manipulation > Python > Create Table node.

Creates a new table by combining one or more existing tables (table inputs A, B, …). All output table columns must be defined in the node dialog column list table:

The code contains a quick reference of predefined variables:

GA3 JavaScript Editor

// a, b, ... - input tables (use a[col][row])
// ag, bg, ... - grouped input tables (use ag[group][col][rowingroup])

// _?T - time-lapse loop column index
// _?Z - z-stack loop column index
// _?M - multi-point loop column index
// _?0..._?N - column indexes after loop columns
// where ? should be replaced by a, b, ... for input table

The expected output is the data of the new table (an array of arrays columns x rows). The number of columns must match the number of defined columns. And the number of rows must be equal for all columns.

Parameters

Input

A (Table)

Output

R (Table)

Control

javaScriptCode (Text)
tableDefinition (ListModel)