Data Wizard

The Data Wizard is designed to facilitate repeated data operations on the same dataset. It also allows changes in the sequence of the operations. It replaces the following standalone tools: Column Proper­ties, Filter Worksheet, and Column Transformation. (If a project created in an earlier version of Phoe­nix has those options, these objects are loaded and visible.) The Data Wizard provides the property computing, filtering, and transformation operations as well as generating an easy summary worksheet to keep the workflow organized.

Use one of the following to add a Data Wizard object to a Workflow:

Right-click menu for a Workflow object: New > Data > Data Wizard.
Or Main menu: Insert > Data > Data Wizard.
Or right-click menu for a worksheet: Send To > Data > Data Wizard.

The manipulation of data is defined in the Data Wizard as Steps. When the Data Wizard object is added, Step 1 is automatically listed in the Summary list of the Setup tab and is ready to be defined. From the Action menu in the Options tab, select Properties, Filter or Transformation and press Add to define the step.

 

Once that selection is made, the action is appended to the Step name in the Summary list and other controls on the Options tab are enabled appropriately.

A series of steps can easily be created (specifying column properties via Properties, filtering the data using Filter, and then doing a data transformation using Transformation). Each step in the Data Wiz­ard identifies its action in the Summary. Delete steps, rearrange steps, insert new steps, and specify when to execute steps in a sequence using the buttons in the Options tab.

The first step in the Data Wizard allows the user to map the dataset, whether it is for filtering, transfor­mation, or modifying column properties. After the dataset has been mapped in that first step, all sub­sequent steps use that source dataset in combination with the steps created in the Data Wizard.

The Retain Intermediate Results checkbox preserves all results for each step set up in the Data Wizard. It will add significantly to the amount of output (and requires more memory). This options is only recommended when the intermediate output is truly necessary.

This section contains information about the following:

Properties operation

Transformation operation

Filter operation

Results

See also “Ratios and baseline adjustments example”.

Note:To view the object in its own window, select it in the Object Browser and double-click it or press ENTER. All instructions for setting up and execution are the same whether the object is viewed in its own window or in Phoenix view.

Propertieoperation

The Properties operation in the Data Wizard allows users to set column header names, units, and data types (text or numeric) for worksheets.

The interface consists of three main parts:

Main Mappings panel
Options tab
Defaults tab

Main Mappings panel

Users must map a dataset to the Main Mappings panel when the first step is set up. All subsequent steps will automatically be mapped to the same dataset.

None: Columns mapped to this context are simply included in the output.

Sort: Sort variables used to define individual profiles.

Options tab

The Options tab lists all the column headers in a worksheet and any associated units. The column headers and units are displayed in the columns list in the Options tab.

Properties_Options_tab.png 

To change column header properties:

Caution:If it is possible, units are automatically converted when the new column unit does not match the old column unit. Make sure the old and new column units match if unit conversion is not necessary or wanted.

Unit types

Unit Type

Unit

Abbreviation

Time

day

day

hour

hr, h

minute

min

second

sec, s

millisecond

ms

microsecond

us

nanosecond

ns

week

week

month

mon

year

year

Mass

gram

g

mole

mol

pound

lb

IU

IU

Volume

Liter

L

Unit Prefixes

Prefix

Abbreviation

 

Prefix

Abbreviation

femto

f

 

milli

m

pico

p

centi

c

nano

n

deci

d

micro

u

deca

dk

 

 

kilo

k

Defaults tab

The Defaults tab is used to automatically change column properties. Users can specify new column names and units for common columns like Time or Concentration.

Caution:Defaults must be specified before a worksheet is mapped in the Properties operation.

Properties_Defaults_tab.png 

Properties Defaults tab

When a worksheet is mapped to a Properties operation that has defaults set, any matching column headers and units are automatically changed or converted.

Defaults are listed on the default columns box.

To add new defaults

Type the column header name to be automatically converted in the Old Column Name field.

Select a unit, if applicable, in the Old Column Unit menu.

Type a new column name in the New Column Name field.

Select a new unit, if applicable, in the New Column Unit menu.

Caution:If it is possible, units are automatically converted if the new column unit does not match the old column unit. Make sure the old and new column units match if unit conversion is not necessary or wanted.

Click Add to add the new column header default to the default columns box.

The default column name can be edited after it is added.

Make any changes in the column name field and column units menus, and click the Apply button to apply the changes.

In the Match on menu, select whether or not to match the default column to input worksheet columns based on column name or column name and unit.

Select a default in the default columns box and click Remove to remove the column default.

Add another default by typing another column header name in the Old Column Name field and repeating the previous instructions.

Transformation operation

The Transformation operation creates a new column in a worksheet that is the result of a transfor­mation. Phoenix supports arithmetic, baseline, custom, and function transformations. All data in the source column are transformed regardless of exclusions, which are carried over to the new column.

The Transformation user interface consists of two main parts, the Setup tab and the Options tab.

Main Mappings panel
Options tab

Main Mappings panel

Users must map a dataset to the Main Mappings panel when the first step is set up. All subsequent steps will automatically be mapped to the same dataset. Required input will be highlighted orange in the interface.

Users can select to assign the units of an input (mapped) column to a new column created via a trans­formation. If the Output Column Unit selection is mapped, the column mapped to that context will have its unit used in the new output column.

Context associations for a Transformation object change depending on which transformation type is selected in the Options tab.

Arithmetic transformation mappings 

None: Columns mapped to this context are simply included in the output.

x Column and y Column: Columns in a worksheet used in an arithmetic transformation.

Output Columns:
None: Units of columns mapped to this context are not used in the new output column.
Unit: The column whose units are to be used in the new output column.

Baseline transformation mappings 

None: Columns mapped to this context are simply included in the output.

x Column: The independent variable that is used to compute the baseline values.

Sort: Sort variable(s) used to define individual profiles.

Time: Time values that are used to compute the baseline values.

Output Column:
None: Units of columns mapped to this context are not used in the new output column.
Unit: The column whose units are to be used in the new output column

Custom transformation mappings 

Output Column:
None: Units of columns mapped to this context are not used in the new output column.
Unit: The column whose units are to be used in the new output column.

Other mapping contexts are defined in the Formula field.

Function transformation mappings 

None: Columns mapped to this context are simply included in the output.

x Column: The independent variable that is used to provide the values for the function transfor­mations.

Output Column:
None: Units of columns mapped to this context are not used in the new output column.
Unit: The column whose units are to be used in the new output column.

Options tab

Transformation_Options_tab.png 

The Options tab is used to select the transformation type and specify options for each transformation type. The Options tab also allows users to specify the sequence of the operations in the Data Wizard.

Transformation types supported by Phoenix

Arithmetic

Baseline

Custom

Functions

(x – y)/y

% Change from Baseline

Custom Function
See “Example cus­tom functions”.

1/x

(x + y)/y*100

Change from Base­line

Absolute value(x)

x – n

Ratio from Baseline

e^x

x – y

 

LN(x)

x*n

Log10(x)

x*y

Square root(x)

x/n

 

x/y

x^n

x^y

x + n

x + y

In the Arithmetic and Functions options, x and y are columns in a worksheet, in the Arithmetic options, n is a number, and in the Functions options e is the base of the natural logarithm (2.718281828).

Note:Blank cells are treated as zero when operated on with cells that contain numbers.

In the Transformation Type menu, select one of four transformations: Arithmetic, Baseline, Cus­tom, Functions.

Arithmetic options

In the Transformation Type menu, select Arithmetic.

In the Transformation menu, select the arithmetic transformation type.

If one of the x and y transformations are selected, the x Column and y Column mapping con­texts are made available in the Main Mappings panel.

Use the option buttons to map data types in the worksheet to the x Column and y Column contexts. If one of the x and n transformations are selected, then a value for n must be specified.

In the Arithmetic Options field, type a value for n.

In the New Column Name field, type a name for the column used to contain the transformed values.

In the Destination Area, select one of the option buttons.

Append places the new column after the last column in the dataset.

Adjacent places the new column to the right of the data type mapped to the x Column context.

Baseline options

Users can choose whether to specify sort variables for baseline transformations. Sort variables define unique profiles that are used to compute change from baseline separately for each profile. Typically, a single variable representing subject IDs is sufficient. Other data may require more than one variable, such as Subject and Period. The baseline transformation sorts the output worksheet in alphabetical or numerical order, based on the sort variables.

Rules for using datasets in a baseline transformation:

By default, Phoenix uses the value corresponding to time=0 as the baseline for each unique pro­file. If a profile has more than one value at time zero, Phoenix uses the first value as the baseline.

If a zero time value is not in the dataset, the time value to use for the baseline value (normally the lowest time) must be specified, or the user must choose to use the minimum time for each group, in order to generate the baseline transformations.
Enter the lowest (or other) time value in the Fixed Value field or
Select MinimumPerGroup in the Starting Point dropdown menu.

If the columns mapped to x Column has a zero or missing value at the time specified for the baseline, then the % Change from Baseline and Ratio from Baseline options will produce empty results. An alternate baseline time value or the MinimumPerGroup option may be used.

To create a baseline transformation

In the Transformation Type menu, select Baseline.

In the Transformation menu, select the baseline transformation type.

If % Change from Baseline is selected, Phoenix performs a transformation on a column by cal­culating the percent change between each value and either a fixed baseline value or the mini­mum value for each distinct profile.
result=(x – baseline)/baseline*100

If Change from Baseline is selected, Phoenix performs a transformation on a column by calcu­lating the change between each value and either a fixed baseline value or the minimum value for each distinct profile.
result=x – baseline

If Ratio from Baseline is selected, Phoenix performs a transformation on a column by determin­ing the ratio to either a fixed baseline value or the minimum value from each distinct profile.
result=x/baseline

In the Main Mappings panel, click the option buttons to specify the x Column, the Sort column(s), and the Time column.

Users can map multiple or no columns in a worksheet to the Sort context. Each column mapped to the Sort context creates a unique profile. Baselines are calculated on a per profile basis.

In the New Column Name field, type a name for the column that will contain the transformed values.

In the Starting Point menu, select Fixed Point or Minimum per Group.

If Minimum per Group is selected, the lowest time point in each profile is used to generate the baseline transformation. Selecting this option means that the baseline transformation uses a dif­ferent starting point for each profile defined by the sort key(s).

If Fixed Point is selected, users can enter custom starting time value in the Fixed Value field.

In the Fixed Value field, type a time value for a time column in Numeric format or in Text format.

If a time column is in Text format, users must type fixed time value in the same format used to enter the time values in the worksheet. For example, if the first time point used as the baseline is 1:00, then 1:00 must be typed in the Fixed Value column. For acceptable dates and times in Text format, see “Date and time formats”.

The default is zero, because Phoenix, by default, evaluates the change from baseline at time=0. If a value is entered that is higher than the highest time value, then no output values are produced in the new column.

In the Treat Time Values as menu, select the format used by the time column.

Numeric Format means the Time column is in Numeric format.

Date\Time Format means the Time column is in Text format. If selected, the Time Units menu is made available. The selected units are applied to the transformed time column, and are con­verted if they do not match the source time column units.

In the Time Units menu, select the units to use in the transformed time column.

If Date\Time Format is selected, the Create Transformed Time Column checkbox is made available.

Check the Create Transformed Time Column box to create a Numeric time column in the output that is based on the values in a Text time column or clear the checkbox to not create a transformed time column.

In the Destination Area, select one of the option buttons.

Append places the new column after the last column in the dataset.

Adjacent places the new column to the right of the data type mapped to the x Column context.

Custom options

Users can create a custom transformation that transforms one or more columns in a worksheet using the functions provided by Phoenix.

To create a custom transformation

In the Transformation Type menu, select Custom.

Custom Function is selected by default in the Transformation menu. There are no other options in the menu for custom functions.

In the New Column Name field, type a name for the column that will contain the transformed values.

In the Formula field, type a custom function or use the pointer to select a function name in the Func­tion List and drag the function name to the Formula field.

Custom functions can be defined, edited, and combined in the Formula field. The if custom func­tion allows users to create a compound function using the logical operators & (and) and | (or). For example, if((col1=1) & (col2=2), 1, 0). Each condition in the if statement must be placed in parentheses. Be careful with spaces when entering the formulas, as unexpected spaces can cause the mapping panel to not display properly.

See “Example custom functions” for a list of all possible functions and examples of how to use them. The same information is also listed in the Function List in the Options tab.

In the Main Mappings panel, click the option buttons to specify the mapping contexts created by the custom function.

Functions options

To find the absolute value, logarithmic values, or square root

In the Transformation Type menu, select Functions.

In the Transformation menu, select the function transformation type.

Use the option buttons to map a column in the worksheet to the x Column context.

In the New Column Name field, type a name for the column used to contain the transformed values.

In the Destination Area, select one of the option buttons.

Append places the new column after the last column in the dataset.

Adjacent places the new column to the right of the data type mapped to the x Column context.

Note:Using the Data Wizard, it is easy to create a sequence of operations, whether transformations or combinations of the other Wizard operations.

Example custom functions

Below are some common functions that may be helpful.

Formula

Description

substitute(conc,”BQL”,0.1,0)

Look in the mapped column conc and, if you find a value of a cell equal to BQL, substitute it for a 0.1 in the new column.
Otherwise do no substitution.

substitute(CobsPop, 0.1,"BQL",0)

Look in the mapped column CobsPop and, if you find a value of a cell equal to 0.1, substitute it for a BQL in the new column.
Otherwise do no substitution.

if(CobsPop==0.01, 1, 0) or;
Note: There are 2 equal signs.

If any cell in the mapped column CobsPop is equal to 0.1, then enter a 1 in the new column, otherwise enter a 0.

if(CobsPop = 0.01, 1, 0)
Note: When using a single equal sign, spaces are needed before and after the equal sign.

If any cell in the mapped column CobsPop is equal to 0.1, then enter a 1 in the new column.
Otherwise enter a 0. (Same exact function as above.)

if(CobsPop = 0.1, 1, if(Cob­sPop < 1, 2, 0))
Note: You need spaces before and after the < sign or use two << signs without spaces.

Embedded ifs.
If any cell in the mapped column CobsPop is equal to 0.1, then enter a 1 in the new column.
If a cell in column CobsPop is less than 1, enter a 2.
Otherwise enter a 0.

if(CobsPop = 0.1, 1, if(id==1, 2, 0))

Embedded ifs with different vars.
If any cell in the mapped column CobsPop is equal to 0.1, then enter a 1 in the new column.
If the mapped column id is equal to 1, enter a 2.
If none of these conditions are met, enter a 0.

replace(conc,2,2,"LQ")

Look in the mapped column conc and replace starting at number 2 position string with 2 letters LQ.
This is useful for converting BQL to BLQ, but notice that it does not discriminate, so a number 1.537753 ends up being 1LQ37753 (position 2 and 3 replaced).
Use with caution!

Ln(conc)

Write the natural log of the value in the mapped column conc in the new column.

Custom functions list

Name

Description

Example

abs

Returns the absolute value of the supplied parameter.

abs(–12)=12;
abs(ColumnA)

acos

Returns the inverse cosine of parameter1.

acos(–0.5)=2.094395

acosh

Calculates the inverse hyperbolic cosine of parameter1.

acosh(10)=2.99322

asin

Calculates the inverse sine of parameter1.

asin(–0.5)= –0.5236

asinh

Calculates the inverse hyperbolic sine of parameter1.

asinh(–2.5)= –1.6472311

atan

Calculates the inverse tangent of parame­ter1.

atan(1)=0.785398

atan2

Calculates the arc tangent of two coordi­nates.

atan2(x coord, y coord);
atan2(–1,1)= –2.3561945

atanh

Calculates the inverse hyperbolic tangent of a value.

atanh(0.76159416)=1

average

Returns the average of the supplied parameters.

average(A1:A10);
average(12, 5, 7, 8, 11.4)

ceiling

Returns the smallest integer value greater than, but not equal to, the value supplied.

ceiling(12.5)

chiinv

Calculates the inverse chi-square distribu­tion

chiinv(p, df)

clean

Cleans text of non-printable characters.

clean(“string with non-printable chars”);
clean(A1)

concate­nate

Combines two or more strings into one string.

concatenate(A1:A5);
concatenate(ColumnA, “& “, ColumnB)

cos

Calculates the cosine of parameter1.

cos(1.047)=0.50017107

cosh

Calculates the hyperbolic cosine of param­eter1.

cosh(4)=27.30823

count

Returns the number of values in a list, including empty cells.

count(A1:A5)

counta

Returns the number of non-empty cells in a given range.

counta(A1:A5)

date

Returns the sequential serial number that represents a particular date.

date(2007, 11, 27)=39411

datevalue

Uses datetime to return the serial number of the date represented by date_text.
Use this function to convert a date repre­sented by text to a serial number.
If time is associated with the date, then the serial date will be a decimal and not a whole number.

datevalue(“11/27/2007”)=39411

day

Returns the day of a date, represented by a serial number. The day is given as an inte­ger ranging from 1 to 31.

day(serial_time);
day(39411)=27

even

Returns the smallest even integer greater than the argument.

even(12.1)

exp

Returns e raised to the power of the sup­plied parameter.

exp(3);
exp(ColumnA)

fact

Returns the factorial of the supplied param­eter.

fact(3)

floor

Returns the largest integer less than the supplied parameter.

floor(12.9)

format

Formats a string using .NET string format­ting commands.

format(ColumnA, “G8”);
format(“.12345”, “G8”)

geomean

Returns the geometric mean of a range of positive data.

geomean(A1:A15)

geovar

Returns the geometric variance of a range of positive data.

geovar(A1:A15)

harmean

Returns the harmonic mean of a range of data.

harmean(A1:B32)

harmvar

Returns the harmonic variance of a range of data.

harvar(A1:C33)

hour

Returns the hour of a time value. The hour is given as an integer, ranging from zero (12:00 A.M.) to 23 (11:00 P.M.).

hour(serial_time);
hour(38711.06302)=1;
hour(37811.27135)=6

if

Returns true if conditional argument is true; false otherwise.
if function also supports using & (and) and | (or) in statements, and supports nested if statements.

if((A1=0.25), 1, 0)
Note: parameter 1 (logical test) must return true or false;
place a space between cell reference and '='
if((Time=1) & (Conc>2),...);
if((Time=1) | (Conc>2),...);
if(trt=”Reference”, “R”, “T”);
if(CVWR<=30, ‘125.00’,
if(CVWR<=50, 100*exp(+0.76*sqrt(Estimate)), 143.19))

IsNull

Returns true if value of cell is NULL (empty).

IsNull(ColumnA)

left

Returns the specified number of characters (from left to right) of a string.

left(“This is a test”, 1)=T;
left(“This is a test”, 3)=Thi; left(A1, 3)

length

Return the length of a string.

length(“this is a test”)=14;
length(A1)

ln

Returns the natural log of the supplied parameter.

ln(2.3);
ln(ColumnA)

log

Returns the log of the first parameter to the base of the second parameter.

log(number, base);
log(5,3)

log10

Returns the log base 10 of the parameter.

log10(5);
log10(ColumnA)

lower

Returns the supplied string as lower case.

lower(“This Is A Test”)=“this is a test”;
lower(A1)

mid

Returns a specific number of characters from a string from a given position and length parameters.

mid(text, start pos, distance);
mid(“This is a test”, 2, 3)=his

minute

Returns the minutes of a time value. The minute is given as an integer, ranging from zero to 59.

minute(serial_time);
minute(38711.06302)=30

mod

Returns the remainder after parameter1 is divided by parameter2.

mod(number, divisor);
mod(12,2)

month

Returns the month of a date represented by a serial number.
The month is given as an integer, ranging from 1 (January) to 12 (December).

month(serial_time);
month(39411)=11

now

Returns the current date and time as a double (serial date/time).

now()=current date

odd

Returns the smallest odd integer greater than the supplied parameter.

odd(13.1)

pi

Returns the value of pi.

pi()

product

Returns the product of a list of real num­bers.

product(A1:Z88)

rand

Returns a random number based on a lower limit, an upper limit, and a seed.

rand(,);
rand(0,1,92756)

replace

Replaces a sub-string with another string.

replace(text, start pos, distance, replace with);
replace(“T_%_025”, 2, 3, “–”)=T–025;
replace(ColumnA,1,4,””)

right

Returns a specific number of characters from a string starting from the right and moving left.

right(“This is a test”, 3)=“est”

root

Returns the root of a parameter1, using parameter2/parameter3.

x^(a/b);
root(x, a, b);
root(2, 3, 4)

round

Rounds parameter1 to the number of deci­mal places defined by parameter2.

round(12.345, 2)=12.35;
round(ColumnA,1)

roundsig

Rounds parameter1 to the number of sig­nificant digits defined by parameter2.
Note, however, that Phoenix worksheets do not support displaying trailing zeros in deci­mals, so not all significant digits will be dis­played in this case.

roundsig(12.234, 3)=12.3

search

Returns the one-based starting position of a substring within a larger string.
The third argument is the one-based start­ing position for the search.
The function returns –1 if there is no match.

search(“Phoenix”, “oe”, 1)=3

second

Returns the seconds of a time value.
The second is given as an integer in the range zero to 59.

second(38711.06302)=44

sign

Determines the sign of a number.
Returns 1 if the number is positive, 0 if the number is zero, or –1 if the number is neg­ative.

sign(–12);
sign(0);
sign(200)

sin

Calculates the sine of parameter1.

sin(1)=0.84147098

sinh

Calculates the hyperbolic sine of parame­ter1.

sinh(1)=1.175201194

sqrt

Returns the square root of parameter1.

sqrt(9)

stdev

Estimates standard deviation based on a range of data.

stdev(A1:C1)

strcmp

An integer indicating the case sensitive lex­ical relationship between the two compara­nds.
Value condition < zero, stringA is < stringB
Value condition = zero, stringA = stringB
Value condition > zero, stringA is > stringB

strcmp(stringA, stringB)

substitute

Substitutes new text for old text in a text string and optionally [for a given instance].

substitute(text, target text, replace­ment text, [instance]);
[] is optional but the preceding comma is required;
substitute(ColumnA,”_%_”,””,)

sum

Returns the sum of the specified values.

sum(A1:A12);
sum(1,2,3,4,5)

sumsq

Returns the sum of the squares of the arguments.

sumsq(C1:C33)

tan

Calculates the tangent of parameter1.

tan(0.785)=0.99920399

tanh

Calculates the hyperbolic tangent of parameter1.

tanh(–2)= –0.96403

time

Returns the decimal number for a particular time.

time(hour, min, sec);
time(12,2,3)=0.50142361

timevalue

Uses datetimes to return the decimal num­ber of the time represented by a text string, with precision up to the second.

timevalue(“2:24 AM”)=0.1

tinv

Calculates the inverse t distribution and returns absolute value.

tinv(p, df);
tinv(0.7, 10)=tinv(0.3, 10)=0.541528
Note: Entering an invalid df value (e.g., a negative or zero value) results in an incorrect output of 1, when it should be blank.

today

Returns the serial number of the current date.

today()=current date as serial time

trim

Removes all spaces from text except for single spaces between words.

trim(“      this is     a test”)=“this is a test”

trunc

Truncates a real number to an integer by removing the decimal places.

trunc(number);
trunc(12.1234)

upper

Returns parameter1 in upper case.

upper(“this is a test”)=“THIS IS A TEST”

var

Returns the variance of a set of real num­bers.

var(A1:A5)

weekday

Returns the day of the week corresponding to a date.
The day is given as an integer, ranging from 1 (Sunday) to 7 (Saturday).

weekday(serial_time);
weekday(39411)=Monday (Sunday is the first day)

year

Returns the year corresponding to a date.
The year is returned as an integer in the range 1900–9999.

year(39411)=2007

Filter operation

The Filter operation allows users to search for values in a dataset, specify if only matching data are returned (include), if the matches are filtered out (exclude), or replaced with another value. Additional options allow users to choose if the entire dataset is to be searched or just a particular column and if the entire row containing a matching cell is filtered or just the matching cell.

Main Mappings panel
Options tab

Main Mappings panel

Users must map a dataset to the Main Mappings panel when the first step is set up. All subsequent steps will automatically be mapped to the same dataset.

Use the Main Mappings panel to specify which input variables are filtered. Main Mapping input vari­ables to context associations is optional.

None: Columns mapped to this context are simply included in the output.

Exclude: Columns mapped to this context are excluded in the result worksheet.

Options tab

Filter_Options_tab.png 

To specify the type of filter

Select one of the option buttons to choose the filter type.

Built In – Use the Filter Specification dialog to add a filter.

Selection (Exclude) – Use the Filter Selection dialog to exclude part of a dataset.

Custom (Include) – Use the Custom Filter Entry dialog to specify a custom filter.

Click Add to add a filter.

The Specify Filter field lists all added filters. Select a filter to remove, edit, or move it.

Filters are executed in the order they are listed.

Click Undo to remove the last filter created from the list.

Click Edit to edit a filter.

Click Move Up to move a filter to top of the list.

Click Move Down to move a filter to the bottom of the list.

Note:If a User Selection filter is defined and listed in the Specify Filter field, the Move Up/Move Down buttons in the Data Wizard are disabled.

Filter Specification dialog

When the Built In option is selected in the Options tab, clicking Add displays the Filter Specification dialog.

Filter_Specification_dialog.png 

In the Action menu, select a filter action.

Exclude: Filter out cells/rows that match the filter criteria.

Include: Keep only the cells/rows that match the filter criteria.

Replace: For each cell that matches the filter criteria, replace it with a specified value.

From the Column menu, select the column to which the filter criteria is to be applied.

From the Operator menu, select an operator.

Operator(s)

Description

=

equal to the search string

<

less than the search string

<=

less than or equal to the search string

>

greater than the search string

>=

greater than or equal to the search string

Note:To exclude (include) “not equal” values, instead include (exclude) equal values.

Select the name of an existing column from the Select Column or Enter Value pull-down menu to compare its cell values with those in the column chosen in the Column pull-down menu.
Or
Type a value directly into the field to search for that value in the column selected in the Column pull-down menu.

If Exclude or Include is the action, check the Apply to entire row box to exclude/include the entire row when the filter criteria is matched. or clear the checkbox (the default) to exclude or include a cell that matches the criteria.

If Replace is the action:

In the Replace with field, type the value to place in a cell that matches the filter criteria.

Check the Replace All box to replace all instances where the criteria is matched in the specified column (or in all columns if Any is specified). Clear the checkbox to only replace the first match in the specified column (or first match in each column if Any is specified).

Type a value in the Tolerance field to search for values within a range of a numeric search value.

The Tolerance field is only available when a numerical value is entered in the Value field and is used to find values not different from the numeric search value by more than the tolerance value. This function is optional.

Numeric values that are +/- the tolerance of the search value are considered to match the search value. If a non-numerical value is entered in the Find field, then the Case Sensitive checkbox is made available.

Select the Case Sensitive checkbox to search for strings that match the capitalization used in the search string.

Built-in filter rules: 

Blank cells can be filtered by using the term ‘NULL’ in the filter expression to represent the blank cells.

Users can filter cells that contain values with leading single quotes by typing an additional single quote before the value in the Filter Specification dialog. For example, to filter a value like ‘000123’, users need to type “000123” in the Custom Filter Entry dialog.

Filter Selection dialog

When the Selection (Exclude) option is selected in the Options tab, clicking Add displays the Filter Selection dialog.

Filter_Selection_dialog.png 

Note:If this table is blank, click Cancel, check the Retain Intermediate Results checkbox in the Options tab, and then click Add. The table will be populated.

Use the pointer to select part or all of a worksheet.

Click OK to exit the Filter Selection dialog, or click Cancel to exit the dialog without excluding the selected cells.

Note:Adding a User Selection filter to the list in the Specify Filter field results in the Move Up/Move Down buttons in the Data Wizard becoming disabled.

Custom Filter Entry dialog

This section contains the syntax and some of the codes that can be used to create custom inclusion filters for a worksheet. Any standard SQL syntax can be used. Custom filters are defined in the Cus­tom Filter Entry dialog.

Custom_Filter_Entry_dialog.png 

In the Custom Filter Entry dialog, type the custom filter.

Click OK to add the filter or click Cancel to exit the dialog without adding the filter.

Only one filter can be added at a time in the Custom Filter Entry dialog.

Custom filter syntax: 

[Column name] [Expression] [Value] [Logical operator][Column name] [Expression] [Value] 

Example: Subject='GW' AND Time <= 12 

Logic operators: 

Statement grouping is done using the Boolean AND, OR, and NOT operators. Use parentheses to group clauses and force precedence. The AND operator has precedence over other operators.

Example: (Subject='JDW' OR Subject='LEJ') AND Conc > 1 

When creating comparison expressions, the following operators are allowed:

Operator

Description

=

Equals

<>

Not Equals

<

Less than

<=

Less than or equal

>

Greater than

>=

Greater than or equal

Custom filter rules: 

Results

Note:Any step in the Data Wizard can be revisited, revised, and re-executed. All of the steps performed are described carefully in the Summary part of the Output.

All of the steps in the Data Wizard generate output. The output forms specific to a particular kind of operation are specified below. In general, each step in the Data Wizard will create the following out­put:

Worksheet

Content

Result

The final result of all operations in the step.

Summary

Listing of all steps in the Data Wizard.

Settings

The settings used in all steps.

Additional output when intermediate results are retained

Final Results

The Results, Summary, and Settings that represent all steps.

Secondary

Non-empty intermediate results that are not used in the final results above.

Additional output when intermediate results are retained

Final Results

The Results, Summary, and Settings that represent all steps.

Intermediate Results

All non-empty intermediate results generated by the steps.

 


Last modified date:6/26/19
Certara USA, Inc.
Legal Notice | Contact Certara
© 2019 Certara USA, Inc. All rights reserved.