Categorizable Histograms are a special type of histogram that allows you to create groups of respondents (denoted by the red bars) either manually, by equal proportions, or by equal intervals. You are also able to generate new variables with these groups in your data set.
Technical details
Description
This is a legacy chart and not R-based. If you just need to create histograms without the feature to create new variables using them, see Visualization - Distributions - Histogram for the R-based version that can also show tests, and has more customization options.
Inputs
Numeric variable set. You can also select a categorical variable set in the Columns to see how many cases from each column belong in the groups for each row.
Properties tab
-
HISTOGRAM CATEGORIES
-
Generate categories whether or not to create new variables with the groups created on the histogram(s). These groups will be calculated using data across all histograms and not individually. If showing multiple histograms in the visualization, a Nominal-Multi variable set will be created with one variable for each histogram. Settings can be one of:
- Do Not Generate - do not create new variables using this histogram.
- With equal proportions – this is a starting point, where the data is categorized into 3 categories with equal proportions (e.g. 33%, or as close as it can be, according to the data)
- With equal intervals – this is an alternative starting point, where the 3 categories are equally spaced between the minimum and maximum.
- Number of categories only available with the proportions or interval options above. The number of groups that will be created for the new variable set.
-
Categorize each numeric variable individually for visualizations with multiple histograms, whether or not manual changes (you can select and drag the red lines) affect only the groups for that specific variable (checked):
or all variables plotted (unchecked):
If also using a variable set in the columns, know that the red bars/categories will be the same across all columns because there is only one new variable created for each numeric variable in the Rows.
-
Generate categories whether or not to create new variables with the groups created on the histogram(s). These groups will be calculated using data across all histograms and not individually. If showing multiple histograms in the visualization, a Nominal-Multi variable set will be created with one variable for each histogram. Settings can be one of:
Chart tab
-
CATEGORY LABELS
- Font settings for the Group names at the very top above the histograms.
-
CATEGORY LINE
- Select the category line you'd like to edit.
- Category cutoff point you can provide a specific number for the cutoff instead of dragging and dropping.
- Line color and Font settings affect only the red category lines.
- Delete Category Line will always delete the last category line shown on the graph, no matter which line is selected in the dropdown.
-
CATEGORY SIZES
- Font settings for the numbers shown on the plot that show the number of cases in each Group.
Output
Method
- In Displayr: How to Categorize Numeric Data
- In Q: Show Data as > Histograms