When uploading a dataset, you can select a subset of properties and configure each of them. The aim of this pre-configuration step is to define what will be shown by default in every new SAR Slides created for this dataset.
We review in details what can be done, why it can be useful to do it at the dataset import level, and provide detailed explanation of the available configurations.
Properties of interest
Upon uploading a dataset, you must define at least one, and optionally several properties using the Default Properties select box.
.png)
These properties are typically the main endpoints of interest for your current research project that you typically look at when assessing your target compound profile: primary activities, ADMET properties, ...
When selecting the first (mandatory) one, it will be flagged as being the main endpoint. This might typically be a primary IC50 activity, or any other primary endpoint of interest regarding your current compound design objectives. The property flagged as such will be used by default to sort fragments in SAR Galleries.
All other properties that will be optionally added during this step will also be loaded by default and available in SAR Slides. If you know that for most report you'll perform on a specific dataset, some properties will be required most of the time for your analysis, creating this list at the dataset import step will prevent you from redefining it over and over again for every new SAR Slide created.
It's interesting to have that pre-configuration step aiming at a particular state that would be relevant to your current objective, for example:
By default, every new MMP SAR Slide created will show the best transformations regarding our main pIC50 endpoint, highlight transformations that have a significant effect in green (good change) or red (bad change), and show additional endpoints that are of interest for this dataset.
Although optional, it's advised to take time to configure them, again, for the sake of not re-doing it for every new SAR Slide.
Individual Property configuration
This is the current attributes that you can define for every properties added upon importing a new dataset. Let's review each of them:
.png)
- Property Name & Type: Name of the property as defined in your dataset source. You can alias it using the edit button (pencil icon) next to the property name. Below is displayed the type of the property as automatically detected (that you can change as well) along with the number of valid and non-empty values.
- Direction: Are you trying to optimize this property towards high or low values ? This information is currently used with MMPs SAR Slides to (1) order transformations to show the best ones by default, (2) determine whether an activity change between two compound is good or bad, which, together with the threshold (see bellow), will translate into colored values reflecting the effect of a given transformation on the endpoint.
- Threshold Type: Thresholds are used to identify significant activity change between two molecules. Significant changes will be colored in green if the change is good, or red if the change is bad, depending on the directionality you defined.
Two types of thresholds can be used:- Absolute: Use absolute value of the difference between activities.
- Fold: Use ratio between the other compound value and the reference activity value. Suited for non-linear activity scales, typically, IC50s.
- Change Threshold: Value used to determine whether a change between two property values is significant or not.
- Main ?: Defines the main property, or primary endpoint. This property will be used by default to sort fragments in SAR Galleries, which translates into, show the "best" transformations first.
You have a pIC50 evaluating inhibition effect of your compound on your primary target. You would typically define the direction as Higher, the threshold type as "Absolute" and the change threshold = 1 (which corresponds to a 10-folds change), and toggle the Main radio button for this property.
Check out what would happen by taking compound 27 (pIC50 = 6.93) as a reference compound:
.png)
We defined that higher values are good, with a threshold of 1. Since compound 38 has a pIC50 of 8.05, and compound 27 (reference) has a pIC50 of 6.93, the difference between these two is +1.12, which is greater than the threshold. The value for compound 38 ends up being colored in green: this is a good, significant (according to the threshold) change.
Note that instead of displaying pIC50 values, we can display activity differences instead, which makes it easier to interpret the default coloring:
.png)
If you edit the threshold value and put 2 instead, you would end up with no coloring in this example, since there is no absolute activity change greater or equals that 2:
.png)
Note that all this information can then be changed once in the SAR Slides. Defining some relevant default configuration will however save you time, since once done, it will be applied to every new SAR Report created on the dataset.
