Performance Analysis
=====================
Python-enclined scientists do not use any Units of Measures (UoM) checking
library with the computation overhead they often induce during execution.
Python is a dynamic language, and most tools provide dynamic checking of
the UoM, every time a function is called with potentially different
parameters and different units of measures.
impunity statically checks the UoM of variables in the code based on
annotations and modifies the AST at definition time. We show in the
following that impunity is a lot quicker than four existing libraries,
chosen among other Python libraries dealing with UoM for their support,
regular updates and community.
- `numericalunits `_ uses
a complete set of independent base units (meters, kilograms, seconds,
coulombs, kelvins) that are defined as *randomly-chosen positive
floating-point* numbers, different for all executions. If units are consistent,
the randomness disappears; if not, two executions of the same code return
different values. All other units and constants are defined in terms of
those. In a dimensionally-correct calculation, the units all cancel out,
so the final answer is deterministic, not random. In a dimensionally-incorrect
calculations, there will be random factors causing a randomly-varying
final answer.
- `astropy `_
is a Python package offering functionalities aimed at astronomers
and astrophysicists. It also includes an implementation of the
quantity design pattern. astropy also implements a decorator to check
calls of functions with quantity parameters.
- `Pint `_
also provides an implementation of the quantity
design pattern as a standalone library. It is
flexible and provides good integration with other
scientific libraries like Pandas (through extension types) or NumPy.
- `Quantities `_
is designed to handle arithmetic and conversions of physical quantities,
which have a magnitude, dimensionality specified by various units,
and possibly an uncertainty. Quantities builds on the popular NumPy
library and is designed to work with NumPy’s standard ufuncs,
many of which are already supported.
For each library, two different use-cases are considered based on the example
*speed* function. One with variables annotated with the correct units
(meters and seconds), and one with different but commensurable units (meters
and hours). In both cases, two NumPy arrays of shape (10000,) are sent as
parameters. The computation time over 300 iterations is then averaged.
Execution times for both use-cases are displayed here :
.. figure:: images/impunity_performance.svg
:name: my-custom-label
As we can see, the overhead induced by impunity is minimal.
This is mainly due to the difference between the dynamic
checking of the other libraries and the static analysis
done by impunity. By changing the AST directly before execution,
impunity limits its overhead to the multiplications added to keep
the units coherent between each others. This is also why, when
UoM are identical, the overhead is non-existent.
.. Statically checking unit coherence and modifying the AST offers several
.. performance advantages over dynamic approaches. Let's explore these
.. advantages in detail:
.. 1. Early Detection of Unit Inconsistencies:
.. Static analysis of annotations allows for early detection of unit
.. inconsistencies during the compilation phase. This enables developers
.. to catch and address unit errors before executing the code. In contrast,
.. dynamic methods relying on if statements and Pint conversions only identify
.. unit inconsistencies at runtime, potentially causing errors during execution.
.. 2. Efficient and Optimized Execution:
.. By modifying the AST, **Impunity** ensures that the code executes with
.. coherent units, eliminating the need for runtime unit conversions. This
.. results in more efficient and optimized execution, as the conversions are
.. handled during the compilation process rather than repeatedly during runtime.
.. 3. Reduced Overhead and Computational Costs:
.. Statically checking the coherence of units and applying conversions at
.. compile-time significantly reduces the overhead and computational costs
.. associated with dynamic conversions. This can lead to improved performance,
.. especially in code segments that involve complex calculations or loops.
.. Examples
.. --------
.. To better understand the performance advantages of static unit coherence
.. checking and AST modification, let's consider a couple of examples:
.. Example 1: Loop with Dynamic Conversion using Pint
.. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
.. .. code-block:: python
.. import pint
.. ureg = pint.UnitRegistry()
.. Q_ = ureg.Quantity
.. def calculate_velocity(distance: "feet", time: "minutes") -> "ft / mn":
.. converted_distance = Q_(distance, ureg.feet)
.. converted_time = Q_(time, ureg.minutes)
.. return converted_distance / converted_time
.. distance = Q_([1, 2, 3], ureg.meter)
.. time = Q_([2, 3, 4], ureg.second)
.. for d, t in zip(distance, time):
.. velocity = calculate_velocity(d, t)
.. print(velocity)
.. In this example, a loop iterates over lists of `distance` and `time`
.. values. To perform unit conversion, each value is multiplied by the
.. respective Pint unit (`ureg.feet` and `ureg.minutes`). The overhead of
.. repeatedly performing conversions within the loop can impact performance,
.. especially for large datasets as it needs to check each value one by one.
.. It is also prone to Dimensionality Errors if `calculate_velocity` is not given
.. a length quantity.
.. Example 2: Loop with Static Coherence Checking and AST Modification
.. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
.. .. code-block:: python
.. from impunity import impunity
.. @impunity
.. def calculate_velocity(distance: "feet", time: "minutes") -> "ft / mn":
.. return distance / time
.. @impunity
.. def test_impunity():
.. distance: "meters" = [1, 2, 3]
.. time: "seconds" = [2, 3, 4]
.. for d, t in zip(distance, time):
.. velocity = calculate_velocity(d, t)
.. print(velocity)
.. test_impunity()
.. In this example, the `calculate_velocity` function is decorated with
.. `@impunity` to ensure unit coherence. The loop iterates over the lists
.. of `distance` and `time`, invoking the decorated function for each pair.
.. The static unit coherence checking and AST modification performed by
.. **Impunity** eliminate the need for explicit conversions within the
.. loop, resulting in improved performance.
.. Conclusion
.. ----------
.. Static unit coherence checking and AST modification provided by the
.. **Impunity** library offer significant performance advantages over
.. dynamic approaches. By detecting unit inconsistencies early, ensuring
.. efficient execution, and reducing overhead and computational costs,
.. **Impunity** enables developers to work with coherent units
.. seamlessly and achieve optimal performance.
.. By adopting static coherence checking and AST modification,
.. you can enhance the performance of your code, especially
.. in scenarios involving complex calculations, loops, and large datasets.