Allow unrecognized units by rprospero · Pull Request #188 · SasView/sasdata

rprospero · 2026-02-20T11:56:45Z

As mentioned in #172, users might want to track units that aren't necessarily SI measurements (e.g. Slices per Pizza, Pages per Book). This PR adds an ArbitraryUnits class which can track these sorts of units while still interactive with the regular Units and NamedUnits` classes.

It also contains a modernisation of the utest_units.py test file so that the tests can be properly searched and controlled through pytest.

The tests now properly appear in the pytest list of tests. Additionally, the tests are given readable parameterised names, but display the actual values on failure.

codescene-delta-analysis

No quality gates enabled for this code.

See analysis details in CodeScene

Quality Gate Profile: Custom Configuration
Install CodeScene MCP: safeguard and uplift AI-generated code. Catch issues early with our IDE extension and CLI tool.

krzywon · 2026-03-02T14:53:07Z

sasdata/quantities/_units_base.py

-#
-#

+class ArbitraryUnit(NamedUnit):


Arbitrary unit (a.u.) is used for SAS data not on absolute scale. I think UnknownUnit or similar would better match the current naming scheme and not cause confusion.

You're right about the confusion. I've switched to UnknownUnit

krzywon · 2026-03-02T14:56:22Z

sasdata/quantities/_units_base.py

+                 numerator: str | list[str] | dict[str, int],
+                 denominator: None | list[str] | dict[str, int]= None):


What should these strings and/or lists look like? Documentation on this would be helpful. Currently, the unit converter allows a*a, a**2, and a^2. What happens if a^2/b is passed as the numerator? Does it matter?

Likely it won't matter, since UnknownUnit should only be called after a failed parse that has already split on those characters. However, just in case, I've added validation that throws a runtime error if invalid characters are in any part of the unit.

This is contained in the _valid_name static method

krzywon · 2026-03-02T14:58:31Z

sasdata/quantities/_units_base.py

+            case str():
+                self._numerator = {numerator: 1}
+            case list():
+                self._numerator = {}
+                for key in numerator:
+                    if key in self._numerator:
+                        self._numerator[key] += 1
+                    else:
+                        self._numerator[key] = 1
+            case dict():
+                self._numerator = numerator
+            case _:
+                raise TypeError


This match/case block is almost exactly the same as the one for denominator. Could this be made into a separate function that returns the resulting dictionary?

Refactoring out the repetition also made the string validation simpler. The separate function is now the _parse_arg class method.

krzywon · 2026-03-02T15:04:06Z

sasdata/quantities/_units_base.py

+            case (_, []):
+                return " ".join(num)
+            case ([], _):
+                return "1 / " + " ".join(den)


This will give something like 1 / A B C which will create ambiguity for B and C. Maybe return "1 / (" + " ".join(den) + ")"? I would suggest something similar in the default case as well.

I've added parentheses, but only in the case where there are multiple terms in the denominator. Thus, it will appear as "1 / (A B C)", but the single term will still appear as "1 / A"

krzywon · 2026-03-02T15:10:14Z

sasdata/quantities/units.py

+        return str(self)
+
+    @staticmethod
+    def parse(unit_string: str) -> "Unit":


standardize_units and _format_unit_structure in the existing sasdata.data_util.nxsunit module already has a basic version of this you might want to look at.

rprospero · 2026-03-03T13:49:49Z

This has NOT been merged, despite what GitHub claims. Rather, the discussion continues in #190.

rprospero and others added 10 commits February 20, 2026 11:51

Implement ArbitraryUnit class

72bd692

Refactor utest_units.py

abc78d6

The tests now properly appear in the pytest list of tests. Additionally, the tests are given readable parameterised names, but display the actual values on failure.

Add multiplication support for arbitrary units

2225c54

Add power support for arbitrary units

fbb1d28

Refactor arbitrary unit representations

48722d4

Enable arbitrary division

a236fd4

Rework display of arbitrary units

1ed0a97

Properly reduce terms and add rdiv for arbitrary units

4d82590

Remove unneeded function stubs

f69e363

[pre-commit.ci lite] apply automatic fixes for ruff linting errors

abc2f11

This comment was marked as outdated.

Sign in to view

Fix windows unicode printing issue in test

6e85365

rprospero force-pushed the 172_unrecognized_units branch from 662433f to 6e85365 Compare February 20, 2026 12:43

codescene-delta-analysis bot approved these changes Feb 20, 2026

View reviewed changes

rprospero requested a review from DrPaulSharp February 20, 2026 12:51

rprospero marked this pull request as ready for review February 20, 2026 13:14

krzywon reviewed Mar 2, 2026

View reviewed changes

rprospero merged commit 6e85365 into refactor_24 Mar 3, 2026
11 checks passed

rprospero deleted the 172_unrecognized_units branch March 3, 2026 13:26

rprospero mentioned this pull request Mar 3, 2026

172 unrecognized units (Again) #190

Merged

		numerator: str \| list[str] \| dict[str, int],
		denominator: None \| list[str] \| dict[str, int]= None):

Conversation

rprospero commented Feb 20, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

codescene-delta-analysis bot left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rprospero commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants