Use last element as end time #58

magnusuMET · 2024-11-01T13:15:52Z

Added an additional fix for type comparisons

avaldebe · 2024-11-01T13:41:53Z

src/pyaro/timeseries/Filter.py

        end = datetime.min
        for s, e in self._start_include + self._startend_include + self._end_include:
            start = min(start, s)
-            end = max(end, s)
+            end = max(end, e)


Is this the cleanest way to set the start/end
Also, why is does start depends on self._end_include and end depends on self._start_include?

maybe itertools.chain can improve the readability

from itertools import chain [...] if self._start_include or self._startend_include: start = min(s for s, e in chain(self._start_include, self._startend_include)) else: start = datetime.max if self._end_include or self._startend_include: end = max(e for s, e in chain(self._end_include, self._startend_include)) else: end = datetime.min

I don't think the chain-version is very readable here.

I don't think the chain-version is very readable here.

Yes, at the cost of creating new lists this could be written on a simpler way

if self._start_include or self._startend_include: start = min(s for s, e in self._start_include + self._startend_include) else: start = datetime.max if self._end_include or self._startend_include: end = max(e for s, e in self._end_include + self._startend_include) else: end = datetime.min

In any case, my question still stands

Also, why is does start depends on self._end_include and end depends on self._start_include?

The tuple values of (start, end) are the time-ranges of the filter, while start_include/end_include refer to measurements starttime/endtime.

avaldebe · 2024-11-01T14:21:27Z

src/pyaro/timeseries/Filter.py

@@ -579,16 +579,15 @@ def init_kwargs(self):
        }

    def _index_from_include_exclude(self, times1, times2, includes, excludes):
-        idx = times1.astype("bool")
        if len(includes) == 0:


Looks like includes is assumed to be an Iterable, in which case
if len(includes) == 0: is equivalent to if not includes:.

With this in mind, this section can be written as:

idx = np.repeat(not includes, len(times1)) for start, end in includes: idx |= (np.datetime64(start) <= times1) & (times2 <= np.datetime64(end))

While idx = np.repeat(not includes, len(times1)) is written using very few characters, the boolean value of includes is not related to the boolean value of idx except for a chain of action, e.g. we could also end up with idx = np.repeat(includes, ...) if we would use the idx differently.

Rather than having to add long comments, please use rather more code.

heikoklein

Thanks, looks good.
Please check if we could improve the code by internally storing all datetimes as np.datetime64[s] directly?

heikoklein · 2024-11-01T14:17:36Z

src/pyaro/timeseries/Filter.py

@@ -614,7 +613,7 @@ def envelope(self) -> tuple[datetime, datetime]:
        end = datetime.min
        for s, e in self._start_include + self._startend_include + self._end_include:
            start = min(start, s)
-            end = max(end, s)
+            end = max(end, e)


Thanks, very well that you also included a test.

heikoklein · 2024-11-01T14:25:26Z

src/pyaro/timeseries/Filter.py

            for start, end in includes:
-                idx |= (start <= times1) & (times2 <= end)
+                idx |= (np.datetime64(start) <= times1) & (times2 <= np.datetime64(end))



I'm not sure why the conversion to np.datetime64 is needed here, or, more specific, if operations need to be done as np.datetime64 then maybe _str_list_to_datetime_list should do the conversion already?

Probably should store dt's as numpy format, opened issue #59 for this

heikoklein · 2024-11-01T14:28:26Z

src/pyaro/timeseries/Filter.py

        end = datetime.min
        for s, e in self._start_include + self._startend_include + self._end_include:
            start = min(start, s)
-            end = max(end, s)
+            end = max(end, e)


I don't think the chain-version is very readable here.

Use last element as end time

ca4f826

magnusuMET requested a review from heikoklein November 1, 2024 13:15

avaldebe reviewed Nov 1, 2024

View reviewed changes

Fix type comparison error

d14a17f

avaldebe reviewed Nov 1, 2024

View reviewed changes

heikoklein approved these changes Nov 1, 2024

View reviewed changes

magnusuMET mentioned this pull request Nov 1, 2024

Store times as np.datetime64 #59

Closed

magnusuMET merged commit 923faeb into metno:main Nov 1, 2024
2 checks passed

magnusuMET deleted the bugfix/endtime branch November 1, 2024 14:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use last element as end time #58

Use last element as end time #58

magnusuMET commented Nov 1, 2024 •

edited

Loading

avaldebe Nov 1, 2024 •

edited

Loading

heikoklein Nov 1, 2024

avaldebe Nov 1, 2024

heikoklein Nov 1, 2024

avaldebe Nov 1, 2024 •

edited

Loading

heikoklein Nov 1, 2024

heikoklein left a comment

heikoklein Nov 1, 2024

heikoklein Nov 1, 2024

magnusuMET Nov 1, 2024

heikoklein Nov 1, 2024

Use last element as end time #58

Use last element as end time #58

Conversation

magnusuMET commented Nov 1, 2024 • edited Loading

avaldebe Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

avaldebe Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

heikoklein left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

magnusuMET commented Nov 1, 2024 •

edited

Loading

avaldebe Nov 1, 2024 •

edited

Loading

avaldebe Nov 1, 2024 •

edited

Loading