New epochs class #14

qian-chu · 2025-12-29T13:54:32Z

Developers and users have found the old Epochs class quite unintuitive and operationally hard to use (e.g., nested DataFrame). This PR aims to reorganize the class.

Current features:

Slight change of Events class, where now event IDs (e.g., "fixation id") are now used as index. For messages and custom events, the index is name "event id"
event_times is renamed to epochs_info for easier comprehension
Creating an Epochs instance will not compute the epochs or annotated data automatically.
epochs is now a cached property in the form of a dictionary. Keys are indices of epochs_info and values are Stream/Events instances.
Annotated data now available through annotate_epochs.
(in progress) added a pupillary light reflex (PLR) dataset (2 participants, data in cloud and native formats). Currently exploring options to host data on FigShare vs OSF

Refactored the Epochs class to use a dictionary of epochs indexed by epoch index, added properties for empty epochs, and improved overlap checking and warnings. Updated the to_numpy method to support flexible sampling rates and interpolation, and improved baseline correction error handling. Modified plot_epochs and its helpers to work with the new Epochs API, and updated the pupil_size_and_epoching tutorial to use the new sample data and API.

Expanded and clarified docstrings for epoching functions and the Epochs class, including detailed parameter and return value descriptions. Refactored Dataset to automatically handle native data without requiring a 'custom' flag, and improved section construction for native recordings. Updated tutorials to use consistent event naming and native data examples. Minor bugfixes and doc improvements in export and utility modules.

qian-chu · 2025-12-29T13:57:41Z

@JGHartel Currently the baseline correction method is not updated yet. What do you think would be the best API/code for it? For operability with other methods such as plot(), I think it makes the most sense to modify the data in epochs property

JGHartel

Overall, I welcome the suggested changes. In particular, the rename towards epoch_info and the unified access to epochs data as epoch.epochs, rather than the previous mixed access, is appreciated. Still, I would like to invite some discussions on the assumed datatypes, e.g. epochs.epochs and epochs.annotate being a dictionary. This would especially futureproof the development, if dataset level operations should be introduced

JGHartel · 2026-01-07T11:19:56Z

pyneon/utils/doc_decorators.py

+        ``t_after``: Time after the reference time to end the epoch.\n
+        ``description``: Description or label associated with the epoch.
+"""
+


I will defer to you on this, but I personally think it a bit over the top to use decorators for the docstring. Besides generating the API doc, sometimes one just wants to look on the functions/methods themselves. I would not want to have to dig through a dictionary of docstring defaults for that.
That being said, the current ones seem to be simple enough to be redundant in code

JGHartel · 2026-01-07T11:29:10Z

pyneon/epochs.py

-        # Create epochs
-        self.epochs, self.data = _create_epochs(source, times_df)
+    @cached_property
+    def epochs(self) -> dict[int, Stream | Events | None]:


flagging this in case we ever want to support multi-recording functionality. It would be good to have a Unique ID for each event, so that one can easily concat these dicts between recordings. Alternatively, we could then construct an implicit multiindex (recording, epoch_number), which would require a custom concat function.

This seems a bit of an advanced user case and also in principle should apply to other classes (e.g. streams and events). My evaluation would be it would require another PR if we see value in supporting multi-recording concatenation

JGHartel · 2026-01-07T11:34:25Z

pyneon/epochs.py

+    ts = source.ts if isinstance(source, Stream) else source.start_ts
+    source_index = source.data.index
+    annot = {i: [] for i in source_index}  # Initialize empty lists for each index

    # Iterate over each event time to create epochs
-    for i, row in times_df.iterrows():
-        t_ref_i, t_before_i, t_after_i, description_i = row[
-            ["t_ref", "t_before", "t_after", "description"]
-        ].to_list()
+    empty_epochs = []
+    for i, row in epochs_info.iterrows():
+        t_ref_i, t_before_i, t_after_i = row[["t_ref", "t_before", "t_after"]].to_list()

        start_time = t_ref_i - t_before_i
        end_time = t_ref_i + t_after_i
        mask = np.logical_and(ts >= start_time, ts <= end_time)

        if not mask.any():
-            warnings.warn(f"No data found for epoch {i}.", RuntimeWarning)
-            epochs.at[i, "epoch data"] = pd.DataFrame()
-            continue
+            empty_epochs.append(i)

-        data.loc[mask, "epoch index"] = i
-        data.loc[mask, "epoch description"] = str(description_i)
-        data.loc[mask, "epoch time"] = (
-            data.loc[mask].index.to_numpy() - t_ref_i
-        ).astype("int64")
+        # Annotate the data with the epoch index
+        for idx in source_index[mask]:
+            annot[idx].append(i)

-        local_data = data.loc[mask].copy()
-        local_data.drop(columns=["epoch index", "epoch description"], inplace=True)
-        epochs.at[i, "data"] = local_data
+    if empty_epochs:
+        warn(f"No data found for epoch(s): {empty_epochs}.", RuntimeWarning)

-    return epochs, data
+    return annot


why do you want to return a dict here. Would it not be better to have a df?

You are right that a df in this case seems intuitive (with an "epoch_index" column of object type)

JGHartel · 2026-01-07T11:35:34Z

pyneon/stream.py

+    def detect_events_from_derivative(
+        self,
+        column: str,
+        threshold: Number,
+        min_duration: Optional[Number] = None,
+    ) -> "Events":
+        """
+        Detect events based on velocity thresholding of a specified data column.
+
+        Parameters
+        ----------
+        column : str
+            The data column on which to compute velocity for event detection.
+        threshold : Number
+            Threshold above which an event is detected.
+            The units depend on the units of the specified column
+            and the timestamps (e.g., px/s for gaze position in pixels).
+        min_duration : float
+            Minimum duration (in seconds) for an event to be valid.
+
+
+        Returns
+        -------
+        Events
+            An Events object containing the detected events with
+            ``start timestamp [ns]`` and ``end timestamp [ns]`` columns.
+        """


placeholder?

Yeah, this is an idea I have to detect events like saccades or blinks from derivative of stream data (gaze, pupil diameter)

qian-chu and others added 15 commits December 19, 2025 15:34

use new sample_data

498344b

Format code with isort and ruff

e132c3e

Update sample_data.py

4dd4bfb

Update sample_data.py

94f67aa

Merge branch 'main' into new_epochs

e4da3ef

Format code with isort and ruff

7f367f2

Refactor epochs/events API and improve doc consistency

e0c77ee

Format code with isort and ruff

b6e23e4

update reference

6a9bac5

update annot

2b0432f

Format code with isort and ruff

9106967

Format code with isort and ruff

de8f53e

Format code with isort and ruff

7e28a5d

qian-chu requested a review from JGHartel December 29, 2025 13:54

JGHartel reviewed Jan 7, 2026

View reviewed changes

qian-chu added 3 commits January 7, 2026 16:03

update

fb227bb

adjust workflow

31c5b5c

Update main.yml

b361cf3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

New epochs class #14

New epochs class #14

Uh oh!

qian-chu commented Dec 29, 2025

Uh oh!

qian-chu commented Dec 29, 2025

Uh oh!

JGHartel left a comment

Uh oh!

JGHartel Jan 7, 2026

Uh oh!

JGHartel Jan 7, 2026

Uh oh!

qian-chu Jan 7, 2026

Uh oh!

JGHartel Jan 7, 2026

Uh oh!

qian-chu Jan 7, 2026

Uh oh!

JGHartel Jan 7, 2026

Uh oh!

qian-chu Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

New epochs class #14

Are you sure you want to change the base?

New epochs class #14

Uh oh!

Conversation

qian-chu commented Dec 29, 2025

Uh oh!

qian-chu commented Dec 29, 2025

Uh oh!

JGHartel left a comment

Choose a reason for hiding this comment

Uh oh!

JGHartel Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

JGHartel Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

qian-chu Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

JGHartel Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

qian-chu Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

JGHartel Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

qian-chu Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants