Python code to verify if a specific condition is satisfied within a certain time period

Question

Python code to verify if a specific condition is satisfied within a certain time period

Seeking to determine if certain conditions are met over a period of time. The data is structured as follows:

Datetime	Valve1	Valve2
01/01/2020 11:00:01	1	0

The condition being evaluated is: (Valve1=1 for 1h) and (Valve-0 for 1h) Utilizing rolling sum technique:

data = data.set_index('Datetime', drop=True)
data.loc[((data.Valve1.rolling('1h').sum())==?) & ((data.Valve2.rolling('1h').sum())==0), 'alarm'] = 'Yes'

Data should not be resampled or contain interpolated values. [Note]: Missing datetimes will inherit the Valve1 and Valve2 from the previous available datetime.

The resulting table would look like this:

Datetime	Valve1	Valve2	Alarm
01/01/2020 11:00:01	1	0

python pandas datetime

Answer 1

Answer №1

If you previously mentioned that data resampling was not possible, here is a way to achieve it successfully:

>>> temp_df = pd.concat([df.copy().iloc[0, :].to_frame().T, df.copy()], axis=0, ignore_index=True)
# Additional line for initialization effect
>>> temp_df.loc[0, ['Valve1', 'Valve2']] = [0, 1]
>>> temp_df['alarm'] = temp_df.Valve1.eq(1) & temp_df.Valve2.eq(0)
>>> df['alarm'] = temp_df.set_index('Datetime').rolling('1h').agg({'alarm': pd.Series.product}).replace({1: 'Yes', 0: 'No'})[1:].values
>>> df

              Datetime  Valve1  Valve2 alarm
0  2020-01-01 11:00:01       1       0    No
1  2020-01-01 11:00:15       1       0    No
2  2020-01-01 11:30:00       1       0    No
3  2020-01-01 11:30:45       1       0    No
4  2020-01-01 12:00:10       1       0   Yes
5  2020-01-01 12:15:00       1       1    No
6  2020-01-01 12:15:30       1       0    No
7  2020-01-01 12:16:45       1       0    No
8  2020-01-01 13:17:00       1       0   Yes
9  2020-01-01 13:20:15       1       0   Yes
10 2020-01-01 13:21:30       1       0   Yes
11 2020-01-01 13:45:08       1       0   Yes
12 2020-01-01 14:00:00       0       1    No
13 2020-01-01 14:01:15       0       1    No
14 2020-01-01 14:30:00       0       1    No

Give this method a try.

Answer 2

If you previously mentioned that data resampling was not possible, here is a way to achieve it successfully:

>>> temp_df = pd.concat([df.copy().iloc[0, :].to_frame().T, df.copy()], axis=0, ignore_index=True)
# Additional line for initialization effect
>>> temp_df.loc[0, ['Valve1', 'Valve2']] = [0, 1]
>>> temp_df['alarm'] = temp_df.Valve1.eq(1) & temp_df.Valve2.eq(0)
>>> df['alarm'] = temp_df.set_index('Datetime').rolling('1h').agg({'alarm': pd.Series.product}).replace({1: 'Yes', 0: 'No'})[1:].values
>>> df

              Datetime  Valve1  Valve2 alarm
0  2020-01-01 11:00:01       1       0    No
1  2020-01-01 11:00:15       1       0    No
2  2020-01-01 11:30:00       1       0    No
3  2020-01-01 11:30:45       1       0    No
4  2020-01-01 12:00:10       1       0   Yes
5  2020-01-01 12:15:00       1       1    No
6  2020-01-01 12:15:30       1       0    No
7  2020-01-01 12:16:45       1       0    No
8  2020-01-01 13:17:00       1       0   Yes
9  2020-01-01 13:20:15       1       0   Yes
10 2020-01-01 13:21:30       1       0   Yes
11 2020-01-01 13:45:08       1       0   Yes
12 2020-01-01 14:00:00       0       1    No
13 2020-01-01 14:01:15       0       1    No
14 2020-01-01 14:30:00       0       1    No

Give this method a try.

Answer 3

Answer №2

Utilize the groupby() method with date() and hour
Implement logical operations to obtain a boolean result
Perform a merge() operation back to your dataframe

df = pd.read_csv(io.StringIO("""Datetime    Valve1  Valve2
01/01/2020 11:00:01 1   0
01/01/2020 11:00:15 1   0
01/01/2020 11:30:00 1   0
01/01/2020 11:30:45 1   0
01/01/2020 12:00:10 1   1
01/01/2020 12:15:00 1   1
01/01/2020 12:15:30 1   1
01/01/2020 12:16:45 0   1
01/01/2020 13:17:00 1   0
01/01/2020 13:20:15 1   0
01/01/2020 13:21:30 1   0
01/01/2020 13:45:08 1   0
01/01/2020 14:00:00 0   1
01/01/2020 14:01:15 0   1
01/01/2020 14:30:00 0   1
"""), sep="\t")

df.Datetime = pd.to_datetime(df.Datetime)

dfr = df.groupby([df.Datetime.dt.date, df.Datetime.dt.hour]).apply(lambda dfa: ((dfa.Valve1==1) & (dfa.Valve1==1)).all())

df = (df.merge(dfr.to_frame(), left_on=[df.Datetime.dt.date, df.Datetime.dt.hour], right_index=True)
 .drop(columns=["key_0","key_1"])
 .rename(columns={0:"Cond"})
)

	Datetime	Valve1	Valve2	Cond
0	2020-01-01 11:00:01	1	0	True
1	2020-01-01 11:00:15	1	0	True
2	2020-01-01 11:30:00	1	0	True
3	2020-01-01 11:30:45	1	0	True
4	2020-01-01 12:00:10	1	1	False
5	2020-01-01 12:15:00	1	1	False
6	2020-01-01 12:15:30	1	1	False
7	2020-01-01 12:16:45	0	1	False
8	2020-01-01 13:17:00	1	0	True
9	2020-01-01 13:20:15	1	0	True
10	2020-01-01 13:21:30	1	0	True
11	2020-01-01 13:45:08	1	0	True
12	2020-01-01 14:00:00	0	1	False
13	2020-01-01 14:01:15	0	1	False
14	2020-01-01 14:30:00	0	1	False

Answer 4

Utilize the groupby() method with date() and hour
Implement logical operations to obtain a boolean result
Perform a merge() operation back to your dataframe

df = pd.read_csv(io.StringIO("""Datetime    Valve1  Valve2
01/01/2020 11:00:01 1   0
01/01/2020 11:00:15 1   0
01/01/2020 11:30:00 1   0
01/01/2020 11:30:45 1   0
01/01/2020 12:00:10 1   1
01/01/2020 12:15:00 1   1
01/01/2020 12:15:30 1   1
01/01/2020 12:16:45 0   1
01/01/2020 13:17:00 1   0
01/01/2020 13:20:15 1   0
01/01/2020 13:21:30 1   0
01/01/2020 13:45:08 1   0
01/01/2020 14:00:00 0   1
01/01/2020 14:01:15 0   1
01/01/2020 14:30:00 0   1
"""), sep="\t")

df.Datetime = pd.to_datetime(df.Datetime)

dfr = df.groupby([df.Datetime.dt.date, df.Datetime.dt.hour]).apply(lambda dfa: ((dfa.Valve1==1) & (dfa.Valve1==1)).all())

df = (df.merge(dfr.to_frame(), left_on=[df.Datetime.dt.date, df.Datetime.dt.hour], right_index=True)
 .drop(columns=["key_0","key_1"])
 .rename(columns={0:"Cond"})
)

	Datetime	Valve1	Valve2	Cond
0	2020-01-01 11:00:01	1	0	True
1	2020-01-01 11:00:15	1	0	True
2	2020-01-01 11:30:00	1	0	True
3	2020-01-01 11:30:45	1	0	True
4	2020-01-01 12:00:10	1	1	False
5	2020-01-01 12:15:00	1	1	False
6	2020-01-01 12:15:30	1	1	False
7	2020-01-01 12:16:45	0	1	False
8	2020-01-01 13:17:00	1	0	True
9	2020-01-01 13:20:15	1	0	True
10	2020-01-01 13:21:30	1	0	True
11	2020-01-01 13:45:08	1	0	True
12	2020-01-01 14:00:00	0	1	False
13	2020-01-01 14:01:15	0	1	False
14	2020-01-01 14:30:00	0	1	False

Python code to verify if a specific condition is satisfied within a certain time period

Answer №1

Answer №2

Similar questions

Select the optimal algorithm with Sklearn while efficiently managing memory concerns

What is the best way to manage my Python IRC bot using Twisted in an interactive manner?

Efficiently passing parameters to AJAX URLs in Django

Automate the process of replacing strings in a dataframe using Pandas

Normalizing 2-dimensional input arrays using Keras

A guide on incorporating recursion to nest dictionaries within current data records

"Detailed ConnectionError handling is a key feature of Python requests libraries

Transforming cloud-init logs into json format with a custom conversion script

Python2 has the site-packages folder, while Python3 does not have it

Encountering a Django issue when attempting to save data in a DateTimeField

Encountering a "buffer too small" error when attempting to retrieve a numpy image from a gstreamer appsink

Exploring Additional Feedback with BeautifulSoup

The 'selectExpr' attribute cannot be found in the 'NoneType' object

What could be causing the 500 internal server error when attempting to add users to a Google Sheet using the Google Drive API in Python?

decrease the sum by one for each group

Guide to determining the quantity of distinct values within each group within the past n days

Using Foreign Key in AJAX POST method with Django

Having trouble selecting content in an HTML document using Xpath and response.css with Scrapy?

Python script for deleting empty lines from txt/srt files in a directory and its subfolders

Python fastAPI and MongoDB environment allows the return of a tuple in the BaseModel list