The soft time limit for celery was not activated

Question

The soft time limit for celery was not activated

I am facing an issue with a celery task where the soft limit is set at 10 and the hard limit at 32:

from celery.exceptions import SoftTimeLimitExceeded
from scrapy.crawler import CrawlerProcess
from scrapy.utils.project import get_project_settings

@app.task(bind=True, acks_late=False, time_limit=32, soft_time_limit=10)
def my_task(self, **kwargs):
    try:
       if 'twisted.internet.reactor' in sys.modules:
            del sys.modules['twisted.internet.reactor']
        settings = get_project_settings()
        process = CrawlerProcess(settings)
        process.crawl(**kwargs)
        process.start()

    except SoftTimeLimitExceeded as te:

        print('Time Exceeded...')

The code mentioned above runs correctly. However, when the crawl operation exceeds the soft limit, no exception is triggered. The crawl operation continues until the hard limit is reached, causing this error to be displayed:

Traceback (most recent call last):
  File "/usr/local/lib/python3.8/site-packages/billiard/pool.py", line 684, in on_hard_timeout
    raise TimeLimitExceeded(job._timeout)
billiard.exceptions.TimeLimitExceeded: TimeLimitExceeded(32,)

I have tried catching this error within the task but was unsuccessful. To test, I replaced the process.start() command with time.sleep(50) to create a delay without starting any crawl operation:

@app.task(bind=True, acks_late=False, time_limit=32, soft_time_limit=10)
def my_task(self, **kwargs):
    try:
       if 'twisted.internet.reactor' in sys.modules:
            del sys.modules['twisted.internet.reactor']
        settings = get_project_settings()
        process = CrawlerProcess(settings)
        process.crawl(**kwargs)
        time.sleep(50)

    except SoftTimeLimitExceeded as te:
        print('Time Exceeded...')

The catch occurs for SoftTimeLimitExceeded. What could be the reason for this?

Versions

celery==5.2.7

Scrapy==2.6.1

python exception scrapy celery taskmanager

Answer 1

Answer №1

Experiencing the same issue on my end.

I suspect that the error "SoftTimeLimitExceeded" is being caught in your script, preventing it from being raised externally.

You should review your script for any expected Exceptions and either remove them or limit their scope.

 settings = get_project_settings()
 process = CrawlerProcess(settings)
 process.crawl(**kwargs)

This is just my suggestion. I am testing it out on my end, and will continue to provide updates here.

Answer 2

Experiencing the same issue on my end.

I suspect that the error "SoftTimeLimitExceeded" is being caught in your script, preventing it from being raised externally.

You should review your script for any expected Exceptions and either remove them or limit their scope.

 settings = get_project_settings()
 process = CrawlerProcess(settings)
 process.crawl(**kwargs)

This is just my suggestion. I am testing it out on my end, and will continue to provide updates here.

The soft time limit for celery was not activated

Versions

Answer №1

Similar questions

Exploring the dynamic capabilities of Pandas with the use of .cut and

Python encountered an ibm_db exception: [IBM][CLI Driver] SQL4917N The element "SQLE_CLIENT_INFO_WRKSTNNAME" in the option array is invalid. SQLCODE=-4917

Modifying the index value in a list within a Tower of Lists

Having difficulty modifying the custom_field in Jira using Python

Navigate through URLs without using wildcards

Encountering a 403 error while trying to access the G Suite Admin SDK through the google-api-python-client

Error encountered when trying to import a file from a specific directory in Python: `ModuleNotFoundError`

Algorithm making inaccurate predictions due to flawed machine learning model

Bringing in text using pandas in Python

Is it possible to scrape using Python Beautiful Soup only when the text matches?

Using Python and Selenium to automate searches in the Facebook search bar

What could be the reason for Python not handling errors in list comprehension?

The `get_attribute` function in Python's Selenium module

Combining Starlette and pydantic for optimal performance

The Iterative Minimax Algorithm for Tic Tac Toe

Ways to spin characters in a python text

Tips on preventing built-in functions from appearing when using the dir function on my modules

Default tags in Django Sentry are configured to categorize and label specific

Unable to assign a value to a variable

Showing the heading again after a new page starts