Very Slow #27

jhonathas · 2018-04-23T13:42:56Z

The time between the file uploaded and the scan process to the end, is taking more than 10 seconds. This is normal?

There is a faster way. Local scanning takes no more than 200ms

AndrewLane · 2019-01-09T20:43:03Z

I've run 5 tests so far with very small files and all my times are over 20 seconds. I guess this is expected?

Time: 24.584 sec (0 m 24 s)
Time: 34.061 sec (0 m 34 s)
Time: 33.114 sec (0 m 33 s)
Time: 24.806 sec (0 m 24 s)
Time: 25.074 sec (0 m 25 s)

AndrewLane · 2019-01-21T04:00:43Z

I found that if you crank the lambda function settings all the way up to 3GB (as high as it will go, as of now), the timing on the scan goes down to about 13s. Still not great, but improved.

theblockent · 2019-03-05T23:17:06Z

Has anyone had any luck with improving speeds? When scanning 2 files (500 bytes each), it takes 15 seconds each file @ 3008 megabytes memory, and it's because of the actual clamscan (from print to print):
print("Starting clamscan of %s." % path) av_proc = Popen( [ CLAMSCAN_PATH, "-v", "-a", "--stdout", "-d", AV_DEFINITION_PATH, path ], stderr=STDOUT, stdout=PIPE, env=av_env ) output = av_proc.communicate()[0] print("clamscan output:\n%s" % output)

Any way to speed this up? Would it be more valuable to put any of this inside of a lambda layer?

I am unsure what 'local' scan means in the above message, how do I enable/perform that?

vrabeshko · 2019-03-20T15:38:26Z

same issue. No matter how much file weigh, 5KB or 300MB. Execution time the same

j1mmie · 2019-07-01T19:10:01Z

I'm scanning the 68 byte EICAR test file and it's taking around 70 seconds. Haven't cranked the memory, but it sounds like that has diminishing returns anyway.

The majority of time is spent scanning, not checking / downloading definitions, or even transferring the file to /tmp/. See below

18:24:51 START RequestId: 0b015f5f-97e9-46ef-a24b-a6186ee00aea Version: $LATEST
18:24:51 Script starting at 2019/07/01 18:24:51 UTC
18:24:51 Attempting to create directory /tmp/***.
18:24:52 Attempting to create directory /tmp/clamav_defs.
18:24:52 Downloading definition file main.cvd from s3://***/clamav_defs
18:24:54 Downloading definition file daily.cvd from s3://***/clamav_defs
18:24:55 Downloading definition file bytecode.cvd from s3://***/clamav_defs
18:24:55 Starting clamscan of /tmp/***/totally_not_a_virus.png.
18:26:02 clamscan output:
18:26:02 Scanning /tmp/***/totally_not_a_virus.png
18:26:02 /tmp/***/totally_not_a_virus.png: OK

midnightcodr · 2019-09-12T19:17:40Z

Same here, we see average scan time on the north of 80 seconds per file. Most of our files are images and pdfs.

midnightcodr · 2019-09-20T12:34:41Z

The scanning has ballooned to almost 100 seconds per file. It's not practical to use Lamda any more. We've implemented a local scan solution for our document uploads using a clamav docker image from https://github.com/mko-x/docker-clamav.

JarmBlueOak · 2019-10-09T00:42:57Z

Increasing the AWS Lambda max memory helps with run time. We set to 2048 and get 40-45 second run times.

j1mmie · 2019-10-09T02:30:07Z

FWIW I went with this implementation instead: https://github.com/widdix/aws-s3-virusscan

Autoscaling EC2 cluster that does the scanning. Was pretty easy to set up and is near instant.

chrisgilmerproj · 2019-11-05T17:04:04Z

I think the answers in this thread are correct. Update the memory allocated for the lambda and that will speed things up. Also, if AWS re-uses a lambda it can have a faster spin-up time but that's not guaranteed. If you need something faster then lambda may not be the best option for your workflow.

sean-redmond · 2020-10-19T17:22:04Z

It looks to me like this PR will address this issue:

#112

jhonathas · 2020-10-19T17:30:15Z

Thanks!

sean-redmond · 2020-10-20T07:59:56Z

I tested this PR out, it works really well in my testing.

chrisgilmerproj closed this as completed Nov 5, 2019

mgzenitech mentioned this issue May 18, 2020

Increased lambda memory to 3GB trussworks/terraform-aws-s3-anti-virus#12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Very Slow #27

Very Slow #27

jhonathas commented Apr 23, 2018

AndrewLane commented Jan 9, 2019

AndrewLane commented Jan 21, 2019

theblockent commented Mar 5, 2019

vrabeshko commented Mar 20, 2019

j1mmie commented Jul 1, 2019 •

edited

Loading

midnightcodr commented Sep 12, 2019

midnightcodr commented Sep 20, 2019

JarmBlueOak commented Oct 9, 2019

j1mmie commented Oct 9, 2019

chrisgilmerproj commented Nov 5, 2019

sean-redmond commented Oct 19, 2020

jhonathas commented Oct 19, 2020

sean-redmond commented Oct 20, 2020

Very Slow #27

Very Slow #27

Comments

jhonathas commented Apr 23, 2018

AndrewLane commented Jan 9, 2019

AndrewLane commented Jan 21, 2019

theblockent commented Mar 5, 2019

vrabeshko commented Mar 20, 2019

j1mmie commented Jul 1, 2019 • edited Loading

midnightcodr commented Sep 12, 2019

midnightcodr commented Sep 20, 2019

JarmBlueOak commented Oct 9, 2019

j1mmie commented Oct 9, 2019

chrisgilmerproj commented Nov 5, 2019

sean-redmond commented Oct 19, 2020

jhonathas commented Oct 19, 2020

sean-redmond commented Oct 20, 2020

j1mmie commented Jul 1, 2019 •

edited

Loading