New MultipartParser fails on files above 102400 bytes #1470

SethurBlackcoat · 2024-12-13T01:48:44Z

Affected versions: 0.13+ (tested in 0.13.2, issue is still present in code for 0.14)
Works in 0.12.25

The new MultipartParser released in Version 0.13 raises a MultipartError("Memory limit reached.") when the total file size exceeds 102400 bytes. This is due to MEMFILE_MAX being passed in (possibly mistakenly?) as the memory limit on line 1353.

The default value for this parameter would be 2 ** 20 i.e. ~1mb, which is still fairly low but more reasonable than 100kb. Ideally though this should be either dependent on actual memory available or configurable, perhaps via a kwarg to run().

SethurBlackcoat · 2024-12-13T03:46:19Z

Looking at the code in a bit more detail, I believe (but haven't tested this) that the error would not occur if the upload consisted of a single file exceeding MEMFILE_MAX - which would instead cause it to become a NamedTemporaryFile and thus count against the much larger disk_limit - but rather only if you have multiple smaller files, each individually smaller than memfile_limit, that when taken together exceed mem_limit.

defnull · 2024-12-13T12:38:11Z

The MEMFILE_MAX limit was originally the maximum number of bytes Bottle would read from the body in order to populate in-memory data structures (e.g. json or form fields minus file uploads). The intention is to protect apps from MemoryError. It was only loosely applied in 0.12 and had no effect on multipart because cgi.FieldStorage had it's own mechanism for that. There were many was to bypass this safeguard :/

With the new parser, it's now enforced by the parser itself. The idea is that the sum of all in-memory buffers must not exceed this limit. This is not ideal yet because the limit is used for two different things: As a soft-limit of individual memory-buffered files before they are spooled to disk, and as the total hard-limit for memory consumption for all fields and files. Multiple small files that each stay below the limit can, in total, trigger the hard limit and cause an error.

We should probably introduce more fine grained control here, and expose those settings to app developers. And also improve documentation. Limiting request parsers is important, but it should be possible to change the limits easily, if needed.

SethurBlackcoat · 2024-12-13T18:35:19Z

Fully agree with everything above. As a stopgap measure until a longer-term fix is implemented, it would probably help to stick with the default value of 1mb for the mem_limit instead of passing in MEMFILE_MAX for it.

Long term, making mem_limit (and probably memfile_limit) configurable and having the behavior documented for file uploads would definitely be desirable, but thinking about it I realized even that would only delay the problem to a (now developer-selectable) point in the future. The most elegant solution I can come up with would be to not have exceeding mem_limit be an instant abort of the parse, but instead just force all subsequent files in the parse to be stored on disk instead of being memory buffered. I think that's a practical fallback that prevents memory exhaustion while still allowing uploads with numerous smaller files.

defnull · 2024-12-13T19:08:01Z

but instead just force all subsequent files in the parse to be stored on disk instead of being memory buffered.

Yes, that's an idea, but there is another problem. Bottle treats text fields that exceed MEMFILE_MAX as file uploads, they are not returned as strings (and not contained in request.fields) but instead are FileUpload instances without a filename and stored in request.files. This is also a safeguard, and MEMFILE_MAX is big enough that 'normal' text fields should never exceed it. But with the proposed solution (thread all fields as file uploads once the limit is reached) it is now possible that normal-sized text fields end up in temporary files. That's very unpredictable as it depends on order of fields.

Hmm, maybe just scrap the total hard memory limit and rely on MEMFILE_MAX and the part limit? This is 128 at the moment and hard coded (which also has to change). Or default to a total memory limit of part_limit*memfile_limit? That would be 12.8MB at the moment, which sounds reasonable for modern servers and would never be reached, because the part limit hits first.

dee-me-tree-or-love mentioned this issue Jan 3, 2025

MultipartError: Memory limit reached when uploading wheels over ~100kB on pypiserver 2.3.x pypiserver/pypiserver#630

Open

nagadomi mentioned this issue Jan 6, 2025

Is it down? nagadomi/waifu2x#197

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New MultipartParser fails on files above 102400 bytes #1470

New MultipartParser fails on files above 102400 bytes #1470

SethurBlackcoat commented Dec 13, 2024

SethurBlackcoat commented Dec 13, 2024 •

edited

Loading

defnull commented Dec 13, 2024

SethurBlackcoat commented Dec 13, 2024

defnull commented Dec 13, 2024 •

edited

Loading

New MultipartParser fails on files above 102400 bytes #1470

New MultipartParser fails on files above 102400 bytes #1470

Comments

SethurBlackcoat commented Dec 13, 2024

SethurBlackcoat commented Dec 13, 2024 • edited Loading

defnull commented Dec 13, 2024

SethurBlackcoat commented Dec 13, 2024

defnull commented Dec 13, 2024 • edited Loading

SethurBlackcoat commented Dec 13, 2024 •

edited

Loading

defnull commented Dec 13, 2024 •

edited

Loading