* [pmg-devel] [PATCH pmg-api] fix #4536: parse original filenames from gzip files
@ 2023-03-03 10:04 Leo Nunner
2023-03-03 11:43 ` Stoiko Ivanov
0 siblings, 1 reply; 3+ messages in thread
From: Leo Nunner @ 2023-03-03 10:04 UTC (permalink / raw)
To: pmg-devel
GZIP provides the possiblity to store the original filename in the
optional FNAME header field, which we can use for 'Match Archive
Filename' rules.
IO::Uncompress::Gunzip is explicitly recommended for this purpose by the
documentation on Compress::Zlib, so an additional imnport was
introduced here.
Signed-off-by: Leo Nunner <l.nunner@proxmox.com>
---
src/PMG/Unpack.pm | 8 ++++++++
1 file changed, 8 insertions(+)
diff --git a/src/PMG/Unpack.pm b/src/PMG/Unpack.pm
index d4fe454..f79ed3f 100755
--- a/src/PMG/Unpack.pm
+++ b/src/PMG/Unpack.pm
@@ -7,6 +7,7 @@ use IO::Select;
use Xdgmime;
use Compress::Zlib qw(gzopen);
use Compress::Bzip2 qw(bzopen);
+use IO::Uncompress::Gunzip;
use File::Path;
use File::Temp qw(tempdir);
use File::Basename;
@@ -298,6 +299,13 @@ sub uncompress_file {
$maxsize = min2 ($maxsize, $filesize * $self->{maxratio});
}
+ if($app eq 'guzip' && (my $z = IO::Uncompress::Gunzip->new($filename))) {
+ # the name (FNAME) field is optional in GZIP archives, so we won't
+ # always have a value here
+ my $header = $z->getHeaderInfo();
+ $newname = $header->{Name} if $header->{Name};
+ }
+
$self->add_glob_mime_type ($newname);
my $outfd;
--
2.30.2
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [pmg-devel] [PATCH pmg-api] fix #4536: parse original filenames from gzip files
2023-03-03 10:04 [pmg-devel] [PATCH pmg-api] fix #4536: parse original filenames from gzip files Leo Nunner
@ 2023-03-03 11:43 ` Stoiko Ivanov
2023-03-03 12:18 ` Leo Nunner
0 siblings, 1 reply; 3+ messages in thread
From: Stoiko Ivanov @ 2023-03-03 11:43 UTC (permalink / raw)
To: Leo Nunner; +Cc: pmg-devel
Thank you very much for addressing the issue and providing the patch!
technically I think this seems like a good approach!
one thing I'm not 100% sure about is, why the finding of the filename from
header is done in uncompress_file and not in __unpack_archive (where the
$newname variable is originally assigned)?
https://git.proxmox.com/?p=pmg-api.git;a=blob;f=src/PMG/Unpack.pm;h=d4fe4547ba9341a23a6841f9e24b756ed65605df;hb=HEAD#l1014
IIUC $newname is also an absolute path to a tempdir (which conveniently
also gets cleaned up on exit)
did not run any explicit tests yet - but thought I'll ask anyways.
On Fri, 3 Mar 2023 11:04:56 +0100
Leo Nunner <l.nunner@proxmox.com> wrote:
> GZIP provides the possiblity to store the original filename in the
> optional FNAME header field, which we can use for 'Match Archive
> Filename' rules.
>
> IO::Uncompress::Gunzip is explicitly recommended for this purpose by the
> documentation on Compress::Zlib, so an additional imnport was
> introduced here.
>
> Signed-off-by: Leo Nunner <l.nunner@proxmox.com>
> ---
> src/PMG/Unpack.pm | 8 ++++++++
> 1 file changed, 8 insertions(+)
>
> diff --git a/src/PMG/Unpack.pm b/src/PMG/Unpack.pm
> index d4fe454..f79ed3f 100755
> --- a/src/PMG/Unpack.pm
> +++ b/src/PMG/Unpack.pm
> @@ -7,6 +7,7 @@ use IO::Select;
> use Xdgmime;
> use Compress::Zlib qw(gzopen);
> use Compress::Bzip2 qw(bzopen);
> +use IO::Uncompress::Gunzip;
> use File::Path;
> use File::Temp qw(tempdir);
> use File::Basename;
> @@ -298,6 +299,13 @@ sub uncompress_file {
> $maxsize = min2 ($maxsize, $filesize * $self->{maxratio});
> }
>
> + if($app eq 'guzip' && (my $z = IO::Uncompress::Gunzip->new($filename))) {
> + # the name (FNAME) field is optional in GZIP archives, so we won't
> + # always have a value here
> + my $header = $z->getHeaderInfo();
> + $newname = $header->{Name} if $header->{Name};
> + }
> +
> $self->add_glob_mime_type ($newname);
>
> my $outfd;
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [pmg-devel] [PATCH pmg-api] fix #4536: parse original filenames from gzip files
2023-03-03 11:43 ` Stoiko Ivanov
@ 2023-03-03 12:18 ` Leo Nunner
0 siblings, 0 replies; 3+ messages in thread
From: Leo Nunner @ 2023-03-03 12:18 UTC (permalink / raw)
To: Stoiko Ivanov; +Cc: pmg-devel
On 2023-03-03 12:43, Stoiko Ivanov wrote:
> Thank you very much for addressing the issue and providing the patch!
> technically I think this seems like a good approach!
>
> one thing I'm not 100% sure about is, why the finding of the filename from
> header is done in uncompress_file and not in __unpack_archive (where the
> $newname variable is originally assigned)?
> https://git.proxmox.com/?p=pmg-api.git;a=blob;f=src/PMG/Unpack.pm;h=d4fe4547ba9341a23a6841f9e24b756ed65605df;hb=HEAD#l1014
Something that Dominik pointed out to me offlist: we might want to push
both the original name *and* the generated name (using
add_glob_mime_type), since we only need to store the original name to
iterate over and not actually save anything there (as otherwise, someone
could just write *any* path in the header and we'll save the attachment
there…).
If we do that (i.e., just add another add_glob_mime_type call in the if
block), I feel like it might still be a better choice in
uncompress_file, but that's more personal preference.
> IIUC $newname is also an absolute path to a tempdir (which conveniently
> also gets cleaned up on exit)
>
> did not run any explicit tests yet - but thought I'll ask anyways.
FWIW, the regression tests all passed for me. I'll send a v2 either way,
because we should also update the debian control file to explicitly
include libio-compress-perl for IO::Uncompress::Gunzip.
> On Fri, 3 Mar 2023 11:04:56 +0100
> Leo Nunner <l.nunner@proxmox.com> wrote:
>
>> GZIP provides the possiblity to store the original filename in the
>> optional FNAME header field, which we can use for 'Match Archive
>> Filename' rules.
>>
>> IO::Uncompress::Gunzip is explicitly recommended for this purpose by the
>> documentation on Compress::Zlib, so an additional imnport was
>> introduced here.
>>
>> Signed-off-by: Leo Nunner <l.nunner@proxmox.com>
>> ---
>> src/PMG/Unpack.pm | 8 ++++++++
>> 1 file changed, 8 insertions(+)
>>
>> diff --git a/src/PMG/Unpack.pm b/src/PMG/Unpack.pm
>> index d4fe454..f79ed3f 100755
>> --- a/src/PMG/Unpack.pm
>> +++ b/src/PMG/Unpack.pm
>> @@ -7,6 +7,7 @@ use IO::Select;
>> use Xdgmime;
>> use Compress::Zlib qw(gzopen);
>> use Compress::Bzip2 qw(bzopen);
>> +use IO::Uncompress::Gunzip;
>> use File::Path;
>> use File::Temp qw(tempdir);
>> use File::Basename;
>> @@ -298,6 +299,13 @@ sub uncompress_file {
>> $maxsize = min2 ($maxsize, $filesize * $self->{maxratio});
>> }
>>
>> + if($app eq 'guzip' && (my $z = IO::Uncompress::Gunzip->new($filename))) {
>> + # the name (FNAME) field is optional in GZIP archives, so we won't
>> + # always have a value here
>> + my $header = $z->getHeaderInfo();
>> + $newname = $header->{Name} if $header->{Name};
>> + }
>> +
>> $self->add_glob_mime_type ($newname);
>>
>> my $outfd;
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2023-03-03 12:19 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-03-03 10:04 [pmg-devel] [PATCH pmg-api] fix #4536: parse original filenames from gzip files Leo Nunner
2023-03-03 11:43 ` Stoiko Ivanov
2023-03-03 12:18 ` Leo Nunner
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox