public inbox for pve-devel@lists.proxmox.com
 help / color / mirror / Atom feed
* [pve-devel] [PATCH proxmox-i18n] js generator: try to extract comments from source
@ 2023-08-30 12:48 Maximiliano Sandoval
  2023-08-30 13:23 ` Thomas Lamprecht
  0 siblings, 1 reply; 4+ messages in thread
From: Maximiliano Sandoval @ 2023-08-30 12:48 UTC (permalink / raw)
  To: pve-devel

Adds a way to extract comments to the resulting .po files matching
xgettext's behaviour. Useful for instances like

```js
   // Translators: This is the verb, not the noun
   gettext("Profile");

   // Translators: This would read 'Manage OSD'
   Ext.String.format(gettext('Manage {0}'), 'OSD');
```

where the string is not enough to guarantee is a satisfactory
translation.

Do note that two identical messages with different comments will count
as the same message (same msgid) from the point of view of gettext. To
truly differentiate them one would need to support Context, see [1].

Caveats:
  - Cannot extract multiline comments
  - Does not understand comments in the /* comment */ form

[1] https://www.gnu.org/software/gettext/manual/html_node/Contexts.html

Signed-off-by: Maximiliano Sandoval <m.sandoval@proxmox.com>
---
 jsgettext.pl | 9 ++++++---
 1 file changed, 6 insertions(+), 3 deletions(-)

diff --git a/jsgettext.pl b/jsgettext.pl
index 7f758fd..d0bf7a9 100755
--- a/jsgettext.pl
+++ b/jsgettext.pl
@@ -93,7 +93,7 @@ my $href = {
 };
 
 sub extract_msg {
-    my ($filename, $linenr, $line) = @_;
+    my ($filename, $linenr, $line, $comment) = @_;
 
     my $count = 0;
 
@@ -111,7 +111,7 @@ sub extract_msg {
 	if (my $po = $href->{$text}) {
 	    $po->reference($po->reference() . " $ref");
 	} else {
-	    $href->{$text} = Locale::PO->new(-msgid=> $text, -reference=> $ref, -msgstr=> '');
+	    $href->{$text} = Locale::PO->new(-msgid=> $text, -reference=> $ref, -msgstr=> '', -automatic=> $comment);
 	}
     }
     die "can't extract gettext message in '$filename' line $linenr\n" if !$count;
@@ -122,10 +122,13 @@ my $sources = find_js_sources($dirs);
 
 foreach my $s (@$sources) {
     open(my $SRC_FH, '<', $s) || die "unable to open file '$s' - $!\n";
+    my $prev_line;
     while(defined(my $line = <$SRC_FH>)) {
 	if ($line =~ m/gettext\s*\(/ && $line !~ m/^\s*function gettext/) {
-	    extract_msg($s, $., $line);
+	    my ($comment) = $prev_line =~ /\/\/\s+(.*)/;
+	    extract_msg($s, $., $line, $comment);
 	}
+	$prev_line = $line;
     }
     close($SRC_FH);
 }
-- 
2.39.2





^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [pve-devel] [PATCH proxmox-i18n] js generator: try to extract comments from source
  2023-08-30 12:48 [pve-devel] [PATCH proxmox-i18n] js generator: try to extract comments from source Maximiliano Sandoval
@ 2023-08-30 13:23 ` Thomas Lamprecht
  2023-08-30 13:49   ` Maximiliano Sandoval
  2023-08-30 15:11   ` Maximiliano Sandoval
  0 siblings, 2 replies; 4+ messages in thread
From: Thomas Lamprecht @ 2023-08-30 13:23 UTC (permalink / raw)
  To: Proxmox VE development discussion, Maximiliano Sandoval

Am 30/08/2023 um 14:48 schrieb Maximiliano Sandoval:
> Adds a way to extract comments to the resulting .po files matching
> xgettext's behaviour. Useful for instances like
> 
> ```js
>    // Translators: This is the verb, not the noun

hmm, but is then grouped with the reference comment in the resulting
PO file?

And even if, what use is the comment if there are then a usage as noun
and one as verb? For that we really would need the msgcontext stuff and
a pgettext implementation so that one could then actually do different
translations for either variant (e.g., in German then Noun -> "Profil"
and verb -> "Profilieren")

IMO the comments mostly make sense for describing string formats, but
support for them can be still fine as it doesn't cost much.

Oh, and while at it, I pondered over adding a wrapper for the common
Ext.String.format(gettext(MSG), PARAMS)) combi, something like
gettextf(MSG, PARAMS) or even shorter i18n(MSG, PARAMS) (with params
optional). This way we might even parse the passed params and add those
as comment automatically.

>    gettext("Profile");
> 
>    // Translators: This would read 'Manage OSD'
>    Ext.String.format(gettext('Manage {0}'), 'OSD');
> ```
> 
> where the string is not enough to guarantee is a satisfactory
> translation.
> 
> Do note that two identical messages with different comments will count
> as the same message (same msgid) from the point of view of gettext. To
> truly differentiate them one would need to support Context, see [1].
> 
> Caveats:
>   - Cannot extract multiline comments
>   - Does not understand comments in the /* comment */ form

those two are fine, but comments at the end of line won't be recognized either,
e.g.:

    fieldLabel: gettext('Foo'), // Translators: bar baz

No biggie, but IMO not to hard to parse, especially as we don't have
to bother that much for performance as this is only triggered manually
when updating the POT catalogue.

> 
> [1] https://www.gnu.org/software/gettext/manual/html_node/Contexts.html
> 
> Signed-off-by: Maximiliano Sandoval <m.sandoval@proxmox.com>
> ---
>  jsgettext.pl | 9 ++++++---
>  1 file changed, 6 insertions(+), 3 deletions(-)
> 
> diff --git a/jsgettext.pl b/jsgettext.pl
> index 7f758fd..d0bf7a9 100755
> --- a/jsgettext.pl
> +++ b/jsgettext.pl
> @@ -93,7 +93,7 @@ my $href = {
>  };
>  
>  sub extract_msg {
> -    my ($filename, $linenr, $line) = @_;
> +    my ($filename, $linenr, $line, $comment) = @_;
>  
>      my $count = 0;
>  
> @@ -111,7 +111,7 @@ sub extract_msg {
>  	if (my $po = $href->{$text}) {
>  	    $po->reference($po->reference() . " $ref");
>  	} else {
> -	    $href->{$text} = Locale::PO->new(-msgid=> $text, -reference=> $ref, -msgstr=> '');
> +	    $href->{$text} = Locale::PO->new(-msgid=> $text, -reference=> $ref, -msgstr=> '', -automatic=> $comment);
>  	}
>      }
>      die "can't extract gettext message in '$filename' line $linenr\n" if !$count;
> @@ -122,10 +122,13 @@ my $sources = find_js_sources($dirs);
>  
>  foreach my $s (@$sources) {
>      open(my $SRC_FH, '<', $s) || die "unable to open file '$s' - $!\n";
> +    my $prev_line;
>      while(defined(my $line = <$SRC_FH>)) {
>  	if ($line =~ m/gettext\s*\(/ && $line !~ m/^\s*function gettext/) {
> -	    extract_msg($s, $., $line);
> +	    my ($comment) = $prev_line =~ /\/\/\s+(.*)/;
> +	    extract_msg($s, $., $line, $comment);
>  	}
> +	$prev_line = $line;
>      }
>      close($SRC_FH);
>  }





^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [pve-devel] [PATCH proxmox-i18n] js generator: try to extract comments from source
  2023-08-30 13:23 ` Thomas Lamprecht
@ 2023-08-30 13:49   ` Maximiliano Sandoval
  2023-08-30 15:11   ` Maximiliano Sandoval
  1 sibling, 0 replies; 4+ messages in thread
From: Maximiliano Sandoval @ 2023-08-30 13:49 UTC (permalink / raw)
  To: Thomas Lamprecht, Proxmox VE development discussion

I did a quick test with

```js
     // Comment 1
     a: gettext('TEST 1'),
     // Comment 2
     b: gettext('TEST 1'),

     c: gettext('TEST 2'),
     // Comment 3
     d: gettext('TEST 2'),

     // Comment 4
     e: gettext('TEST 3'),
     f: gettext('TEST 3'),
```

and the resulting file contains a single instance of each msgid:

```

#. Comment 1
#: pve-manager/www/manager6/ceph/OSD.js:239
#: pve-manager/www/manager6/ceph/OSD.js:241
msgid "TEST 1"
msgstr ""

#: pve-manager/www/manager6/ceph/OSD.js:243
#: pve-manager/www/manager6/ceph/OSD.js:245
msgid "TEST 2"
msgstr ""

#. Comment 4
#: pve-manager/www/manager6/ceph/OSD.js:248
#: pve-manager/www/manager6/ceph/OSD.js:249
msgid "TEST 3"
msgstr ""
```

Compare with the output from xgettext (xgettext --language=JavaScript --add-comments OSD.js)
```
#. Comment 1
#. Comment 2
#: OSD.js:239 OSD.js:241
msgid "TEST 1"
msgstr ""

#. Comment 3
#: OSD.js:243 OSD.js:245
msgid "TEST 2"
msgstr ""

#. Comment 4
#: OSD.js:248 OSD.js:249
msgid "TEST 3"
msgstr ""
```

which means that at worst we will lose a comment depending on which msgid was
found first. Given that this is not using the pgettext method that takes into
account the to generate different translations, this seems fine to me, at least
as a first step into getting feature parity with (x)gettext.

On 8/30/23 15:23, Thomas Lamprecht wrote:
> Am 30/08/2023 um 14:48 schrieb Maximiliano Sandoval:
>> Adds a way to extract comments to the resulting .po files matching
>> xgettext's behaviour. Useful for instances like
>>
>> ```js
>>     // Translators: This is the verb, not the noun
> hmm, but is then grouped with the reference comment in the resulting
> PO file?
>
> And even if, what use is the comment if there are then a usage as noun
> and one as verb? For that we really would need the msgcontext stuff and
> a pgettext implementation so that one could then actually do different
> translations for either variant (e.g., in German then Noun -> "Profil"
> and verb -> "Profilieren")
>
> IMO the comments mostly make sense for describing string formats, but
> support for them can be still fine as it doesn't cost much.
>
> Oh, and while at it, I pondered over adding a wrapper for the common
> Ext.String.format(gettext(MSG), PARAMS)) combi, something like
> gettextf(MSG, PARAMS) or even shorter i18n(MSG, PARAMS) (with params
> optional). This way we might even parse the passed params and add those
> as comment automatically.
>
>>     gettext("Profile");
>>
>>     // Translators: This would read 'Manage OSD'
>>     Ext.String.format(gettext('Manage {0}'), 'OSD');
>> ```
>>
>> where the string is not enough to guarantee is a satisfactory
>> translation.
>>
>> Do note that two identical messages with different comments will count
>> as the same message (same msgid) from the point of view of gettext. To
>> truly differentiate them one would need to support Context, see [1].
>>
>> Caveats:
>>    - Cannot extract multiline comments
>>    - Does not understand comments in the /* comment */ form
> those two are fine, but comments at the end of line won't be recognized either,
> e.g.:
>
>      fieldLabel: gettext('Foo'), // Translators: bar baz
>
> No biggie, but IMO not to hard to parse, especially as we don't have
> to bother that much for performance as this is only triggered manually
> when updating the POT catalogue.
>
>> [1] https://www.gnu.org/software/gettext/manual/html_node/Contexts.html
>>
>> Signed-off-by: Maximiliano Sandoval <m.sandoval@proxmox.com>
>> ---
>>   jsgettext.pl | 9 ++++++---
>>   1 file changed, 6 insertions(+), 3 deletions(-)
>>
>> diff --git a/jsgettext.pl b/jsgettext.pl
>> index 7f758fd..d0bf7a9 100755
>> --- a/jsgettext.pl
>> +++ b/jsgettext.pl
>> @@ -93,7 +93,7 @@ my $href = {
>>   };
>>   
>>   sub extract_msg {
>> -    my ($filename, $linenr, $line) = @_;
>> +    my ($filename, $linenr, $line, $comment) = @_;
>>   
>>       my $count = 0;
>>   
>> @@ -111,7 +111,7 @@ sub extract_msg {
>>   	if (my $po = $href->{$text}) {
>>   	    $po->reference($po->reference() . " $ref");
>>   	} else {
>> -	    $href->{$text} = Locale::PO->new(-msgid=> $text, -reference=> $ref, -msgstr=> '');
>> +	    $href->{$text} = Locale::PO->new(-msgid=> $text, -reference=> $ref, -msgstr=> '', -automatic=> $comment);
>>   	}
>>       }
>>       die "can't extract gettext message in '$filename' line $linenr\n" if !$count;
>> @@ -122,10 +122,13 @@ my $sources = find_js_sources($dirs);
>>   
>>   foreach my $s (@$sources) {
>>       open(my $SRC_FH, '<', $s) || die "unable to open file '$s' - $!\n";
>> +    my $prev_line;
>>       while(defined(my $line = <$SRC_FH>)) {
>>   	if ($line =~ m/gettext\s*\(/ && $line !~ m/^\s*function gettext/) {
>> -	    extract_msg($s, $., $line);
>> +	    my ($comment) = $prev_line =~ /\/\/\s+(.*)/;
>> +	    extract_msg($s, $., $line, $comment);
>>   	}
>> +	$prev_line = $line;
>>       }
>>       close($SRC_FH);
>>   }




^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [pve-devel] [PATCH proxmox-i18n] js generator: try to extract comments from source
  2023-08-30 13:23 ` Thomas Lamprecht
  2023-08-30 13:49   ` Maximiliano Sandoval
@ 2023-08-30 15:11   ` Maximiliano Sandoval
  1 sibling, 0 replies; 4+ messages in thread
From: Maximiliano Sandoval @ 2023-08-30 15:11 UTC (permalink / raw)
  To: Thomas Lamprecht, Proxmox VE development discussion


On 8/30/23 15:23, Thomas Lamprecht wrote:
> Oh, and while at it, I pondered over adding a wrapper for the common Ext.String.format(gettext(MSG), PARAMS)) combi, something like gettextf(MSG, PARAMS) or even shorter i18n(MSG, PARAMS) (with params optional). This way we might even parse the passed params and add those as comment automatically.


Regarding this, sounds like a good idea for a future patch.





^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2023-08-30 15:11 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-08-30 12:48 [pve-devel] [PATCH proxmox-i18n] js generator: try to extract comments from source Maximiliano Sandoval
2023-08-30 13:23 ` Thomas Lamprecht
2023-08-30 13:49   ` Maximiliano Sandoval
2023-08-30 15:11   ` Maximiliano Sandoval

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox
Service provided by Proxmox Server Solutions GmbH | Privacy | Legal