[Dict-common-dev] Re: Bug#321040: fixed in bgoffice 3.0-5

Anton Zinoviev anton at lml.bas.bg
Fri Sep 30 11:47:03 UTC 2005


On Wed, Sep 28, 2005 at 11:02:38PM +0200, Agustin Martin wrote:
> 
> I have tested that in sarge and sid, and seemed to work in sid, but failed
> in sarge.

I upgraded my system to latest sid but I cannot see a difference.
"M-x ispell-buffer" still checks the ASCII Latin letters but skips the
Cyrillic (and probably the non-ASCII Latin letters also).

> My guess is that cyrillic had unification problems as
> iso-latin had, being the same character internally mule represented
> diferently depending on the font.

What has changed for iso-latin?  Is the change somehow connected with
the Emacs functions unify-8859-on-decoding-mode and
unify-8859-on-encoding-mode?

> I have done the tests with the ibulgarian README file, in utf-8 or
> recoded to cp1251. Cyrillic chars show properly in cp1251, but
> (after explicitely declaring buffer as utf-8) as squares in utf-8,
> when the screen chars should be the same. I am not sure if another
> installed package (mule-ucs?) could make a difference.

I don't think.  The squares mean the ISO10646-1 font from the fontset
Emacs uses does not contain Cyrillic letters.  However this is only a
display problem - squares or not, Emacs should work properly with
Cyrillic.

> > There is also another weird problem I'd like to ask for.  I found it
> > to be reproducible for all non-ISO-8859-1 dictionaries for aspell, for
> > example aspell-pl (Latin2) and aspell-bg (Cyrillic).  I have the
> > following setup in my ~/.emacs:
> > 
> > (custom-set-variables
> >   '(ispell-program-name "bulgarian") ; or "polish"
> >   '(ispell-dictionary "polish"))

A copy/paste error.  I had used

(custom-set-variables
  (ispell-program-name "aspell")
  (ispell-dictionary "bulgarian")) ; or "polish"

> Does
> 
> (custom-set-variables
>   '(ispell-program-name "ispell")      ; or "aspell"
>   '(ispell-local-dictionary "polish")) ; or "bulgarian"

There are no problems with ispell.  Only aspell doesn't work properly
with respect to this.

> > I think the best solution is  to insert somewhere the command
> > 
> > iconv -c -futf-8 -t`locale charmap`
> 
> Thanks for the suggestion,
> 
> That is desirable, but in a first approach not very straighforward
> considering the interaction with debconf and how we gather values from
> the different dicts/wordlists. I have thought a bit about that, but what I
> had in mind requires some debconf black magic, and I am still unsure that
> would preserve previous functionality and does not force debconf questions
> being prompted again over and over. But I would like to experiment with
> this.

I think the questions will be asked again only when the encoding on
the console changes.  Here is another solution (untested):

from_utf8 () {
    iconv -c -futf-8 -t`locale charmap`
}

to_utf8 () {
    iconv -c -f`locale charmap` -tutf-8 
}

db_get dictionaries/which_dictionary
db_set dictionaries/which_dictionary $(echo $RET | from_utf8)

db_subst dictionaries/which_dictionary variants $(echo $variants | from_utf8)
db_input high dictionaries/which_dictionary || true
db_go || true

variant=`echo $RET | to_utf8`
db_set dictionaries/which_dictionary "$variant"

Anton Zinoviev



More information about the Dict-common-dev mailing list