[Dict-common-dev] Re: Bug#321040: fixed in bgoffice 3.0-5
Anton Zinoviev
anton at lml.bas.bg
Fri Sep 30 11:47:03 UTC 2005
On Wed, Sep 28, 2005 at 11:02:38PM +0200, Agustin Martin wrote:
>
> I have tested that in sarge and sid, and seemed to work in sid, but failed
> in sarge.
I upgraded my system to latest sid but I cannot see a difference.
"M-x ispell-buffer" still checks the ASCII Latin letters but skips the
Cyrillic (and probably the non-ASCII Latin letters also).
> My guess is that cyrillic had unification problems as
> iso-latin had, being the same character internally mule represented
> diferently depending on the font.
What has changed for iso-latin? Is the change somehow connected with
the Emacs functions unify-8859-on-decoding-mode and
unify-8859-on-encoding-mode?
> I have done the tests with the ibulgarian README file, in utf-8 or
> recoded to cp1251. Cyrillic chars show properly in cp1251, but
> (after explicitely declaring buffer as utf-8) as squares in utf-8,
> when the screen chars should be the same. I am not sure if another
> installed package (mule-ucs?) could make a difference.
I don't think. The squares mean the ISO10646-1 font from the fontset
Emacs uses does not contain Cyrillic letters. However this is only a
display problem - squares or not, Emacs should work properly with
Cyrillic.
> > There is also another weird problem I'd like to ask for. I found it
> > to be reproducible for all non-ISO-8859-1 dictionaries for aspell, for
> > example aspell-pl (Latin2) and aspell-bg (Cyrillic). I have the
> > following setup in my ~/.emacs:
> >
> > (custom-set-variables
> > '(ispell-program-name "bulgarian") ; or "polish"
> > '(ispell-dictionary "polish"))
A copy/paste error. I had used
(custom-set-variables
(ispell-program-name "aspell")
(ispell-dictionary "bulgarian")) ; or "polish"
> Does
>
> (custom-set-variables
> '(ispell-program-name "ispell") ; or "aspell"
> '(ispell-local-dictionary "polish")) ; or "bulgarian"
There are no problems with ispell. Only aspell doesn't work properly
with respect to this.
> > I think the best solution is to insert somewhere the command
> >
> > iconv -c -futf-8 -t`locale charmap`
>
> Thanks for the suggestion,
>
> That is desirable, but in a first approach not very straighforward
> considering the interaction with debconf and how we gather values from
> the different dicts/wordlists. I have thought a bit about that, but what I
> had in mind requires some debconf black magic, and I am still unsure that
> would preserve previous functionality and does not force debconf questions
> being prompted again over and over. But I would like to experiment with
> this.
I think the questions will be asked again only when the encoding on
the console changes. Here is another solution (untested):
from_utf8 () {
iconv -c -futf-8 -t`locale charmap`
}
to_utf8 () {
iconv -c -f`locale charmap` -tutf-8
}
db_get dictionaries/which_dictionary
db_set dictionaries/which_dictionary $(echo $RET | from_utf8)
db_subst dictionaries/which_dictionary variants $(echo $variants | from_utf8)
db_input high dictionaries/which_dictionary || true
db_go || true
variant=`echo $RET | to_utf8`
db_set dictionaries/which_dictionary "$variant"
Anton Zinoviev
More information about the Dict-common-dev
mailing list