dwww Home | Manual pages | Find package

BIBTEXU(1)                       User Commands                       BIBTEXU(1)

NAME
       bibtexu - UTF-8 Big BibTeX

SYNOPSIS
       bibtexu [options] aux-file

DESCRIPTION
       BibTeXu is the Unicode-compliant version of BibTeX.  It is largely based
       on Niel Kempson's BibTeX8, and it provides a better support for UTF-8 by
       integrating  ICU  library.   Therefore,  BibTeXu  no longer requires the
       Codepage and Sort order ("CS") file; instead, the method of sorting  and
       case-changing can be controlled via command-line options.

OPTIONS
       -?  --help
              display some brief help text.

       -d  --debug TYPE
              report  debugging  information.  TYPE is one or more of all, csf,
              io, mem, misc, search.

       -s  --statistics
              report internal statistics.

       -t  --trace
              report execution tracing.

       -v  --version
              report BibTeX version.

       -l  --language LANG
              use language LANG to convert strings to low case.  This  argument
              is passed to ICU library.

       -o  --location LANG
              use  language  LANG  for sorting.  This argument is passed to ICU
              library.

       -B  --big
              set large BibTeX capacity.

       -H  --huge
              set huge BibTeX capacity.

       -W  --wolfgang
              set really huge BibTeX capacity for Wolfgang.

       -M  --min_crossrefs ##
              set min_crossrefs to ##.

       --mstrings ##
              allow ## unique strings.

UNICODE SUPPORT
       BibTeXu supports extended features to handle Unicode  characters.   Sev-
       eral built-in functions in bibliography styles are enhanced as follows.

       &      Pops the top two (integer) literals and pushes their bitwise AND.

       |      Pops the top two (integer) literals and pushes their bitwise OR.

       add.period$
              Pops  the  top  (string)  literal,  adds  a `.' to it if the last
              non`}' character isn't a `.', `?', `!' or a  Unicode  punctuation
              mark  and  pushes this resulting string.  The mark may be U+203C,
              U+203D, U+2047, U+2048, U+2049, U+3002, U+FF01, U+FF0E or U+FF1F.

       chr.to.int$
              Pops the top (string) literal, makes sure it's a multibyte string
              of a single Unicode code point, converts it to the  corresponding
              Unicode scalar value (integer), and pushes this integer.

       int.to.chr$
              Pops the top (integer) literal, interpreted as the Unicode scalar
              value  of  a  single code point, converts it to the corresponding
              single character multibyte string, and pushes this string.

       num.names$, format.name$
              The function  is  the  same  as  original  BibTeX  but  an  Ideo-
              graphic/Fullwidth  Comma (U+3001, U+FF0C) in addition to an " and
              " string is accepted as a separator  between  persons  and  Ideo-
              graphic  Space (U+3000) in addition to a space " " is accepted as
              a separator between a family name and a given name.

       substring$, text.length$, text.prefix$
              The function is the same as  original  BibTeX  but  the  unit  of
              operand numbers is Unicode code point.

       change.case$
              The  function  is the same as original BibTeX but letters of non-
              english Latin, Greek and Cyrillic are supported.

       width$ The function is the  same  as  original  BibTeX  but  letters  of
              Latin-1 and Latin Extended-A and CJK characters are supported.

       is.cjk.str$
              Pops the top (string) literal, set flag bits to an integer if CJK
              characters  are found in the string, and pushes the resulting in-
              teger, otherwise pushes 0.  Flags 0x001, 0x002, 0x004, 0x008  and
              0x800  are  corresponding  to Hanzi (Kanji, Hanja), Kana, Hangul,
              Bopomofo and other CJK characters, respectively.  For example, an
              integer 0x003 will be pushed if Hanzi  and  Kana  characters  are
              found in a poped string literal.

       is.kanji.str$
              Same as is.cjk.str$ for compatibility with (u)pBibTeX.

SEE ALSO
       More detailed description of BibTeXu is available at $TEXMFDIST/doc/bib-
       texu/README.

AUTHORS
       BibTeXu was written by Yannis Haralambous and his students.  It is main-
       tained as part of TeX Live.

       This manpage was written for TeX Live.

bibtexu 4.00                     30 August 2022                      BIBTEXU(1)

Generated by dwww version 1.16 on Tue Dec 16 05:55:39 CET 2025.