add unicode superscripts and subscripts to latex substitutions #6927

lstagner · 2014-05-22T22:51:21Z

Added Unicode superscripts and subscripts to latex substitutions.

lstagner · 2014-05-22T23:19:05Z

I noticed this caused the following weird error

julia> e¹=2
2

julia> e¹
ERROR: syntax: invalid character "�"

julia> e¹
2

julia> e¹
2

julia> e¹
ERROR: syntax: invalid character "�"

JeffBezanson · 2014-05-22T23:21:35Z

Looks like mystery issue #5712

stevengj · 2014-05-23T01:01:06Z

I'd prefer without the braces, i.e. \^2 and not \^{2}. The backslash should be enough to differentiate from exponentiation, and braces aren't needed in LaTeX either for single-character superscripts and subscripts.

stevengj · 2014-05-23T01:02:38Z

base/latex_symbols.jl

@@ -781,6 +781,51 @@ const latex_symbols = [
    "\\openbracketright" => "〛",
    "\\overbrace" => "︷",
    "\\underbrace" => "︸",
+    "\\^{0}" => "⁰",


Also, please put a comment or something to separate them from the auto-generated list.

lstagner · 2014-05-23T01:07:06Z

@stevengj That was my original plan but there were conflicts with \^( ,\^), \_(, \_), and \^i. The curly braces guaranteed a unique match and was still valid latex.

…EPL, and allow a wider range of chars (for things like JuliaLang#6927)

lstagner · 2014-05-23T01:39:25Z

I suppose we could have numeric super/subscripts be \^1 and \_1 since they will probably be the most common and have all the rest have the curly braces.

Any opinions?

stevengj · 2014-05-23T01:45:50Z

Why do you need braces for single letters like \^h and \_h?

I would just dump the schwa. In general, I wanted to avoid more and more LaTeX code creeping in, which is why I omitted things like \mathbb{A} (U+1D538) from my initial table, even though they were listed in the W3C unicode.xml. I suppose we could put those back in, but I would caution against anything that includes more than one backslash, as that will make parsing much more difficult.

lstagner · 2014-05-23T01:53:20Z

I think we should try to be as close to latex as possible. If there is a more common name, such as in the case of \hbar and \Elzxh there is no reason not to have both. We could also have \grad map to ∇ like it does for \nabla.

As for braces around \^h on my machine it list all functions that start with the letter h

julia> \^h
hankelh1  hash       help       hex        hist       homedir    hvcat
hankelh2  haskey     hessfact   hex2bytes  hist2d     htol       hypot
has       hcat       hessfact!  hex2num    histrange  hton

stevengj · 2014-05-23T01:55:45Z

I agree that we should pick the most common name when there are several to choose from; definitely \hbar and not \Elzxh or \xh, and have a couple of common synonyms. But the \Elz names are especially ugly (and in most cases it seems that there are more common versions names without the Elz prefix)... The autogenerated list is just a starting point.

The completion of \^h should be fixable.

stevengj · 2014-05-23T02:01:03Z

I don't see the problem. If I do:

Base.REPLCompletions.latex_symbols["\\_h"] = "ₕ"
Base.REPLCompletions.latex_symbols["\\^n"] = "ⁿ"

then completion of \_h and \^n work for me. Maybe you had a typo?

stevengj · 2014-05-23T02:07:01Z

It also looks like we are missing most of the IPA symbols (LaTeX wsuipa package), in case someone wants to spell their variables phonetically... ;-)

In general, beware that the W3C's unicode.xml file dates from 2003 (I couldn't find any more recent comprehensive table), so it may have many omissions.

lstagner · 2014-05-23T02:27:51Z

Hmm, seems like i confused h with n. It is hard to tell the difference when its a subscript. In anycase, the curly braces don't seem necessary. Must of fixed whatever issue I was having with it. However I still have an issue with "\\^(" => "⁽" not being substituted in.

julia> Base.REPLCompletions.latex_symbols["\\^("] = "⁽"
"⁽"

julia> \^(              ##Doesn't do anything when tab is hit

lstagner · 2014-05-23T02:41:43Z

Funnily enough when I just go through the list \^( works fine (all without curly braces)
Seems like so long as \^( is after another sub/superscript it works fine.

julia> ⁰¹²³⁴⁵⁶⁷⁸⁹⁺⁻⁼⁽⁾ⁿ₀₁₂₃₄₅₆₇₈₉₊₋₌₍₎ₐₑₒₓₔₕₖₗₘₙₚₛₜ

julia> a\^(        ##Nothing

julia> ω\^(       ##Nothing

julia> ₁⁽

stevengj · 2014-05-23T02:49:06Z

Probably something in REPLCompletions that handles parens specially. cc: @loladiro

Keno · 2014-05-23T05:20:45Z

I believe it's considered a word boundary and thus not completed.

…th ( (see #6927)

stevengj · 2014-05-23T15:14:09Z

\^( works now (commit d707cb8).

stevengj · 2014-05-23T15:17:00Z

Now, ⁽ is still not allowed in identifiers; you should probably include a patch to src/flisp/julia_extensions.c

JeffBezanson · 2014-05-23T15:32:47Z

Added.

stevengj · 2014-05-23T16:47:51Z

LGTM.

stevengj · 2014-05-23T17:14:23Z

base/latex_symbols.jl

+    "\\_p" => "ₚ",
+    "\\_s" => "ₛ",
+    "\\_t" => "ₜ",
+    "\\hbar" => "ħ",


It looks like you are using the wrong codepoint here. This is U+0127, but \hbar should be U+210F.

We also have \hslash for U+210F. U+0127 looks better to me in upright text, to be honest, so it's not completely clear to me what we should use here.

FWIW Wikipedia always uses U+210F for Planck's constant

U+0127 is also used in IPA, where it is called \textcrh or \crossh, depending upon the LaTeX package.

lstagner · 2014-05-23T18:12:53Z

Well \hbar ħ (U+0127), looks more like a bar and \hslash ℏ (U+210F) looks more like a slash. So I think the current names at least make logical sense. My preference is to keep it as is. Perhaps when this gets documented we can sort it meaningfully so it will be easier to find alternatives.

lstagner · 2014-05-23T23:33:47Z

I am ready for this to be merged if there are no other comments.

stevengj · 2014-05-24T02:42:30Z

Arguments in favor of U+0127 for \hbar:

In LaTeX (and Wikipedia), all letters in equations are italic by default, so U+210F makes sense there. Here, all of our letters are generally upright, so it makes sense to use an upright \hbar by default too.
We are calling it \hbar, not \planck. The old name for U+0127 in Unicode was, in fact LATIN SMALL LETTER H BAR.

Argument against: most people using \hbar will be using it for Planck's constant, and U+210F is defined as Planck's constant in Unicode.

On balance, I'm inclined to support U+0127. In this context, typographical consistency (upright vs. italic) is more important than code point definitions.

lstagner · 2014-05-24T02:55:10Z

+1 for U+0127 \hbar

…ad of U+2329/232A (angle bracket), as the former are recommended by Unicode for math & technical usage

add unicode superscripts and subscripts to latex substitutions

stevengj · 2014-05-28T17:12:00Z

Since there seem to be no further objections (and LaTeX abbreviations are fairly innocuous anyway), I went ahead and merged.

add unicode superscripts and subscripts to latex substitutions

d40d8fb

stevengj reviewed May 23, 2014
View reviewed changes

replace \\Elzxh with \\hbar to match latex usage

198cd02

stevengj added a commit to stevengj/julia that referenced this pull request May 23, 2014

more sensible sorting of substitutions; only search backwards as in R…

d2de8ee

…EPL, and allow a wider range of chars (for things like JuliaLang#6927)

moved super/subscripts and hbar to manually added section

f0e0339

get rid of curly braces in latex super/subscripts and add \del mapping

b42723e

stevengj added a commit that referenced this pull request May 23, 2014

allow LaTeX tab-substitutions in comments and substitutions ending wi…

d707cb8

…th ( (see #6927)

stevengj added the unicode label May 23, 2014

stevengj reviewed May 23, 2014
View reviewed changes

Merge branch 'master' into sub_super

8713b4c

stevengj referenced this pull request May 27, 2014

use U+27E8/27E9 (mathematical angle bracket) for langle/rangle, inste…

7f5e311

…ad of U+2329/232A (angle bracket), as the former are recommended by Unicode for math & technical usage

stevengj added a commit that referenced this pull request May 28, 2014

Merge pull request #6927 from lstagner/sub_super

39bc1bc

add unicode superscripts and subscripts to latex substitutions

stevengj merged commit 39bc1bc into JuliaLang:master May 28, 2014

lstagner mentioned this pull request May 28, 2014

Added more unicode sub/superscripts to Latex tab completions #7018

Merged

hayd mentioned this pull request Jul 18, 2014

Add some mathbb latex substitutions #7657

Closed

lstagner mentioned this pull request Aug 10, 2016

Add NCycle iterator #17935

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add unicode superscripts and subscripts to latex substitutions #6927

add unicode superscripts and subscripts to latex substitutions #6927

lstagner commented May 22, 2014

lstagner commented May 22, 2014

JeffBezanson commented May 22, 2014

stevengj commented May 23, 2014

stevengj May 23, 2014

lstagner commented May 23, 2014

lstagner commented May 23, 2014

stevengj commented May 23, 2014

lstagner commented May 23, 2014

stevengj commented May 23, 2014

stevengj commented May 23, 2014

stevengj commented May 23, 2014

lstagner commented May 23, 2014

lstagner commented May 23, 2014

stevengj commented May 23, 2014

Keno commented May 23, 2014

stevengj commented May 23, 2014

stevengj commented May 23, 2014

JeffBezanson commented May 23, 2014

stevengj commented May 23, 2014

stevengj May 23, 2014

stevengj May 23, 2014

Keno May 23, 2014

stevengj May 23, 2014

lstagner commented May 23, 2014

lstagner commented May 23, 2014

stevengj commented May 24, 2014

lstagner commented May 24, 2014

stevengj commented May 28, 2014

add unicode superscripts and subscripts to latex substitutions #6927

add unicode superscripts and subscripts to latex substitutions #6927

Conversation

lstagner commented May 22, 2014

lstagner commented May 22, 2014

JeffBezanson commented May 22, 2014

stevengj commented May 23, 2014

stevengj May 23, 2014

Choose a reason for hiding this comment

lstagner commented May 23, 2014

lstagner commented May 23, 2014

stevengj commented May 23, 2014

lstagner commented May 23, 2014

stevengj commented May 23, 2014

stevengj commented May 23, 2014

stevengj commented May 23, 2014

lstagner commented May 23, 2014

lstagner commented May 23, 2014

stevengj commented May 23, 2014

Keno commented May 23, 2014

stevengj commented May 23, 2014

stevengj commented May 23, 2014

JeffBezanson commented May 23, 2014

stevengj commented May 23, 2014

stevengj May 23, 2014

Choose a reason for hiding this comment

stevengj May 23, 2014

Choose a reason for hiding this comment

Keno May 23, 2014

Choose a reason for hiding this comment

stevengj May 23, 2014

Choose a reason for hiding this comment

lstagner commented May 23, 2014

lstagner commented May 23, 2014

stevengj commented May 24, 2014

lstagner commented May 24, 2014

stevengj commented May 28, 2014