Skip to content

Latest commit

 

History

History
404 lines (298 loc) · 15.4 KB

CONTRIBUTOR_GUIDE.md

File metadata and controls

404 lines (298 loc) · 15.4 KB

This guide is for developers who want to contribute code to the project, or who need to modify or debug it.

If you simply want to use MathLive in your project, see the Getting Started guide.

Table of Contents

Getting Started: Setting up Your Development Environment

The project uses NPM scripts for its build system. The package.json file and the scripts/ directory contain the definitions of the build scripts.

To get started developing:

  1. Install Node.js on your dev machine (this will also install npm). The LTS version is recommended.
  2. In your shell, type:
$ git clone https://github.com/arnog/mathlive
$ cd mathlive
$ npm ci

Depending on your system setup, you may need to run as admin, in which case use sudo npm ci or equivalent.

The npm ci command installs in the mathlive/node_modules directory all the Node modules necessary to build and test the MathLive SDK.

Once the installation is successful, you can use the following commands:

# Make a local build and watch source file for changes.
# Run a local server (http://localhost:8080/test/smoke/)
$ npm start


# Make a local development build in the `dist/` directory
$ npm run build


# Run test suite
$ npm test


# Calculate the code coverage and output to `coverage/`
$ npm test coverage


# Create a production build to `dist/`.
$ npm run build production

During development, keep npm start running. A build will be triggered when a source file is updated. Note however that changes to .less files do no trigger a rebuild. You'll need to stop and restart the npm start command.

Submitting changes

Run the test suite with npm test and linter npm run lint to make sure your changes are ready to submit, then push a PR to the main branch

Troubleshooting

If you are getting build errors after updating your repo, your node-modules/ directory may need to be updated. Run:

$ npm ci

Deploy / Publish

Note: only maintainers of the project should deploy to NPM.

You will need a GH_PUBLIC_TOKEN env variable set up with a "Personal Access Token" to publish the GitHub Release.

Go to GitHub > [user] > Settings > Developer Settings > Personal Access Tokens to create one and set it as a shell env global variable.

To publish a new version of the SDK:

# Increase the version number of the SDK
$ npm version [major | minor | patch]

# Publish to NPM
$ npm publish

These commands will:

  1. Increment the version number of the SDK and create a corresponding git tag
  2. Update the CHANGELOG with the current version number
  3. Publish a GitHub release
  4. Publish to NPM

(See scripts/version.sh, which is invoked by npm version)

Note on versioning: Use the semver convention for versions:

  • npm version patch: bug fixes and other minor changes. Last number of the version is incremented, e.g. 1.2.411.2.42
  • npm version minor: new features which don't break existing features. Middle number of the version is incremented, e.g. 1.2.421.3.0
  • npm version major: changes which break backward compatibility of the API. Increment the first number, e.g. 1.3.562.0.0

Code Structure

The MathLive SDK consists of the following key directories:

  • css/ the stylesheets and fonts

  • sounds/ the default sound files

  • src/core is what is necessary to handle the LaTeX: parsing it and displaying in, as a general layout engine

  • core-atoms contain the code about the specific "kind" of atoms that can be displayed (fractions, operators, groups, etc...)

  • src/core-definitions contains the code for the specific LaTeX commands that are known by MathLive

  • src/editor-model is all the code that keeps track of the "state" of the mathfield (its model), including the content and selection, and the code necessary to modify it (insert/remove, etc...)

  • src/editor contains utilities to handle user interaction, including handling of keybinding and shortcuts, localization, text-to-speech, virtual keyboard, and "commands" that users can use programmatically to change the state of the mathfield.

  • src/editor-mathfield is the outer layer that handles the user input, and interaction with the DOM. This section is a bit messy, and arguably some components could be moved elsewhere (the remote virtual keyboard support for example is in there, it should probably be in editor

  • src/public contains the exposed API of MathLive, including the MathfieldElement wrapper class.

  • the dist/ directory contains executable build artifacts. If a file named "DEVELOPMENT_BUILD" is present in the directory, the content of the directory is suitable only for development purposes, not for production. This means the files have not been minified and compiled, and include additional .map files referencing back to the source file for ease of debugging.

The content of the dist/ directory are entirely generated as part of the build process. No other directory should contain intermediated files generated as part of the build process.

Language and Coding Style

MathLive is written in TypeScript.

The project uses the prettier tool to enforce a consistent formatting style and naming conventions. The tool will be run automatically before commits. You can also run it manually using npm run lint.

The code base attempts to follow these general guidelines:

  • Consistency All code in the codebase should look as if it had been written by a single person. Don't write code for yourself, but for the many people who will read it later.
  • Clarity before performance Write code that is easy to read, and avoid obscure constructs that may obfuscate the code to improve performance. For example, RegEx are crazy fast in all modern browsers, and trying to roll out your own pattern matching will result in more code and less performance. If you think something could be made faster, use http://jsben.ch/ to try out options in various browsers and compare the results. You might be surprised.
  • Follow Postel's Law, the Robustness Principle "Be conservative in what you do, be liberal in what you accept from others". For example, functions that are invoked internally do not need to check that the input parameters are valid. However, public APIs should check the validity of parameters, and behave reasonably when they aren't.

Bundling

The TypeScript code is compiled to JavaScript by the tsc compiler. When doing a production build, the JavaScript is further minimized with terser, then bundled into a single file with esbuild. The CSS files are minimized with postcss.

Browser Support

MathLive is designed for the modern web. Supporting older browsers complicates the effort involved in building new features, but it is also an insecure practice that should not be encouraged.

In this context, modern means the latest two releases of Chrome, Edge, Safari and Firefox. Both desktop and mobile are supported.

Note that the HTML quirks mode is not supported. This means that the host page should use the strict mode, indicated by a <!doctype html> directive at the top of the page.

Note that the HTML page should use the UTF-8 encoding. Use a server header or a <meta charset="UTF-8"> tag in the page if necessary.

Accessibility - A11Y

Rendering

MathLive renders math using HTML and CSS. Digits, letters and math symbols are displayed in <span> tags with the necessary CSS styling to display them in the right place. In addition, rules (lines) such as the fraction line, are rendered using CSS borders. In a few rare cases, SVG is used to render some decorations, such as the annotations of the \enclose command.

The rendered math is not purely graphical, and as such can be accessed by screen readers.

Alternate renditions

However, in addition to the "visual" HTML+CSS representation that MathLive outputs, it can also generate alternate renditions, including:

  • LaTeX: a string of LaTeX code equivalent to the formula.
  • Spoken Text: a text representation of the formula as someone would speak it, for example: f(x) = x^2 → "f of x equals x squared"
  • Annotated Spoken Text: as above, but in addition prosody hints are inserted for a more natural rendition by text to speech systems (breathing pauses, variation in pitch, etc...).

Those alternate renditions are rendered as an ARIA-label, or as an element that is not visually rendered, but visible to screen readers.

Speech

Although MathLive works with screen readers, since math is its own language MathLive has its own built-in text to speech renderer. With the speech interface it is possible to:

  • read the current group (numerator or subscript, for example)
    • Mac: Ctrl + Command + Down
    • Windows/Linux/ChromeOS: Ctrl + Alt + Down
  • read what's before or after the selection
    • Mac: Ctrl + Command + Left/Right
    • Windows/Linux/ChromeOS: Ctrl + Alt + Left/Right
  • read the parent of the current group
    • Mac: Ctrl + Command + Up
    • Windows/Linux/ChromeOS: Ctrl + Alt + Up
  • read the current selection
    • Mac: Ctrl + Command + Shift + Down
    • Windows/Linux/ChromeOS: Ctrl + Alt + Shift + Down

With these convenient keyboard shortcuts, it is possible to aurally navigate and understand even complex formulas.

Input and navigation

MathLive supports multiple modalities for input: in addition to pointer devices (mouse, trackpad, touch screen), MathLive has an extensive set of keyboard shortcuts that allow navigation and editing of the most complex formulas. Every operation is possible without the use of a pointing device.

Conversely, it is possible to enter commands and complex mathematical symbols using only a pointing device: the command bar can be invoked by tapping a round toggle button displayed to the right of the formula. The command bar offers large buttons that act as a virtual keyboard, but offer contextual operations depending on the current selection, and the content around it. Those buttons are easy to use on touch screens and for users of alternative pointing devices.

Architecture

The core of MathLive is a rendering engine that generates HTML (and SVG) markup. This engine uses the TeX layout algorithms because of their quality. Given the same input, MathLive will render pixel for pixel (or very close to it) what TeX would have rendered.

To do so, it makes use of a web version of the fonts used by TeX and which are included in the dist/fonts/ directory.

Although the rendering engine follows the TeX algorithms, MathLive also has an in-memory data structure to represent a math expression while it is being edited (a tree of Atoms).

MathLive is divided into two main components:

  • Core: handles rendering of LaTeX to HTML markup
  • Editor: handles the user interaction with the formula, using Core for the rendering.

Core

Core takes a LaTeX string as input. A lexer converts the string into Token[] which are then passed on to a parser. The parser uses the information from modes-* to parse the tokens depending on the current mode (text, math, etc...). The LaTeX commands are defined in definitions-*, and used by the parser to properly interpret the commands it encounter and turn them into Atom[].

An Atom is an elementary layout unit, for example a genfrac Atom can layout a "generalized fraction", that is something with a numerator and denominator, optionally a bar separating the two, and some optionally some opening and closing fences. It is used by the \frac command, but also \choose, \pdiff and others.

The Atom[] are then turned into Box[] which are virtual markup elements.

Eventually, the Box[] get rendered into HTML/SVG markup.

Core Architecture

Lexer

The lexer converts a string of TeX code into tokens that can be digested by the parser.

Parser

The parser turns a stream of tokens generated by the lexer into math atoms. Those atoms then can be rendered into boxes, or back into LaTeX or into spoken text.

Box

A box is virtual DOM node that is used to represent an element displayed in a web page: a symbol such as x or =, an open brace, a line separating the numerator and denominator of a fraction, etc...

The basic layout strategy is to calculate the vertical placement of the boxes and position them accordingly, while letting the HTML rendering engine position and display the horizontal items. When horizontal adjustments need to be made, such as additional space between items the CSS margin are adjusted.

boxes can be rendered to HTML markup with Box.toMarkup() before being displayed on the page.

Atom

An atom is an object representing a mathematical symbol, for example x, 1, a fraction, a delimiter, etc...

There are several different classes of Atom (subclass of the base Atom class). Each class represents different layout algorithm (different ways of generating boxes in their render() method) as well as different ways to generate LaTeX to represent the atom (in their serialize() method)

It can be of one of the following classes:

  • Atom: the base class is used for the simplest symbol, e.g. x, 1, \alpha
  • AccentAtom: a diacritic mark above a symbol
  • ArrayAtom: "environments" in TeX parlance, a matrix, vector or other array-like structure
  • BoxAtom: a decoration around a "nucleus", including a color background, lines, etc...
  • DelimAtom and SizedDelimAtom delimiters and extensible delimiters
  • see src/core-atoms for more.

Editor

The mathfield is the object handling the user interaction and driving the rendering of the formula into the document.

It makes use of several subcomponents (virtual-keyboard, undo, etc...) to handle specific aspects of the user interaction. It makes changes to the formula by issuing basic editing commands such as insert(), delete() and modifying the selection to the model.

The model keep track of the state of the formula, including its content (a tree of Atom) and the selection and interacts with the core to turn the Atom into Box and into markup.

Editor Architecture

Model

The Model class encapsulates the operations that can be done to a tree of atoms, including adding and removing content and keeping track of and modifying an insertion point and selection.

Mathfield

The Mathfield class is a user interface element that captures the keyboard and pointing device events, and presents an appropriate user experience.

It uses a Model to manipulate the in-memory representation of the math expression being edited.