Skip to content
This repository has been archived by the owner on Mar 25, 2024. It is now read-only.

Commit

Permalink
v0.3.2, prepared the first public release of the arXiv paragraph clas…
Browse files Browse the repository at this point in the history
…sification dataset (to be announced)
  • Loading branch information
dginev committed Jun 3, 2019
1 parent cb68bab commit 15b3db7
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 3 deletions.
2 changes: 1 addition & 1 deletion Cargo.toml
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[package]
name = "llamapun"
version = "0.3.1"
version = "0.3.2"
edition = "2018"
authors = ["Deyan Ginev <[email protected]>", "Jan Frederik Schaefer <[email protected]>"]

Expand Down
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ The **llamapun** library hosts common _language and mathematics processing_ algo
[![Build Status](https://travis-ci.org/KWARC/llamapun.svg?branch=master)](https://travis-ci.org/KWARC/llamapun)
[![API Documentation](https://img.shields.io/badge/docs-API-blue.svg)](http://kwarc.github.io/llamapun/llamapun/index.html)
[![license](http://img.shields.io/badge/license-GPLv3-blue.svg)](https://raw.githubusercontent.com/KWARC/llamapun/master/LICENSE)
![version](https://img.shields.io/badge/version-0.3.1-orange.svg)
![version](https://img.shields.io/badge/version-0.3.2-orange.svg)
---
At its core, **llamapun** is a [Rust](http://rust-lang.org/) implementation that aims at minimal footprint and optimal runtime, in order to safely scale to corpora of millions of documents and tens of billions ot tokens.

Expand Down Expand Up @@ -43,7 +43,7 @@ Requires **stable** rust, starting from `rustc 1.34.0 (91856ed52 2019-04-10)`.

* **Additional included examples**
* math-aware corpus token models, via DNM plain text normalization
* math-aware AMS-labeled dataset generation
* math-aware dataset extraction for "statement classification" of paragraphs
* "node footprint" statistics for corpora, e.g. [informing the MathML4 effort](https://github.com/mathml-refresh/mathml/issues/55#issuecomment-475916070)
* track sibling words to inline references in scientific articles, [informing LaTeXML development](https://github.com/brucemiller/LaTeXML/issues/1043#issuecomment-478249149)

Expand Down

0 comments on commit 15b3db7

Please sign in to comment.