servo/futf

Name: futf

Owner: Servo

Description: Handling fragments of UTF-8 in Rust

Created: 2015-03-28 22:33:57.0

Updated: 2018-01-22 18:45:49.0

Pushed: 2017-02-14 19:50:13.0

Homepage: null

Size: 30

Language: Rust

GitHub Committers

UserMost Recent Commit# Commits

Other Committers

UserEmailMost Recent Commit# Commits

README

futf

Build Status

futf is a library for flexible UTF-8, or UTF-8 fragments. I don't know. Check out the API documentation.

Anyway, it takes an index into a byte buffer and tells you things about the UTF-8 codepoint containing that byte. It can deal with incomplete codepoint prefixes / suffixes at the ends of a buffer, which is useful for incremental I/O. It can also handle UTF-16 surrogate code units encoded in the manner of CESU-8 or WTF-8.

This is a low-level helper for tendril that might be useful more generally.


This work is supported by the National Institutes of Health's National Center for Advancing Translational Sciences, Grant Number U24TR002306. This work is solely the responsibility of the creators and does not necessarily represent the official views of the National Institutes of Health.