tokenizer

A grammar describes the syntax of a programming language, and might be defined in Backus-Naur form (BNF). A lexer performs lexical analysis, turning text into tokens. A parser takes tokens and builds a data structure like an abstract syntax tree (AST). The parser is concerned with context: does the sequence of tokens fit the grammar? A compiler is a combined lexer and parser, built for a specific grammar.

tokenizer

Here are 765 public repositories matching this topic...

theseer / tokenizer

Chevrotain / chevrotain

natasha / natasha

lovit / soynlp

mathewsanders / Mustard

ikawaha / kagome

no-context / moo

cbaziotis / ekphrasis

smoothnlp / SmoothNLP

BLKSerene / Wordless

open-korean-text / open-korean-text

jflex-de / jflex

glayzzle / php-parser

CogComp / cogcomp-nlp

alvations / sacremoses

lionsoul2014 / friso

timtadh / lexmachine

lydell / js-tokens

neurosnap / sentences

taishi-i / nagisa

Related Topics