Regular expression libraries: (main topic: regular expression)
- cl-irregsexp - cl-irregsexp is a fast regular expression library with a lispy alternative to traditional syntax for text matching
- cl-ppcre - CL-PPCRE is a portable, Perl-compatible regular expression library by Edi Weitz
- pregexp - pregexp: Portable Regular Expressions for Scheme and Common Lisp
- recursive-regex - recursive-regex is a library to extend CL-PPCRE to make regular expression named capture groups dispatch to custom matcher functions and named-expression patterns
- Regex - Regex is a full-featured regular expression compiler and matching engine written by Michael Parker
- terse-ppcre - TERSE-PPCRE aims to make manipulating CL-PPCRE regular expression parse trees easier and more succinct
- The Regex Coach - The Regex Coach is a graphical Common Lisp application which can be used to experiment with (Perl-compatible) regular expressions interactively
- bintype - BINTYPE is a specification-driven parser generator for binary formats
- cl-opossum - CL-Opossum is a Common Lisp implementation of a Parsing Expression Grammar parser generator
- cl-parsec - cl-parsec is an experiment to create the analog of Haskell's Parsec library (an alternative to parser generators) in ANSI CL
- cl-peg - Cl-peg is a PEG packrat parser generator by John Leuner
- CL-Yacc - CL-Yacc is a LALR(1) parser generator for Common Lisp, somewhat like Zebu or LALR.cl.
- com.nklein.parser-generator - com nklein parser-generator is a parser generator will generate SAX-based XML parsers for Lisp or Objective-C and create all of the necessary data types for your internal structures based on your input file
- de.setf.atn-parser - de setf atn-parser is an atn-based BNF -> Common Lisp LR(*) parser generator
- dso-parse - This is a simple PEG (parsing-expression grammar) parser generator, aimed mostly at parsing text but capable of parsing other structures as well
- ebnf-parser - ebnf-parser is an EBNF (ISO/IEC 14977) parser generator
- ESRAP - ESRAP -- a packrat parser generator for Common Lisp
- esrap-peg - Esrap-PEG is a parser generator; it takes files with portable (language-agnostic) PEG notation and produces Esrap rules to parse this grammar
- FUCC - FUCC is universal parser generator for Common Lisp, currently implementing LR0, SLR, LALR, LR1 and LL grammars
- hh-parse - hh-parse is an LALR(1) parser generator written in Common Lisp
- LALR - LALR is a LALR(1) parser generator available at the CMU AI repository.
- Meta - A recursive-descent parser DSL that is a simpler alternative to parser generators
- meta-sexp - meta-sexp is a META parser generator using LL(1) grammars with s-expressions
- metapeg - Metapeg is a PEG parser generator created by John Leuner
- monkeylib-parser - monkeylib-parser is a parser generator loosely based on Henry Baker's META paper
- parser-combinators - An implementation of parser combinators, an alternative to parser generators
- rdp - COM.INFORMATIMAGO.RDP is a simple Recursive Descent parser generator
- yid - yid (Yacc Is Dead) is a parser generator based on Brzozowski's derivative from regular expressions to context-free grammars
- Zebu - Zebu is a LALR(1) parser generator similar to YACC, but for Common Lisp
- cl-lex - cl-lex is a set of Common Lisp macros for generating lexical analyzers automatically
- DEFLEXER - The LEXER package implements a lexical-analyzer-generator called DEFLEXER, which is built on top of both REGEX and CLAWK
- dso-lex - dso-lex allows lexers to be defined using regular expressions a la cl-ppcre
- graylex - graylex offers a means to do string operations on input streams without slurping all input at once by using Common Lisp Gray Streams, fixed-sized and flexible buffers
- Zebu - Zebu is a LALR(1) parser generator similar to YACC, but for Common Lisp
- charseq - CHARSEQ provides a shareable, (simple-array character *) typed string
- cl-netstrings - cl-netstrings is a tiny CL library for handling netstrings
- cl-string-match - CL-STRING-MATCH provides substring (subsequence) search algorithms implementations
- aspellisp - Aspellisp is a crude CFFI interface for GNU Aspell
- changed-stream - Is a lisp library for non-destructive changing of streams by inserting or deleting characters at a position
- CL-CSV - CL-CSV is a library to parse and write csv (comma-separated-values) files
- CL-DIFFLIB - CL-DIFFLIB is a Lisp library for computing differences between sequences
- cl-heredoc - cl-heredoc is an implementation of "here documents" that allow the user to
- cl-inflector - cl-inflector is a common-lisp library allowing easy singularization / pluralization of english words
- cl-interpol - CL-INTERPOL modifies the reader so that you can have interpolation of strings similar to Perl or Unix Shell scripts
- CLAWK - CLAWK is an AWK text manipulation programming language implementation embedded into Common Lisp, by Michael Parker
- diff - DIFF is a simple asdf-installable Text library which can compute unified-style or context-style diffs between two files
- format-setf - The Common Lisp equivalent of scanf()
- guess - guess Japanese encoding (gauche's algorithm)
- html-encode - html-encode is a small library for encoding text in various HTML-savvy formats
- Levenshtein - The Levenshtein Distance algorithm finds the minimum number of operations needed to transform one string into the other, where an operation is an insertion, deletion, or substitution of a single character
- monkeylib-prose-diff - monkeylib-prose-diff is a diff program optimized for comparing text files containing prose
- Montezuma - Montezuma is a text search engine for Common Lisp
- persistent-variables - Persistent-variables is a convenience library that makes it easy to serialize and deserialize variables
- PorterStemmer - The Porter Stemmer is a stemming text algorithm by Martin Porter
- read-csv - Read-csv is a library for reading csv (comma-separated-values) files
- sequence-search-replace - A library for sequence search and replace so it's useful on Text
- Soundex - The Soundex algorithm indexes words by their sound when pronounced in English, for example to account for differences in spelling
- vana-inflector - A common lisp library to easily pluralize and singularize English words.
- vas-string-metrics - vas-string-metrics provides the Jaro, Jaro-Winkler, Levenshtein, and normalized Levenshtein string distance/similarity metrics algorithms for text analysis
- CAPTURED-STREAM - CAPTURED-STREAM is a small Common Lisp library for viewing streams as sequences
- changed-stream - Is a lisp library for non-destructive changing of streams by inserting or deleting characters at a position
- flexi-streams - FLEXI-STREAMS is a library which implements "virtual" bivalent streams that can be layered atop real binary/bivalent streams
- Gray streams - "Gray Streams" are a generic function wrapping of the COMMON-LISP streams in the standard library, allowing for further specialization by end users
- graylex - graylex offers a means to do string operations on input streams without slurping all input at once by using Common Lisp Gray Streams, fixed-sized and flexible buffers
- gzip-stream - gzip-stream is a simple wrapper around salza which gives CL users gzip compression and decompression in the form of streams (gzip-input-stream and gzip-output-stream)
- odd-streams - ODD-STREAMS implements binary streams with "odd" byte sizes
- simple-stream - Simple-streams are Franz's proposal for a Gray streams replacement
- trivial-bit-streams - Trivial-bit-streams implements flexible buffered bit streams
- trivial-gray-streams - trivial-gray-streams provides an extremely thin compatibility layer for Gray streams
- Francis Leboutte's functions to compute frequencies of characters, digrams and trigrams from a text file (function names and comments in French - some comments in English).
- SPLIT-SEQUENCE. Part of Common Lisp Utilities.
- CL-BibTeX is a replacement for the BibTeX program.
- CL-DATA-FORMAT-VALIDATION - generic interface for parsing and formating data
See also the pages for Regular Expression, XML libraries, HTML Parsers, Lisp Markup Languages, document formats, Unicode support, Unicode and Lisp