Fork me on GitHub
GitHub CPAN
Last Update: 2018-11-10T20:19:53
Avatar

Helmut Wollmersdorfer

Repositories

  • Description: Set::Similarity - Similarity measures for sets
  • Stars: 3
  • Forks: 0
  • Language: Perl
  • Description: Guess script from text using iso15924 codes
  • Stars: 1
  • Forks: 0
  • Language: Perl
  • Description: Support of Unicode CLDR locales in Perl
  • Stars: 1
  • Forks: 0
  • Language: Perl
  • Description: Longest Common Subsequence implemented with Bit-Vectors
  • Stars: 1
  • Forks: 1
  • Language: Perl6
  • Description: scripts reporting scores and statistics
  • Stars: 1
  • Forks: 0
  • Language: Perl
  • Description: NicePim is a successor of IcePIM Product Information Management (PIM)
  • Stars: 1
  • Forks: 2
  • Open Issues: 3
  • Language: Perl
  • Description: Transliterate Yiddish from Hebrew to Latin script
  • Stars: 1
  • Forks: 0
  • Language: Perl
  • Description: Check file names
  • Stars: 1
  • Forks: 0
  • Language: Perl
  • Description: Bit vector implementation of Longest Common Subsequence (LCS)
  • Stars: 1
  • Forks: 0
  • Language: C
  • Description: cpants tools
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: re-OCR selected books
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • Description: A composable RESTful JSON API to DBIx::Class schemas using roles and Web::Machine
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Interface to ICU from perl
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Access to the Unicode Common Locale Data Repository XML database from perl through a simple API
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. -- THIS IS THE BLEEDING-EDGE EXPERIMENTAL VERSION - FOR THE LATEST STABLE VERSION SEE http://ilk.uvt.nl/ucto --
  • Stars: 0
  • Forks: 0
  • Language: C++
  • Description: Perl toolchain docs, specs, guidelines, etc.
  • Stars: 0
  • Forks: 0
  • Description: Levenshtein using bit vectors
  • Stars: 0
  • Forks: 0
  • Description: Guess langauge from Text using top 1000 words
  • Stars: 0
  • Forks: 1
  • Open Issues: 3
  • Language: Perl
  • Description: Test::More, Test::Simple and Test::Builder Perl modules for writing tests
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Training files produced for and by the Tesseract OCR engine for work on the Early Modern OCR Project (eMOP)
  • Stars: 0
  • Forks: 0
  • Description: Tesseract ocr training data for Danish written in fraktur script and a few other languages
  • Stars: 0
  • Forks: 0
  • Language: Shell
  • Description: Parse Bio Taxon Names
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Talks GPW 2016
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Perl interface to SimString
  • Stars: 0
  • Forks: 0
  • Language: C++
  • Description: SimString::Wrapper - Wrap simstring command-line interface
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A Python Implementation of Simhash Algorithm
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: Set::Similarity::CosinePP - implemented using pure Perl sparse Vectors
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Set::Similarity::CosinePDL - implemented using PDL
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Similarity measures for sets using fast bit vectors (BV)
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Sequence alignment algorithms including check-pointing
  • Stars: 0
  • Forks: 0
  • Description: The scraperJSON standard for defining web scrapers as JSON objects
  • Stars: 0
  • Forks: 0
  • Description: Perl QA Hackathon 2015
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • Description: Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowdsourcing applications.
  • Stars: 0
  • Forks: 0
  • Language: Java
  • Description: Open source Farsi OCR, اوسی‌آر متن‌باز فارسی
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Perl module to use the Common Local Data Repository from the Unicode Consortium
  • Stars: 0
  • Forks: 0
  • Open Issues: 1
  • Language: Perl
  • Description: Look what you can do at the terminal! A collection of Perl6 one liners
  • Stars: 0
  • Forks: 0
  • Description: Parse PHP in Perl
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Calculate all possible LCSs (Longest Common Subsequences)
  • Stars: 0
  • Forks: 1
  • Language: Perl6
  • Description: An OpenType, TrueType, WOFF, and WOFF2 parser in JavaScript
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Ocular is a state-of-the-art historical OCR system.
  • Stars: 0
  • Forks: 0
  • Language: Java
  • Description: Python-based OCR package using recurrent neural networks.
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
  • Stars: 0
  • Forks: 0
  • Language: Shell
  • Description: Process file formats used by Tesseract
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Latin language (nature, biology) ground truth
  • Stars: 0
  • Forks: 0
  • Description: process the hOCR file format
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • Description: OCR English (Bio, Natur) ground truth and testfiles
  • Stars: 0
  • Forks: 0
  • Description: German language (nature, biology) ground truth
  • Stars: 0
  • Forks: 0
  • Description: A place for notes, plans, etc
  • Stars: 0
  • Forks: 0
  • Description: The Net::Dict module for Perl, which talks the DICT protocol (RFC 2229)
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Web-presence rewritten
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: display Server and Perl environment
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Mojolicious ❤️ Reveal.js
  • Stars: 0
  • Forks: 0
  • Language: CSS
  • Description: Mojolicious plugin to define form fields in a json file
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: abstract forms for Mojolicious and DBIx::Class
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: display DataBase Information
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: CMS based on Mojolicious
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • http://modernizr.com
  • Description: Modernizr is a JavaScript library that detects HTML5 and CSS3 features in the user’s browser.
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Lua 5.1 Parser and LUIF to Lua 5.1 Transpiler in barebones SLIF
  • Stars: 0
  • Forks: 0
  • Language: Lua
  • Description: Parse any language you can describe in BNF -- Release 2
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Perl OO Interface to Uniforum Message Translation
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Mirror of Locale-CLDR-LDML on urth.org
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Mirror of Locale-CLDR on urth.org
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Parse a word into scored known vs unknown parts
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: LCS implemented in XS
  • Stars: 0
  • Forks: 0
  • Language: C++
  • Description: Allow differences in the comparison of elements
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Longest Common Subsequence implemented with Bit-Vectors
  • Stars: 0
  • Forks: 1
  • Language: Perl
  • Description: Guess the language of text using top 1000 wordlists
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Stand-alone language identification system
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: Tools for extracting labels from specimen scans from and for tesseract OCR
  • Stars: 0
  • Forks: 0
  • Language: Shell
  • Description: A standalone and lightweight C library
  • Stars: 0
  • Forks: 0
  • Language: C
  • Description: HTML-CMS - Keep It Simple and Stupid
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Test package for case duplicates
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A test distribution using similar filenames
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Automatically exported from code.google.com/p/isri-ocr-evaluation-tools
  • Stars: 0
  • Forks: 0
  • Language: C
  • Description: Similarity of icons
  • Stars: 0
  • Forks: 0
  • Description: Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: code to remove "noise" from hOCR output of Tesseract OCR.
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: Tag hierarchy extraction
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Grapheme::Ngram - build N-grams respecting Unicode Grapheme Cluster Bounderies
  • Stars: 0
  • Forks: 0
  • Open Issues: 1
  • Language: Perl
  • Description: German Perl Workshop 2015
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • Description: Courses, rating, rounds, statistics
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Generic Environment for Context-Aware Correction of Orthography
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: A checklist to the wasps of Peru (Hymenoptera, Aculeata)
  • Stars: 0
  • Forks: 0
  • Description: 3i - Cicadellinae Database
  • Stars: 0
  • Forks: 0
  • Description: Repository for Frequency Word List Generator and processed files
  • Stars: 0
  • Forks: 0
  • Language: C#
  • Description: Font::TTF::Scripts perl module
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Font::TTF Perl Module
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Interface to OpenType fonts
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: stack trace visualizer
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • Description: Improved File::ShareDir for Perl
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: files and code related to the Early Modern OCR Project (eMOP) at the IDHMC
  • Stars: 0
  • Forks: 0
  • Language: C++
  • Description: Opinionated and Unobtrusive distribution builder
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Perl Devel::NYTProf
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Easily create DBIx::Class fixtures.
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Object Oriented Localization Tool For Perl
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Quick and effortless CRUD (create/read/update/delete) operations based on database tables
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A module that gives easy access to the coverage test results of CPAN modules from the CPAN Cover service
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Common library for searching CPAN indexes
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Find cut-n-pasted Perl code
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A small C++ implementation of LSTM networks, focused on OCR.
  • Stars: 0
  • Forks: 0
  • Language: Jupyter Notebook
  • Description: Nagios plugin for Nasdeluxe V2.04.06a
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A web-based editor for Tesseract box files
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Similarity measures for bags
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Archive Devel::Cover test coverage reports
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • http://aloha-editor.com
  • Description: World’s most advanced Editor gives you a complete new experience when editing. It’s faster than existing technology and offers unprecedented opportunities.
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Align sequences using XS
  • Stars: 0
  • Forks: 0
  • Description: Example Cpp-Dist with MakeMaker and CppGuess to test the tool chain
  • Stars: 0
  • Forks: 0
  • Language: C++