Fork me on GitHub
GitHub CPAN
Last Update: 2020-09-19T22:30:48
Avatar

Helmut Wollmersdorfer

Coderwall Badges

Komodo Dragon Velociraptor Velociraptor 3 Walrus Charity

Repositories

  • Description: Set::Similarity - Similarity measures for sets
  • Stars: 3
  • Forks: 0
  • Language: Perl
  • Description: Transliterate Yiddish from Hebrew to Latin script
  • Stars: 2
  • Forks: 0
  • Language: Perl
  • Description: Bit vector implementation of Longest Common Subsequence (LCS)
  • Stars: 1
  • Forks: 0
  • Language: C
  • Description: Check file names
  • Stars: 1
  • Forks: 0
  • Language: Perl
  • Description: Glyph names for font makers
  • Stars: 1
  • Forks: 0
  • Description: NicePim is a successor of IcePIM Product Information Management (PIM)
  • Stars: 1
  • Forks: 1
  • Open Issues: 3
  • Language: Perl
  • Description: Scripts for AustrianNewspapers
  • Stars: 1
  • Forks: 1
  • Language: HTML
  • Description: scripts reporting scores and statistics
  • Stars: 1
  • Forks: 0
  • Language: Perl
  • Description: Longest Common Subsequence implemented with Bit-Vectors
  • Stars: 1
  • Forks: 1
  • Language: Perl6
  • Description: Support of Unicode CLDR locales in Perl
  • Stars: 1
  • Forks: 0
  • Language: Perl
  • Description: Guess script from text using iso15924 codes
  • Stars: 1
  • Forks: 0
  • Language: Perl
  • Description: Levenshtein using bit vectors
  • Stars: 1
  • Forks: 0
  • Language: Perl
  • Description: Example Cpp-Dist with MakeMaker and CppGuess to test the tool chain
  • Stars: 0
  • Forks: 0
  • Language: C++
  • Description: Advent Of Code 2019
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A C++ implementation of the aho corasick pattern search algorithm
  • Stars: 0
  • Forks: 0
  • Description: Align sequences using XS
  • Stars: 0
  • Forks: 0
  • http://aloha-editor.com
  • Description: World’s most advanced Editor gives you a complete new experience when editing. It’s faster than existing technology and offers unprecedented opportunities.
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Archive Devel::Cover test coverage reports
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • Description: NewsEye / READ OCR training dataset from Austrian Newspapers
  • Stars: 0
  • Forks: 0
  • Description: Similarity measures for bags
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Bibliothekarische Organisationen und Personen auf GitHub
  • Stars: 0
  • Forks: 0
  • Description: A web-based editor for Tesseract box files
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Nagios plugin for Nasdeluxe V2.04.06a
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A small C++ implementation of LSTM networks, focused on OCR.
  • Stars: 0
  • Forks: 0
  • Language: Jupyter Notebook
  • Description: Find cut-n-pasted Perl code
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: OTOBO code quality checks.
  • Stars: 0
  • Forks: 0
  • Description: Common library for searching CPAN indexes
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A module that gives easy access to the coverage test results of CPAN modules from the CPAN Cover service
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Quick and effortless CRUD (create/read/update/delete) operations based on database tables
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Simple code build dashboard
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Object Oriented Localization Tool For Perl
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Easily create DBIx::Class fixtures.
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Perl Devel::NYTProf
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A dictionary based decompounder that recognizes compound words like 'Herrenschuh' and splits them into its individual parts, e.g. 'Herren' and 'Schuh'.
  • Stars: 0
  • Forks: 0
  • Language: PHP
  • Description: Opinionated and Unobtrusive distribution builder
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: OTOBO Administration Manual
  • Stars: 0
  • Forks: 0
  • Description: OTOBO installation tutorial
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: OTOBO user and Agent manual
  • Stars: 0
  • Forks: 0
  • Description: files and code related to the Early Modern OCR Project (eMOP) at the IDHMC
  • Stars: 0
  • Forks: 0
  • Language: C++
  • Description: C port of Eudex hashing algorithm
  • Stars: 0
  • Forks: 0
  • Description: A blazingly fast phonetic reduction/hashing algorithm.
  • Stars: 0
  • Forks: 0
  • Description: Improved File::ShareDir for Perl
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: stack trace visualizer
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • Description: Interface to OpenType fonts
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Font::TTF Perl Module
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Font::TTF::Scripts perl module
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Repository for Frequency Word List Generator and processed files
  • Stars: 0
  • Forks: 0
  • Language: C#
  • Description: 3i - Cicadellinae Database
  • Stars: 0
  • Forks: 0
  • Description: A checklist to the wasps of Peru (Hymenoptera, Aculeata)
  • Stars: 0
  • Forks: 0
  • Description: Generic Environment for Context-Aware Correction of Orthography
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: Git Source Code Mirror - This is a publish-only repository and all pull requests are ignored. Please follow Documentation/SubmittingPatches procedure for any of your improvements.
  • Stars: 0
  • Forks: 0
  • Description: Courses, rating, rounds, statistics
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: German Perl Workshop 2015
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • Description: Grapheme::Ngram - build N-grams respecting Unicode Grapheme Cluster Bounderies
  • Stars: 0
  • Forks: 0
  • Open Issues: 1
  • Language: Perl
  • Description: Tag hierarchy extraction
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: code to remove "noise" from hOCR output of Tesseract OCR.
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: Similarity of icons
  • Stars: 0
  • Forks: 0
  • Description: Automatically exported from code.google.com/p/isri-ocr-evaluation-tools
  • Stars: 0
  • Forks: 0
  • Language: C
  • Description: A test distribution using similar filenames
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: approximate and phonetic matching of strings
  • Stars: 0
  • Forks: 0
  • Description: Test package for case duplicates
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: HTML-CMS - Keep It Simple and Stupid
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A standalone and lightweight C library
  • Stars: 0
  • Forks: 0
  • Language: C
  • Description: Tools for extracting labels from specimen scans from and for tesseract OCR
  • Stars: 0
  • Forks: 0
  • Language: Shell
  • Description: Stand-alone language identification system
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: Guess the language of text using top 1000 wordlists
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.
  • Stars: 0
  • Forks: 0
  • Description: Longest Common Subsequence implemented with Bit-Vectors
  • Stars: 0
  • Forks: 2
  • Language: Perl
  • Description: Allow differences in the comparison of elements
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: LCS implemented in XS
  • Stars: 0
  • Forks: 0
  • Language: C++
  • Description: lcstest
  • Stars: 0
  • Forks: 0
  • Language: C
  • Description: Spiro is the creation of Raph Levien. It simplifies the drawing of beautiful curves. (Migrated here from libspiro.sourceforge.net on 2013-04-20)
  • Stars: 0
  • Forks: 0
  • Language: C
  • Description: CISTEM Stemmer for German
  • Stars: 0
  • Forks: 1
  • Language: Perl
  • Description: Parse a word into scored known vs unknown parts
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Mirror of Locale-CLDR on urth.org
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Mirror of Locale-CLDR-LDML on urth.org
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Perl OO Interface to Uniforum Message Translation
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Parse any language you can describe in BNF -- Release 2
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Lua 5.1 Parser and LUIF to Lua 5.1 Transpiler in barebones SLIF
  • Stars: 0
  • Forks: 0
  • Language: Lua
  • Description: Matrix Inversion by an Algorith of Ahmad Farooq and Khan Hamid
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • http://modernizr.com
  • Description: Modernizr is a JavaScript library that detects HTML5 and CSS3 features in the user’s browser.
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: CMS based on Mojolicious
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: display DataBase Information
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: abstract forms for Mojolicious and DBIx::Class
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Mojolicious plugin to define form fields in a json file
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Mojolicious ❤️ Reveal.js
  • Stars: 0
  • Forks: 0
  • Language: CSS
  • Description: display Server and Perl environment
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Web-presence rewritten
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: The Net::Dict module for Perl, which talks the DICT protocol (RFC 2229)
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A place for notes, plans, etc
  • Stars: 0
  • Forks: 0
  • Description: German language (nature, biology) ground truth
  • Stars: 0
  • Forks: 0
  • Description: OCR English (Bio, Natur) ground truth and testfiles
  • Stars: 0
  • Forks: 0
  • Description: OCR Ground Truth Resources
  • Stars: 0
  • Forks: 0
  • Description: Ergonomic line-by-line transcription of scanned text.
  • Stars: 0
  • Forks: 0
  • Description: OCR GT tools implemented with Mojolicious
  • Stars: 0
  • Forks: 0
  • Language: CSS
  • Description: process the hOCR file format
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • Description: Latin language (nature, biology) ground truth
  • Stars: 0
  • Forks: 0
  • Description: Process file formats used by Tesseract
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
  • Stars: 0
  • Forks: 0
  • Language: Shell
  • Description: Python-based OCR package using recurrent neural networks.
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: Ocular is a state-of-the-art historical OCR system.
  • Stars: 0
  • Forks: 0
  • Language: Java
  • Description: An OpenType, TrueType, WOFF, and WOFF2 parser in JavaScript
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: OTOBO is one of the most flexible web-based ticketing systems used for Customer Service, Help Desk, IT Service Management. https://www.otobo.de/
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: OTOBO Docker and Docker Compose files.
  • Stars: 0
  • Forks: 0
  • Description: Calculate all possible LCSs (Longest Common Subsequences)
  • Stars: 0
  • Forks: 1
  • Language: Perl6
  • Description: Parse PHP in Perl
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: generate perfect hashes (alpha)
  • Stars: 0
  • Forks: 0
  • Language: C
  • Description: Look what you can do at the terminal! A collection of Perl6 one liners
  • Stars: 0
  • Forks: 0
  • Description: Perl module to use the Common Local Data Repository from the Unicode Consortium
  • Stars: 0
  • Forks: 0
  • Open Issues: 1
  • Language: Perl
  • Description: :dromedary_camel: List of resources about Perl
  • Stars: 0
  • Forks: 0
  • Description: Open source Farsi OCR, اوسی‌آر متن‌باز فارسی
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowdsourcing applications.
  • Stars: 0
  • Forks: 0
  • Language: Java
  • Description: The CIS language aware OCR document error profiler
  • Stars: 0
  • Forks: 0
  • Description: LCS using Bitvectors in Python
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: Perl QA Hackathon 2015
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • Description: Manuals, lexica, OCR test data for PoCoTo and the profiler
  • Stars: 0
  • Forks: 0
  • Description: The scraperJSON standard for defining web scrapers as JSON objects
  • Stars: 0
  • Forks: 0
  • Description: Sequence alignment algorithms including check-pointing
  • Stars: 0
  • Forks: 0
  • Description: Similarity measures for sets using fast bit vectors (BV)
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Set::Similarity::CosinePDL - implemented using PDL
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Set::Similarity::CosinePP - implemented using pure Perl sparse Vectors
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: A Python Implementation of Simhash Algorithm
  • Stars: 0
  • Forks: 0
  • Language: Python
  • Description: SimString::Wrapper - Wrap simstring command-line interface
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Perl interface to SimString
  • Stars: 0
  • Forks: 0
  • Language: C++
  • Description: An interpolating spline based on spirals
  • Stars: 0
  • Forks: 0
  • Language: C
  • Description: Talks GPW 2016
  • Stars: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Parse Bio Taxon Names
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Tesseract ocr training data for Danish written in fraktur script and a few other languages
  • Stars: 0
  • Forks: 0
  • Language: Shell
  • Description: Training files produced for and by the Tesseract OCR engine for work on the Early Modern OCR Project (eMOP)
  • Stars: 0
  • Forks: 0
  • Description: Test::More, Test::Simple and Test::Builder Perl modules for writing tests
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Guess langauge from Text using top 1000 words
  • Stars: 0
  • Forks: 1
  • Open Issues: 3
  • Language: Perl
  • Description: Text::Levenshtein::BVXS - fast implementation using bit vectors
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Perl toolchain docs, specs, guidelines, etc.
  • Stars: 0
  • Forks: 0
  • Description: Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. -- THIS IS THE BLEEDING-EDGE EXPERIMENTAL VERSION - FOR THE LATEST STABLE VERSION SEE http://ilk.uvt.nl/ucto --
  • Stars: 0
  • Forks: 0
  • Language: C++
  • Description: Access to the Unicode Common Locale Data Repository XML database from perl through a simple API
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: Interface to ICU from perl
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: a clean C library for processing UTF-8 Unicode data
  • Stars: 0
  • Forks: 0
  • Description: A composable RESTful JSON API to DBIx::Class schemas using roles and Web::Machine
  • Stars: 0
  • Forks: 0
  • Language: Perl
  • Description: re-OCR selected books
  • Stars: 0
  • Forks: 0
  • Language: HTML
  • Description: cpants tools
  • Stars: 0
  • Forks: 0
  • Language: JavaScript