Fork me on GitHub
GitHub CPAN
Last Update: 2017-06-25T02:38:59
Avatar

Helmut Wollmersdorfer

Repositories

  • Description: Set::Similarity - Similarity measures for sets
  • Watchers: 3
  • Forks: 0
  • Language: Perl
  • Description: Bit vector implementation of Longest Common Subsequence (LCS)
  • Watchers: 1
  • Forks: 0
  • Language: C
  • Description: Transliterate Yiddish from Hebrew to Latin script
  • Watchers: 1
  • Forks: 0
  • Language: Perl
  • Description: NicePim is a successor of IcePIM Product Information Management (PIM)
  • Watchers: 1
  • Forks: 1
  • Open Issues: 2
  • Language: Perl
  • Description: scripts reporting scores and statistics
  • Watchers: 1
  • Forks: 0
  • Language: Perl
  • Description: Longest Common Subsequence implemented with Bit-Vectors
  • Watchers: 1
  • Forks: 1
  • Language: Perl6
  • Description: Support of Unicode CLDR locales in Perl
  • Watchers: 1
  • Forks: 0
  • Language: Perl
  • Description: Guess script from text using iso15924 codes
  • Watchers: 1
  • Forks: 0
  • Language: Perl
  • Description: re-OCR selected books
  • Watchers: 0
  • Forks: 0
  • Language: HTML
  • Description: Example Cpp-Dist with MakeMaker and CppGuess to test the tool chain
  • Watchers: 0
  • Forks: 0
  • Language: C++
  • http://aloha-editor.com
  • Description: World’s most advanced Editor gives you a complete new experience when editing. It’s faster than existing technology and offers unprecedented opportunities.
  • Watchers: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Archive Devel::Cover test coverage reports
  • Watchers: 0
  • Forks: 0
  • Language: HTML
  • Description: Similarity measures for bags
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: A web-based editor for Tesseract box files
  • Watchers: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Nagios plugin for Nasdeluxe V2.04.06a
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: A small C++ implementation of LSTM networks, focused on OCR.
  • Watchers: 0
  • Forks: 0
  • Language: Jupyter Notebook
  • Description: Find cut-n-pasted Perl code
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Common library for searching CPAN indexes
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: A module that gives easy access to the coverage test results of CPAN modules from the CPAN Cover service
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Quick and effortless CRUD (create/read/update/delete) operations based on database tables
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Object Oriented Localization Tool For Perl
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Easily create DBIx::Class fixtures.
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Perl Devel::NYTProf
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Opinionated and Unobtrusive distribution builder
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: files and code related to the Early Modern OCR Project (eMOP) at the IDHMC
  • Watchers: 0
  • Forks: 0
  • Language: C++
  • Description: Check file names
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Improved File::ShareDir for Perl
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: stack trace visualizer
  • Watchers: 0
  • Forks: 0
  • Language: HTML
  • Description: Repository for Frequency Word List Generator and processed files
  • Watchers: 0
  • Forks: 0
  • Language: C#
  • Description: A checklist to the wasps of Peru (Hymenoptera, Aculeata)
  • Watchers: 0
  • Forks: 0
  • Description: Generic Environment for Context-Aware Correction of Orthography
  • Watchers: 0
  • Forks: 0
  • Language: Python
  • Description: Courses, rating, rounds, statistics
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: German Perl Workshop 2015
  • Watchers: 0
  • Forks: 0
  • Language: HTML
  • Description: Grapheme::Ngram - build N-grams respecting Unicode Grapheme Cluster Bounderies
  • Watchers: 0
  • Forks: 0
  • Open Issues: 1
  • Language: Perl
  • Description: Tag hierarchy extraction
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: code to remove "noise" from hOCR output of Tesseract OCR.
  • Watchers: 0
  • Forks: 0
  • Language: Python
  • Description: Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
  • Watchers: 0
  • Forks: 0
  • Language: Python
  • Description: Automatically exported from code.google.com/p/isri-ocr-evaluation-tools
  • Watchers: 0
  • Forks: 0
  • Language: C
  • Description: A test distribution using similar filenames
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Test package for case duplicates
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: HTML-CMS - Keep It Simple and Stupid
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: A standalone and lightweight C library
  • Watchers: 0
  • Forks: 0
  • Language: C
  • Description: Tools for extracting labels from specimen scans from and for tesseract OCR
  • Watchers: 0
  • Forks: 0
  • Language: Shell
  • Description: Stand-alone language identification system
  • Watchers: 0
  • Forks: 0
  • Language: Python
  • Description: Guess the language of text using top 1000 wordlists
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Longest Common Subsequence
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Longest Common Subsequence implemented with Bit-Vectors
  • Watchers: 0
  • Forks: 1
  • Language: Perl
  • Description: Allow differences in the comparison of elements
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: LCS implemented in XS
  • Watchers: 0
  • Forks: 0
  • Language: C++
  • Description: Parse a word into scored known vs unknown parts
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Mirror of Locale-CLDR on urth.org
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Mirror of Locale-CLDR-LDML on urth.org
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Perl OO Interface to Uniforum Message Translation
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Parse any language you can describe in BNF -- Release 2
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Lua 5.1 Parser and LUIF to Lua 5.1 Transpiler in barebones SLIF
  • Watchers: 0
  • Forks: 0
  • Language: Lua
  • http://modernizr.com
  • Description: Modernizr is a JavaScript library that detects HTML5 and CSS3 features in the user’s browser.
  • Watchers: 0
  • Forks: 0
  • Language: JavaScript
  • Description: CMS based on Mojolicious
  • Watchers: 0
  • Forks: 0
  • Language: JavaScript
  • Description: display DataBase Information
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: abstract forms for Mojolicious and DBIx::Class
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Mojolicious plugin to define form fields in a json file
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Mojolicious ❤️ Reveal.js
  • Watchers: 0
  • Forks: 0
  • Language: CSS
  • Description: display Server and Perl environment
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Web-presence rewritten
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: The Net::Dict module for Perl, which talks the DICT protocol (RFC 2229)
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: A place for notes, plans, etc
  • Watchers: 0
  • Forks: 0
  • Description: German language (nature, biology) ground truth
  • Watchers: 0
  • Forks: 0
  • Description: OCR English (Bio, Natur) ground truth and testfiles
  • Watchers: 0
  • Forks: 0
  • Description: process the hOCR file format
  • Watchers: 0
  • Forks: 0
  • Language: HTML
  • Description: Latin language (nature, biology) ground truth
  • Watchers: 0
  • Forks: 0
  • Description: Process file formats used by Tesseract
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
  • Watchers: 0
  • Forks: 0
  • Language: Shell
  • Description: Python-based OCR package using recurrent neural networks.
  • Watchers: 0
  • Forks: 0
  • Language: Python
  • Description: Ocular is a state-of-the-art historical OCR system.
  • Watchers: 0
  • Forks: 0
  • Language: Java
  • Description: Calculate all possible LCSs (Longest Common Subsequences)
  • Watchers: 0
  • Forks: 1
  • Language: Perl6
  • Description: Parse PHP in Perl
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Look what you can do at the terminal! A collection of Perl6 one liners
  • Watchers: 0
  • Forks: 0
  • Description: Perl module to use the Common Local Data Repository from the Unicode Consortium
  • Watchers: 0
  • Forks: 0
  • Open Issues: 1
  • Language: Perl
  • Description: Open source Farsi OCR, اوسی‌آر متن‌باز فارسی
  • Watchers: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowdsourcing applications.
  • Watchers: 0
  • Forks: 0
  • Language: Java
  • Description: Perl QA Hackathon 2015
  • Watchers: 0
  • Forks: 0
  • Language: HTML
  • Description: The scraperJSON standard for defining web scrapers as JSON objects
  • Watchers: 0
  • Forks: 0
  • Description: Sequence alignment algorithms including check-pointing
  • Watchers: 0
  • Forks: 0
  • Description: Similarity measures for sets using fast bit vectors (BV)
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Set::Similarity::CosinePDL - implemented using PDL
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Set::Similarity::CosinePP - implemented using pure Perl sparse Vectors
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: A Python Implementation of Simhash Algorithm
  • Watchers: 0
  • Forks: 0
  • Language: Python
  • Description: SimString::Wrapper - Wrap simstring command-line interface
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Perl interface to SimString
  • Watchers: 0
  • Forks: 0
  • Language: C++
  • Description: Talks GPW 2016
  • Watchers: 0
  • Forks: 0
  • Language: JavaScript
  • Description: Parse Bio Taxon Names
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Tesseract ocr training data for Danish written in fraktur script and a few other languages
  • Watchers: 0
  • Forks: 0
  • Language: Shell
  • Description: Training files produced for and by the Tesseract OCR engine for work on the Early Modern OCR Project (eMOP)
  • Watchers: 0
  • Forks: 0
  • Description: Test::More, Test::Simple and Test::Builder Perl modules for writing tests
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Levenshtein using bit vectors
  • Watchers: 0
  • Forks: 0
  • Description: Perl toolchain docs, specs, guidelines, etc.
  • Watchers: 0
  • Forks: 0
  • Description: Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. -- THIS IS THE BLEEDING-EDGE EXPERIMENTAL VERSION - FOR THE LATEST STABLE VERSION SEE http://ilk.uvt.nl/ucto --
  • Watchers: 0
  • Forks: 0
  • Language: C++
  • Description: Access to the Unicode Common Locale Data Repository XML database from perl through a simple API
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: Interface to ICU from perl
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: A composable RESTful JSON API to DBIx::Class schemas using roles and Web::Machine
  • Watchers: 0
  • Forks: 0
  • Language: Perl
  • Description: cpants tools
  • Watchers: 0
  • Forks: 0
  • Language: JavaScript