Showing posts with label Text Processing. Show all posts
Showing posts with label Text Processing. Show all posts

Monday, July 05, 2010

XOM : An XML object model. It is a tree-based API for processing XML with Java that simultaneously supports streaming

XOM is an XML object model. It is a tree-based API for processing XML with Java that simultaneously supports streaming. In many use-cases, it can process arbitrarily large documents with effectively constant memory sizes. It strives for correctness, simplicity, and performance, in that order. XOM supports XSLT, Canonical XML, and XInclude.
  • Licenses : LGPL

Sunday, July 04, 2010

JSesh : An editor for ancient Egyptian hieroglyphic texts


JSesh is an editor for ancient Egyptian hieroglyphic texts. It can export the text into picture formats, such as WMF files for easy inclusion in word processors. JSesh can also be used as a library for other projects concerning ancient Egyptian.
  • Licenses : LGPL
  • Operating Systems : OS Independent
  • Implementation : Java

Wednesday, May 12, 2010

OpenGrok : A fast and usable source code search and cross reference engine

OpenGrok is a fast and usable source code search and cross reference engine. It helps you search, cross-reference, and navigate your source tree. It can understand various program file formats and version control histories like Mercurial, Bazaar, Git, ClearCase, Perforce, SCCS, RCS, CVS, or Subversion. In other words, it lets you grok (profoundly understand) the source.
  • Licenses : CDDL
  • Implementation : Java

Xinha : A cross browser, cross platform Embedded WYSIWYG HTML Editor


Xinha is a cross browser, cross platform Embedded WYSIWYG HTML Editor. Although it was originally a stop-gap version of the htmlArea Version 3.0rc1 editor during a period of inactivity in that project, it is now a fully fledged fork. Xinha includes a number of bugfixes and many enhancements over htmlArea, and is in active development by a small and growing number of developers.
  • Operating Systems : OS Independent
  • Implementation : JavaScript, Perl, PHP

Tuesday, January 05, 2010

UJO Framework offers a special architecture of beans different from conventional JavaBeans


UJO Framework offers a special architecture of beans different from conventional JavaBeans. The useful features are XML persistence of an object tree, simple introspection, easy collaboration with Swing tables, and very light weight.
  • Licenses : Apache 2.0
  • Operating Systems : OS Independent
  • Implementation :

Sunday, November 15, 2009

OmegaT+ : A Computer-Assisted Translation (CAT) tools platform

OmegaT+ is a Computer-Assisted Translation (CAT) tools platform. It includes a translation processor with translation memory and projects support, a bitext aligner, and a TMX validator. It has various other tools to process documents for translation.
  • Licenses : Eclipse, GPL
  • Operating Systems : OS Independent
  • Implementation : Java

Wednesday, October 07, 2009

DocBook Doclet creates DocBook XML and class diagrams from Javadoc comments, converts HTML to DocBook, and transfoms DocBook XML into various output formats. It consists of a complete DocBook distribution containing schemas and the DocBook XSL stylesheets. It also integrates Apache FOP as the XSL:FO processor. A Swing application is used to customize the doclet and most of the DocBook XSL parameters and to start the transformations.
  • Licenses : Freeware
  • Implementation : Java

Saturday, October 03, 2009

xslet : A collection of XSLT applets


xslet is a collection of XSLT applets. It includes "xslbook", which displays a book-style document in Web browsers without any other transformation tools, and "xslui", which displays Web pages with modularized XML files in Web browsers without any other transformation tools.
  • Licenses : LGPL
  • Operating Systems : OS Independent
  • Implementation : XSL/XSLT, JavaScript

Thursday, October 01, 2009

Luke : A handy development and diagnostic tool for Apache Lucene


Luke is a handy development and diagnostic tool for Apache Lucene. It accesses existing Lucene indexes and allows you to display and modify their contents in several ways. A user can browse by document number or by term, view documents, copy them to the clipboard, retrieve a ranked list of the most frequent terms, execute a search and browse the results, analyze search results, selectively delete documents from the index, reconstruct the original document fields, edit them, and reinsert them into the index, optimize indexes, and much more. Luke can also be extended through plugins.
  • Operating Systems : OS Independent
  • Implementation : Java

Wednesday, September 16, 2009

XINS : A technology used to define, create, and invoke remote APIs

XINS is a technology used to define, create, and invoke remote APIs. XINS is specification-oriented. When API specifications are written (in XML), XINS will transform them to HTML-based documentation and Java code for both the client-side and the server-side. The communication is based on HTTP. XINS competes with the complex SOAP technology. Main design goals include simplicity, scalability, and testability. XINS is not only a specification technology, but also an application development framework. It offers transaction logging, unique log documentation, and active code generation.
  • Licenses : BSD Revised
  • Operating Systems : OS Independent
  • Implementation : Java

Monday, August 17, 2009

jbookshelf : An electronic book collection organizer and reader

jbookshelf is an electronic book collection organizer and reader. It supports collecting plain files (text, HTML, PDF, etc.), has basic collection search, fulltext collection search (planned), internal viewers for plain text, HTML, RTF, and PDF, notes and citations, book categories, FB2 support (planned), and portability (removable drives support).
  • Licenses : GPLv3
  • Operating Systems : OS Independent
  • Implementation : Java 6, Swing, SwingX, PDFBOX, Cobra

Tuesday, August 04, 2009

Sketsa SVG Editor : A vector drawing application based on SVG

Sketsa SVG Editor is a vector drawing application based on SVG. It allows users to create vector graphics that can be scaled and printed at any resolution, without losing detail or clarity. It features various tools for optimizing content creation, and includes a property palette, a DOM editor, a source editor, a resource editor, SVG-specific shape tools, transformation tools, and additional illustration tools. It uses SVG as its native file format.
  • Licenses : Shareware
  • Operating Systems : OS Independent
  • Implementation : Java

Thursday, July 09, 2009

XINS : A technology used to define, create, and invoke remote APIs. XINS is specification-oriented

XINS is a technology used to define, create, and invoke remote APIs. XINS is specification-oriented. When API specifications are written (in XML), XINS will transform them to HTML-based documentation and Java code for both the client-side and the server-side. The communication is based on HTTP. XINS competes with the complex SOAP technology. Main design goals include simplicity, scalability, and testability. XINS is not only a specification technology, but also an application development framework. It offers transaction logging, unique log documentation, and active code generation.
  • Licenses : BSD Revised
  • Operating Systems : OS Independent
  • Implementation : Java

Tuesday, June 30, 2009

FBReaderJ : An e-book reader for the Android platform

FBReaderJ is an e-book reader for the Android platform. It is a clone of the FBReader book reader written in Java by the same authors. FBReaderJ supports several e-book formats: oeb, epub, and fb2. Direct reading from zip, tar, and gzip archives is supported.
  • Licenses : GPL
  • Implementation : Java