|
BFO adds text extraction to PDF Library
London, England, 27 October 2005, - BFO (Big Faceless Organization), a global supplier of java reporting solutions, strengthens the acclaimed Big Faceless PDF Library with the addition of text and image extraction.
The 2.6.2 release adds the ability to extract text and bitmap images from PDF documents, as well as index the PDF using the Apache Lucene search engine. The library extracts and indexes text in Unicode from the form fields, annotations and document metadata as well as the document body, and at roughly 50 pages a second for large documents.
Speed and accuracy of text extraction coupled with the existing features of the PDF Library makes it a wise choice for developers involved in data mining, content management systems and form processing environments. As well as being beneficial in settings that require the ability to search or extract text from large numbers of PDF files.
Text and image extraction requires the Big Faceless PDF Library Extended Edition plus Viewer license, which can be downloaded from BFO's website.
About BFO: BFO is a leading global provider of Java based reporting solutions founded in 1998. They produce a stable of robust Java components for the international B2B market. Such components include Report Generator, Graph and PDF Library. Report Generator comprises both Libraries and converts XML to PDF documents. Using JSP, ASP or similar technology, it is possible to create dynamic PDF reports as quickly and easily as HTML.
Company: BFO
|
| Related press releases |
BFO adds text extraction to PDF Library [2005-10-27 00:00:00]
London, England, 27 October 2005, - BFO (Big Faceless Organization), a global supplier of java reporting solutions, strengthens the acclaimed Big Faceless PDF Library with the addition of text and ima...
|
|
BFO boosts its portfolio with Java PDF Viewer [2005-08-24 00:00:00]
London, England, 24 August 2005 - Big Faceless Organization (BFO), an industry leader in Java software development, are pleased to announce the arrival of an exciting new product from their software ...
|
|
Mesa Dynamics Releases Trapeze 1.1 With PDF-to-HTML Conversion [2004-08-31 00:00:00]
Mesa Dynamics today released Trapeze 1.1, a significant upgrade to its PDF text extraction utility for the Macintosh. Trapeze 1.1 features improved functionality and introduces the ability to convert ...
|
|
Improves User Experience with Linearized PDFs [2008-07-24 05:46:41]
London, England, 25 July 2008, - Big Faceless Organization (BFO) has released version 2.10.3 of their Java PDF Library, featuring substantial improvements across the board.
For the first time the P...
|
|
Batch export PDF form data to CSV or XML file format. [2009-11-14 00:59:28]
A-PDF Form Data Extractor is a simple utility program that lets you batch export PDF form data to CSV or XML file format. It provide a visual form fields extraction rule editor to verify and define wh...
|
|
Lingobit Extractor offers hardcoded strings extraction to resources [2009-10-09 04:02:31]
October 1, 2009: Lingobit Technologies has released a first version of its new tool for software localization, Lingobit Extractor. The tool solves a huge part of internationalization task by extractin...
|
|
VintaSoftTwain.NET Library v5.0 has been released. [2009-05-12 02:32:21]
VintaSoftTwain.NET Library is a pure .NET Library which allows to control work of scanners, cameras and any other TWAIN devices. With this library you can fully control the image acquisition process, ...
|
|
Witzend Search Library™ Component Gains New Features [2008-06-04 19:25:26]
May, 2008 -- Witzend Software has released an important update of the Witzend Search Library, component software that adds sophisticated search capabilities to any Win32 application or Web page. In it...
|
|
QR Code added to PDF Library [2005-07-26 00:00:00]
London, England, 26 July 2005 - Big Faceless Organization (BFO), provider of worldwide Java software solutions, are delighted to announce the release of version 2.4.3 of their award winning PDF Libra...
|
|
.com Solutions Inc. Releases FmPro Migrator 2.24 Enterprise Edition for MacOS X ... [2004-10-07 00:00:00]
.com Solutions Inc., a developer of multi-platform database migration and development tools, has released FmPro Migrator 2.24 for MacOS X and Windows with a new repeating fields extraction feature for...
|
|
|
|
| Annotated Chinese Reader |
Click to Display Annotated Chinese Stream. Online Chinese-English dictionary.
Convenient tool for reading Chinese text. HanZi definition, parts decomposition, stroke order, pronounciation - All is just 2 clicks away! Also used to Listen to web. |
|
| PSPad editor |
PSPad editor is a programmers editor with support for multiple syntax highlighting profiles. It comes with a hex editor, CP conversion, text differences, templates, macros, spellcheck option, auto-completion, Code Explorer and much more. |
|
| EF Find |
EF Find is a powerful search program. Look for files, text, HEX sequences and regular expressions inside 7-Zip, ACE, ARC, ARJ, BZIP2, CAB, CPIO, GZIP, IMG, ISO (ISO9660), LHA, RAR, RPM, SFX, SQX, TAR, TBZ, TGZ, ZIP, Zip64, ZOO archives. |
|
| AlbumMe |
AlbumMe is a software that is easy to use to create flash slideshow from your digital photos, complete with ready-to-use animated templates, stunning transition effect, text captions, music etc. |
|
| Count My Text! |
DEMO this easy to use utility software that gives you accurate complete text count information. It counts all characters and allows you to optimize text by stripping excess code. Ideal for SEO, Webmaster, Forums, Classifieds, Press Releases. WIN/MAC |
|
| Directory Report |
Directory printer. Print to a Printer, Text, Excel, XML or HTML file. Find duplicate files. Find duplicate directories. Multiple file rename. Multiple file change date. Multi file change owner. Shows file owner. Print cyclic redundancy checksum CRC |
|
| Belkasoft Universal IM History Extractor Pro |
Lost in your Internet Messenger history? Not a problem any more!
Belkasoft Universal IM History Extractor Pro allows you to extract your IM
history into such formats as plain text, HTML and XML. |
|
|