Find Jobs
Hire Freelancers

PDF text to XML extraction within website

$250-750 USD

En curso
Publicado hace más de 13 años

$250-750 USD

Pagado a la entrega
I need a web application developed that allows to extract text from PDF pages (magazine pages) into XML format. * Extraction of text from one or more PDF pages * The final result needs to be formatted html text I also require editing capabilities - I do not want to extract just the plain text, but i need the text to keep a certain format. * The extracted text does NOT need to have the same visual format as the source PDF text. It is enough if just the text is extracted. I need to retain a similar formatting - text should be text and headlines should be recognizable as headlines. It is enough to separate between 2 or 3 different types of font sizes (headline, paragraph ... ). The extracted text only needs to have one font. BUT: The text need to be formatted according to the PDF, meaning - Text shall stay text - Headlines shall stay headlines This needs to be automatically recognized to a certain degree. I want to keep thee required user interaction as low as possible. There are tools that allow to analyze the font and the text size during extraction which you need to you use. These could be tools such as: [login to view URL] [login to view URL] I am open other suggestion too, though. For the final application I will purchase the required license then. * The user shall be able to modify the extracted text, eg. add blank lines to it, or increase the font for selected text and save the changes again. * The user shall be able to select an area of the selected text to add a unique id tag to it, so that this area can be accessed later thru its ID. * The images of a page need also to be extracted (reduced to a fixed max. size) and placed at the end of the extracted page. Plus: a very simple user management is required. Server: I am not tied to a certain type of server (can be apache or windows). An example can be provided to each bidding developer.
ID del proyecto: 887342

Información sobre el proyecto

9 propuestas
Proyecto remoto
Activo hace 13 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos

Sobre este cliente

Bandera de GERMANY
berlin, Germany
5,0
62
Forma de pago verificada
Miembro desde feb 4, 2009

Verificación del cliente

¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.