Aspose.HTML for Java is a comprehensive HTML processing library to perform a vast number of manipulation and management tasks within multi-platform applications. A developer can use the API to render, generate, modify, extract data, and convert HTML document formats without any 3rd Party component. It also supports many other standard file-formats, including SVG, EPUB, MHTML, and Markdown, and rendering to XPS, Image, and PDF file formats.
Aspose.HTML Input/Output Format
Aspose.HTML Input Format
Aspose.HTML Output Format
- Images: TIFF, JPEG, PNG, BMP
Convert HTML to Image, PDF, and Other Formats
The Aspose.HTML API allows developers to convert HTML files to PDF, Image, and many other formats with a few lines of code in their Java applications.
Markdown is a plain-text-formatting syntax markup language and is often used as the preferred format for readme files and documentation ad it allows writing in an easy-to-write and read format. The Aspose.HTML API provides a flexible Markdown Converter that provides 2-way conversion between Markdown and HTML. The converter library includes predefined a set of rules for converting HTML to Markdown using the pure Markdown syntax, GitLab Flavored Markdown modification, or configuring rules in a bespoke manner.
Electronic Web and Books Archives
The Electronic Books (EPUB) and Web Archive (MHTML) formats are supported out-of-the-box by the Aspose.HTML API. It provides high fidelity rendering or MHTML and EPUB files to the supported output formats including, Image, XPS, and PDF file formats.
Web scraping, Web Harvesting, Web Crawling, or Web Data Extraction is the processing of extracting web data from a website. Whilst Aspose.HTML does not support Web Scraping natively, a developer can use the Aspose.HTML API to build a bespoke Web Scraping Solution.