The Portable Document Format (PDF) is a file format developed by Adobe in the s to present documents, including text formatting and images, in a manner. This is a list of links to articles on software used to manage Portable Document Format (PDF) documents. The distinction between the various functions is not. Wikipedia pages can be exported and saved as PDF files. From the left sidebar , under Print/export, select Download as PDF. The rendering engine starts and.
|Language:||English, Japanese, Dutch|
|Genre:||Academic & Education|
|ePub File Size:||20.48 MB|
|PDF File Size:||20.86 MB|
|Distribution:||Free* [*Sign up for free]|
PDF/A is an ISO-standardized version of the Portable Document Format (PDF) specialized for use in the archiving and long-term preservation of electronic. The Portable Document Format was created in the early s by Adobe Systems, and remained proprietary format until it was released as an open standard in. The Portable Document Format (PDF) is a file format for storing documents on a computer. Adobe created it in to make it easier to exchange documents.
If you plan to download Wikipedia Dump files to one computer and use an external USB flash drive or hard drive to copy them to other computers, then you will run into the 4 GB FAT32 file size limit.
If you seem to be hitting the 2 GB limit, try using wget version 1. Also, you can resume downloads for example wget -c. Suppose you are building a piece of software that at certain points displays information that came from Wikipedia.
If you want your program to display the information in a different way than can be seen in the live version, you'll probably need the wikicode that is used to enter it, instead of the finished HTML.
Also, if you want to get all the data, you'll probably want to transfer it in the most efficient way that's possible. The wikipedia. That's time consuming both for you and for the wikipedia. To access any article in XML, one at a time, access Special: Read more about this at Special: Please be aware that live mirrors of Wikipedia that are dynamically loaded from the Wikimedia servers are prohibited.
Please see Wikipedia: Mirrors and forks. Please do not use a web crawler to download large numbers of articles.
Aggressive crawling of the server can cause a dramatic slow-down of Wikipedia. You can do SQL queries on the current database dump using Quarry as a replacement for the disabled Special: Asksql page.
See also: Database layout. The sql file used to initialize a MediaWiki database can be found here. The XML schema for each dump is defined at the top of the file.
And also described in the MediaWiki export help page. MediaWiki 1. As the following page states, putting one of these dumps on the web unmodified will constitute a trademark violation. They are intended for private viewing in an intranet or desktop installation. Kiwix is by far the largest offline distribution of Wikipedia to date. As an offline reader, Kiwix works with a library of contents that are zim files: Aard Dictionary is an Offline Wikipedia reader. No images. The wiki-as-ebook store provides ebooks created from a large set of Wikipedia articles with grayscale images for e-book-readers The wikiviewer plugin for rockbox permits viewing converted Wikipedia dumps on many Rockbox devices.
It needs a custom build and conversion of the wiki dumps using the instructions available at http: The conversion recompresses the file and splits it into 1 GB files and an index file which all need to be in the same folder on the device or micro sd card. Browsing a wiki page is just like browsing a Wiki site, but the content is fetched and converted from a local dump file on request from the browser. XOWA is a free, open-source application that helps download Wikipedia to a computer.
Access all of Wikipedia offline, without an internet connection!
It is currently in the beta stage of development, but is functional. It is available for download here. WikiFilter is a program which allows you to browse over dump files without visiting a Wiki site. WikiTaxi is an offline-reader for wikis in MediaWiki format. It enables users to search and browse popular wikis like Wikipedia, Wikiquote, or WikiNews, without being connected to the Internet. WikiTaxi works well with different languages like English, German, Turkish, and others but has a problem with right-to-left language scripts.
WikiTaxi does not display images. For WikiTaxi reading, only two files are required: Copy them to any storage device memory stick or memory card or burn them to a CD or DVD and take your Wikipedia with you wherever you go! BzReader is an offline Wikipedia reader with fast search capabilities.
Requires Microsoft. NET framework 2.
MzReader by Mun works with though is not affiliated with BzReader, and allows further rendering of wikicode into better HTML, including an interpretation of the monobook skin.
It aims to make pages more readable.
Requires Microsoft Visual Basic 6. That is, it builds a wiki farm that the user can browse locally. From Wikipedia, the free encyclopedia. For scheduling, related tools etc. Data dumps. DD" redirects here. For Duplication detector, see Wikipedia: Duplication detector. For East Germany, see Wikipedia: For Wikipedia dos and don'ts, see WP: This help page is a how-to guide.
It details processes or procedures of some aspect s of Wikipedia's norms and practices. The addition of transparency to PDF was done by means of new extensions that were designed to be ignored in products written to the PDF 1. As a result, files that use a small amount of transparency might view acceptably in older viewers, but files making extensive use of transparency could be viewed incorrectly in an older viewer without warning.
The transparency extensions are based on the key concepts of transparency groups , blending modes , shape , and alpha.
The model is closely aligned with the features of Adobe Illustrator version 9. The blend modes were based on those used by Adobe Photoshop at the time. When the PDF 1. They have since been published. The concept of a transparency group in PDF specification is independent of existing notions of "group" or "layer" in applications such as Adobe Illustrator. Those groupings reflect logical relationships among objects that are meaningful when editing those objects, but they are not part of the imaging model.
PDF files may contain interactive elements such as annotations, form fields, video, 3D and rich media. Both formats today coexist in PDF specification: AcroForms were introduced in the PDF 1.
AcroForms permit using objects e. Alongside the standard PDF action types, interactive forms AcroForms support submitting, resetting, and importing data. The "submit" action transmits the names and values of selected interactive form fields to a specified uniform resource locator URL.
Frequently Asked Questions
AcroForms can keep form field values in external stand-alone files containing key: The Forms Data Format can be used when submitting form data to a server, receiving the response, and incorporating into the interactive form. It can also be used to export form data to stand-alone files that can be imported back into the corresponding PDF interactive form.
In addition, XFDF does not allow the spawning, or addition, of new pages based on the given data; as can be done when using an FDF file. A "tagged" PDF see clause Technically speaking, tagged PDF is a stylized use of the format that builds on the logical structure framework introduced in PDF 1.
Tagged PDF defines a set of standard structure types and attributes that allow page content text, graphics, and images to be extracted and reused for other purposes. With the introduction of PDF version, 1. Layers, or as they are more formally known Optional Content Groups OCGs , refer to sections of content in a PDF document that can be selectively viewed or hidden by document authors or consumers.
This capability is useful in CAD drawings, layered artwork, maps, multi-language documents etc. Basically, it consists of an Optional Content Properties Dictionary added to the document root. This dictionary contains an array of Optional Content Groups OCGs , each describing a set of information and each of which may be individually displayed or suppressed, plus a set of Optional Content Configuration Dictionaries, which give the status Displayed or Suppressed of the given OCGs.
A PDF file may be encrypted for security, or digitally signed for authentication. The standard security provided by Acrobat PDF consists of two different methods and two different passwords: The user password encrypts the file, while the owner password does not, instead relying on client software to respect these restrictions.
An owner password can easily be removed by software, including some free online services. Even without removing the password, most freeware or open source PDF readers ignore the permission "protections" and allow the user to print or make copy of excerpts of the text as if the document were not limited by password protection.
There are a number of commercial solutions that offer more robust means of information rights management. Not only can they restrict document access but they also reliably enforce permissions in ways that the standard security handler does not. The signature is used to validate that the permissions have been granted by a bona fide granting authority.
For example, it can be used to allow a user: For example, Adobe Systems grants permissions to enable additional features in Adobe Reader, using public-key cryptography. Adobe Reader verifies that the signature uses a certificate from an Adobe-authorized certificate authority.
Any PDF application can use this same mechanism for its own purposes. PDF files can have file attachments which processors may access and open or save to a local filesystem. PDF files can contain two types of metadata. This is stored in the optional Info trailer of the file. A small set of fields is defined, and can be extended with additional text values if required.
PDF Split and Merge
This method is deprecated in PDF 2. This allows metadata to be attached to any stream in the document, such as information about embedded illustrations, as well as the whole document attaching to the document catalog , using an extensible schema. PDFs may be encrypted so that a password is needed to view or edit the contents. PDF 2. PDF files may also contain embedded DRM restrictions that provide further controls that limit copying, editing or printing.
These restrictions depend on the reader software to obey them, so the security they provide is limited. PDF documents can contain display settings, including the page display layout and zoom level. Adobe Reader uses these settings to override the user's default settings when opening the document.
Anyone may create applications that can read and write PDF files without having to pay royalties to Adobe Systems ; Adobe holds patents to PDF, but licenses them for royalty-free use in developing software complying with its PDF specification.
PDF files can be created specifically to be accessible for disabled people. Some software can automatically produce tagged PDFs, but this feature is not always enabled by default. Adding tags to older PDFs and those that are generated from scanned documents can present some challenges. One of the significant challenges with PDF accessibility is that PDF documents have three distinct views, which, depending on the document's creation, can be inconsistent with each other.
The three views are i the physical view, ii the tags view, and iii the content view. The physical view is displayed and printed what most people consider a PDF document. The tags view is what screen readers and other assistive technologies use to deliver a high-quality navigation and reading experience to users with disabilities.
The content view is based on the physical order of objects within the PDF's content stream and may be displayed by software that does not fully support the tags view, such as the Reflow feature in Adobe's Reader. PDF attachments carrying viruses were first discovered in It was activated with Adobe Acrobat, but not with Acrobat Reader. From time to time, new vulnerabilities are discovered in various versions of Adobe Reader,  prompting the company to issue security fixes.
Other PDF readers are also susceptible. One aggravating factor is that a PDF reader can be configured to start automatically if a web page has an embedded PDF file, providing a vector for attack.
On March 30, security researcher Didier Stevens reported an Adobe Reader and Foxit Reader exploit that runs a malicious executable if the user allows it to launch when asked.
A PDF file is often a combination of vector graphics , text, and bitmap graphics. The basic types of content in a PDF are:. Two PDF files that look similar on a computer screen may be of very different sizes. For example, a high resolution raster image takes more space than a low resolution one.
Typically higher resolution is needed for printing documents than for displaying them on screen. Other things that may increase the size of a file is embedding full fonts, especially for Asiatic scripts, and storing text as graphics.
PDF viewers are generally provided free of charge, and many versions are available from a variety of sources. Raster image processors RIPs are used to convert PDF files into a raster format suitable for imaging onto paper and other media in printers, digital production presses and prepress in a process known as rasterisation.
Adobe Acrobat is one example of proprietary software that allows the user to annotate, highlight, and add notes to already created PDF files. Freeware Qiqqa can create an annotation report that summarizes all the annotations and notes one has made across their library of PDFs. There are also web annotation systems that support annotation in pdf and other documents formats. In cases where PDFs are expected to have all of the functionality of paper documents, ink annotation is required.
Examples of PDF software as online services including Scribd for viewing and storing, Pdfvue for online editing, and Thinkfree , Zamzar for conversion. The company released an upgrade to their Harlequin RIP with the same capability in Agfa-Gevaert introduced and shipped Apogee, the first prepress workflow system based on PDF, in The Preview application can display PDF files, as can version 2.
The files are then exported in PDF 1. When taking a screenshot under Mac OS X versions The Free Software Foundation once thought of as one of their high priority projects to be "developing a free, high-quality and fully functional set of libraries and programs that implement the PDF file format and associated technologies to the ISO standard. Poppler is based on Xpdf   code base.
There are also commercial development libraries available as listed in List of PDF software. From Wikipedia, the free encyclopedia. For other uses, see PDF disambiguation. Main article: See also: Further information: List of PDF software. Comparison of notetaking software.
Software portal. Retrieved It is included in a number of projects such as Firefox, a Chromium extension, et cetera. Range section All: Exports the entire document.
Links page of PDF Options dialog. The process and dialogs are the same for Writer, Calc, Impress, and Draw, with a few minor differences mentioned in this section. The PDF Options dialog opens.