Events Add an event Speakers Talks Collections
 
Electronic Document Conference 2019
June 17, 2019, Seattle, WA, USA
Electronic Document Conference 2019
Request Q&A
Request Q&A
Video
I didn't know PDF could do that!
Available
In cart
Free
Free
Free
Free
Free
Free
Free
Free
Add to favorites
115
I like 0
I dislike 0
Available
In cart
Free
Free
Free
Free
Free
Free
Free
Free
  • Description
  • Transcript
  • Discussion

About the talk

Matt Kuznicki, PDF Association | Electronic Document Conference 2019

About speaker

Matt Kuznicki
Chairman of the Board at PDF Association

Matt Kuznicki has been actively involved with PDF products and technologies since 2000. He is a recognized expert in technical PDF and imaging matters and an active participant in PDF standards activities as well as the application of PDF and other document technologies to solve a variety of business problems. Matt joined the PDF Association board in 2014. Of his role as chairman, Matt comments: ”I am committed to helping the PDF Association serve its membership; both as a technical resource for PDF and the various standards associated with PDF, as well as an advocate for PDF technologies and how these can help drive value, ensure a platform for reliable communication and to make the world a better place.”

View the profile
Share

I'd like to take the next little bit to talk about some of the things that PDF enables that you may not be aware of. My hope is that by the end of this you will learn at least one thing that you can come back that you could take back and say I didn't know PDF could do that. I didn't know PDF had this capability. PDF it's big standard. It contains multitude. The session is not going to be PDF specification trivia. We're not going to Parker's and jpeg2000 stream if that's what you're hoping for. I'm sorry.

Okay, sweet lost one person already instead. We think of PDF as a document format, which it is and we think of documents and pieces of information that we author that we publish that we share some times that we collaborate a PDF has a number of capabilities related to working together on documents working together through the process of updating documents of authoring document and a preserving documents in the long-term. We're going to break this into for broad topics security keeping information safe collaboration, which is you and I and

others working together interoperability, which will be your tools in my tools working together information and information exchange, which is about our computers understanding each other's date going to give an important disclaimer things and I will talk about just because PDF can do it does not need your software cat. Think I'll be talking about a number of things that there may not be any implementations that take full advantage of capabilities, but I'm hoping to inspire you to think about rallying for bringing some of these

to Market. I am at kuzmicki. I cheer the PDF Association. I've been involved in the PDF developer Community since the year 2010 the year 2,000 sounded futuristic PDF was futuristic back in 2009 how it is established technology. I've been involved in the developer Community since 2012. Remember when that felt like it was a long time ago, but maybe not. And I have had for almost all this time interest in promoting the widespread adoption of PDF and its capabilities are promoting interoperability.

So let's start off with barely like topic. It's not about saving the world or at least let's talk about a pedia has helped to save the world just a little bit. Estimated that more than 10 trillion PDFs have been made since PDF was created. If we think maybe PDF has prevented 10% of those from being printed trillion documents that haven't gone through a printer. These are just one page documents in pages that have been saved. Still a trillion sheets of paper. What is a

trillion sheets of paper? That's north of 50 million trees saved by PDF. What's 50 million trees that is enough trees. If we planted them that would take up the entire Seattle area the city of Seattle million trees. We might take up most of the whole Metro Seattle. Are you removing the trees back? That's how many trees we may have saved with PDF if not more. If you want to think about stacking paper in dreams, really tightly this would actually go around the earth more than two and a half times. I would look kind of weird to have a ring of

paper around the earth nevermind twice, but we prevented with PDF enough pieces of paper being printed that if we wanted to we could do this, you know, what else we could do we could make one PDF that held all trillion of these pages. There's no longer 10gb file size limit in PDF. There's no longer 200 in by 200 in page size limit. We can cram all these trillion pages that we have prevented from being printed into one PDF file might be hard to find a drive to store

right now, but we can do it. We can also even take all of these assuming they're A4 letter size Pages or not too far off. We could have posed all of these on one PDF sheet. If we wanted going to be hard to find a printer to actually roll that she threw but we can do it. PDF page sizes are big enough. We might be able to impose all of the PDFs that are estimated to have been made in all of time on to one sheet. Not many people know the PDF do this, but

you can just don't try it with your implementation. Please more seriousness, which save the world. So let's secure the world or at least the world. How about that? We're all familiar with PDF document security. We're familiar with password security and being prompted for passwords to open PDF. What we may not appreciate as much is the standardized mechanisms behind security that allow a document that you create in Microsoft Word that requires a password to be opened up interviewer like foxes PDF viewer, for example, the interoperability aspect and not being

locked into one implementation. This mechanism we know it allows users to save the state actions that we shouldn't be allowed to do. Audrey needs used to be mandatory. But as of 30 mm 1.2 come after you anymore, but I still I'll be happy to PDF Association urge you to honor those. It was PDF to low. We gain aes-256 compatibility which is generally thought of as resistant or able to withstand Quantum cryptographic attacks. So if you put a good password on your PDF now and you use a PDF to a

security you should be able to fend off those weird quantum computers a bit later on. But this is a security that we are costing to Securities so much more than just password protection. Security is also a public-private key Security in PDF that lets you create a PDF and only allow for specific individuals to be able to open this PDF not just specific individuals to open it. You can give different individuals different permission for the spot. You can I can say that use array

should only be allowed to view this document user b c and d should be allowed not just to open but to edit and to change this document again. This is a workflow that takes advantage of interoperability standards supporting implementation in Adobe Acrobat for example, specify a specific individuals for a document and in a compatible viewer open those up and only Those individuals can open the document document embedded data streams. We can use Crypt filters to embed to to encrypt only parts of document which means the different

parts of PDF inside a PDF can be encrypted through different mechanisms. This allows workflow such as online access checking such as digital Rights Management built into PDF allows for example, storing thumbnail images on a PDF page and requiring authentication to see a high-resolution version of those. This allows for better protection than what's in the PDF specification. Of course all of the tools in your workflow need to be aware of your implementation. But PDF allows you to do this PDF

even allows you to embed a document in your PDF to let her use her know that they need a proprietary or a supporting workflow or implementation. Security is not just about protection, but it's also about authentication that many of us know the digital signatures are common in. You stir signing agreements for specifying a a legal agreement what we might not be aware of is that digital signatures do not need to be visible invisible digital signatures are in very handy tool for putting in the ability to

detect tampering with files without a flying that that signature constitutes a legal agreement. We can store in PDF via the document security storm and it's time-stamped dictionary the information that we need to do this validation in this tampering protection not just today but far into the future and we can use the saw digital signatures whether they are visible or not to the include the ability to State. What is a permissible modification to a PDF before it should be considered tampered with Example form-filling commenting you're able to

digitally sign a PDF and to state that users should be allowed to fill in form without triggering a security warning. speaking of users PDF enables collaboration enables people to work together Of one of the coolest features of PDF that I think is not well appreciated is its standardized annotation for the fact that various annotations markup commenting digital signatures other forms of annotations are well-defined means that we can pass documents from person to person we can bark these

up we can send these back. We we know this. FDF forms data format and it's XML Varian X FDF allow us to be able to pass annotations and comments back and forth from person to person without having to keep passing a whole PDF document. We can just pass this dfdf the information about a patient back and forth collaborate in real-time just through passing this information instead of whole documents. As you would need to do it, for example other file formats if we can pass comments and in markup, but we can also pass templates

for when somebody wants to add pages to a PDF what added pages should look like we can use this not just to think of people typing on a keyboard adding pages. Also use this to think about supporting automated page additions from template. So automated statement generation, for example from a common agreed-upon template that you and I and others we may have come to agree that this is our template now, we can automate the generation a statement sort of like format documents from that template. PDF has an incremental

update capability, which I think most of us are aware of. Less well-known is that this supports versioning of PDF content that you can save different versions of PDF you can use this to support editing and modification workflows. What's even less well-known is that these updates do not need to be one after the other in a linear Che UPF actually support the branching models of of incremental update. So that contributors to work from a common Baseline. We don't need to pass along

a PDF every time somebody yet. It's if we can let people edit it on their own NPD of has the information to be able to reconcile these addicts in these updates. Incremental updates also allow the creation of PDFs that are standards-compliant with more than one version of PDF. So for example, if you have a PDF that you want to support PDF 200 capabilities, but you also want to make sure that earlier systems that don't support PDF to low can handle these versioning is an easy way to

bring out a PDF 17 document and then with an update transform the content that you want to be pedia 200 enhanced into is now PDF to a Word document. Incremental updates allow these we know that they allow storing and accessing versions of working draft revisions of documents surround and we've seen how to maximize compatibility tables working together. PDF also enables our tools to work together. PDF is an open data platform. I think something that many of us have heard a lot about will hear more about that panel this afternoon, but Associated files which came about in

PDF a 3pt of 200 and their file specification dictionary now allow embedded files in PDF along with the information about what these files me and these files can be attached not Justa documents, but two pages to content streams to specific images and other streams in a PDF so we can store Source Pages for PDF documents PDF Pages such as a word file that I might have used for authoring a PDF in a PDF itself. Track that out for editing repurposing and workflows. We can take a metadata strength and weakness Associated schema DTD or

other characteristics. So that programs know how to process this metadata. We can provide alternate versions alternate format Renditions for tag PDF content. We can embed math. Ml for example directly in with tag PDF for extraction for those work clothes. They can take advantage of that. And we can connect up content and stag PDF elements to these packages of data. This all comes together to allow the use of PDF is open data container. Likewise the standardization of annotation formats and form data

formats that I talked about allows not just his people work collaboratively on editing and working a document but it allows different tools from different vendors to do the same. This enables foreclosed such as using Adobe Acrobat to propose redactions in a document. And bringing these to a different program say I want to have an approval workflow these redactions before they get a plot. Standardization allows this interoperability between these different programs. We can pre-populate form data from one program before it's presented to

user or we can populated at the users. The user environment at rendition time they can open update and they can save in their preferred environment and transfer information back in a standardized way to different Tools in different formats. Again, we can we can allow multiple people to work on PDF collaboratively in real pie. Not just in the same ecosystem, but across different tools from different vendors now. PDFs in is I touched on it here to different PDF standards and version. So PDF A3 file could be a valid PDF 1-7 file and viewed

appropriately. This is pretty intuitive. I talked about how you can use version updates to make PDFs Backward Compatible and forward compatible. A PDF scan here to multiple substandard there multiple variants at the same time PDF A3 file to be a valid PDF you a one file can be archivable and accessible. We could also be a document requirements what documents tell tools and workflows and users what PDF features they need to support for an Optimum experience. The information

exchange PDF also allows computers to work together. So document Parts allow grouping collections of pages together in a single unit so that I can repurpose these units. This comes from variable and transactional world where I might need to combine different pieces of a finished work together, but it also allows specifying alternate viewing order is processing order in for PDF processors and for extracting specific collections of PDF Pages out for specific using conditions. Logical content allows were close to Define their own semantics for the structure

in PDF most of us if we're familiar with logical content. We're familiar with it from TAG PDF what we may not realize that tag PDF is just one implementation semantic structure the PDF standard allows implementers and communities to make their own semantic structure. They can use what's to find in PDF for structure, but not required to do so. Workflow structures content semantically, however, it wishes with logical content. This makes it fairly trivial

to represent XML and Coatings in PDF or other Mark of languages in PDF semantic structure through logical content. We can even put multiple attributes from SATs on structure for maximum interoperability. So I may have mathml attributes. I may have HTML attributes as well on the same content for use with different processors. That's complicated tag. PDF is a standardized use of logical content is well known for accessibility enablement. But it also enables transforming Pages for alternate forms for alternate

formats and for repurposing information and for mapping this into other definitions for other work floats. That's a that's a lot of data and I'm short time. But PDF let you express all this and let's programs and computers work together to exchange data logical content is separate from visual content on a PDF page it's related, but it's a much richer. It's a much richer way to describe content. It can be ordered differently than visual ordering. It can have different semantics and content

extraction as is best for a given workflow. So PDF Pages their visual appearance design for exchange with other humans logical content is designed for computers to speak together and their own native languages. That was a quick tour wrap up. DDF is well known for Reliable visual presentation. Last known has been tedious capabilities for collaboration for interoperability for information exchange. I hope that you've learned something about how we can use

PDF to work together or powerful piece features of PDF the support different use cases. Tool vendors implementation can use these to work together interoperability. So I'm going to end with a call to action for those vendors those implementers in this room. I'm going to say let's work together to work together. But that say, thank you.

Cackle comments for the website

Buy this talk

Access to the talk “I didn't know PDF could do that!”
Available
In cart
Free
Free
Free
Free
Free
Free
Free
Free

Access to all the recordings of the event

Get access to all videos “Electronic Document Conference 2019”
Available
In cart
Free
Free
Free
Free
Free
Free
Free
Free
Ticket

Interested in topic “Business Management”?

You might be interested in videos from this event

October 5 - 7, 2020
Online
62
4.86 K
brand communities, cmx, community platform, community relations, networking, swot, virtual event

Similar talks

Dov Isaacs
Principal Scientist at Adobe
Available
In cart
Free
Free
Free
Free
Free
Free
Free
Free
Available
In cart
Free
Free
Free
Free
Free
Free
Free
Free
Aleksy Jones
Head Of Core Systems at PDFTron
Available
In cart
Free
Free
Free
Free
Free
Free
Free
Free
Available
In cart
Free
Free
Free
Free
Free
Free
Free
Free
Aleksy Jones
Head Of Core Systems at PDFTron
Available
In cart
Free
Free
Free
Free
Free
Free
Free
Free
Available
In cart
Free
Free
Free
Free
Free
Free
Free
Free

Buy this video

Video
Access to the talk “I didn't know PDF could do that!”
Available
In cart
Free
Free
Free
Free
Free
Free
Free
Free

Conference Cast

With ConferenceCast.tv, you get access to our library of the world's best conference talks.

Conference Cast
839 conferences
34097 speakers
12891 hours of content