About the Document data extraction and manipulation in Studio Web training

This course is designed to equip learners with the skills and knowledge required to automate workflows involving document data extraction and manipulation Through this course, you’ll understand the different types of PDFs—native PDFs and image-based PDFs—and how to handle each type effectively using Studio Web. You’ll learn how to configure and use key activities like Extract PDF Text and Extract Document Data to extract data such as text, tables, and fields from documents. Additionally, you’ll be introduced to activities such as Write Cell, Write Range, and Read Range to process and structure the extracted data, preparing it for use in reports, Excel files, or other applications.

The course also focuses on a practical use case. You’ll implement an end-to-end automation workflow that extracts data from scanned PDFs and populates Excel workbooks in Studio Web. Along the way, you’ll explore real-world scenarios, such as processing invoices, contracts, and forms, to apply the concepts learned and solve common business problems.

Learning prerequisites

To learn the fundamentals of Studio Web and how to build automation workflows, we would recommend you start with the following and then pursue this course:

Build your first automation in Studio Web.

Repetitive and rule-based tasks in Studio Web.

Data validation and processing in Studio Web.

Email and communication management in Studio Web.

Audience

The Document data extraction and manipulation course is perfect for a wide range of users, from beginners with little to no coding experience to experienced professionals, business users, and citizen developers.

Agenda

The full agenda covers:

Differences between native and scanned PDFs and their respective data extraction methods.

Learn to configure and use Extract PDF Text and Extract Document Data activities for efficient data retrieval.

Process and organize extracted data using Write Cell, Write Range, and Read Range activities.

Implement an end-to-end automation use case to solve real-world business problems.

Learning objectives

At the end of the Document data extraction and manipulation in Studio Web course, you should be able to:

Differentiate between native PDFs and image-based PDFs and identify appropriate data extraction methods for each.

Use and configure Extract PDF Text and Extract Document Data activities to extract relevant data.

Process and structure the extracted data for further use in workflows.

Implement a complete use case that extracts data from PDFs and populates workbooks in Studio Web.

Apply the skills learned to solve common business problems, such as processing invoices, contracts, or forms.

Document data extraction and manipulation in Studio Web

About the Document data extraction and manipulation in Studio Web training

Learning prerequisites

Audience

Agenda

Learning objectives