Data Farm Quick-Start: Transform data
Transform data
Once you have data access to a dataset, you can further explore the dataset by adding it to a workflow. The Redivis workflow interface allows you to develop scalable, reproducible, and self-documenting data pipelines as you filter, transform, and merge data from across Redivis.
➊ To begin, select the Analyze in workflow button. From the drop-down, you can add the dataset to an existing workflow, or create a new workflow.
➋ This will bring the dataset into the Redivis workflow interface. Select the dataset icon to view the tables in the dataset. You can select a table to view its variables and summary statistics. In this case, we will select the articles_1977 table.
➌ To manipulate the data, click the Transform node underneath the table icon. This will open a panel on the right side of the page where you can:
1) apply certain transformations, and
2) select variables to include in the output.
➍ In this example, I want to filter the data to only include articles from the Business section. For my future reference, I rename the transform "Filter by Business section" and add a Filter step.
➎ I specify the rules for my Filter step (section LIKE 'business'), and select four variables for inclusion in my output: section, title, authors and paragraphs. Then I select the Run button.
➏ My output appears as a new table in the visual query editor. I can also see more information about my output on the right side of the page.
- Create account
- Browse or search for data
- Apply for access to restricted data
- Transform data
- Analyze data with notebooks
Official Redivis Docs
See Transform Nodes for more information.
- Last Updated: Feb 10, 2025 4:59 PM
- URL: https://guides.library.stanford.edu/data-farm-quick-start
- Print Page