RandomWord
The RandomWord
transformer populates specified database columns with random words. Ideal for simulating textual content, enhancing linguistic datasets, or anonymizing text in databases.
Parameters¶
Name | Description | Default | Required | Supported DB types |
---|---|---|---|---|
column | The name of the column to be affected | Yes | text, varchar | |
keep_null | Indicates whether NULL values should be preserved | false |
No | - |
Description¶
The RandomWord
transformer employs a mechanism to inject random words into a designated database column, supporting the generation of linguistically plausible and contextually diverse text. This transformer is particularly beneficial for creating rich text datasets for development, testing, or educational purposes without specifying the language, focusing on versatility and ease of use.
Example: Populate random words for the content
table¶
This example demonstrates configuring the RandomWord
transformer to populate the tag
column in the content
table with random words. It is a straightforward approach to adding varied textual data for tagging or content categorization.
- schema: "public"
name: "content"
transformers:
- name: "RandomWord"
params:
column: "tag"
keep_null: false
In this setup, the tag
column will be updated with random words for each entry, replacing any existing non-NULL values. If keep_null
is set to true
, existing NULL values in the column will remain unchanged, maintaining data integrity for records where textual data is not applicable.