BBQ
get_input_data_model()
get_output_data_model()
iterable_dataset(repeat=1, batch_size=1, limit=None, category='age')
Streaming dataset for RL-style training.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
category
|
str
|
BBQ category to stream ( |
'age'
|
Returns:
| Type | Description |
|---|---|
HuggingFaceDataset
|
A streaming, iterable dataset. |
Source code in synalinks/src/datasets/built_in/bbq.py
load_data(category='age', validation_split=0.2)
Load BBQ (Bias Benchmark for QA).
BBQ on HF is split by category (age, gender_identity,
race_ethnicity, religion, ses, ...) rather than train /
test. We load the requested category and split it deterministically
into train / test.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
category
|
str
|
The BBQ category to load. Defaults to |
'age'
|
validation_split
|
float
|
Fraction held out for evaluation
(default |
0.2
|
Returns:
| Type | Description |
|---|---|
tuple
|
|