IFEval
get_input_data_model()
get_output_data_model()
iterable_dataset(repeat=1, batch_size=1, limit=None, split='train')
Streaming dataset for RL-style training.
Returns:
| Type | Description |
|---|---|
HuggingFaceDataset
|
A streaming, iterable dataset. |
Source code in synalinks/src/datasets/built_in/ifeval.py
load_data(validation_split=0.2)
Load IFEval (Instruction-Following Eval).
HF ships only a train split (~541 prompts), so we split it
deterministically into train / test. The benchmark is rule-based;
the gold response is the prompt itself, intended to be used
with an LM-as-judge reward.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
validation_split
|
float
|
Fraction held out for evaluation
(default |
0.2
|
Returns:
| Type | Description |
|---|---|
tuple
|
|