t5-vae-wiki / merge_datasets.py
Fraser's picture
add dataset scripts
2095da4
raw
history blame contribute delete
246 Bytes
import datasets
import pandas as pd
dfs = []
for i in range(10):
dfs.append(
datasets.ArrowReader.read_table(f'segment_{i}/dataset.arrow').to_pandas()
)
full_df = pd.concat(dfs, ignore_index=True)
full_df.to_csv('dataset.csv')