Skip to content

Release data generation scripts of GraphGen, for generating QA datasets on Hugging Face #30

@NielsRogge

Description

@NielsRogge

Hi @ChenZiHong-Gavin 🤗

I'm Niels and work as part of the open-source team at Hugging Face. I discovered your work on Arxiv and was wondering whether you would like to submit it to hf.co/papers to improve its discoverability. If you are one of the authors, you can submit it at https://huggingface.co/papers/submit.

The paper page lets people discuss about your paper and lets them find artifacts about it (your data generation scripts for creating datasets for QA tasks, for instance), you can also claim the paper as yours which will show up on your public profile at HF, add Github and project page URLs.

It'd be awesome to also release the data generation scripts/example scripts to make it easier for researchers to generate training data for their models with it. This would allow people to load your framework directly from 🤗 Datasets, so that people can do:

from datasets import load_dataset

dataset = load_dataset("your-hf-org-or-username/your-generation-script")

See here for a guide: https://huggingface.co/docs/datasets/loading.

Besides that, there's the dataset viewer which allows people to quickly explore the first few rows of the data in the browser.

Let me know if you're interested/need any help regarding this!

Cheers,

Niels
ML Engineer @ HF 🤗

Metadata

Metadata

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions