Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision

Jinyan Su*, Peilin Yu*, Jieyu Zhang, Stephen H. Bach (*: Co-first)

This repository host the code for the preliminary experiments in ENLSP-23 Workshop Paper "Structure Discovery in Prompted Weak Superviison" and full experiments in IEEE BigData 23 Paper "Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision". This paper explores a modular approach to leverage Large Language Models to obtain structural information for prompted weak supervision setup.

Citation

If you find our work helpful, please consider citing the following paper:

@misc{
    syzb2023leverage,
    title={Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision}, 
    author={Jinyan Su and Peilin Yu and Jieyu Zhang and Stephen H. Bach},
    year={2023},
    booktitle = {IEEE BigData}, 
}

Instructions

Create environment and run experiments

conda create --name WS_env python=3.8 
conda activate WS_env
pip install -r requirements.txt

bash run.sh # run bash file

Dataset Source

Name	Task	# class	# Prompted LFs	# train	# validation	# test	source
Spouse	relation classification	2	11	22254	2811	2701	Github repo of Snorkel tutorial
Youtube	spam clasification	2	10	1586	120	250	Google drive link shared in WRENCH benchmark
SMS	spam clasification	2	73	4571	500	500	Github repo of Snorkel tutorial

Embed the LFs

The embedding of the LFs are in the './data/' directory with the data, you can also get the embedding yourself with the following command:

cd get_LF_embedding/
bash run_embedding.sh

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.idea		.idea
assets		assets
data		data
get_LF_embedding		get_LF_embedding
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision

Citation

Instructions

Create environment and run experiments

Dataset Source

Embed the LFs

About

Releases

Packages

Languages

BatsResearch/su-bigdata23-code

Folders and files

Latest commit

History

Repository files navigation

Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision

Citation

Instructions

Create environment and run experiments

Dataset Source

Embed the LFs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages