wep

css grep'ing responses using goquery and playwright. works by going to url, waiting for network to idle, then extracting css-selector query content from page.

install with : go install github.com/reallygoodprogrammer/wep@latest

examples:

# extract all div elements from site.com
wep -u "https://site.com" div

# extract all text within a-tag elements containing an href link
# that are children of span elements within div elements with class 
# 'content'
wep -u "https://site.com" "div.content > span > a[href]"

# extract all inner content from h1 elements with class 'title'
# and p elements with class 'post' from urls in urls.txt file
wep "h1.title, p.post" < urls.txt

# extract all src attribute values from img elements
wep -a src img < urls.txt

Usage of wep:
  -a string
    	extract from attribute instead of inner content
  -c int
    	concurrency level (default=3) (default 3)
  -headless
    	run in headless mode
  -l string
    	read from local file path instead of making a request
  -s	read html data from standard input
  -t float
    	timeout for requests (default=10) (default 10)
  -u string
    	site url for request

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

wep

examples:

About

Uh oh!

Releases

Packages

Languages

reallygoodprogrammer/wep

Folders and files

Latest commit

History

Repository files navigation

wep

examples:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages