Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

servlet: wayback machine #6

Open
wants to merge 5 commits into
base: main
Choose a base branch
from
Open

servlet: wayback machine #6

wants to merge 5 commits into from

Conversation

evacchi
Copy link
Contributor

@evacchi evacchi commented Dec 13, 2024

A simple tool to query the archive.org Wayback Machine using the API describe at https://archive.org/help/wayback_api.php.

Requires domain access on archive.org.

Example

Combined with the fetch tool, with domain access on web.archive.org:

– what did the Onion home page looked on the closest date to 2014?

Based on the Wayback Machine snapshot from January 1, 2014, The Onion's homepage featured several notable headlines and sections:
Top Stories:

"The Onion's Tips For Hosting A New Year's Eve Party"
"Cousins Meaner This Year"
"Relatives Gather From Across The Country To Stare Into Screens Together"

Other Key Sections:

A "Our Annual Year: 2013" retrospective section
Commentary section featuring articles like "What Do You Get For The Man Who Already Has The Complete Fleetwood Mac Discography?"
American Voices section discussing "2013 In Entertainment"
News sections covering Politics, Local, Entertainment, Business
Video content including "The Onion Looks Back At 'It's A Wonderful Life'" review

The site had a classic news website layout with multiple columns featuring different types of content, including news articles, commentary, videos, and special features. The design included social media links, a newsletter signup, and navigation menus for different content sections.
Would you like me to pull out any specific details about a particular section or article from this snapshot?

Signed-off-by: Edoardo Vacchi <[email protected]>
Signed-off-by: Edoardo Vacchi <[email protected]>
Signed-off-by: Edoardo Vacchi <[email protected]>
Signed-off-by: Edoardo Vacchi <[email protected]>
Signed-off-by: Edoardo Vacchi <[email protected]>
@nilslice
Copy link
Member

nilslice commented Jan 23, 2025

Should we get this merged up & published?

@evacchi
Copy link
Contributor Author

evacchi commented Jan 24, 2025

I suppose it doesn't do much so we can also close it. I was thinking, for these GET-oriented APIs, it would be cool if we could "specialize" the fetch servlet instead; i.e. provide a way to configure a fetch servlet as an additional tool, instructed with the way the GET URL should be constructed (like in this case, basically); like I did for the task with Unsplash; e.g. you tell the LLM "example.com/$somekeyword where $somekeyword is..." and/or transform it into a schema.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants