Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Resources should be future-proofed #3

Open
1 of 2 tasks
notpresident35 opened this issue Sep 5, 2022 · 2 comments
Open
1 of 2 tasks

Resources should be future-proofed #3

notpresident35 opened this issue Sep 5, 2022 · 2 comments
Labels
good first issue Good for newcomers help wanted Extra attention is needed

Comments

@notpresident35
Copy link
Owner

notpresident35 commented Sep 5, 2022

  • - Several resources are twitter threads, which are unstable for a whole host of reasons (tweets can be deleted, accounts get suspended or deleted all the time, etc). For longevity, extract all links and any relevant text from these threads and store them in an archival-friendly format.

  • - Several resources are youtube playlists, which are unstable for similar reasons to twitter threads. For longevity, extract all video links from each youtube playlist to a separate file (preferably markdown) and link them beneath the playlist, just in case a playlist gets lost. A tutorial on how to extract a youtube playlist: https://dtomoffcpa.medium.com/youtube-playlist-to-linked-list-in-excel-why-not-3a96297e980c

The internet archive is your best friend and is a great place to start, though it is not a silver bullet as sub-links are sometimes missed in the archival process.

@notpresident35 notpresident35 changed the title Twitter threads should be archived Resources should be future-proofed Sep 5, 2022
@notpresident35 notpresident35 added help wanted Extra attention is needed good first issue Good for newcomers labels Sep 5, 2022
@Arzenar
Copy link

Arzenar commented Sep 8, 2022

The simplest method, which is also most likely used like this in scientific papers, is to save Twitter threads via PDF. AFAIK, there is no external tool for this, but there are Twitter bots like "unrollthread" or "threadreaderapp" that do this job. Maybe one of them will help.

Of course, all linked web pages should also be saved as PDFs in case they are sooner or later no longer operated, the domain expires or are purged.

It would be extremely helpful if a page or thread is deleted, it remains as a PDF here in this repository. But I have no idea if that would be legally okay.

Helpful links:
https://techpp.com/2021/10/27/how-to-save-twitter-threads-guide/
https://www.businessinsider.com/guides/tech/how-to-save-a-webpage-as-a-pdf-on-windows

@notpresident35
Copy link
Owner Author

Good lead - printing the output from threadreader as a pdf is fitting for archival purposes. Archived a thread for testing purposes; this will do perfectly for improving the longevity of this resource.

notpresident35 added a commit that referenced this issue Sep 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants