thread-keeper-automator

Automated Twitter URL processing for the excellent thread-keeper from the Harvard Library Innovation Laboratory

Deploy the thread-keeper to your server
Download an archive of your twitter account
Change the extension .js => .json and validate and the /twitter-xxxx-xxxx/data/tweets.js file
Extract the tweet.id and the .tweet.created_at from the file:

  {
    "tweet" : {
      "edit_info" : {
        "initial" : {
          "editTweetIds" : [
            "1593006617125326848"
          ],
          "editableUntil" : "2022-11-16T22:52:28.811Z",
          "editsRemaining" : "5",
          "isEditEligible" : false
        }
      },
      "retweeted" : false,
      "source" : "<a href=\"https://mobile.twitter.com\" rel=\"nofollow\">Twitter Web App</a>",
      "entities" : {
        "hashtags" : [ ],
        "symbols" : [ ],
        "user_mentions" : [
          {
            "name" : "CRKN RCDR",
            "screen_name" : "CRKN_RCDR",
            "indices" : [
              "3",
              "13"
            ],
            "id_str" : "813991658",
            "id" : "813991658"
          },
          {
            "name" : "McGill Library",
            "screen_name" : "McGillLib",
            "indices" : [
              "36",
              "46"
            ],
            "id_str" : "21223663",
            "id" : "21223663"
          }
        ],
        "urls" : [ ]
      },
      "display_text_range" : [
        "0",
        "140"
      ],
      "favorite_count" : "0",
      "id_str" : "1593006617125326848",
      "truncated" : false,
      "retweet_count" : "0",
      "id" : "1593006617125326848",
      "created_at" : "Wed Nov 16 22:22:28 +0000 2022",
      "favorited" : false,
      "full_text" : "RT @CRKN_RCDR: 🗺In partnership with @McGillLib, we have added approximately 22,000 digitized Canadian maps to the Canadiana Collection. It’…",
      "lang" : "en"
    }
  },

Extract example using JQ:

cat tweets.json | jq '.[].tweet.id' > tweetsID.json

cat tweets.json | jq '.[].tweet.created_at' > tweetsdates.json

and dump in a csv . . .

Clean up the .csv (see example.csv with added columns for date sorting) - I split the csv into multiple sheets and pulled tweets by year.
Put the the .csv in the same directory as the python script
Execute the script

python ffHead.py (GIU) or python ffHeadless.py (Headless)

Combine the pdfs. I used Ghostscript here.

gs -q -dNOPAUSE -dBATCH -sDEVICE=pdfwrite -sOutputFile=COMBINEDPDFS.pdf *.pdf

(combining files will likely break the signatures applied to each pdf) . . . this is something to be further explored.

Sign the new combined document (if you like). I have been using open-pdf-sign

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
ffHead.py		ffHead.py
ffHeadless.py		ffHeadless.py
sample.csv		sample.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

thread-keeper-automator

About

Releases

Packages

Languages

License

telezoic/thread-keeper-automator

Folders and files

Latest commit

History

Repository files navigation

thread-keeper-automator

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages