-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Technology Simply Explained #25
Comments
Step 1: Download text file. Remove leading spaces. Remove all blank lines. Remove intro Gutenberg info. Concat all lines. Force line breaks at every period. |
Sample text using the Linux
|
Some goals:
|
Kind of a great preface. Must be re-worked somehow. I'm thinking the formula can be:
|
looking for suggestions of how to generate diagrams or funny pictures. ideally, all images of obscure machines. |
Example output now. Should definitely add diagrams, and clean up design and maybe placement of illustrations.
|
Still polishing:
|
Project repo lives here. I'm actively working on it. |
I like this. Keep it up!
Well, since this has made me think about it... (1a) Patent applications often contain diagrams, and after time they fall into the public domain. Apparently NASA released some of these in 2016, but the link to their "searchable database" is broken. One problem remains: extracting the relevant images. I worked on a tool for this a couple of NaNoGenMo's back, using Python and pillow, but I don't recall how well I got it to work. (1b) Scans of old books about machines on e.g. archive.org would work just as well as patent applications. I haven't looked though. (2) You could also do a search for "machine" on one of those free vector clipart sites. For example: https://free-images.com/search/?q=machine&cat=vr . You definitely get mixed results that way. Which might actually be a plus. But it's hard to control the aesthetic. (3) And then there's graphviz. This is a command-line tool that takes a file in |
Could also check Flickr Commons of free images: For example searching British Library for "machine": |
Relevance of drawings in patent data can partially be resolved with careful
use of classification codes, which are hierarchical and can be extremely
specific. That said, predicting classification codes from word frequencies,
while trivial (it's basically just TFIDF), requires a lot of processing & a
lot of patent data (which may not be well-structured depending on where you
get it from), so I'd recommend starting off with classifications and
generating both text and images from them, if you're using patent data.
Structured patent data differentiates between line drawings & other kinds
of images. (Source: I work with patent data for my day job.)
…On Fri, Nov 5, 2021 at 5:21 AM Hugo van Kemenade ***@***.***> wrote:
Could also check Flickr Commons of free images:
- https://www.flickr.com/commons
For example searching British Library for "machine":
-
https://flickr.com/search/?user_id=12403504%40N02&view_all=1&text=machine
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#25 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AADXUGINEHEJEY2GTQ3N4VLUKOOZ3ANCNFSM5HFL4GXA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Thanks for all these tips, folks! I spent time going over them and appreciate your insights. Splitting the image generation into two goals:
For the diagrams, the graphviz idea is cool but i'm already familiar with Processing and kind of enjoying my random diagrams I've been making so far. So will keep doing that. Now for images I had trouble getting usable images from the NASA patents and for sure it's a bummer that only 5 years out their link to public domain'ed patents doesn't even work anymore and I can't find on their website. I guess I also need to decide if this will result in a printed book, and if so, maybe need higher quality images. I'm looking at the Flickr.com No Known Copyright Restrictions (some may be NSFW) and maybe images tagged something like industrial on Pexels. I might use that as a start and sub-select 500 to 1000 images, cull out duds, and run some filtering on selections, something like:
|
Ok, here's some simple code, whipped up as a test in a jiffy from previous code sketching, and example generated images. First 3 images use threshholding to convert to black and white, and latter two do not. Trying to decide if i like these or if they are too oblique. Perhaps I should also try using a bucket of machine parts like gears, ratchets, etc and then randomly assembling those together? might be the next try. |
Some updates. I now have a working book, but I think I want it to be a bit funnier while also making more logical sense. Here's some screenshots to show some pages. I think massaging the output will be the hardest. The first part was primarily a coding challenge, and now I get into the 'aesthetics' and form of the output in more detail. |
I have a complete beautiful book compiled. Two big changes have allowed me to finish up:
Yes, I finally installed pdflatex which I'd been avoiding for years because it has 546MB of dependencies, which seems insane since all it's supposed to do is add the ability to convert to pdf in pandoc. Half a gigabyte for that? Nuts. Aside from the fact that it's probably 520megabytes too big, I'm glad I finally installed it because it made a big difference. Using the markdown to pdf converter presents a beautiful looking book. Here is one example of compiled book of Technology Simply Explained (PDF). I'll probably make some more. And potentially do a print run with a nice looking cover. But I'll consider this complete now. |
Wow! This reminds me of undergraduate engineering labs where they give you some piece of equipment and the operation manual. The headings were hopeful, and I can see how useful this manual will be in the future. Great arrangement of informative diagrams near relevant instructions, and helpful caveats. Any engineering student would be lucky to have such a comprehensive tome. Makes a great gift for the person who's always fixing everyone else's things. I'm putting this right next to my Machinery's Handbook 👍 |
hahah thanks angela! |
A book of things that are quite complicated explained in simple terms. (just kidding, completely confusing)
Source text:
Gas and Oil Engines SIMPLY EXPLAINED
The text was updated successfully, but these errors were encountered: