Gather codefags

Name
Email
Subject
Comment *
File	Select/drop/paste files here
Password	(Randomized for file and post deletion; you may also set your own.)
* = required field	[▶ Show post options & limits] Confused? See the FAQ.

Embed	(replaces files and can be used instead)
Options	Do not bump (you can also write sage in the email field) Spoiler images (this replaces the thumbnails of your images with question marks)
Allowed file types:jpg, jpeg, gif, png, webm, mp4, pdf Max filesize is 16 MB. Max image dimensions are 15000 x 15000. You may upload 5 per post.

File (hide): 49d469eed840973⋯.gif (606.65 KB, 800x792, 100:99, pepe_matrix.gif) (h) (u)

▶Gather codefags Anonymous 07/18/18 (Wed) 01:04:13 fa4f25 (5) No.2193326 [Watch Thread][Show All Posts]

I discovered that the [Quest for Searchability] thread slid off the catalog today and decided to bake a new thread with a more general codefag focus. I've got a few things I've been thinking about and I've realized I'm probably ready for some help. I'll post more about what I'm thinking later.

Here's the current bread search links that are functional:

- https://www.resignation.info/scripts/8chan/search.php

- q-questions.info/research-tool.php

- qanon.news/archives

Gather brother codefags!

▶Anonymous 07/18/18 (Wed) 02:13:33 fa4f25 (5) No.2194137>>2194480 >>2196679

File (hide): 72ac2463299f859⋯.jpg (128.72 KB, 515x414, 515:414, Slap-Fight-IPA.jpg) (h) (u)

File (hide): 88edc3c0a7e6fa3⋯.png (85.79 KB, 644x404, 161:101, ClipboardImage.png) (h) (u)

File (hide): 08b4e0e7aba612d⋯.gif (130.52 KB, 220x268, 55:67, doit.gif) (h) (u)

mmmbeer

So here's da ting. I was looking at my archives today trying to speed it up, and was checking here when I noticed the Quest for Searchability thread slid off. I looked at my catalog and behold, I didn't have it.

Nice.

I traced the problem down and fixed it but, it was however, too late for the Searchability thread on my end. <slap/> Simple bug and I wasn't paying attention. Maybe germanarchiveanon has it. Maybe nobody cares. If was moving kinda slow anyways.

I think that I've just about got it wired in now.

- Improved thread targeting system, and automated breads selection to shave the ~9min scrape back down to about ~2min.

- Targeted a bunch of threads, including the notables archives from COMMS

- Added handy shading to the archives that contain Q posts

- New JSON links from the archives direct with no messy api fumblage.

So I've been thinking.

There is an immense amount of data right at our fingertips if only we could bend it to our wills. How do we take the data that we have accumulated and put it to it's best use - together? No way one person can sort thru all the data. You would need a team or software. Ideally both… and that's where (you) come in.

We need to build the platform to spread the message.

How?

▶Anonymous 07/18/18 (Wed) 02:48:04 fa4f25 (5) No.2194480>>2199358

File (hide): 81a2fddfd96d650⋯.png (37.22 KB, 274x275, 274:275, ClipboardImage.png) (h) (u)

File (hide): 67eeef35c886bb7⋯.png (515.01 KB, 887x800, 887:800, ClipboardImage.png) (h) (u)

>>2194137

I've been thinking about this.

Codeflower visualization

https://www.redotheweb.com/CodeFlower/

Plan is to run a worker process on the server to generate this thing. I'm looking to show the links between breads. With so many links and breads it may turn into mayhem.

Probably look something more the second image

https://ramblings.mcpher.com/Home/excelquirks/gassites/d3nodefocus

The search thing here bothers me.

I've been thinking about running a worker thread on the server to create a search index. Who knows how big that would be. The bottom line is that the search is only as up to date as your index. So adding 1 and regenerating the index is no good. It should probably be a daily thing. Who knows how big that would be. It's way too big to use Lunr for I think.

I guess now I've got to find out.

▶Anonymous 07/18/18 (Wed) 06:40:08 6a62a7 (1) No.2196679>>2198748

>>2194137

so no laughing at me cause I'm no digageek BUT could you create a 'search box' to cover all posts relating to the topic searched from the scope of all the links containing Q stuff, crumbs, map, research, wherever any & all the gathered information has been saved. So a search for "Brennan" would look in those places, hit on whatever sources include his name; video, Q proofs, research with sauce, information gathered on him designated to links for notables, or within different loaves etc. Seems to me that would be quickest and links to any further info can be expanded without losing the search parameters. My experience is (unpracticed for a few years) but EXCEL spreadsheet idea. Like I said I'm not a program writer but I have received bonuses for logical thinking. K.I.S.S. theory mostly, I recognize not everything is that simple though. thinking about it, all the names and topics have been gathered pretty much in one place they're sorted into eras, operations, connections, the links the search needs to cover, are all established.

hope this makes cents imma couple a puffs into next years 420

▶Anonymous 07/18/18 (Wed) 14:09:29 fa4f25 (5) No.2198748

>>2196679

Search relevancy is the secret sauce behind a search. It's what make some better than others.

Have tried any of the search tools linked above?

Here's what I discovered working on the search. It was all in the old bread search thread.

On my site the current archive search covers everything on my site and uses the googl algos to determine relevancy. You could read up on how everybody thinks is done, but goog never says. It returns reasonably accurate results, but only as full breads, not as individual posts.

The Q posts search on my site uses a local install of Lunr. It searches ONLY the q drops and can be tweaked to provide better results. The Lunr search is straight text, no images/image names etc. Post name, Subject, Dates, trips, and post text are all included in the index. Search results are returned as individual posts.

I was trying to build out a search that could be local to my archive, but with tweakable results returned in individual posts. In doing that I ran into several problems:

1) The index MUST be prebuilt. My archive has nearly 3000 breads. Due to the way the archive is pure json (no current database), each bread must be read in. My tests showed that it took about 5-10 minutes to read everything in depending on system load.

2) Search results from the engines I looked into (Lunr, Solr, elasticsearch, Sphinx, Lucerne etc) all return results with the ID of the result + rank, not the actual result. You have to code it to return the results themselves. This opened up other issues in using the API for this. 5000 resul post id's would ultimately be bassed via the querystring into the API. Not possible. The querystring can't be that long. Fail.

3) There is a limit in the size of the json data than can be effectively sent to a client for js use. 10MB seems to be the upper limit without killing off the browser. 3000 breads * 751 posts = 2,253,000 total posts. I figure 10MB of results is about 5,000 results avg. The current breadarchive.json is about 10MB due to json being a text based key/value object format. You can see it in action with this search on 'awan'

qanon.news/api/smash?search=awan&xml=true Result is in xml for magical readibility. This is a stright text search of the text node only, no Lunr, no database.

4) A javascript search is therefore never going to work because there is simply too much data. (Lunr is out) The search must be done with a better server based technology (Solr, elasticsearch, Sphinx, Lucerne etc). This issue then is having to install it as a service which requires dedicated hosting which comes with a more substantial pricetag. $30-50/month. My current hosting cost me $12/year. Unfortunately I'm not richanon.

5) Getting everything into a database and using plain text search would move us towards a better solution, but it has it's own issues. DB size, DB platform, and result accuracy. I'm pretty sure the other 2 searches linked above are using this technology. The big search engines will all plug into pretty much any database available or even json.

Everything indicates we've got to use a real search engine to do what we want.

So all this leads me to a couple different conclusions. Yeah I can do a search using a big engine - locally.

So for me I've got 2 options. I can set up to take donations/ads and try and raise enough money to finance the hosting needed to provide the search.

Anons can set up sacrificial lamb server(s) on their home network available to the internet that has one of the big engines on it. Several of the big engines support distributed type topology so we could have say 5 different search servers to provide redundancy/speed. 1 is always a single point of failure. The price is right, but it does come with some risks.

Until I can resolve this engine/hosting issue it's a no go for me. As usual it boils down to money. I've got ideas, know-how and time, but not much in the way of financial wealth.

I think I follow what you are asking about, but tell me more. I'm open to all ideas.

▶Anonymous 07/18/18 (Wed) 15:29:36 fa4f25 (5) No.2199358

File (hide): b663a6e6b1414ab⋯.png (418.39 KB, 954x972, 53:54, ClipboardImage.png) (h) (u)

File (hide): d6582b19f2f64a4⋯.png (289.43 KB, 960x500, 48:25, ClipboardImage.png) (h) (u)

>>2194480

What about something like this for the notables?

https://bl.ocks.org/mbostock/4063550

Radial Tidy Tree

https://bl.ocks.org/mbostock/3cba6ac2fac09e5483bf6c1fade733be

Interesting lightning bolt idea.

https://bl.ocks.org/mbostock/e6f9e160585c153fa5ec543bd12b81e9

This tree of life thing looks cool

https://bl.ocks.org/mbostock/c034d66572fd6bd6815a

>>>/comms/1536

Having the links here like this is rich data trolling ground. I figure the bakers have already sorted thru the shitposting clownery, and presented us with a tidy archive of links. Crawl those up and down. Find common words… Seems like it would get us a pretty long way.

Who wants to try?

>>>/comms/1536

qanon.news/api/bread/1536

/qresearch/ - Q Research Board★

First time on QResearch? 8chan? Click here, newfag.

General

WebM

Theme

User JS

Do not paste code here unless you absolutely trust the source or have read it yourself!

Favorites

Customize Formatting

Filters