[ / / / / / / / / / / / / / ] [ dir / chaos / gdp2083 / hikki / htg / leftpol / newbrit / nyx / tijuana ][Options][ watchlist ]

/hydrus/ - Hydrus Network

Bug reports, feature requests, and other discussion for the hydrus network.
You can now write text to your AI-generated image at https://aiproto.com It is currently free to use for Proto members.
Name
Email
Subject
Comment *
File
Select/drop/paste files here
* = required field[▶ Show post options & limits]
Confused? See the FAQ.
Expand all images

New user? Start here ---> http://hydrusnetwork.github.io/hydrus/

Current to-do list has: 1,182 items

Current big job: finishing login and domain managers and overhauling the downloader


YouTube embed. Click thumbnail to play.
[–]

277248 (4) No.7432[Watch Thread][Show All Posts]

windows

zip: https://github.com/hydrusnetwork/hydrus/releases/download/v285/Hydrus.Network.285.-.Windows.-.Extract.only.zip

exe: https://github.com/hydrusnetwork/hydrus/releases/download/v285/Hydrus.Network.285.-.Windows.-.Installer.exe

os x

app: https://github.com/hydrusnetwork/hydrus/releases/download/v285/Hydrus.Network.285.-.OS.X.-.App.dmg

tar.gz: https://github.com/hydrusnetwork/hydrus/releases/download/v285/Hydrus.Network.285.-.OS.X.-.Extract.only.tar.gz

linux

tar.gz: https://github.com/hydrusnetwork/hydrus/releases/download/v285/Hydrus.Network.285.-.Linux.-.Executable.tar.gz

source

tar.gz: https://github.com/hydrusnetwork/hydrus/archive/v285.tar.gz

I had a great week. There are several fixes and improvements, and the local booru is working again.

network menu and other highlights

I have created a new 'network' menu in the main gui window. It takes possession of several things that used to be under 'services' (including 'manage subscriptions') and will be the new home for the various gui related to the ongoing downloader overhaul.

The edit subscription panel now has a 'paste queries' button that lets you add queries en masse from a newline-separated list on the clipboard.

You can now set 'manage_file_urls' and 'get_similar_to_x' (for similar file search) shortcut actions under the 'media' shortcut set.

An issue where a December->January bandwidth calculation was giving errors should be fixed!

As some people are still getting hit by the old listctrl sort crash bug, I've turned off column sorting on all the old listctrls. Please feel free to now try to sort listctrls as much as you like--if the control is the new one, it will sort correctly, and if it isn't, nothing will happen. I will continue to update the last instances of the old control in the future. There are about 15 left to do now.

local booru

The local booru finally has its manage and review services panels back! These were replaced with placeholders in a big service rewrite a while ago, and it has taken a while to catch up on this cleanup job. They work essentially as they used to, but I updated a couple of things, like their buttons work a little better, and there's a little status text to tell you if the service isn't currently running. I apologise for the delay.

The local booru basically works as it used to before the change. This was always an experimental thing, but if you were ever into it, please give it another go and let me know your thoughts about possible features for future iterations. I expect a future version of this could be much more dynamic and permit searches and other high-CPU browsing and so on, perhaps in tandem with a broader client-API.

url class stuff

This is still only for advanced users.

As well as splitting the http header and url class panels apart into separate dialogs, I've added a new network->manage class links. It connects the defined url classes to other things. At the moment, it only allows you to show/hide different classes on the media viewer, but in future, it will connect them to known parsers on the new downloading engine.

Any url that is not matched to a url class by the domain manager will currently be shown in the media viewer by default, but I think I will add a bunch of url classes for the existing downloaders in the coming weeks and change this to be 'hide' by default.

full list

- added 'network' main gui menu and moved network stuff from services to it

- split the new domain manager stuff up into separate dialogs and menu entries on the new network menu

- manage url classes dialog now lists url type in the listctrl

- added url class links info (which will permit client-specific settings and downloader mappings for url classes) to the domain manager

- wrote a 'url class links' dialog and added it to the new network menu (only the 'display on media viewer' part works atm)

- the domain manager now filters urls on the media viewer depending on whether they have a url match and are set to display in the new links panel

- updated the local booru service code to the new service system

- the local booru's shares can be reviewed again under review services

- the local booru's port and bandwidth settings can be set again under manage services

- the different gui parts of the local booru are updated to new controls

- fixed a local booru 404 reporting error

- the edit subscription panel now has a 'paste queries' button that lets you add queries en masse

- added 'manage_file_urls' to shortcuts system

- added several 'get_similar_to_x' actions to the shortcuts system

- the manage upnp dialog now initialises its mappings on another thread and fails better when UpnP mappings cannot be fetched

- connection and readtimeout network exceptions are now recognised more specifically by the client. subscriptions will only delay for an hour on one of these exceptions

- improved the resilience of the HF login code after wake from sleep (when networking is often not available for a couple of seconds)

- like the recent subscription query randomisation update, subscriptions themselves are now synced in random order (this stops a subscription named 'aadvark' always getting first bite into available bandwidth)

- fixed import for jpegs that have unparsable exif data

- fixed a bug in 'next month' bandwidth estimate calculation when the month is December, wew

- fixed some logic that was setting max page at 165 rather than the intended 200 and added a dialog that even lets you engage the debug override at 200 if you are insane

- all audio mime detection and duration parsing is now done through ffmpeg. hsaudiotag is no longer needed to run the program

- since the old listctrl sort crash bug is still hitting some people, I've disabled sort on the old listctrl class. feel free to try to sort any listctrls now, no matter your situation. I will continue replacing the old class with the new working class over time

- updated another listctrl

- a ton of misc controller options/manager access refactoring

- cleared out some old code

- moved Time controls to their own file and added velocity and checker options stuff as well

- wrote a new edit panel for single controls and updated the time delta button to use it

- misc refactoring

next week

I am not sure. I plan to reserve a big wx update in the last two weeks of the year, so I only have two weeks left before the year is done. I'd like to have something neat ready in that time, so I'll review what I have and push either towards a simple working login system or maybe some first custom parsers, perhaps for the thread watcher. I'll also continue to push on small issues and improvements as normal.

Also, we should hit 🎉200 million mappings🎉 on the public tag repository this week. Congratulations, everyone! The PTR has grown far larger than I ever expected (those 200m mappings are for 2.2m different tags and 11m files), and I'm really happy with and thankful for everyone's contributions to this milestone. Things have been running a lot better since the round of 'compactions' that cut the size of client.mappings.db (we now run about 32 bytes per mapping including hash and tag definitions, which I am very happy with), and now we are seeing different pressures, particularly at the CPU and workflow level. In the coming year, I expect to add some sort of 'clearing out crap' mechanism to the tag repository code so we can more solidly fix mis-parsed junk and filter things you do not care about and reduce CPU load in accessing this very large store of tags.

If you looked at one tag every second for eight solid hours a day, it would take nineteen years to see every tag on the PTR!

e18d5f (2) No.7437>>7474

>The edit subscription panel now has a 'paste queries' button that lets you add queries en masse from a newline-separated list on the clipboard.

Cool

Can you add that to the tag censorship/sibling/parent windows?


09dc9c (1) No.7440

YouTube embed. Click thumbnail to play.

I made an alternate Linux build for people having trouble with the official build. A few things are still weird, but overall it works better for me than the official. Run from source works slightly better, but pip is a headache. Built on Linux Mint 18, so it should also work on Ubuntu 16.04.

If you want open externally to work, you will have to manually set the open program for each mimetype in options->files and trash. For some reason, when Hydrus invokes xdg-open it opens a program in WINE.

https://www.mediafire.com/file/eo34b2yy1od465o/hydrus-285.tar.gz

I'll post one of these every week until the official Linux build works on my machine. If anything is broken, tell hydrus_dev (and thank him while you're at it). He was kind enough to give me his build command, but I don't know Python. If something is broken, I'm probably just as clueless as you are.


b6a9f5 (9) No.7442>>7443 >>7474

how do i update my version without fucking up and losing my database? its probably easier than i think but i dont want to fuck it up.

also hydev, do you know about a program called Visipics? its used to detect duplicate image files and is incredibly proficient at it


359fa6 (1) No.7443>>7444 >>7445

>>7442

Install/extract the new version into the same folder as your old version

Hydrus already has a better dupe finder than Visipics


b6a9f5 (9) No.7444>>7445

>>7443

>Hydrus already has a better dupe finder than Visipics

what about that warning of "not doing dupe finding with more than 100 pictures"? im still new to this and i don't want to freeze the client or something


b6a9f5 (9) No.7445>>7447

>>7444

>>7443

honestly i dont understand it, it only works when i try to find duplicates for a single specific file, if i try a big duplicate search in a group of files, it displays nothing


593892 (10) No.7447>>7448 >>7452

>>7445

There's no "or" yet in hydrus. When you search for more than one thing, it searches for files with all of those things.

If you search for files in common with a group of files, it only displays files that are duplicates of all those files.

There's a duplicate filter though, if you want to run it, just use the shortcut ctr+t -> right arrow key -> left arrow key.


b6a9f5 (9) No.7448>>7449 >>7450 >>7452

>>7447

i got that running but it doesn't show any duplicates whatsoever, what am i doing wrong here?


593892 (10) No.7449>>7450 >>7455

File (hide): 7d1d8fb1433c34e⋯.png (81.18 KB, 1920x1080, 16:9, ClipboardImage.png) (h) (u)

>>7448

This page, right?

Set the search distance lower.

Personally, I like 14. It shows a bunch of unrelated pics, but it also gets the more different versions of the same pics. Though you don't have to go that low.


593892 (10) No.7450>>7455

>>7448

>>7449

Also, click

>show random pairs

To show a random group of similar pics

or

>launch filter

To start mass filtering them manually

The page doesn't show anything by itself.


b6a9f5 (9) No.7452>>7455

>>7448

>>7447

i set the range to "very similar", and i know for certain that i have quite a few dupes. then i did it again on distance 8 (it said speculative) and its still not showing me shit.

i mean i know im probably doing it wrong, but this is definitely not better than visipics for its intended purpose at least. I'm in no hurry though, the amount of dupes is too minimal to do anything on my available disk space.


593892 (10) No.7455>>7456

>>7452

Did it ever show anything? Try >>7450

It doesn't show anything by default until you choose one of the options.

Also, post a pic of the duplicates screen like >>7449 , I want to see if it says anything.


b6a9f5 (9) No.7456>>7457 >>7459

File (hide): e5fb3b9e52883e4⋯.jpg (55.14 KB, 1007x609, 1007:609, Capture.JPG) (h) (u)

>>7455

here you go.

when i launch the filter it starts opening pairs of pictures that don't even look remotely similar, and thats not what i want. i want the groups of pictures the program considers dupes to appear in the window to the right side, if possible, like visipics does.

if its not available yet then its no problem, i'll just keep using visipics until it is, i just dont know if i'm doing it wrong or not.


593892 (10) No.7457>>7458

>>7456

>194.552 potential pairs

M8, that almost 200 potential duplicate pics.

>when i launch the filter it starts opening pairs of pictures that don't even look remotely similar, and thats not what i want.

If you want pictures that look more alike, set the distance to lower.

> i want the groups of pictures the program considers dupes to appear in the window to the right side, if possible, like visipics does.

You can't do that, only the filter. If you want it you could try suggesting it in the suggestion thread though. It sounds relatively easy to do, so devanon might do it.


593892 (10) No.7458

>>7457

>M8, that almost 200 potential duplicate pics.

*almost 200k potential duplicate pairs


593892 (10) No.7459>>7460

File (hide): ca68042edcf4df2⋯.png (23.16 KB, 875x430, 175:86, Untitled.png) (h) (u)

>>7456

Also, don't forget to reset your queue.

I accidentally 180k false positives to the queue, so it had a lot of dumb shit in it.

This is what I'm going to do now, to fix a couple mistakes.

>Click on the grind thing, then reset duplicates.

>Set distance to 0

>launch filter and go through the current queue

>increase distance by 1

>go through the filter

>keep increasing and going through it until you feel like there's no duplicates anymore

>???

>profit


b6a9f5 (9) No.7460>>7461

>>7459

thanks, but that sounds like a royal pain in the ass. im gonna stick to visipics

i only uploaded about 14k pictures so far, so its impossible to have that many dupes.


593892 (10) No.7461>>7462

>>7460

I just did that to test my threshold -seems it goes up to 8 before getting false positives-, you don't have to do that. You can just reset the duplicates and then go with a number like 4 for the distance.

If you have already tagged stuff, I recommend doing the hydrus filter. Alternatives get properly marked as so and if two pics are duplicates the one you choose gets the tags from both.

That said, there isn't much to gain if you haven't yet.


b6a9f5 (9) No.7462>>7463

>>7461

i dont archive pictures unless i have 2 or more tags, usually series and creator/character name is what i go with.

so yeah, i have a ton of tags, and i download stuff from e621 which brings like 30 tags per picture.

the dupe finder works very differently of what im used to, so i dunno, it just shows random pictures in each "pair" regardless of which distance i set, i've tried at 0, at 2, and at 8.


593892 (10) No.7463>>7464

File (hide): ebb5710bf510bfe⋯.webm (523.49 KB, 1280x720, 16:9, a.webm) (h) (u) [play once] [loop]

>>7462

You need to reset the duplicates first.

It keeps the queue saved until you either go through it or reset it. If there's a lot of false positives, it's better to reset it

Change the distance to 0 and do this, it should work.


b6a9f5 (9) No.7464>>7465

File (hide): 24f5652e5b1bb60⋯.jpg (107.41 KB, 1236x509, 1236:509, Capture.JPG) (h) (u)

>>7463

finally, its working now. it actually pops pictures i have more than 1 copy of in the big window to the side, just the way i wanted.

though it only does it 1 "set" of dupes at a time, its still pretty good.

thanks for your help anon, im gonna update to 285 as soon as i finish unfucking that 2400-pictures tumblr mess


593892 (10) No.7465>>7466

>>7464

>554 potential pairs

Yeah, it's working now.

I recommend setting it to 4 after that, then 8, to get the more different ones.

Also, I'd recommend using the filter, it's faster that way.


6e33bd (1) No.7466

>>7465

I've actually found that after 5, using the filter becomes more trouble then it's worth, when every image has 15 dupes it's picking up, it's far easier to highlight five at a time and manually select them all as non-duplicates at a glance rather then at 2 or 3, where you do need to stare pretty carefully to see the precise difference between two images.


74278c (5) No.7467>>7474

Ok, So I realized through needing to do some things that my tag repositories have not been syncing for I want to say at least 3 months possibly more.

I remember why I stopped syncing in the first place now, this shit takes fucking forever.

So, a suggestion.

Like with import folders, is there any way you can have the take repository searches paused, and then manually start them up? I could see doing this when I plan on sleeping for 12+ hours thanks to some sleep aids, but holy shit, I have wanted to go though images for the last 4-6 hours but this shit is still syncing.

If I could start the process on my terms that would be greatly helpful.


277248 (4) No.7474

>>7437

Sure; I'll add it to the list. I'll try and write a 'wrapper' control that does the job well on its own, and then I can replace all the other 'textbox into list' instances across the program with a unified interface.

>>7442

Hey, I am sorry this has been confusing. It looks like these other Anons have helped, but you might like to check out my help here:

http://hydrusnetwork.github.io/hydrus/help/index.html

There is also a copy in the client, under help->help. It tells you how to update an install and then goes up to advanced topics like the duplicate filter. Let me know if you still find anything confusing.

>>7467

Please check the settings under file->options->maintenance and processing. If processing is slow on your machine, I recommend not allowing it to kick in during idle time. Either tell it to run only on shutdown for a certain period, or disable it entirely and then run it manually overnight or whenever is otherwise convenient. To run it manually, make sure 'advanced mode' is turned on under the help menu and then open up the tag repo on services->review services. It should have a 'process now' button that will start it whenever you like.

The PTR has grown and grown, and many users on HDDs are feeling the strain. Running on an SSD, if you have that available, typically runs much better. I did this a month or so ago and am very happy--I went from like 600 rows/s to 30,000.

You might also like to check this page, which has some advice (like running defrags) on keeping the client running fast:

http://hydrusnetwork.github.io/hydrus/help/reducing_lag.html

It also talks about how to create 'profiles' that you can send to me to examine if anything that is running unreasonably slowly.


74278c (5) No.7478>>7491

have a recurring issue with windows 7 where the ui hangs for a bit, and then the client get stuck in the forefront till I restart it. don't know if there is anything you can do on your end with this.


74278c (5) No.7479>>7485 >>7491

Just thought of a new feature

So I have been having issues with dealing with large pages of images, my largest being 46k images and a bane to my existance, i have gotten half way through it before, but a crash saw that endeavor killed off, so I split the tab into about 10 FAR more manageable parts.

so here is my thinking, is there a way to add a split section to when you right click on a tab, this would bring up 2 sub menus, one would be number of pages, and another would number of images, pages obviously going from 1-20 pages cant imagine anything that would need to be split up more than that even in my most fucked up pages, and images would go from 100-500 in 100 image increments, 500-2500 in 250 image increments and 3000-10000 in 1000 image increments

Even if its just the easier pages implementation (at least I think pages would be easier) this could be very helpful to large image imports and the like where processing tons of files needs to happen but getting though pages is kind of hard due to the scope.

I would also recommend that this option automatically starts a page of pages for all the parts.


3979f8 (1) No.7481>>7491

I'm getting this error when I try to download some files:


2017/12/10 16:59:05:
2017/12/10 16:59:05: Exception:
2017/12/10 16:59:05: TypeError: argument of type 'NoneType' is not iterable
File "/usr/lib/python2.7/threading.py", line 774, in __bootstrap
self.__bootstrap_inner()
File "/usr/lib/python2.7/threading.py", line 801, in __bootstrap_inner
self.run()
File "/opt/hydrus/include/HydrusThreading.py", line 230, in run
callable( *args, **kwargs )
File "/opt/hydrus/include/ClientImporting.py", line 5533, in _THREADWorkOnThread
self._CheckThread( page_key )
File "/opt/hydrus/include/ClientImporting.py", line 5103, in _CheckThread
HydrusData.PrintException( e )
File "/opt/hydrus/include/HydrusData.py", line 931, in PrintException
trace_list = traceback.format_stack()

I think I've got this in 284 version(I was using 283 for two weeks and everything was alright). I'm using aur arch linux build.


e18d5f (2) No.7485>>7488

>>7479

A "split one page into multiple pages" and "combine multiple pages into one page" option would be really good. Seconding this post.

"combine pages" could be used as an or-search if you open a search page for each tag then combine them


74278c (5) No.7488

>>7485

The combine is happening, but I think splitting may be easier than combining so fingers crossed on that one happening sooner than later, it makes going through the tabs I split so much easier when its smaller sections, it's not a big monolithic clusterfuck dump, that said, 161 tabs and 175 causes massive instability for me, so got to go though a fuck ton before I can split further, and with some of the tabs being 4000~ which isn't much by my standards, its just so much easier to plow 500~ image tabs I think i'm going to end up moving everything to it so long as i'm able to before I begin processing.


277248 (4) No.7491>>7494

File (hide): 6377b4c7704eab2⋯.webm (2.03 MB, 432x768, 9:16, 6377b4c7704eab264d0ad71c9….webm) (h) (u) [play once] [loop]

>>7481

Thank you for this report. I am not sure what is happening here. This is a slightly rarer and older bit of error handling code, and it isn't telling me enough to know the precise problem. I have updated this for tomorrow's release. Please try it again in v286 when convenient and let me know the updated error you see--there should be an error trace as well as the stack trace here.

>>7478

Can you tell me a bit more information about your usage? Do you have many importers running, or otherwise a lot of pages open? What are your settings like under options->maintenance and processing?

>>7479

Aha, I now see you have some big pages open. This is likely part of your laggy gui. I recommend users with older computers try not to have more than about 10k files open at any one time.

A good way to go through new imports in smaller chunks is to set up a normal search page like this:

system:inbox

sistem:limit=200

This lets you find and deal with matches of imports in fast chunks that keeps memory use low and the ui snappy.

If you want to process one particular job like this, try adding an 'explicit tag' as you import. Something like 'big download on 14th sept' to the 'local tags' tag domain (or just ctrl+a->f3 when the import is complete). You can then find these files again, again in batches of 200 or so, by doing the above search and just adding the 'big download…' tag.

That said, I do want to add more actions to improve 'processing' workflow at the ui level. I will keep the thought of 'split all this shit into smaller bits' in mind. There will be important new workflow-friendly session controls out tomorrow, which I am hoping to get feedback on so we can plan more of this in future.


74278c (5) No.7494>>7496

>>7491

as of right now

161 tabs, 250-300k images open across all of them, 1 experimental 'lets see how subscriptions work' sub a good deal or at least use to be a good deal of 404 thread watchers, they were easy enough to process I made my way through them to free up some overhead.

last time I fresh booted, everything works fine, no issues at all with importing or playback, but with this session, I get near constant hiccups, and I have 2 other sessions saved that are basicly the same as this thanks to a massive import. ram use for the program right now sits between 8.5gb and 10gb,

Most of the problems I encounter are caused due to this session being out of hand.

On a side note, ctrl+alt+esc allows the windows to lose its focus again.

also, apart from the initial 5000 images, I think the rest of the 1.35 million images are all in the inbox. I was keeping the images there so I would go through and tag them then move them to the archive, but that plan is out the fucking window, especially that I think ratings is more useful for my needs in its current implementation.

as for my computer

1700 ryzen, 32gb of ram, 512gb nvme ssd that the database and thumbnails are on, 8tb hdd for import folders, a 4tb hdd that files are moved to, and possibly a 3tb hdd that is exclusively images, not sure if thats happening yet or not.

very hard for me to classify my computer as old.

and yea, I think it was somewhere between 50 or 60 thousand images that the program had issues last time I did that testing, the program has honestly come such a long way since then, those points where shit slows down are probably outdated.


277248 (4) No.7496

>>7494

Right--I figured with Win 7 you had an older machine. I am not totally sure, but I think some of the older Windows have lower gui object limits (it means 'a program can't have more than x thousand window objects'), which is the main cause for the ~165 limit atm. I believe the proper name for this, if you would like to look into it for Win7, is GDIProcessHandleQuota in the registry. Win10 is like 10,000, but I think earlier is like 3,000. Obviously be careful if you mess around with this stuff.

But there are other problems beyond GDI count, which is probably what is slowing your bad session. Every import page has two threads and a bunch of other shit that I want to reduce.

I have more overhead reduction today and hope to do more in the coming months. As you are a heavy user, your continued feedback would be appreciated.




[Return][Go to top][Catalog][Screencap][Nerve Center][Cancer][Update] ( Scroll to new posts) ( Auto) 5
33 replies | 8 images | 9 UIDs | Page ?
[Post a Reply]
[ / / / / / / / / / / / / / ] [ dir / chaos / gdp2083 / hikki / htg / leftpol / newbrit / nyx / tijuana ][ watchlist ]