[ / / / / / / / / / ] [ dir / asmr / choroy / ita / kpop / newbrit / polk / sl / vore ]

/hydrus/ - Hydrus Network

Bug reports, feature requests, and other discussion for the hydrus network.

Catalog

Name
Email
Subject
Comment *
File
* = required field[▶ Show post options & limits]
Confused? See the FAQ.
Embed
(replaces files and can be used instead)
Options
Password (For file and post deletion.)

Allowed file types:jpg, jpeg, gif, png, webm, mp4, swf, pdf
Max filesize is 12 MB.
Max image dimensions are 10000 x 10000.
You may upload 5 per post.


New user? Start here ---> http://hydrusnetwork.github.io/hydrus/

Current to-do list has: 829 items

Current big job: finishing and polishing duplicate system


YouTube embed. Click thumbnail to play.

1460bb No.5719

windows

zip: https://github.com/hydrusnetwork/hydrus/releases/download/v255/Hydrus.Network.255.-.Windows.-.Extract.only.zip

exe: https://github.com/hydrusnetwork/hydrus/releases/download/v255/Hydrus.Network.255.-.Windows.-.Installer.exe

os x

app: https://github.com/hydrusnetwork/hydrus/releases/download/v255/Hydrus.Network.255.-.OS.X.-.App.dmg

tar.gz: https://github.com/hydrusnetwork/hydrus/releases/download/v255/Hydrus.Network.255.-.OS.X.-.Extract.only.tar.gz

linux

tar.gz: https://github.com/hydrusnetwork/hydrus/releases/download/v255/Hydrus.Network.255.-.Linux.-.Executable.tar.gz

source

tar.gz: https://github.com/hydrusnetwork/hydrus/archive/v255.tar.gz

links

I had a great week, getting a lot done. There are many improvements to the duplicate system and some important notes for anyone who downloads from gelbooru.

you can now move tabs!

Right-click on a page tab on the main gui and hit 'move left/right' to move it left and right!

I can't believe I didn't think of this before, but it was a hell of a lot easier than writing custom drag and drop code!

the gelbooru downloader is not healthy

Like several other downloaders in the program, now the gelbooru parser is now also having trouble. I fixed some gelbooru ssl and redirect problems this week, but the gelbooru guys themselves have also changed how their gallery thumbnail URLs work in a way that is not helpful to hydrus, particularly for subscriptions.

The gelbooru downloader page seems to fetch files and tags fine, although the urls associated with the files are not great.

For subscriptions, I have attempted to mitigate the url problem in the update code–you will receive a popup on update if you are affected–but if you had a gelbooru subscription fire in the past week (it then likely had an error due to the above redirect problem), you should check it in the manage subscriptions dialog. If the sub has twice as many urls as it should, you should go into the subscription, click the icon button that launches the list of urls and then either 'skip' or just delete the new ones.

If you have too many subs to fix or are just not confident playing around with this stuff, just pause all your gelbooru subscriptions. I will put more work into this next week, which I suspect will include a retroactive purge of these new ugly redirect urls.

This is another reason to overhaul the downloader engine, which will then be more flexible and easier to fix when these things happen. I will start it in a few weeks, as soon as I am done with the current duplicate stuff.

duplicate stuff

I've polished the duplicate filter substantially. Several miscellaneous buttons and unusual workflow cases now work properly, and shift-click should drag media around. Also, the background will brighten and darken as you flick between the pair to give better feedback, and the pairs it presents you will be selected faster and more intelligently.

And as an experiment, I've made it so pairs that have the same resolution will stay at the same zoom and pan when you flick between them. I'm really pleased with how this works, so I may extend it to more classes of pair, like those with the same aspect ratio.

full list

- the duplicate filter now supports shift+left-click to drag, like the archive/delete filter (this remains hardcoded for now)

- if a pair in the dupe filter has the same resolution, they will now maintain zoom and pan when switching back and forth (I might increase this to work for same ratio as well, let me know how it works in the real world)

- the duplicate filter will show a lighter/darker background colour as you scroll the current pair

- the way lighter/darker and alternate lighter/darker colours are calculated is now centralised and should be more reliable in edge cases

- improved the dupe filter pair selection algorithm–it now chooses pairs more reliably under edge case conditions and prioritises decision-groups that have high potential decision value. it should also run a little faster

- increased the dupe filter batch size to 250, let's see if it causes any problems

- the close button on the dupe filter's top hover window now works

- fixed the duplicate filter disappearing pairs that were skipped (meaning you could not go back to revisit them)

- fixed a frequent deadobject error when the dupe filter closes

- the shutdown 'maintenance due' test is less sensitive to dupe search tree rebalancing, which typically only takes half a second

- the archive/delete filter now uses the new shortcuts system for both keyboard and mouse input

- the archive/delete filter now intercepts archive or delete commands from different sources ('media' shortcuts, top hover frame button presses) more reliably and converts them into filter actions (hence moving on to the next file)

- you can now move pages one to the left or right from their menu!

- on the top-right hover window (and the background underneath), ratings are now on the top, and hence will always be in the same location as you scroll through your media regardless of known urls, remote location, or inbox status

- autocomplete searches for tags with apostrophes, quote marks, braces, brackets and paretheses should be more reliable

- urls are now associated with files through the same 'content' pipeline as tags and ratings and so on

- gui-level media is now aware of the 'new url content update' event and will update and redraw itself appropriately

- fixed deviant art nsfw parsing, but it might not hold for long. proper fix here is to wait for the downloader overhaul

- networking engine now uses the 'requests' module's CA .pem (which the overhaul will be moving to anyway), which should reduce the frequency of ssl verify failures (gelbooru on the relatively new 'Let's Encrypt' CA had this problem for many users)

- fixed the networking engine redirect parsing for gelbooru's unusual location header. unfortunately, gelbooru is still giving mickey-mouse garbage redirect urls from its main thumbnail pages, for which in this engine there is no immediate fix

- neighbouring .txt tags will now be properly cleaned and sibling-collapsed in the path tagging dialog

- neighbouring .txt tags will now be properly sibling-collapsed in the import folder workflow

- making a media viewer borderless fullscreen and back will now recenter the media (previously, the current drag delta was not reset, so this frequently put media off-screen)

- fixed a serious issue where the media viewer could lock the client up on opening with a video if its gui options set 'remember size' to false

- tag import options objects will cleanse themselves of missing services on options save/client shutdown

- manage tags now defaults to cross-referencing 'my files' on the 'local tags' domain, where 'remote' (i.e. deleted) files' tags are not useful

- reduced memory use when importing large pngs with transparency

- improved adminside petition processing gui reporting

- servers will now cap the size of their mapping petitions so as not to ovewhelm the admin processing them (they now won't be both >20 tags and >1000 total row weight)

- the media viewer's manage tags frame now listens for content updates from outside, so if you alter an in-view file's tags (such as with a shortcut key), the manage tags dialog will update as it happens

- moved a number of the buttons on the top hover windows to the new unified internal command engine (which the new shortcut system also uses)

- disk cache maintenance uses fewer resources but now occurs in the foreground (and should hence more reliably maintain the cache)

- critical repository service id lookup errors will now automatically reset the repository's processing cache and better inform the user of what has happened. if you see this, please let me know the details and how this error fired in the real world

- updated and reinstated the ipfs service panel in review services

- updated some ipfs service code for the new service system

- misc refactoring and cleanup

- more cleanup and deletion of redundant old pubsub command code

- updated ubuntu build machine to 17.04 and opencv 3.2

next week

I have 41 things left to do for the duplicate work, and I want to get that to 0 as soon as possible. Beyond some more gelbooru triage, I want to focus on duplicate stuff and a few bugs that I didn't have time to get to this week.

I'm behind on messages, so I'll make some time to catch up as well.

933163 No.5720

File: 0871987a759ac86⋯.png (18.88 KB, 1683x968, 153:88, screen.1494462341.png)

Currently @ 6.1 mil items added into Hydrus. Everything's still smooth sailing.

Keep up the good work!


63b657 No.5721

hopefully simple feature request:

saving downloader tabs

specifically, i sometimes download a lot of chan threads at once, and have to manually set up the tag settings and "ignore deleted files" options and so on for each tab every time… because session-saving doesn't include the tagging/etc options on a downloader that has not yet been activated.

essentially i want to be able to save and restore six un-triggered downloader tabs, *with* their tagging/deleted settings intact


9df39a No.5722

>>5719

Nice! I think I was one of the people who originally suggested tab shuffling, this is a simple and convenient solution. I'd also been thinking about a way to show the duplicate scrolling easier, and changing background tint a little sounds ideal.

Question, I know that when you mark a file as better it commutes the tags of the one to the other, but is there any way to add tags during the comparison process to the list that will then be added to the better of the two? A lot of times a worse-quality image will have a watermark or URL that's not present on a higher quality image, but may yet be the original source, and it would be nice to have a way to get rid of the lower res image but preserve that metadata as a tag. Since I can't right-click to edit tags like in a normal viewer pane, I'm a little lost.


ae87b7 No.5724

File: 4caae109e154248⋯.jpg (99.76 KB, 350x310, 35:31, 4caae109e154248384891b271f….jpg)

>>5720

Jesus christ.


e3de1f No.5725

>>5720

So what is all of that? Porn?


d465e7 No.5726

>>5719

I'm currently downloading from gelbooru (not using a subscription) and havent had any problems so far. What are some of the problems others have been having?


4e54bd No.5727

File: dc80df47a3708df⋯.jpg (11.8 KB, 241x230, 241:230, dc80df47a3708dfe3cc3ee257c….jpg)

>>5720

My god. Do you actually tag stuff or just rely on the PTR and the tag look up? Which tag ranks the highest?


b90091 No.5728

>>5726

It works now, but they changed what the thumbnails link to on gallery pages. They now link to sketchy redirect links (assumedly to facilitate ads) which hydrus couldn't resolve. It can now, but with the feature that shows URLs of files, there's no way to tell if those URLs will work in the future. Similarly, subscriptions rely on urls never changing, so with the change they suddenly think all the images are new (because they have a different url). Basically, your gelbooru subscriptions will suddenly try to redownload themselves entirely.


b3bde9 No.5729

Is there a way to disable the bg color change when filtering dupes? That stark change outside the images makes it more difficult to spot subtle changes within them.


d465e7 No.5730

>>5728

Thanks brah.


1d955b No.5731

File: 432b16f08d743e3⋯.png (38 KB, 188x268, 47:67, 089041958235.png)

>>5720

s-six million?!


1bc186 No.5732

>>5720

My 50k micropenis salute you


ae87b7 No.5733

Bug: shortcuts bound to two actions (say, "c" to "set tag "catgirl"" and "set tag "cute"") do not work, only the newest one is saved when you click "apply". At least in the same shortcut set.


9df39a No.5735

Minor feature request OP:

If convenient, can you make the default minimum tab width a little smaller or better yet add an options box to set the size by integer or something? I know it's inconvenient to have near-blank tabs too tiny for most users, but working mainly off a 1336x768 screen I often find myself having to shift the tab bar back and forth with the directionals just to see the tabs I want.

>>5733

That might be by design to avoid your commands getting FUBAR. Have you considered assigning "cute" as a subtag of "catgirl" since you seem to want all that are tagged one to be tagged the other anyways? I have rating:NSFW set up to automatically get added when I enter board:(certain boards) tags.


933163 No.5740

>>5725

90% is.

For the most part it's the image entire databases of the following:

Gelbooru as of May 1st 2017(This took about a month to add, with around 100k files being added daily)

MyFigureCollection 2016

e621 2015

Rule34Hentai as of April 2017

Rule34Pahael 2015

HentaiFoundry 2015

And then around 500+ pixiv artists. There's about 20 other boorus/sites too but they have less than 300k images. It probably won't grow much bigger than this.

>>5727

>Do you actually tag stuff or just rely on the PTR and the tag look up?

I don't tag anything myself. I make tag archives of the sites I rip which already contains their tags, so nearly every image should be tagged(unless it wasn't tagged in the site)

>Which tag ranks the highest

As in like across all my files? I've no idea since I have tag counts and such disabled


933163 No.5741

>>5740

>image entire databases

Entire image databases*

Also I haven't had the time to run a dupe check, so that'll probably trim down 200k files or so.


ae87b7 No.5742

>>5735

>That might be by design to avoid your commands getting FUBAR. Have you considered assigning "cute" as a subtag of "catgirl" since you seem to want all that are tagged one to be tagged the other anyways?

I don't want that. I just want to do it sometimes for when I scrap an artist that makes cute catgirl pics. Not all catgirl pictures are cute.


9df39a No.5744

>>5742

Maybe a different key for each tag then? They could be right next to each other. I'm not sure Hydrus supports running macros from a single keypress yet.


4f93c0 No.5747

Can you add this https://github.com/ipfs/go-ipfs/issues/3397#issuecomment-284337564 as an option for ipfs in the next release? It could be enabled when you check the checkbox in ipfs service settings.


1460bb No.5752

File: cd3cb363f2cfdba⋯.png (8.87 KB, 300x318, 50:53, cd3cb363f2cfdba0e93ab4748c….png)

File: 9c9d73a2a2a0671⋯.gif (421.57 KB, 640x480, 4:3, 9c9d73a2a2a0671c6687b13b58….gif)

>>5720

Incredible. I'm glad it works for you!

Do you have to straddle multiple hard drives, or does it all fit into a 4TB or whatever?

Please don't hesitate to report slowdown now or in the future. You are definitely a supreme edge case, so you will experience bottlenecks far before us mortals.

>>5721

Thank you for this report. I'll look into this this week, as them not saving those details is not intended.

If it is any help, you can set the defaults of those options under options->default tag import options and options->files and trash.

>>5722

F3 or any other shortcut you have set up to launch manage tags should still work, and it'll even follow the media it is pointed at as you scroll between the pair. You can also just double click the tag hover window on the left.

Unfortunately, you can't edit tags after the decision as the new pair is loaded, so you can't easily edit the tag merge finely. Perhaps that is something to add to the 'custom' action, but I'm not sure if I have time to add the extra gui in this iteration. Maaaybe a 'open manage tags for both after the decision' checkbox?

>>5726

>>5728

I think I am going to purge these shit urls this week and write a special hook just for gelbooru to resolve them back to proper urls. That'll be a feature of the new downloader engine, so I might as well write the underlying code now, when we need it.

>>5729

Sure, I'll add this as an option.

>>5733

>>5742

>>5744

Unfortunately, the shortcut system doesn't support multiple actions per shortcut. It will in a future iteration, particularly by you assigning a script of commands (like 'set this rating, add this tag, archive it, then move to the next file'). Please assign your tags to different shortcuts for now, and if you have many different tags to add, see if having multiple custom shortcut sets works for you.

>>5735

Is this the page tabs on the main gui frame, or on a ListCtrl (the table-like control with rows and columns) somewhere else in the program?

Actually, if you can take a screenshot, that'd be great to see the sort of bad-fitting ratio we are talking about. A lot of this code is a mess (and not all mine), but I am absolutely happy to try to improve this situation for smaller-screen users.


1460bb No.5753

>>5747

Sure, I will try to fit this in. I want to do some other IPFS work as well. If I can't do that option this week, I'll make sure to have it in for v257.


3fd050 No.5754

Getting an interesting bug; with the duplicate filter up, I can't move to the B image from the A. I click the button, but nothing happens. I'm on Arch Linux, fyi


9a1674 No.5755

Do the known urls in the dupe filter show up on the white background? Either they don't or every single image that ends up as B in mine has no known urls.


93cd6b No.5759

>>5752

>Maaaybe a 'open manage tags for both after the decision' checkbox?

Not him, but that would be much appreciated.


067cae No.5760

File: 8631d7c043366b6⋯.png (533.55 KB, 803x866, 803:866, client_2017-05-13_10-59-27.png)

Hi, I was wondering if it would be possible to add an exclude/include tag to the duplicate system.

I have about 5000 pictures of visual novel rips, and they are the only things that show up on the duplicate page if I use "show me a random pair", since they're so many and extremely similar except for the text on the picture.


1bc186 No.5761

After resolving all the possible dupes, Hydrus will spam a popup window indefinitely. Forgot exactly what it said, but basically "no more dupes".

Why not add some more buttons to the top hover window? Open manage tags, remove all tags, for example. Just to make it easier for newbies, also someone like me who mostly uses that hover window instead of hotkeys.

Also the ability to select multiple images in the main browser and set them as alts to one another would be useful and help speed up the dupe checking process.


d3a8c6 No.5764

>>5761

>Also the ability to select multiple images in the main browser and set them as alts to one another would be useful and help speed up the dupe checking process.

That's a feature I would enjoy quit a lot considering the many variation set i fetch from pixiv.


4f93c0 No.5765

>>5753

Thank you


7ed8d0 No.5766

Some kind of automated system to delete one of the two images set as an "Exact duplicate" would be grand. Look at how Dupeguru does its picture sort, allowing you to set the smaller of two images first or the smallest filesize. Having that be done automatically instead of trying to go through your list of extact dupes and picking one or the other, but simply based on which of them is xier or yier then the other would be very cool.


33b2e0 No.5768

>>5720

>archive SIX MILLION

>inbox SEVEN

Good man. What kind of scrapers do you have running?


9df39a No.5791

Since the last update, some 4chan threads have been giving me this for a few files, in one case it was for like 20+ files in a thread but usually just one or two:

TypeError: buffer object expected… (Copy note to see full error)
Traceback (most recent call last):
File "include\ClientImporting.py", line 2817, in _WorkOnFiles
HG.client_controller.WriteSynchronous( 'content_updates', service_keys_to_content_updates )
File "include\HydrusController.py", line 386, in WriteSynchronous
return self._Write( action, HC.LOW_PRIORITY, True, *args, **kwargs )
File "include\HydrusController.py", line 117, in _Write
result = self._db.Write( action, priority, synchronous, *args, **kwargs )
File "include\HydrusDB.py", line 824, in Write
if synchronous: return job.GetResult()
File "include\HydrusData.py", line 1668, in GetResult
raise e
DBException: TypeError: buffer object expected
Database Traceback (most recent call last):
File "include\HydrusDB.py", line 517, in _ProcessJob
elif job_type in ( 'write' ): result = self._Write( action, *args, **kwargs )
File "include\ClientDB.py", line 9471, in _Write
elif action == 'content_updates': result = self._ProcessContentUpdates( *args, **kwargs )
File "include\ClientDB.py", line 6621, in _ProcessContentUpdates
hash_id = self._GetHashId( hash )
File "include\ClientDB.py", line 3426, in _GetHashId
result = self._c.execute( 'SELECT hash_id FROM hashes WHERE hash = ?;', ( sqlite3.Binary( hash ), ) ).fetchone()
TypeError: buffer object expected

wat do?


953dc7 No.5793

>>5791

It's currently failing like that all already deleted pictures. It effectively does not impact an user because it's just marking them as failed rather than "already deleted".


135556 No.5800

>>5720

I only have ~1/3 of that now. Would be more if everything worked smoothly, but I get UI lockups when generating phashes and so on.

Didn't get to add more.

> >>5752

> Do you have to straddle multiple hard drives, or does it all fit into a 4TB or whatever?

Since he didn't answer, my set is actually under 1TB. His might still easily still fit on a single 4TB drive, yea.

And it's not like 8x6TB or whatever already requires particularly specialized hardware or a fortune. Just throw ten 6TB drives into a pretty standard Linux / big tower machine with one-two add-on SATA cards, RAID6 over it all, and it you got 8 drives worth of storage without much trouble in setting it up or operating it.

As a small remark: At a bigger LAN party around 15 years ago, there was a guy who had a stack of 40 drives on simple rails, and he apparently had left another 20 or 40 or whatever at home. All connected to bigger 3ware enterprise cards cards. [That was fairly impressive to look at, even though it's not really clever - they even apparently performed pretty shit under his Win 2k or whatever it was back then.]


1460bb No.5807

File: 89ad104abb9f095⋯.jpg (1.38 MB, 2164x4145, 2164:4145, 89ad104abb9f095ea6e7a7a1d5….jpg)

>>5754

Thank you for this report. I am not sure what was causing this–which button were you clicking?

I have made some more changes and improvements to the duplicate filter this week. I am guessing you were clicking the >| button, which actually moves to the 'skip to next pair'. If this is what happened, this will be more obvious in the new version. Also pairs, will not necessarily start with A from now on (they'll start with best guess for 'best').

>>5755

I think this is working properly, so I think what you are seeing is an accident. I will look into this more tomorrow, when I test the new filter on my real world client.

>>5759

I don't have time to write nice gui, but I'll figure something out to achieve this.

>>5760

I'll be adding thumbnail right-click duplicate manual decisions as I finish this off. At that point, you should be able to eliminate all those decisions more quickly by going:

ctrl+a->right-click->dupe stuff->these are all alternates

Let me know how it works for you!

>>5761

Thank you for this report about the popup spam. I think I have it fixed for tomorrow, but please let me know if it still gives you trouble.

I like the idea of adding more clickable commands to the hover windows. I'm concerned about running out of space, but perhaps I can make them individually optional.

Also, if it is any help, you can double-click the left hover window to open manage tags.

>>5761

>>5764

Yeah, this will happen in the next couple of weeks!

>>5766

Tomorrow's release will tell you which file is larger/has more tags/and a couple of other properties. It'll also try to put the 'better' image first. Maybe this will make it easier to determine better/worse quickly?

>>5791

>>5793

Thank you for this report. I apologise–I messed up some url-associating code last week. This is fixed for tomorrow. It only afftected 'already deleted' files, so if you had those set to be excluded anyway, this was ultimately mostly harmless.




[Return][Go to top][Catalog][Post a Reply]
Delete Post [ ]
[]
[ / / / / / / / / / ] [ dir / asmr / choroy / ita / kpop / newbrit / polk / sl / vore ]