[Release] OfflineBay v2 - Open source and No more Java dependency
#41
(Jul 16, 2018, 13:48 pm)0zz0 Wrote: I firmly believe TPB is the greatest thing since Roman aqueducts and I fully support them Big Grin

Haha. Couldn't agree more.  Tongue
Reply
#42
Hello! I can't believe I stumbled upon this only now!

I put the Pirate Bay dump on ipfsearch, a decentralized search engine and have some questions about how OfflineBay works.

How do you use DHT to get seed/leech counts? I thought that there is close to no way to finding this out accurately and fast, and for generating my index, I use the trackers that Pirate Bay includes by default in their magnet links. Without being in the DHT swarm for long, you can pretty quickly (quickly by distributed systems standards, slowly compared to trackers Big Grin ) find one peer with the files, but not the definitive peer count.

Do you use the NoSQL database only for metadata and not for indexing the dump? From skimming through the code, it seems that it's sequentially reading the whole CSV file (or something CSV that you're parsing) and searching for the wanted words, but I am absolutely unsure, the code is pretty hard to read...

Where can I find the code that handles the tokenization? I have no control over the tokenization on the querying side -- there is the Porter Stemmer, but I'd like to see how have you handled all the edge cases about torrent naming like (h264), dots instead of spaces, etc.

How do you feel about the rewrite to JS now? ipfsearch has a library for index generation that is written in Typescript (typed JS). Now that I look on the code I have written for index generation, it would've been faster to just write my own library for index generation for ipfsearch in something what supports threading, is much more elegant and has runtime type checks, like Go (I love Go! and Java is ok too). Would you tell younger you to rewrite it to Javascript?

Now I'm looking back at this it seems like there are a lot of questions Big Grin ... I'd be super happy if you could answer all this, thanks in advance!

If you want to check out the Pirate Bay dump: https://ipfsearch.xyz/?index=/ipns/12D3K...XpKteg2zMx
If you want to read more about ipfsearch: https://ipfsearch.xyz/

P.S.: I'm not the developer behind ipfsearch.xyz, I only created the index. Yes, code for generating the index is not open source yet, but it sure will, one day! ipfsearch.xyz is open source.
Reply
#43
(Aug 28, 2018, 17:38 pm)urbanguac Wrote: I thought that there is close to no way to finding this out accurately and fast


There isn't, and there never will be.
Reply
#44
anyone know of any mirrors where i can get database dumps for offlinebay?
Reply
#45
Spasibo.

A wonderful program, understandably a bit rought around the edges, that I am happy to have, and hoped would never need.

Times change.

Is there any advantages to compiling this in Node? Have it on the system....
Reply
#46
i cant find the torrent dump.
Reply
#47
http://uj3wazyk5u4hnvtk.onion/static/dum..._2017.html
Reply
#48
(Jun 04, 2018, 06:37 am)techtac Wrote:
Show Content

Hello Was wondering how i can create my own CSV database to only have my own personal torrents?
I don't understand where you get the base64 code for the torrents?
Reply
#49
(Jan 19, 2019, 18:44 pm)CKkio23 Wrote: Hello Was wondering how i can create my own CSV database to only have my own personal torrents?
I don't understand where you get the base64 code for the torrents?

Just follow the same header structure as the TPB dump. You can check the source code to find how it's done. It's a simple base64 decrypt function.
Reply
#50
Hey, my question I got torrent from this csv dump file , example below :

2019-Apr-13 22:04:04;K4t83wWiKQ4p2S+ZIi49jNrV64I=;"Adobe Media Encoder CC 2018 v22.0.1.64 (x64) + Crack";10596222

and how to decode this "K4t83wWiKQ4p2S+ZIi49jNrV64I=" on magnet url ? I tried to decode via online tools (base 64 decode) .. but I got incorrect result, some chinsese characters, not correct magnet url .. please help, thank you.
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Java Course Materials RobertX 6 9,874 Sep 05, 2024, 02:36 am
Last Post: samui
  how can i learn to use Java efficiently Blue_Bon 10 11,821 Feb 26, 2024, 08:41 am
Last Post: RobertX
  Question About the Concept of Abstracts in Java RobertX 4 5,635 Jan 17, 2024, 07:50 am
Last Post: gulshan212
  Regarding to Purpose for .class files in Java ankitdixit 1 11,726 Mar 10, 2022, 14:26 pm
Last Post: RobertX
  Popularity of Java in Today's World RobertX 6 37,884 Feb 02, 2020, 18:05 pm
Last Post: onlytorrents



Users browsing this thread: 6 Guest(s)