Non-threaded

Forums » The Lounge » Read Thread

A place to sit back, hang out, and make monkey noises about anything you'd like.

CYScraper: a CYS forum searching tool

23 hours ago
Commended by Ogre11 on 3/9/2026 10:06:08 PM
AKA "the type of shit Mousecore would rather do than their contest entry." One thing I noticed about this quirky site is that the forums don't have a search feature! Maybe it's just a side effect of the place's age. I noticed that people usually just used Google searches set to only scrawl CYS for results, which is definitely an option. Out of sheer boredom and lack of motivation for my contest entry, I decided it would be kind of cool to have a dedicated CYS search. So that's basically what I did this entire weekend. Here is CYScraper. Like a Google search, you can query anything directly from CYS' forums, but CYScraper lets you filter results based on author, excluded author (all posts made NOT by an author), forum(s), and date before/after/on, as well as if you want to search only post replies vs. only threads (i.e. thread titles). While I'm no Google algorithm engineer, I did try to create a VERY basic relevancy scoring algorithm that ranks posts by default based on a few factors. You can also do a strict query (i.e. if you search for "heir", you will ONLY see "heir" and never "their") by wrapping your search in "quotations." Oh, hey, and there's dark mode too. Some Things to Consider - THIS IS AN IMPORTANT ONE. The data I have on hand for people to search is very limited. I didn't want to scrape all of the CYS forums in one sitting because I didn't want to get my IP flagged or literally blow up the website. Right now, the data you can query through is any post made within the timeline given on the CYScraper site, as well as any post on the front page of any forum (so as an example even if the Reading Corner's first page has some threads from beyond the timeline, you can still search through those on CYScraper). Over time, if people see this as a legitimate tool, I will collect more data from further back to put into the search service. - New posts are not automatically entered into the CYScraper database as soon as they are posted. They have to be scraped by my own service first in order to be queried on CYScraper. I am still in the process of figuring out an acceptable automatic scraping schedule that toes the line between having a good amount of fresh data vs. not harassing the site with requests. (I know it says it has a schedule on the CYScraper site itself, but that was an artifact from when I was still brainstorming how the hell I wanted to do this and I'm too lazy to update my entire repo just to remove that one sentence.) - With that being said, the majority of performance/optimization decisions are made with favor towards CYS. CYS a small site and I don't need or want to flood its servers excessively just for a performance buff on my end, even if that means the tool gets data slower/less frequently. - Downtime on the CYScraper page itself is definitely possible while I'm still figuring out its hosting configurations. - The mobile version doesn't look super pretty. I will be going back to test the UI specifically on mobile, but it should look good on Chromium and Firefox desktop browsers. So yeah, just a funky little project I made to test my Python skills and to solve an incredibly niche problem. Go ham, let me know if you find any problems, and if you have any suggestions, I'm all ears.

CYScraper: a CYS forum searching tool

20 hours ago
This is fucking dope.

CYScraper: a CYS forum searching tool

20 hours ago
Thanks Chris :)

CYScraper: a CYS forum searching tool

20 hours ago
I search for lol, fag and got 47 results. That seems reasonable, considering the limited sample data.

CYScraper: a CYS forum searching tool

20 hours ago
I'm testing a recent scrape from all the way back to March 2019, so once I check that the data is valid from that push, you will surely get more lol, fags.

CYScraper: a CYS forum searching tool

19 hours ago
Searched up End saying "fag" and saw posts from 2017, did you go further back for posts from specific members or something?

CYScraper: a CYS forum searching tool

19 hours ago
Nah I'm too lazy for that. The scraping cutoff date determines "freshness" by the last post in the thread rather than the thread's creation date. So if there's a thread from 2017 that got bumped in late 2025, it got picked up. The reason I kept the rest of the thread (regardless of those post dates) instead of only the posts within a certain date is so that I don't end up with "impartial" or half-scraped threads in the system already. Just makes it easier for my scraper to more optimally skip threads that are already determined to be stale, which means less work for the scraper and less requests to handle on CYS' end. Also more shit for you to search through. Everyone wins.

CYScraper: a CYS forum searching tool

20 hours ago
This is really awesome Mousecore. You deserve all the comms. This will be a really useful archivest tool as you expand the data set. It will certainly help out EndMaster when he does the next CYS History update, which makes it automatically noteworthy.

CYScraper: a CYS forum searching tool

20 hours ago
That was actually one of my inspirations for the tool haha. I mean I'm sure the noteworthy stuff is just kept in memory but it might be nice to use for a quick receipts lookup.

CYScraper: a CYS forum searching tool

19 hours ago

This is already better than the old search function this place once had.

Yep, this place did have a search function once, but it sucked a lot so it just got ripped out and everyone was told to use Google instead

CYScraper: a CYS forum searching tool

19 hours ago
Commended by TharaApples on 3/9/2026 11:30:01 PM
Oh damn that's nuts. Well I'll definitely be keeping this running if it's going to be useful for everyone. It costs me a whopping total of $0 to host and run anyway. Glad it can do something good :)

CYScraper: a CYS forum searching tool

19 hours ago

Damnnnnnnn.

CYScraper: a CYS forum searching tool

19 hours ago

Might as well pin this.

CYScraper: a CYS forum searching tool

19 hours ago
Aw thanks End!

CYScraper: a CYS forum searching tool

19 hours ago

You did so good! You're getting a community contributor trophy. 

CYScraper: a CYS forum searching tool

18 hours ago
Holy shit! Thank you so much Thara!

CYScraper: a CYS forum searching tool

18 hours ago

Yes. But now since it's in your trophy description. You literally have to host it forever, even if it eventually costs a million dollars in the dystopian cyberpunk future. 

It's both a blessing and a curse.

Ha.

CYScraper: a CYS forum searching tool

18 hours ago
So I knocked out a few bugs with the scraper tonight that didn't like the concept of pagination. I'll be rerunning a scrape tomorrow to grab data back to March 2019. Periodically I'll keep you guys posted whenever there's a new data drop, as I'm only going back a few years at a time, but I think over half a decade of CYS forum data is enough toys to play with in the meantime.

CYScraper: a CYS forum searching tool

18 hours ago
Oh this is cool.

The forum used to have a search, but it was buggy to the point of uselessness, so they just removed it a few years ago since Google existed. Not everyone knows how to tell Google to search a specific site though (I can never remember the exact syntax either), so that wasn't ideal either. Haven't really had a chance to play around with this yet but I'm looking forward to it.

CYScraper: a CYS forum searching tool

18 hours ago
This is really cool Mousecore! Well done! I'm excited to see this continuously be updated and improved. It looks really cool! I already searched up my name and it's been a really fun trip down memory lane to scroll through all the threads I posted in

Also, congrats on your new trophy!

CYScraper: a CYS forum searching tool

15 hours ago
Wow, very nicely done.

Edit: Now that it exists it should maybe there could be a wrapped on the site that calls it.

CYScraper: a CYS forum searching tool

15 hours ago
Looks and works great, maybe when one day somebody gets access to the mythical code base it'll be possible to integrate it into the site directly. That'd be something.

Also it is a crime that a masterpiece such as Boobs!!! has basically no mentions in the dataset.

CYScraper: a CYS forum searching tool

11 hours ago

It should absolutely be possible to call the scraper's backend externally. Although obviously if the codebase is going to be modified, just building a search in-house and thereby retiring CYScraper would be optimal.

Anyways that at least gives me the idea of a CYScraper extension that could host a frontend display to the scraper backend right on the forum page. This is once the scraper is actually where I want it to be though and not in its infancy stage

CYScraper: a CYS forum searching tool

11 hours ago
True the scraper part of CYScraper would be pretty redundant then, but the core search engine could be ported in. Especially if people get used to the inferface.

Far as the extension goes that is a pretty cool idea, but yeah better to leave it for once everything else is at a point where you're fully happy with it.

CYScraper: a CYS forum searching tool

13 hours ago
Thank you. This is awesome.

CYScraper: Roadmap and Considerations

10 hours ago
Commended by TharaApples on 3/10/2026 2:49:41 PM
So I figured I would write this here so it's more visible. Please do not directly reply to this as I will be updating this post. Scrape Progress 2019-2026 DONE 2016-2019 2014-2016 2012-2014 2010-2012 2007-2010 all of 2002-2007

Feature Roadmap

  • Full scraping of the CYS site: this is the work in progress right now. The scraper will always catch new posts and any updates to existing posts up to nearly two months ago, but any data further back needs to be configured to be scraped. I am only going a few years at a time not only to avoid overwhelming the site, but to give people incrementally larger datasets to work with rather than make everyone wait for one massive data dump. It's also just easier on my end in case something fails or gets corrupted. The post-Discord era is a little easier as a lot of communication presumably went to Discord, but in eras where the forums were significantly more active pre-Discord (i.e. early 2010s), I may only scrape for data 2-3 years at a time. And of course, if there were any edits to a thread/post beyond the ~2 month mark, just necro it and it will be caught by the scraper.
  • Routine new post scraping: the scraper is pretty lightweight and unintrusive when running small scrapes against the CYS servers. Once we got the full forum index scrawled, I will be creating an automated scraping routine that will only check for new posts in the past 24 hours and any updated posts in the past ~2 months. This scraping routine will run at 12:00 am EST daily so it's at a relatively unobtrusive time, and it probabably would take no more than 3-4 minutes at most.
  • Sorting: this is an obvious one. I would like to accomplish this relatively soon.
  • More ways to query: exclude a phrase from the list, lazy/optional searches, multiple query searches, etc.
  • Working mobile version: mobile is technically usable but it looks absolutely terrible. I will be pushing an update dedicated to fixing the mobile UI at some point.
  • DB optimization: admittedly this tool is held together with duct tape and blood. Over time I'm probably going to want to change this to make the DB more efficient.
  • CYScraper extension: it would be another neat project to have an extension that puts the CYScraper searchbar and filter dropdown right on the forum main page. I'll have to work out the details once the OG search page is where I want it to be. One of my larger concerns with this is making sure it works fine in tangent with Brad's very popular CYS extension, which I also use.

Other Considerations

As it says on the site, CYScraper is an unofficial tool. While I have access to all the scraped data, at the end of the day, those posts are the site's posts. With that being said, privacy is a big thing to keep in mind. Admins, if there are any posts/entire threads that the scraper picks up that need to be deleted (for example, the scraper runs while there's a heated gamer-word frenzy going on in the Lounge that's going to all be deleted by the mods), please contact me with the post/thread IDs and I will delete them. I can also modify individual entries if you only need some parts of a thread or post removed. This is also a big reason why images/embeds are not saved in the scraper, in the off chance that it picks up something extremely unsafe (think highly illegal content). Additionally, these posts are your posts too. If at any point, for privacy concerns, you would like to not have any of your posts scraped by the CYScraper, please contact me and I will add your data to an exclusion list and delete any previous posts/threads in the CYScraper system if necessary. I might point and laugh though, but I won't outright say no. And as if it needs to be said, no I am not selling or distributing your post data. I am a one-man show that's doing this for fun outside of their full-time job and irl obligations. I literally could not care less.

CYScraper: a CYS forum searching tool

9 hours ago

This sounds amazing, and I wish it wasn't blocked for me. I look forward to testing it at home!

CYScraper: a CYS forum searching tool

9 hours ago
This is amazing work. I'm sure it'll be plenty useful when the Secret Santa event rolls around again. XD

CYScraper: a CYS forum searching tool

6 hours ago
This is really cool. Glad to see you got the trophy & points for this, it's absolutely deserved

CYScraper: a CYS forum searching tool

4 hours ago

This is fucking amazing. Good job!

CYScraper: a CYS forum searching tool

57 minutes ago
Expect a bit of downtime right now, I am currently launching a version with the full March 2019-March 2026 scrape.