One of the best things about reddit was looking for answers or other users with the same problem as you, and since Google didn’t really help with that anymore and instead insisted on giving you business results, the best practice was to put your search terms in followed by ‘reddit’ and you’d find your answer.

  • marsara9@lemmy.world
    link
    fedilink
    arrow-up
    38
    ·
    edit-2
    1 year ago

    I’m working on a specialized search engine just for the fediverse. https://github.com/marsara9/lemmy-search

    If anyone wants to help out, feel free to reach out, but I hope to have something ready to release soon.

    The idea with my version is that it’ll search as much of Lemmy / the fediverse as it can and you can select the preferred instance that you want to open any link with.

    • qisope@lemmy.world
      link
      fedilink
      arrow-up
      11
      ·
      edit-2
      1 year ago

      If you are looking to return relevant, well ranked results based on freeform queries you’d be better indexing into something like elasticsearch. Otherwise you’ll be reinventing solutions to well understood problems, like stemming as a very basic example.

      • marsara9@lemmy.world
        link
        fedilink
        arrow-up
        7
        ·
        1 year ago

        For the initial release the search is still fairly basic, but A LOT better than the built in search here.

        Right now I just look for IF the individual words match ANY of the words in the post title or body and then rank based on the number of upvotes that the post has.

        Future versions may look at using elastic search, etc… But for MVP it just looks for the number of hits + the score of the post as I assume the higher the score the more trustworthy the post, and obviously the more matches that to your query the more relevant the post is.

    • Confetti@lemmy.world
      link
      fedilink
      arrow-up
      3
      ·
      edit-2
      1 year ago

      Some name ideas off the top of my head if you would like some:

      • Findmmy (find me)
      • lemscover
      • discovermmy (discover me)
      • Lemlens
      • Seachiverse
      • Findiverse
      • Fedisearch (I like this one) taken sadly

      Edit: please ignore if I posted multiple times something weird is going on haha

    • Excel@lemmy.megumin.org
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      How is this different from just searching for posts on the original “seed instance”? Presumably you’re crawling through everything on all of the instances that it’s aware of, as opposed to the Lemmy built-in search which would only search communities that have a subscriber?

  • rtxn@lemmy.world
    link
    fedilink
    arrow-up
    8
    ·
    edit-2
    1 year ago

    I recently saw that someone was making a keyword search engine that works across the fediverse. I’ll try to find the project.

    edit: found it, unsurprisingly it’s called lemmy-search. Although it only seems to work on Lemmy instances.

      • sadreality@kbin.social
        link
        fedilink
        arrow-up
        1
        ·
        1 year ago

        Google gutted their search over last few years.

        No, sandar I don’t want 69 pages of SEO optimized trash

        • GammaScorpii@lemmy.worldOP
          link
          fedilink
          arrow-up
          1
          ·
          1 year ago

          Yeah I meant just a neat way of searching for user posts.

          Google probably ingrained a little so that’s what I gravitate to, but if there are better ways of searching that’s helpful too. Having a search engine do it is good because you might come across old forum posts as well as Reddit, but over the years Reddit just became more prevalent. Obviously these new federated sites won’t yet have all the usable content, but I’m wondering if the process will be the same once they do.

  • rm_dash_r_star@lemm.ee
    link
    fedilink
    arrow-up
    6
    ·
    1 year ago

    They’re getting indexed, but search rankings are so low they’re buried. If you put <search term> site:<server> you get post results. For example lemmy site:lemmy.world

  • marcar@lemmy.world
    link
    fedilink
    arrow-up
    5
    ·
    1 year ago

    This would obviously be good for promoting Lemmy which I’m 100% all for.

    But from a privacy point of view, I also feel mods should be able to stop indexing or choose which engines can index for their specific communities and also users at a user should be able to control it. I understand that engines could ignore this, but I doubt the big ones would…

    I think I read that individual instances already can choose whether to be indexed or not, I could be wrong there

    • yaomtc@kbin.social
      link
      fedilink
      arrow-up
      1
      ·
      1 year ago

      Why are people using a site named after the place they purposefully left with just one letter changed?

      • Moka@kbin.social
        link
        fedilink
        arrow-up
        1
        ·
        1 year ago

        Presumably because reddit itself has a lot of positivity and memories attached to it for a lot of people - it wasn’t the site that people wanted to leave, but rather the ceo and staff behind it.

    • MinusPi (she/they)@pawb.social
      link
      fedilink
      arrow-up
      1
      ·
      1 year ago

      I don’t know about others but I used to just add “reddit” to each of my searches. Wouldn’t adding “Lemmy” instead do the same thing eventually?

        • tqgibtngo@kbin.social
          link
          fedilink
          arrow-up
          2
          ·
          1 year ago

          As a newcomer, I’ve visited 3 Lemmy sites: Beehaw. Lemmy.world, and a custom instance. I noticed that they each have page footers that contain: Join Lemmy. If the same is true of many Lemmy instances, I can add Lemmy (or, with quotation marks, “Join Lemmy”) in a Google query. — (Note: Top matches might not always be best matches on the originating instance, or sometimes the best matches might be hidden until I click “repeat the search with the omitted results included.” And of course sometimes I won’t get any match because the target hasn’t been indexed by Google.)

    • Elkaki123@vlemmy.net
      link
      fedilink
      arrow-up
      0
      ·
      1 year ago

      Would this be corrected naturally by people using feddit as a search term more or does google have to manually patch this things?

        • chaorace@lemmy.sdf.org
          link
          fedilink
          English
          arrow-up
          0
          ·
          1 year ago

          I don’t understand. I looked at your screenshot again and the search field seems to show feddit.de: Musk. This is not the site: syntax. What I suggested was Musk site:feddit.de. Am I missing something?

          • albert180@feddit.de
            link
            fedilink
            English
            arrow-up
            0
            ·
            edit-2
            1 year ago

            The site: is feddit.de: and after that follows the search query. It works that way too, and it’s less work to type. Try it out by yourself

            • chaorace@lemmy.sdf.org
              link
              fedilink
              English
              arrow-up
              0
              ·
              1 year ago

              I tried it myself and they’re not similar at all. site: is handled specially through Google’s advanced search syntax while the other approach is no different from a normal keyword. Please refer to the below images with attention to the result counts:

              It’s fine if you don’t want to use the syntax, but using it would solve your problem with keyword autocorrect and properly filter your results to only the website you’ve asked for.

  • theactualmitch@lemmy.mitchday.com
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 year ago

    I’ve been experimenting with my instance and google does index my lemmy.mitchday.com page. If you search ‘lemmy mitch day’, you get my page right up top and a few motorhead fans named mitch down below.

    My experiment involves trying to SEO a vanity domain I’ve had for years and only used for email. Since July 4, the page ranking for my general site has steadily climbed. The little bit of traffic from other instances and a handful of subscriptions seems to be impacting the ranking.

  • sadreality@kbin.social
    link
    fedilink
    arrow-up
    1
    ·
    1 year ago

    Does fediverse instances even show up in search engines?

    There is no money for parasites here so I wouldn’t expect them to send anyone here.

  • hawkwind@lemmy.management
    link
    fedilink
    arrow-up
    1
    ·
    edit-2
    1 year ago

    Being decentralized will make it harder to just use “search + reddit” because you don’t know if it’s “search + lemmy.world” or “search + beehaw” or “search + kbin.” Also, each admin is in charge of their own Robots.txt.