Yandex caught scraping Google SEO code

Tech experts think Yandex and Google could share a lineage

When you purchase through links on our site, we may earn an affiliate commission.Here’s how it works.

As TechRadar Proreportedearlier in January 2023, a formerYandexemployee with a “political” motive has allegedly leaked a wide-ranging repository of source code for many of the web portal’s products, potentially shedding light on the dark art ofsearch engine optimization.

BleepingComputerreports the employee leaked git sources totalling 44.7GB of files, containing “all of” Yandex’s source code except for its anti-spam rules, that were obtained in July 2022.

The raw source code won’t be of interest to everyone,Search Engine Land’s report that 17,854 search ranking factors have been uncovered as part of the leak should be of interest to any person, business or publication looking to see their pages ranked highly in search engines.

Yandex leak SEO insights

Yandex leak SEO insights

Apartial list of factors ranked by the Yandex search enginefrom one file in the codebase, shared by CEO ofSEOconsultancy MOG Media Martin MacDonald, does shed some light on the aspects of copy that Yandex applies weight to.

PerRussian Search News, these include PageRank and several aspects of links such as age and relevancy, the perceived relevance of copy, host-reilability, and innate preferences towards specific sites with perceived authority, such as Wikipedia.

A deeper, longer, more technical dive bySearch Engine Landalso shows that this priority also includes a “NEWS_AGENCY_RATING”, allowing Yandex’ search engine to show preference to certain news organizations.

Others include the number of unique visitors, percentages of organic traffic, and average domain rankings across queries.

Are you a pro? Subscribe to our newsletter

Are you a pro? Subscribe to our newsletter

Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!

However, it’s perhaps melodramatic, or a little desolate, for MacDonald to describe it as “the most interesting thing to have happened in SEO in years.”

While the leaked codebase certainly offers a raft of insights, it’s worth noting that many websites will be looking to rank well onGoogleover Yandex, purely because the former is far better known.

Both companies have shared web engineers over the years, Yandex does use many of Google’s open source technologies, such as TensorFlow and BERT, and references to Google data appear in the leaked codebase.

However, Search Engine Land’s deep dive argues that the Yandex leak can give general insight into the anatomy of a modern search engine, but, per Russian Search News, many of the Yandex’ leaked ranking search factors go unused, or are officially considered depreciated.

Firefox bins Russian search engines over misinformation fears

How to learn search engine optimization with SEO Panel

We’ve also listed the best online marketing services right now

Even the technical deep dive admits many of Google (the search engine’s) known aspects, such as its crawler and index systems, differ from Yandex’.

All of this, combined with the age of the leaked codebase, makes it unclear as to how assumptions over how Yandex and Google may both rank pages will fare.

Luke Hughes holds the role of Staff Writer at TechRadar Pro, producing news, features and deals content across topics ranging from computing to cloud services, cybersecurity, data privacy and business software.

7 myths about email security everyone should stop believing

Best Usenet client of 2024

Philips Hue vs Govee: choose the right smart lights for you