# robots.txt for https://www.ricardo.ch/ User-agent: * # English Pages until release Disallow: /en/ # Selling Form Disallow: /de/list Disallow: /fr/list Disallow: /it/list # 28.04.2016 Disallow: /dataservice/ # CMS Pages Disallow: /pages/ Disallow: /ajax/ # Archived Article Disallow: /viewitem.aspx # Category XML Pages Disallow: /*feed.xml # Legacy and new online shop Disallow: /online-shop/ Disallow: /shop/ # New ratings pages Disallow: /ratings/ # Legacy French Pages Disallow: /pages/*/fr.php # Disallow commercial bots we don't like User-agent: psbot User-agent: Yandex User-agent: PetalBot User-agent: Mail.RU_Bot User-agent: MegaIndex User-agent: Baiduspider User-agent: 360Spider User-agent: Yisouspider User-agent: Bytespider User-agent: Sogou web spider User-agent: Sogou inst spider User-agent: proximic User-agent: ADmantX User-agent: Ahrefs User-agent: Seekport Crawler User-agent: SEMrushBot User-agent: BLEXBot User-agent: MJ12bot User-agent: dotbot Disallow: / # Disallow static resources and API endpoints crawling # User agent names for Google AdsBot can be found here : https://support.google.com/webmasters/answer/1061943?hl=en # Instruction for OnCrawl bot can be found here : http://help.oncrawl.com/en/articles/2767653-oncrawl-crawler-how-does-the-oncrawl-bot-find-and-crawl-pages#:~:text=OnCrawl%20follows%20all%20instructions%20to,will%20apply%20to%20your%20crawl. User-agent: AdsBot-Google-Mobile User-agent: AdsBot-Google User-agent: OnCrawl Disallow: /marketplace-spa/api/ Disallow: /api/mfa/ Disallow: /api/browser-statistics/ Disallow: /assets/search/