Seznam metaoznak

🏷️

equiv: content-type content: text/html;charset=utf-8
name: generator content: Geany 1.36

Whois informacije

📄

domain_name: cyjon.net

update_date: 2018-07-22T20:48:05Z

update_time: 1532292485

creation_date: 2003-08-18T14:14:39Z

creation_time: 1061216079

expiration_date: 2027-08-18T14:14:39Z

Surovi podatki Whois

📋

            Domain Name: CYJON.NET
Registrar: NameCheap, Inc.
Domain Status: client transfer prohibited
Creation Date: 2003-08-18T14:14:39Z
Registry Expiry Date: 2027-08-18T14:14:39Z
Updated Date: 2018-07-22T20:48:05Z
Name Server: NS1.ASMALLORANGE.COM
Name Server: NS2.ASMALLORANGE.COM
REGISTRAR Contact: NameCheap, Inc.
>>> Last update of RDAP database: 2026-01-09T17:53:25Z

Robots.txt

🤖

			#
# robots.txt
#
# This file is to prevent the crawling and indexing of certain parts
# of your site by web crawlers and spiders run by sites like Yahoo!
# and Google. By telling these "robots" where not to go on your site,
# you save bandwidth and server resources.
#
# This file will be ignored unless it is at the root of your host:
# Used:    http://example.com/robots.txt
# Ignored: http://example.com/site/robots.txt
#
# For more information about the robots.txt standard, see:
# http://www.robotstxt.org/wc/robots.html
#
# For syntax checking, see:
# http://www.sxw.org.uk/computing/robots/check.html

User-agent: *
Crawl-delay: 10
# Directories
Disallow: /includes/
Disallow: /misc/
Disallow: /modules/
Disallow: /profiles/
Disallow: /scripts/
Disallow: /themes/
# Files
Disallow: /CHANGELOG.txt
Disallow: /cron.php
Disallow: /INSTALL.mysql.txt
Disallow: /INSTALL.pgsql.txt
Disallow: /install.php
Disallow: /INSTALL.txt
Disallow: /LICENSE.txt
Disallow: /MAINTAINERS.txt
Disallow: /update.php
Disallow: /UPGRADE.txt
Disallow: /xmlrpc.php
# Paths (clean URLs)
Disallow: /admin/
Disallow: /comment/reply/
Disallow: /filter/tips/
Disallow: /logout/
Disallow: /node/add/
Disallow: /search/
Disallow: /user/register/
Disallow: /user/password/
Disallow: /user/login/
# Paths (no clean URLs)
Disallow: /?q=admin/
Disallow: /?q=comment/reply/
Disallow: /?q=filter/tips/
Disallow: /?q=logout/
Disallow: /?q=node/add/
Disallow: /?q=search/
Disallow: /?q=user/password/
Disallow: /?q=user/register/
Disallow: /?q=user/login/

############
# End of Drupal-generated robots.txt
############

############
# This list is taken from parts of http://www.pflanzenfreunde.com/robots.txt
##############

# Some bots are known to be trouble, particularly those designed to copy
# entire sites. Please obey robots.txt.
User-agent: sitecheck.internetseer.com
Disallow: /

User-agent: Zealbot
Disallow: /

User-agent: MSIECrawler
Disallow: /

User-agent: SiteSnagger
Disallow: /

User-agent: WebStripper
Disallow: /

User-agent: WebCopier
Disallow: /

User-agent: Fetch
Disallow: /

User-agent: Offline Explorer
Disallow: /

User-agent: Teleport
Disallow: /

User-agent: TeleportPro
Disallow: /

User-agent: WebZIP
Disallow: /

User-agent: linko
Disallow: /

User-agent: HTTrack
Disallow: /

User-agent: Microsoft.URL.Control
Disallow: /

User-agent: Xenu
Disallow: /

User-agent: larbin
Disallow: /

User-agent: libwww
Disallow: /

User-agent: ZyBORG
Disallow: /

User-agent: Download Ninja
Disallow: /

#
# Sorry, wget in its recursive mode is a frequent problem.
# Please read the man page and use it properly; there is a
# --wait option you can use to set the delay between hits,
# for instance.
#
User-agent: wget
Disallow: /

#
# The 'grub' distributed client has been *very* poorly behaved.
#
User-agent: grub-client
Disallow: /

#
# Doesn't follow robots.txt anyway, but...
#
User-agent: k2spider
Disallow: /

#
# Hits many times per second, not acceptable
# http://www.nameprotect.com/botinfo.html
User-agent: NPBot
Disallow: /

# A capture bot, downloads gazillions of pages with no public benefit
# http://www.webreaper.net/
User-agent: WebReaper
Disallow: /

############
# End of stuff from http://www.pflanzenfreunde.com/robots.txt
############

##########
# From here below is the stuff I created (cyjon.net)
###########

#"Mozilla/5.0 (compatible; SEOkicks-Robot +http://www.seokicks.de/robot.html)"
# This one just felt fishy to me
User-agent: SEOkicks
Disallow: /
User-agent: SEOkicks-Robot
Disallow: /

#Sistrix crawler, pulls WAY too many pages WAY too fast. Ignores robots.txt
User-agent:SISTRIX
Disallow: /

# Semi-legitimate bot, but the company CEO is a jackass on bot protection forums
User-agent: MJ12bot
Disallow: /

Iskalne fraze - Google

🔍

Položaj	Fraza	Stran	Izrezek
8	2 better eating	/node/324	Smarter Food Serving 6 мая 2012 г. — DESCRIPTION: Sims will be more logical when deciding how many servings to lay out on the table. Updated to fix a problem when serving ...

Dodatne storitve

💎

cyjon.net

cyjon.net Posnetek zaslona spletnega mesta

Seznam metaoznak

Whois informacije

Surovi podatki Whois

Robots.txt

Položaji v Google

Iskalne fraze - Google

Dodatne storitve

Платеж успешен!

Оплата за услугу

Plačilo

Войти в аккаунт

Зарегистрироваться через Telegram

Как это работает:

Код подтверждения

Завершите регистрацию в Telegram

Подтверждение Email