Thank you for the answers. I see a light in the dark.
That the robots.txt is for directing search engines is easy for me to understand....
O.K. for google you may say disallow for the cgi-bin or other directories.
But
DF6IH writes that he is running his sites without robots.txt.
That means that the spiders are spidering every folder. Thats right? Or even not because they have no order to crawl anything.
And if we use this for phpwcms for example, the spiders are crawling every folder (allow *), but there are just the php-files and no html-like content.
So what's the intention? Are the spiders "seeing" the php website like we do? Means the spiders are seeing just the content?
For example - there is no use for spiders to crawling the FCKEditor subfolders...
And what means the words in the robot.txt files ?
DF6IH writes:
Kompetenz in Präzisionsgewindespindeln, Fein- und Trapezgewindespindeln, gewindeschleifen,
Are these some keywords of the content? So maybe i should bring them in my r*.txt file, too.
my robots.txt file looks like:
Code: Select all
User-agent:*
Disallow: /cgi-bin/
Disallow: /logs/
Disallow: /config/
Disallow: /include/
Disallow: /img/
Disallow: /phpwcms_ftp/
Disallow: /picture/
Disallow: /phpwcms_code_snippets/
If I understand you, i should change it to allow all folders but cgi-bin, config..
Thank you for teaching me. But you see, if i ask a question, there are many more coming up after i read your answers.
The site
http://www.blahblahblah.de you told before is an internet site from a moderator.
Anyway, I wish yo all a happy christmas...

[/code][/quote]