You are missing our premiere tool bar navigation system! Register and use it for FREE!

NukeCops  
•  Home •  Downloads •  Gallery •  Your Account •  Forums • 
Readme First
- Readme First! -

Read and follow the rules, otherwise your posts will be closed
Modules
· Home
· FAQ
· Buy a Theme
· Advertising
· AvantGo
· Bookmarks
· Columbia
· Community
· Donations
· Downloads
· Feedback
· Forums
· PHP-Nuke HOWTO
· Private Messages
· Search
· Statistics
· Stories Archive
· Submit News
· Surveys
· Theme Gallery
· Top
· Topics
· Your Account
Who's Online
There are currently, 68 guest(s) and 1 member(s) that are online.

You are Anonymous user. You can register for free by clicking here
Nuke Cops :: View topic - Performance [ ]
 Forum FAQ  •  Search  •   •  Memberlist  •  Usergroups   •  Register  •  Profile •    •  Log in to check your private messages  •  Log in

 
Post new topic  Reply to topicprinter-friendly view
View previous topic Log in to check your private messages View next topic
Author Message
Pole
Nuke Cadet
Nuke Cadet


Joined: Nov 03, 2004
Posts: 2


PostPosted: Wed Nov 03, 2004 6:37 am Reply with quoteBack to top

Heya

I am having dire performance problems having rewrite rules in my .htaccess file.

My site get's about 15K-20K hits a day. the load sits between 0.10 and 1.1 fluctuating extensively. Before the .htaccess and rewrites it was sitting pretty much on 0.00 - 0.03 load.
I need to optimise apache more. The first step is to eliminate all the .htaccess files, so apache does not have to do recursive lookups on each request. This alone will drasticly increase performance.


I would love to move them all into my main httpd.conf file.
I can migrate the .htaccess into it no problem.
The site links get rewritten, but they dont work. All pages not found.
I guess I need to yank the header.php condition rules out and chuck it in the httpd.conf also.

Now how, or what's the syntax for that?
Does anyone have any experience in doing this?

Thanks
Find all posts by PoleView user's profileSend private message
Imago
Captain
Captain


Joined: Jan 17, 2003
Posts: 629

Location: Europe

PostPosted: Thu Nov 04, 2004 2:45 am Reply with quoteBack to top

Better play with your robots.txt and allow only several SE to crawl the site. Or sign up for a better machine. My load is steady around 0.5 - with a dozen of Amazon shops rewritten and included in the Nuke Encyclopedia.

_________________
www.vdsp.net | www.indopedia.org | www.orientalia.org | www.indology.net | www.yogadarsana.org | www.husserl.info | www.medicum.net
Find all posts by ImagoView user's profileSend private messageVisit poster's website
Pole
Nuke Cadet
Nuke Cadet


Joined: Nov 03, 2004
Posts: 2


PostPosted: Thu Nov 04, 2004 6:47 am Reply with quoteBack to top

It's a Dell 2U, 2.4Ghz P4 with 512M ram. Sad

Anyway I did some performance tuning and my site performs better now.
Yesterday I received over 140K hits. Page creation times dropped to 4 seconds at times.

I managed to migrate the htaccess file into my main http.conf file, which gave a slight performance increase. I also installed turck-mmcache. This has increased performance quite a bit, most notably the main pages, and some other pages.

Though I have some questions I'm hoping you can answer for me:

1. Does Apache2 deal with rewrites in a better manner?

2. Is it possible to enter a RewriteCondition for Googlebot/MSN/Yahoo etc and have all _other_ ip's/users/agents not go through the rewrite rules, so they can browse the site with the traditional non rewrite way?

Will this break the site in any way? Also if google crawled the links and a user clicks through google to my site will the .html rewrite work for him? and when he browse the site furhter will it all work through .php (non rewriting) ?

Wow thats alot.
Hope you can help me out here.
Thanks
Find all posts by PoleView user's profileSend private message
Imago
Captain
Captain


Joined: Jan 17, 2003
Posts: 629

Location: Europe

PostPosted: Sat Nov 13, 2004 4:34 am Reply with quoteBack to top

Hello,

Don't know about Apache2

Entering any additional rules to .htaccess will probably slow down the performance. You don't need different rules for SE and visitors - rewritten URLs without visitors are useless. For me, the problem is quite the contrary. I have to deny from Google all dynamic links, because, in the last month, Google started to index indiscriminately all links of the type /gates.html?name=Module, etc., down to the reply with quote links on the forum.

That's waste of indexing quota (if they exist at all).

Since our problem is Google, in the robots.txt put the following line

User-agent: googlebot
Disallow: /admin.php
Disallow: /modules.php
Disallow: /gate.html
Disallow: /admin/
Disallow: /images/
Disallow: /includes/
Disallow: /themes/
Disallow: /language/
Disallow: /sutra*.html$
Disallow: /ptopic*.html$
Disallow: /ntopic*.html$
Disallow: /ftopic*asc*.html$

or whatever the name of the rewritten page - post. etc. instead of sutra.

Putthing this line
Disallow: /gate.html

should prevent Google from indexing all dynamic pages

Don't know about Inktomi. It doesn't seem to understand the * direction.

* * *

If you want Google and the other SE to index only the static pages while the user to browse dynamic pages, the only solution is this:

1. generate an extensive list of all short URL pages you want the SE to crawl and put it online

2. remove all IN and OUT rules from header.php

then you will have similar to this effect

http://us.researchreports.biz/reports/stress+management
http://us.researchreports.biz/reports/stress+management/2
http://us.researchreports.biz/reports/stress+management/3
http://us.researchreports.biz/reports/stress+management/4
http://us.researchreports.biz/reports/stress+management/5
http://us.researchreports.biz/reports/stress+management/6
http://us.researchreports.biz/reports/stress+management/7
http://us.researchreports.biz/reports/stress+management/8
http://us.researchreports.biz/reports/stress+management/9
http://us.researchreports.biz/reports/stress+management/10
http://us.researchreports.biz/reports/stress+management/11
http://us.researchreports.biz/reports/stress+management/12
http://us.researchreports.biz/reports/stress+management/13
http://us.researchreports.biz/reports/stress+management/14
http://us.researchreports.biz/reports/stress+management/15
etc.

_________________
www.vdsp.net | www.indopedia.org | www.orientalia.org | www.indology.net | www.yogadarsana.org | www.husserl.info | www.medicum.net
Find all posts by ImagoView user's profileSend private messageVisit poster's website
GibsonXXI
Private
Private


Joined: Apr 25, 2004
Posts: 48

Location: United Kingdom

PostPosted: Fri Mar 11, 2005 8:46 am Reply with quoteBack to top

Imago wrote:

...Entering any additional rules to .htaccess will probably slow down the performance. You don't need different rules for SE and visitors - rewritten URLs without visitors are useless. For me, the problem is quite the contrary. I have to deny from Google all dynamic links, because, in the last month, Google started to index indiscriminately all links of the type /gates.html?name=Module, etc., down to the reply with quote links on the forum...


Beware, this could be a variant of the Santy worm trying to attack your site using Google's search routines.

Exclamation

_________________
"Sic vis pacem para bellum!"
RAF71_Hornet / GibsonXXI
Find all posts by GibsonXXIView user's profileSend private messageVisit poster's websiteYahoo MessengerMSN MessengerICQ Number
Display posts from previous:      
Post new topic  Reply to topicprinter-friendly view
View previous topic Log in to check your private messages View next topic
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum



Powered by phpBB © 2001, 2005 phpBB Group

Ported by Nuke Cops © 2003 www.nukecops.com
:: FI Theme :: PHP-Nuke theme by coldblooded (www.nukemods.com) ::
Powered by · TOGETHER TEAM srl ITALY http://www.togetherteam.it · DONDELEO E-COMMERCE http://www.DonDeLeo.com
Web site engine's code is Copyright © 2002 by PHP-Nuke. All Rights Reserved. PHP-Nuke is Free Software released under the GNU/GPL license.
Page Generation: 0.766 Seconds - 337 pages served in past 5 minutes. Nuke Cops Founded by Paul Laudanski (Zhen-Xjell)
:: FI Theme :: PHP-Nuke theme by coldblooded (www.nukemods.com) ::