The Cache: Technology Expert's Forum
 
*
Welcome, Guest. Please login or register.
Did you miss your activation email?
May 21, 2013, 08:59:17 PM

Login with username, password and session length


Pages: [1]
  Print  
Author Topic: getting my deep pages spidered?  (Read 955 times)
svakanda
Expert
****
Offline Offline

Posts: 131



View Profile
« on: August 16, 2009, 02:28:28 PM »

I am trying to entice google into indexing my NEW pages...I submitted a sitemap with 107 of them, and it's been a week now, and only three are indexed.  How do you guys go about getting basic backlinks for indexing?  I used to use joseph tierney's bloggergenerator, and i must say, it worked bloody awesome.  I have bloggers from nearly a year ago that are still up.  But the program seems to have died since then.

Anyone have suggestions for me?  I need this site up and searchable, it's my Real Business.  thanks !
Logged

a ship is safe in the harbor, but that's not what it's for.
vsloathe
vim ftw!
Global Moderator
Lifer
*****
Offline Offline

Posts: 1669



View Profile
« Reply #1 on: August 17, 2009, 08:40:34 AM »

Take a text file and populate it full of your new page links and their anchor text, one per line like:

Quote

Then create a little module that you're going to place all over your site. Something like this (coded in this window so my apologies if there are errors):

Code:
<?php
$arrData 
file('myLinks.txt');
shuffle($arrData);
foreach(
$arrData as $line){
   
$arrParts explode(','$line);
   echo 
'<a href="'.$arrParts[0].'">'.$arrParts[1].'</a>';
}

If you want to get fancier and add more factors into the equation, the sky's the limit pretty much. You just need to link from these new pages from every page on your site so that they get indexed, and make sure that you remove links very slowly.
Logged

hai
Keebler
Rookie
**
Offline Offline

Posts: 18


View Profile
« Reply #2 on: August 17, 2009, 01:37:37 PM »

Take a text file and populate it full of your new page links and their anchor text, one per line like:

Quote

Then create a little module that you're going to place all over your site. Something like this (coded in this window so my apologies if there are errors):

Code:
<?php
$arrData 
file('myLinks.txt');
shuffle($arrData);
foreach(
$arrData as $line){
   
$arrParts explode(','$line);
   echo 
'<a href="'.$arrParts[0].'">'.$arrParts[1].'</a>';
}

If you want to get fancier and add more factors into the equation, the sky's the limit pretty much. You just need to link from these new pages from every page on your site so that they get indexed, and make sure that you remove links very slowly.

I am putting together a site that is going to have upwards of 10 million pages. Would the same thing work?
Logged

No links in signatures please
vsloathe
vim ftw!
Global Moderator
Lifer
*****
Offline Offline

Posts: 1669



View Profile
« Reply #3 on: August 17, 2009, 02:21:40 PM »

Sure, but I'd use a database instead of a flat file in that case.

OH! One important thing to note:

You will want to cache the set of links that you show on a particular page. You can do a shuffle() on them so that it's always 10, 20, 50 random links (whatever # you choose) per page, but you will want to cache that set for that page so that when a spider returns, it doesn't see a completely different set. That would be BAD.
Logged

hai
Keebler
Rookie
**
Offline Offline

Posts: 18


View Profile
« Reply #4 on: August 17, 2009, 05:08:16 PM »

Hmm. I guess ill have to generate the links when I make the pages. I had it set up so that I had a footer.php included in each page. But I wouldnt know how to make it remember what to link to with caching.

So ill have to generate 5 random links each time and save them in the php.
Logged

No links in signatures please
Keebler
Rookie
**
Offline Offline

Posts: 18


View Profile
« Reply #5 on: November 08, 2009, 11:28:58 PM »

Sure, but I'd use a database instead of a flat file in that case.

OH! One important thing to note:

You will want to cache the set of links that you show on a particular page. You can do a shuffle() on them so that it's always 10, 20, 50 random links (whatever # you choose) per page, but you will want to cache that set for that page so that when a spider returns, it doesn't see a completely different set. That would be BAD.

I still need help on this.

How would I cache it?. and what do you mean by using a database
Logged

No links in signatures please
vsloathe
vim ftw!
Global Moderator
Lifer
*****
Offline Offline

Posts: 1669



View Profile
« Reply #6 on: November 09, 2009, 07:22:23 AM »

Just save the set of links for each page to a file or whatever.
Logged

hai
perkiset
Olde World Hacker
Administrator
Lifer
*****
Offline Offline

Posts: 10009



View Profile
« Reply #7 on: November 09, 2009, 09:20:36 AM »

Essentially, you want the entire page image stored in either RAM (preferable) or your database. This then makes presentation of the page very fast, because rather than re-interpreting all the PHP to create the page, it is simply grabbed as a blob of text and spit back out to the user.

It is easiest to do this if you have a single "main.php" (this is what I do) - where all requests come through and the you create a page and send it back. If you don't, it's just a little more elbowgreasy.

In either case, you'll want to store the page image under the name of the current URL. Then, at the front end of processing, ask the cache (RAM or DB) if there is an image already stored. If so, grab it, echo it and quit. If you do not use a main.php methodology, then you'll probably want to include your caching code in the front of every page. The included code would look something like this pseudocode:

if ($buff = $db->singleAnswer("select pagebuff from cachetable where url='{$_SERVER['REQUEST_URI']}'"))
{
   echo $buff
   exit;
}
// since there's no cached version, create the page here...

// now here, push the cached version to the database...
$db->query("replace into cachetable values(url, pagebuff) values('{$_SERVER['REQUEST_URI']}', '$theCreatedPageBuff')");

With this kind of system, you'd also want a cache expiration mechanism, perhaps based on time or usage. There's a bit of work here, but it's an excellent exercise.
Logged

It is now believed, that after having lived in one compound with 3 wives and never leaving the house for 5 years, Bin Laden called the U.S. Navy Seals himself.
Pages: [1]
  Print  
 
Jump to:  

Perkiset's Place Home   Best of The Cache   phpMyIDE: MySQL Stored Procedures, Functions & Triggers
Politics @ Perkiset's   Pinkhat's Perspective   
cache
mart
coder
programmers
ajax
php
javascript
Powered by MySQL Powered by PHP Powered by SMF 1.1.2 | SMF © 2006-2007, Simple Machines LLC
Seo4Smf v0.2 © Webmaster's Talks


Valid XHTML 1.0! Valid CSS!