A lot of data can place a lot of load on a server. As I discussed in a post a few weeks ago Data can translate nicely into cash, so if you have a nice good source of data then take it while you can. Free data is great and can be good to feed to pages, but as its free other people are likely to be using it - upping the likelihood of competition, google thinking your replicating and thus lower traffic. Free data not in the public eye is even better. You have users on your site? a lot of them? Then think how to use that - from users logging in and out, registering, chatting - whatever - use everything you can to produce good sources of information.
Anyway in this post I will discuss ways to manipulate, play with and organise data - in large chunks specifically. Lets say if you have a source with 40k lines of detail, with 10-30 fields for each line, and you want to mass update a few fields.
First the format: My skill base covers SQL so I will discuss with this as the basis, MSSQL is great, if you are using that then updates are easy as its presumably all local, good connection speeds and processor/memory control will allow for easy updating and manipulating. In a web situation Mysql is the obvious choice, and always my recommendation. But there are problems that may potentially crop up with using mysql to feed 40,000+ records, edit/update them and generally manipulate that much data usually when hosted by another company, that is running mysql locally is almost better than mssql for data manipulation.
Shared hosting and even quite a lot of virtual servers are heavily restricted in memory, script running times and processor load. There are as ever ways around this.
or…Trickle updating - When you have thousands of lines to update and you really want to get messy with each line e.g. create long manipulations of all the fields and then update them.
Here’s how I do it:
In essence:
Php and mysql can get round most shared hosting hold backs - with a little elbow grease!!!!
If anyone wants help with mass manipulating data, sql I can offer freelance data warehousing, manipulation and general processing - from back end to crystal reports.
If you like this post on mass data then you may be interested in the following: