Google Clone Script – A Practical Guide

Google Clone Script - A Practical GuideMany of us, may have tried to build a search engine based on search engine scripts or google clone scripts. This article intent to give you on useful information on working with a google clone script. To build a google clone, you need to first choose the best google clone script.  You also need to use a good search engine theme. Your hosting company preferences can also be very important based on the google clone script you have chosen. Below I will try to give you a practical idea on various components for launching your own search engine service online.

Choosing the Right Type of  Google Clone Script

Google clone scripts are of two kinds

Meta Search Engine Scripts

It will use the results DBs of third party search engines to present the results in a friendly way. It will not do the much complicated world wide web search.

Search Engines like Inout Search Engine , K Search are popular meta search engine scripts. Inout Search Engine works on Legal API keys of major search engines which enables you to run a completely legal mea search engine service.

Search Engine Scripts with Integrated Crawler/Bots

Another type of search engine/ google clone scripts, are designed to build more like big search engines which includes the crawling/search bot logic also along with the result display logic. These kind of search engines are powerful, but expensive in nature. You need to have some kind of architecture understanding also.

Sphider and Inout Spider are search engines of this nature. Sphider is an old system that works on PHP an MySQL, where Inout Spider is developed on a powerful search engine architecture that can support distributed computing and distributed data handling. Inout Spider is widely regarded as the best complete search engine script currently available built in the architecture of Google/Bing.

You may also have a look on another article written by me, on this subject. Google Clone – Technology and Architecture Guide, gives you a detailed idea on how to build a reliable search engine service.

Choosing the Right Theme for your Google Clone Script

Just buying the software need not be enough! Sometimes if you do not want to use the default look of the software, you may order a theme/template. You can get your own designer to get your search engine script designed.

Deploying in the Right Server

Depending on which search engine/ google clone script type you have chosen you may need to buy the right hosting. For example, most of the meta search engines like K Search, Inout Search Engine will work on a standard shared hosting environment, crawler engines like Inout Spider, may require much more powerful machines.

Softwares like Inout Spider are designed to work on much powerful distributed environment, and it is always recommended that you choose a hosting company that can provide you nodes in a single network in future if necessary. It can help you to scale you data storage and computation capacity of your search engine as you need it.

 

Disclaimer
By Google clone, I do not mean an exact google clone, The term Google is used as a synonym for ‘search engine’. This article is indented to help you create a standard search engine like Google, Bing, Yahoo, Baidu etc.





Read More

Inoutscripts Nulled – Why You Must Not Use a Nulled Script

Using Inoutscripts nulled script can land you in troubleQuite often many of us tend to use nulled copy of softwares, web scripts and desktop softwares. Though all proprietary software companies act against piracy, nulled softwares are quite easy to find all over the internet. While some online web scripts companies like InoutScripts, Agriya and vBullettin act so strong against piracy, others are not so aggressively attacking it. Inoutscripts nulled, vbulletting nulled software are quite often searched by many. Though some may find a partially working(fully unencrypted or partially unencrypted) copy online, there are some serious factors you need to consider before uploading or installing the so claimed nulled scripts in your web server.  I would like to point out some reasons below, why it is very risk to use vbulletin or inoutscripts nulled scripts in your server.

1) Nobody Has Ever Run a Successful Online Business On a Nulled Script

Seriously. Think whether you really know in person, somebody running a successful online business on a nulled script? You might know many people successfully running desktop applications or even operating systems, but you are not likely to meet anybody who run an online business successfully with a nulled/pirated script. The reason, is that internet it too public for everybody, and you can be caught immediately once your website is up.  If you ever use a pirated/nulled script copy, the owner company, your competitors or even your users are going to figure out that today or tomorrow. Once it is known to the public that is really the END of your business for the following reasons.

  • Your users do not want be associated with a illegal website.
  • Your uses will not trust your financial stability if they know that you are using a nulled web script
  • People hate to be associated with theft items in general
  • People do not want to risk their personal information, credit card details, email details etc working with a completely insecure company.
  • People do not want to run into legal complications if the script owner files a complaint about the Company and its users.

None of us want to start a business which is short lived. If you are running the business on nulled script, the very foundation of your business itself is very weak that it cannot probably sustain a month online.

2) You Will be Easily Caught By the Script Owner Company

If you are running vBulletin or Inoutscripts nulled script you could be easily tracked by the owner companies. You cannot just assume that you will not be caught as you are invisible among the millions of websites in the world wide web. In fact it is a very easy for  most of the owner Companies, as the script may contain specific public codes or private server pings which will make them very easy to identify an illegal use.

3) People(Also Your Competitors) Will Report Your Use

Your users and your competitors will usually report your unauthorised use, to the script company, if they ever suspect you are not using a legal edition. Your competitors may discuss/refer your website for various of reasons(requesting a feature, design, template customisation etc) with the owner company, and it makes it easy for the owner company to identify your illegal use. Any vbulletin/ Inoutscripts nulled scripts uses are likely to be easily reported as those companies always open for such reports to fight piracy.

4) Inoutscripts Nulled Scripts and vBulletin Nulled Scripts Will be Stopped by Your Server Company

Once an illegal use is identified, Inoutscripts or vBulletin can easily report and complain it to your hosting company with sufficient proofs. The owner company will make your hosting company legally liable to turn off your services. Of course you can try to host the script another server, but usually these companies will chase you down, until you stop it completely, or even penalize you legally.

5) Copyright Infringement Suit Will be Filed Against You, To Your Domain Registrar

Script Owner Companies, with ample proof will file copyright infringements suits against you to you domain registrar. The domain registrar is legally required to look into the infringement suit. With the verification of the evidences, it can even cause to turn off your services and to ban your domain name permanently.

6) Inoutscripts/vBulletin May Directly Sue You, Asking Heavy Compensation

Being sued by others, when we cannot stand legally or ethically, doesn’t sound too good. It can be really hard for you and can badly damage your personal reputation as well as your business reputation. It can take away all your money too if you are not really lucky enough. Once a suit is filed, Inout Scripts and vBulletin are likely to continue with the case and the trial until they win and get enough compensation from you, in almost all cases.

The worst part is that they may go legal when you are about to start monetizing with your service or to receive traffic into your site.

 7) Lesser/Compromised Security. You are Always at Risk.

Most of the owner companies tests their scripts against many vulnerabilities and include their premium security modules in their protected codes to help their genuine customers from hacking, unauthorised access, SQL injection etc.  A partially nulled or fully nulled script could miss out  the security modules(for example the code that prevents SQL injection) during the illegal decryption/decoding phase, and it can cause severe damages to  your server. The missing security modules may remove your website’s immunity against a hacker or a virus attack. For this reason alone, nulled/decoded softwares are highly risk to be used on a web server.

8) You Cannot really Trust the Source You Found the Nulled Script!

Where do you get the nulled script? Do you really trust the source you got the nulled script? Did you get that from a person you found on a social media site like facebook? Or you got that from a forum link that discusses about nulled softwares? Decryption of a software, theft of the package and unauthorised use of a software are severe criminal actions. Nulled scripts and packages usually contains some kind of hack codes written within the script or kept as a separate file placed in the package with a friendly file name, like system.php, config.php templates.php etc. Such files could contain the hack code which you don’t understand(encrypted sometimes) and will act as a gateway to your live server. A hacker may access your site through this trap doors and can take all your data, manipulate the login details and steal credit card information, not just from the nulled script, but also from other websites, databases used in connection to your web server. So you should only use the script that you downloaded/purchased from a trusted source. Either directly from the owner company or from an authorised reseller.

Conclusion

Trying to run a website on a nulled scripts can be highly destructive and dangerous. It may take away your reputation and/or money. Using a script found on a forum, social media or an untrusted source may damage your server also. So it is highly recommended that you do not upload/deploy a nulled script on a live web server and you do not attempt to run a website on a nulled script.

 

 

 

Read More

Google Clone – Technology and Architecture Guide

A Guide to Build Your Own Google Clone Search EngineHave you ever thought about building a fully featured search engine working similar to Google or Bing? Google has emerged as one of the biggest companies on Internet within a very short span of time. All internet entrepreneurs might have amused by seeing the success of Google as a Company. Thinking about the Technology, how google is working so fast and powerful? How does google manage the fault tolerance? Where do google save all these data of billions of web pages? Can you create a search engine like Google? If so how?

Well, thinking about building a search engine like google, you need to know various aspects. First of all building a search engine like google cannot be done overnight. It takes months or even years to crawl and store all the data, and to rank the results, to make it crawl almost the entire web. But usually you should be able to start producing the search results within a couple of week.

Where do you store the data? Where do Google stores the data? Google has a unique NOSQL database called BigTable where they store the entire search data. BigTable works on a distributed system which works on much reliable HDFS system. This file system supports distributed computing to support thousands of notes attached in the network.

What Technology should I use?

You cannot run google on MySQL. Period. Not even in Oracle, if you are looking for a global scale service. You need to have something similar to BigTable which works on a file system like HDFS. But HDFS and BigTable are google specific technologies and are not open source and not available to the public.

Hadoop : Hadoop is a filesystem which works very similar to HDFS, and it is widely regarded as the BEST distributed filesystem available now. Hadoop is open source continuously researched and developed by Apache! Hadoop is the best file system you can use to run a highly scalable, multimachine applications like search engines, analytics etc.Hadoop help you to connects thousands of nodes together to work as a expandable file system.
http://hadoop.apache.org/ 

HBase: Hbase is a database that works on NOSQL (Not Only SQL) system, which can work on top of Hadoop to store petabytes of data. Though it based on Java and regarded as a reliable database. Hadoop is maintained by Apache!
http://hbase.apache.org/

Hypertable: Hypertable is another NOSQL database which works on Hadoop. It works based on C++ and the Hypertable company claims that the performance is much faster the HBase. Hypertable support is also very good and it has more flexibility on queries comparing with HBase.
http://hypertable.com/

So for running a Google clone, you shall either use Hadoop + HBase or Hadoop + Hypertable.

What Hardware Should I use?

Of course I understand that you don’t want to start with your own datacenter initially. Google has their own, ever expanding datacenter around the world. The ideal solution to start would be you tie up with a datacenter or hosting company who can provide a series of nodes(computers) in a single network. The key reason, why need nodes in a single network is that, as we expand more nodes in future in a scalable distributed system, nodes in same physical network can significantly improve the performance of your search engine.

How Can I Code a Google Clone Application?

Here comes the most tricky and interesting part on your journey to build a Google clone search engine. No matter your decide to use the right technology or to use the right infrastructure, if the code is not powerful, and designed to manage the scalability, your spider won’t be effective enough. I am not able to cover your the components of your software logic, algorithm to build up a spider. Anyway the below diagram found on Inout Spider  will give you a read good idea about the major components required to build a spider. Inout Spider is a commercial application (widely regarded as a powerful search engine data spider application, and a standard google clone script) which work on Hadoop and Hypertable technologies. So if you cannot code it yourself, I recommend you consider Inout Spider.

 

Google Clone Algorithm of Inout Spider

(Source: http://www.inoutscripts.com/products/inout_spider/)

Summary

Building a search engine like google, is never as easy task, or else we would have seen much google clones online. But with the right technology, hardware and software(your own, or commercial applications like Inout Spider), your dream is achievable.

 

Disclaimer

By Google clone, I do not mean an exact google clone, The term Google is used as a synonym for ‘search engine’. This article is indented to help you create a standard search engine like Google, Bing, Yahoo, Baidu etc.

 

Read More