Archive for October, 2006



Merging galaxies … Wow !!!

www.gif

This Hubble image of the Antennae galaxies the sharpest yet of this merging pair of galaxies. As the two galaxies smash together, billions of stars are born, mostly in groups and clusters of stars. The brightest and most compact of these are called super star clusters. (NASA, ESA/Hubble, and B. Whitmore - Space Telescope Science Institute/Handout/Reuters)

Reference : http://news.yahoo.com/photo/061018/photos_sc/

Spring-JSF integration

Spring is a powerful framework for building enterprise Java applications. JSF is a standards-based technology that can simplify Web development. It is possible to combine the two with surprisingly little effort, allowing developers to take advantage of the best that both Spring and JSF have to offer. Check this useful article By Michael Klaene which demonstrates how to utilize JSF and Spring to build an application. Which also contains a very brief overview of both JSF and Spring .

Spring to Java Server Faces By Michael Klaene

Technorati tags: Spring Framework, JSF 

An Indian Google

Guruji
Guruji.com is the first crawler based search engine for India and India related content. Two Indian Institute of Technology (IIT) Delhi graduates have returned from the Silicon Valley to launch a home-grown search engine with loads of Indian content.Its proprietary algorithm automatically identifies India related content on the web and organizes it in such a way that the users get the most relevant results fast.

Gaurav Mishra, co-founder and coo, guruji.com explains, “90% of Internet search queries are local in nature, and guruji.com will deliver better search results than any other search engine in these instances. For example, if a user types a search “Pizza in Koramangala, Bangalore ” or “Chinese restaurant Juhu, Mumbai” the user will be able to see local business listings as well as articles, reviews, blogs, or any other web references.”

Technorati tags: Google, Guruji 

How Google works?

I think its very interesting to learn, how Google creates the index and the database of the documents. The following are some of the basic steps of this process…

1. Google creates its own version of the Internet, using automated programmes called “Googlebots“, which crawl the web in search of new information. Web sites known to be important and frequently modified are scanned every few minute; sites less frequently updated may be scanned every few weeks.

2. Googlebots feed key information from a Web page to Google’s central network: URL, full text of the page, references to images and other embedded files and specific information the site owner creates about the page, called metadata

3. At central network the information is indexed; every word that could be used in a search query is listed along with information referencing Web sites where the word can be found.

4. The index is broken into “shards” and send to the data centers of the servers wired together- around the world; because centers may have slightly different versions of the index, depending on when they received the last update, users in different places may get slightly different results for the same search.

Searching and ranking

When the people search Google, they are asking the company to find every instance of the term in its index and rank the corresponding documents by their relevance.

1. The user types a search query; the typical query is two or three words which can make finding the most relevant results challenging; roughly one in 10 queries is misspelled

2. Before Google provides any information, it identifies the searcher’s location through his or her Internet Protocol (IP) address. The IP helps speed up the search by sending the request to the nearest data center and allows the Google to identify geagraphically appropriate ads.

3. The query is sent to the central network then redirected to the nearest data center.

4. At the data center, the search item is run through the index; matching terms are sent back to the central network, then to the user with a summary of the webpage, called a “snippet”.

The “SECRET SAUCE”

Google determines which web sites are more relevant to a search item by using its “secret sauce”, a formula that weights more than 200 measurements, such as the number of times the search item appears on a web page, the number of visitors to the page and the Page Rank- the number of sites linking to the page and the popularity of those sites.

Technorati tags: Google

Google Search Tips: Part 2

* The asterisk is a search wildcard. For example, searching for three*mice finds three blind mice, three button mice, etc.* Google search currently has a hard limit of 32 words - that’s keywords and special syntax combined. Search terms after the first 32 words are ignored.

* Google’s Boolean default is AND, which means that, if you enter query words without modifiers, Google will search for all of your query words.

* The Google synonym operator, the ~ (tilde) character, placed in front of any number of keywords in your query, asks Google to include not only exact matches, but also what it thinks are synonyms for each of the keywords. For example, search for ~legal, you will get results for lawyer, attorney, law, etc.

* Google is case insensitive. If you search for Three, three, THREE, even ThrEE, you get the same results.

* Numrange searches for results containing numbers in a given range. Just add two numbers, separated by two periods, with no spaces, into the search box along with your search terms. For example, If you’re looking to spend $800 to $1,000 on a nice 3 to 6 megapixel digital SLR camera, Google for: slr digital camera 3..6 megapixel $800..1000.

* Page size in Google results is never going to be more than 101 KB. That’s because Google doesn’t index more than 101 KB worth of a given web page.

* Google’s define-operator allows you to look up word definitions. For example, [define:css] yields “Short for Cascading Style Sheets” and many more explanations. You can trigger a somewhat “softer” version of the define-operator by entering “what is something”, e.g. [what is css].

* Google searches for all of your words, whether or not you write a “+” before them (I often see people write queries [+like +this], but it’s not necessary). Unless, of course, you use Google’s or-operator. It’s an upper-case [OR] (lower-case won’t work and is simply searching for occurrences of the word “or”), and you can also use parentheses and the “|” character.

Technorati tags: Google

« Previous PageNext Page »


View Lijin Joseji's profile on LinkedIn

Disclaimer

The information on this site is for informational purposes only. The use of any Trademark or Copyrighted material is not intended to infringe Copyright. This blog is intended to be used under a policy of personal and non commercial use.

Adds

Add to Google

Blog Stats

  • 107,964 hits

Categories