Site Search Glossary
The world of site search has its own unique vocabulary, and as a marketing professional you need to be in the know! This search glossary compiles the essential terminology, definitions, and abbreviations you’ll need to have an informed discussion with your IT department when talking about your search platform.
Accessibility: Web accessibility refers to how accessible web pages and content are to people with disabilities. Accessibility is also the practice of inclusion – people who are dependent on using websites need to have access to content regardless of their condition, meaning websites need to be accessible to people with a diverse range of hearing, movement, sight, and cognitive ability.
Actionable Recommendations: the Actionable Recommendations module provides tailored recommendations to help improve your search performance. Based on the current search data configuration and machine learning, these recommendations go beyond traditional analytics by providing a dynamic and individualized approach to search optimization.
Algorithm: An algorithm is a complex mathematical equation that search engines use to match relevant pages within the billions of pages saved, or cached, by the search engine with the search terms employed by Internet users.
Analytics: Search data and information gathered by Cludo and sent to your MyCludo dashboard. Use these insights to further your marketing strategy and efforts.
Artificial Intelligence: See Machine Learning.
API: An application programming interface (API) is a set of subroutine definitions, protocols, and tools for building application software. In general terms, it is a set of clearly defined methods of communication between various software components.
Assistant: The Cludo Assistant is an actionable search overlay that will appear on sites of your choosing.
Asynchronous Content: Asynchronous content is data that is inessential for the browser to initially render a web page. It can be loaded into the web page whenever it is available. For example, consider a web page that displays a user profile. If the user information (i.e. name, profile image, etc.) exists on an external database and is not immediately available to the browser, the browser can send a request for this information to the database and, in the meantime, render the web page that will contain the user information without needing to wait for the database’s response. Once the response is available, the information can be loaded into the already rendered page. This increases the speed at which the page and content is perceived by the end user to be delivered.
Autocomplete: Search query predictions displayed while a user is searching. Also known as word completion or predictive text.
Banners: Visually engaging calls-to-action that can be displayed at the top of the search results page for specific queries.
Bigram matching: Bigrams are word combinations in sentences where each two adjacent words create a bigram. Bigram matching is a language analysis tool which advanced search engines use to find results for multiple word queries that are similar to but not exactly the same as the text in the searchable index.
Boosting: Promoting certain areas of your website while deprioritizing others. Customize relevancy to boost search results that provide more value or decrease the value of less important pages or files.
Canonical tags (canonicalization): A method of telling search engines that a specific URL represents the master copy of a page. Using the canonical tag prevents problems caused by identical or "duplicate" content appearing on multiple URLs. Canonical tags tell search engines which version of a URL you want to appear in search results. (Moz)
Category: A standard field in a crawler that can be set up to identify a specific type of content when crawling.
CMS: A content management system is a computer application that supports the creation and modification of digital content. It typically supports multiple users in a collaborative environment.
CMS agnostic: Software is CMS agnostic if it is developed to integrate with many types of Content Management Systems rather than just one particular CMS.
CSS: Cascading Style Sheets is a style sheet language used for describing the presentation of a document written in a markup language.
Crawler: A crawler (otherwise known as an indexer or spider) is a program that visits websites and reads their pages and other information in order to create entries for a search engine index.
CTR (Clickthrough rate): A ratio within your search analytics that shows how often a visitor clicked on a certain search result or banner.
Dynamic Re-ranking: AI feature that takes advantage of your data to automatically improve search results over time.
Engine: A web search engine is a software system that is designed to search for information.
Enterprise search: Enterprise search is the practice of making content from multiple enterprise-type sources, such as databases and intranets, searchable to a defined audience.
Error 404: An error message displayed by a browser indicating that a resource at an Internet address cannot be found.
Faceted search: A technique that lets users narrow search results through multiple facets.
Facets: Categories within search results.
Federated search: Federated search is an information retrieval technology that allows the simultaneous search of multiple searchable resources.
Filters: Characteristics a visitor can use to refine search results, e.g. by size, color, price, location, etc.
Google Analytics: The analytics service provided by Google to track website traffic and performance. The Cludo integration allows search data to seamlessly be sent to Google Analytics in real-time for further analysis and comparison to other website activity.
Google Custom Search: Google Custom Search (GCS) is an ads-supported product that provides similar capabilities to Google Site Search, including the ability to build custom search engines for sites or pages, or image search for your website. With a few exceptions, the Google Custom Search contains advertising within the search results, as well as Google branding. GCS is the replacement for Google Site Search that was sunset on April 1, 2018.
Google Search Appliance: The Google Search Appliance (GSA) was a rack-mounted device that provided document indexing functionality. GSA runs as a server inside or outside your network. It indexes critical content by “crawling” your company resources (or public web servers) for data so users can locate it using the same search methods offered on the Google Search site. This may include websites, file shares and databases. Google Search Appliance has been discontinued by Google, and ceased to operate in 2019.
Google Tag Manager (GTM): The tag management system powered by Google that allows you to add tags (measurement codes and code fragments) on your website or app. When paired with Cludo's GTM integration, these tags can be used to add personalization to your search results and product recommendations.
H1 Tag: The header tag, or the tag in HTML, will usually be the title of a post, or other emphasized text on the page. It will usually be the largest text that stands out. In most cases, there should only be one h1 tag per web page.
HTML injection attack: HTML injection is a type of injection issue that occurs when a user is able to control an input point and is able to inject arbitrary HTML code into a vulnerable web page.
HTTP header: HTTP header fields are components of the header section of request and response messages in the Hypertext Transfer Protocol (HTTP). They define the operating parameters of an HTTP transaction.
Index: Search engine indexing collects, parses, and stores content, videos, text, images etc. to facilitate fast and accurate information retrieval.
Inline search results: Search results that are opened onto a new web page, separate from the page where the search was conducted.
Input: The location (text field) in which a search query takes place.
Instant suggestions: Display popular or relevant search terms to the user before a search has been completed. With this feature, a list of your chosen search terms is shown to the user when they focus on a Cludo search input. This helps you direct the customers to the content you would like them to see.
Intelligent 404: Cludo’s proprietary module ensures no website visitor is left at a dead-end when hitting a 404 on your website by providing relevant suggested pages.
Intents: Also known as intentions, are common topics and objectives users are looking to find on your website through search.
Intranet: A private network for communication, collaboration and sharing information within an organization.
Keyword match: Keyword match types help control which searches on Google can trigger your ad. So, you could use broad match to show your ad to a wide audience or you could use exact match to hone in on specific groups of customers.
Latent Semantic Indexing (LSI): Latent Semantic Indexing is when a search engine indexes commonly associated groups of words in a document. SEOs refer to these same groups of words as “Long Tail.”
Machine learning: Machine learning is a subset of artificial intelligence that often applies statistical techniques to data, to give computers the ability to “learn” to progressively improve performance on a specific task without being explicitly programmed.
Meta tags: Meta tags are snippets of text that describe a page’s content. They don’t appear in the page itself, but rather in the page’s code.
Misspellings: This tool allows you to add common misspellings to your site’s search function. When you create a misspelling, you connect it to a correctly spelled word. When a user performs a search with the misspelled word, they will receive the results for the correctly spelled word that correspond to that misspelling.
Multi-factor authentication: Cludo MFA is a security mechanism used to enhance the authentication process for accessing the MyCludo platform. MFA adds an extra layer of security beyond using just a username and password to verify the user’s identity.
MyCludo: The dashboard in which you'll find your search data and configurations.
NoIndex: The noindex directive is an often used value in a meta tag that can be added to the HTML source code of a webpage to suggest to search engines to not include that particular page in its list of search results.
NoFollow: The nofollow directive is an often used value in a meta tag that can be added to the HTML source code of a webpage to suggest to search engines not to pass link equity through any links on a given webpage.
One Box: In order to provide answers to specific questions that are as accurate as possible, Google has repeatedly experimented with the display of search results. With One Box, Google has found a way to integrate the results from its vertical search engines in the organic SERPS. The increasingly integrated One Box has drastically reduced the space for organic search results for certain queries.
Open source: Software for which the original source code is made freely available and may be redistributed and modified.
Overlay search results: Search results that are displayed on top of the page where the search was conducted.
Page rankings: Create a list of web pages, rank them, and then assign those pages to a group that contains one or more search words. When those words are searched, the top search results displayed will be the web pages from your list, and they will appear in the order in which you ranked them.
Penetration test: A penetration test, colloquially known as a pen test, is an authorized simulated attack on a computer system, performed to evaluate the security of the system.
Personalization: Personalized search refers to search results that are tailored specifically to an individual’s interests by incorporating information about the individual (previous search behavior, geolocation, demographic, language, etc.) beyond the specific query provided.
Query: A phrase or keyword combination visitors enter within your search function to find things of interest.
Quicklinks: Assign URLs to specific search queries, guiding and skipping the search results page altogether.
Responsive design: An approach to webpage creation that makes use of flexible layouts, flexible images and cascading style sheet media queries. The goal of responsive design is to build webpages that detect the visitor’s screen size and orientation and change the layout accordingly.
Relevance: In the search world, relevance refers to how relevant the search results are to a given search. When determining search result relevance, search engines consider hundreds of factors, including semantic signals, linguistic signals, prominence factors, length and publish date.
Robots.TXT: The robots exclusion standard, also known as the robots exclusion protocol or simply robots.txt, is a standard used by websites to communicate with web crawlers and other web robots. The standard specifies how to inform the web robot about which areas of the website should not be processed or scanned. Robots are often used by search engines to categorize websites.
SAML: Security Assertion Markup Language is an open standard for exchanging authentication and authorization data between parties, in particular, between an identity provider and a service provider.
Scoped search: Scoped search allows you to limit search results to a specific section or type of content within the website instead of searching across the whole index.
Search Engine Optimization (SEO): The process of improving the quality and quantity of traffic to a website or a web page from external search engines like Google or Bing.
Section 504: Requires agencies to provide appropriate auxiliary aids in the form of braille & large print forms of materials.
Section 508: Requires federal agencies to ensure that persons with disabilities (both employees and members of the public) have comparable access to and use of electronic information technology.
Search Engine Results Page (SERP): A search engine results page is the page displayed by a web search engine in response to a query by a searcher. The main component of the SERP is the listing of results that are returned by the search engine in response to a keyword query.
Single sign-on: Single Sign-On is a property of an access control system that allows access to multiple independent software systems with only one form of authorization.
Sitemap: A page or structured group of pages which link to user accessible pages on a website. It is intended to improve site usability by clarifying the data structure of the site for the user. Site maps can also be used by web crawlers to efficiently and thoroughly explore a website’s content for indexing.
Social Search Optimization: Social search optimization is just like any search engine optimization strategy in that the desired result is to increase your visibility. Search engines are now placing more importance on the content of these social sites in ranking webpages in their SERP’s.
Static HTML: Static webpages contain the same prebuilt content each time the page is loaded, while the content of dynamic webpages can be generated on-the-fly.
Synonyms: Synonyms are words or phrases having the same or nearly the same meaning as another in the language, such as the words “happy”, “joyful”, and “elated”. Advanced search engines understand synonyms and use this understanding to provide relevant search results without the need for exact key matching.
Voice search: Search via speech-to-text.
Web Content Accessibility Guidelines (WCAG): Standard guidelines on how to create web content that is accessible for people with disabilities.
Web content highlighter: A feature that highlights the search query within the specific landing page the visitor has selected from the search results, making it easier for them to find exactly what they're searching for.
White hat SEO: The usage of optimization strategies, techniques and tactics that focus on a human audience opposed to search engines and completely follows search engine rules and policies. For example, a website that is optimized for search engines, yet focuses on relevancy and organic ranking is considered to be optimized using White Hat SEO practices.
Widgets: A control element in a graphical user interface is an element of interaction, such as a button or a scroll bar.
WordPress: WordPress is a free and open-source content management system based on PHP and MySQL. To function, WordPress has to be installed on a web server, which would either be part of an Internet hosting service or a network host in its own right.
XML: Extensible Markup Language is a simple, very flexible text format derived from SGML, used to make it easy to syndicate or format information using technologies such as RSS.
XML Site Map: A page or structured group of pages which link to user accessible pages on a website. It is intended to improve site usability by clarifying the data structure of the site for the user.
XPath: XPath is a query language for selecting nodes from an XML document. In addition, Xpath may be used to compute values from the content of an XML document.