Kevin Price of the Price of Business show discusses the topic with Thede on a recent interview.
Sandwiched between Juneteenth and Father’s Day this year is a little-known holiday tailor-made for enterprise search: World Productivity Day. Just in time for World Productivity Day is a new enterprise search version.
What’s in the new version?
dtSearch® provides instant concurrent searching across terabytes of mixed online and offline data. One of the most important formats dtSearch supports is PDF. dtSearch previously required a separate “plug in” to highlight hits in PDFs. The new version highlights hits in PDF files directly through annotations, letting dtSearch highlight hits in PDFs without the need for any external software.
Are there other benefits to the new PDF hit-highlighting?
Multicolor hit highlighting now works with PDFs just like other formats: local and remote Microsoft Office files, emails with attachments, compressed files, website data, etc. Take an “any words” search for world or productivity or day. dtSearch can display PDFs and other retrieved files with world, productivity and day all highlighted in yellow. Or dtSearch can highlight world in yellow, productivity in light green, and day in light blue.
So how does dtSearch instantly search terabytes?
dtSearch lets multiple concurrent users instantly search terabytes only after first indexing the data. Indexing stores each unique word and number across the data and the location of each in the data. While a lot of work for the indexer, just point to the folders, email archives and the like to cover and the software will take it from there. So long as the files and emails are visible through the Windows folder system, the files themselves can be local or remote like OneDrive, SharePoint or DropBox.
The indexer can automatically handle issues like a PDF with a PowerPoint extension or a OneNote file with an Access database extension. The indexer can also automatically work with multilevel nested data like an email with a ZIP or RAR attachment with a Word document itself embedding an Excel spreadsheet. For speed, a 64-bit multithreaded indexer offers many times faster indexing.
What about index capacity?
A single dtSearch index can hold up to a terabyte of online and offline data. There are no limits on the number of terabyte indexes that dtSearch can build and instantly concurrently search. While indexing is resource-intensive, searching is resource-light, with no built-in limits on the number of concurrent search threads. Instant multithreaded searching can even continue while indexes automatically update to reflect new content.
And search options?
dtSearch has over 25 search options. Look for World Productivity Day as an exact phrase. Or do an “all words” search for files and emails including all 3 of these words but not necessarily as an exact phrase. Or do an “any words” search finding files with even just one mention of any of these 3 words. Or apply Boolean (and/or/not) searching to find the phrase productivity day in a file or email that has world but not New Year’s Day.
Or do a proximity search for the phrase productivity day within 38 words of world. Or look for productivity day within just 7 words following world. Concept searching extends a query for world to globe or Earth. Fuzzy searching adjusts from 0 to 10 to sift through typographical and spelling errors like productiNity for productivity.
And metadata?
Indexed searching covers all metadata—even obscure metadata that’s very hard to spot viewing a file in its native application. By default, searches cover full-text content plus metadata, but searches can also require that certain terms appear in specific metadata. In addition to word-based searching, full-text and metadata searches can also encompass a number or numeric range or a date or date range. A search for date(March 14, 2026 to August 3, 2026) would pick up June 20, 2026 as well as 6/20/26.
Advanced search options include credit card number identification and hash value generation and search. Notably, for WORLD Productivity Day, dtSearch supports Unicode covering hundreds of international languages. A single file or email can go from English, to a different European language, to a right-to-left language like Arabic or Hebrew, to double-byte Chinese, Japanese or Korean text, and Unicode and dtSearch will track all of that.
How does dtSearch sort search results?
Relevancy ranking prioritizes rarer and denser search terms across indexed data. Take an “any words” search for world or productivity or day. If world and day are common but productivity mentions rare, then productivity hits would get a higher relevance score, with the densest productivity-mentioning files coming out on top. Alternatively, dtSearch supports custom variable term weighting, like giving world a positive weight of 7, productivity a positive weight of 6 and summer a negative weight of 4.
Weightings can apply across all text, or selectively to hits in specific metadata or near the top or bottom of a file. dtSearch can also instantly re-sort by a totally different criterion like file data, file name or file size. In all cases, dtSearch can now display retrieved files with multicolor highlighted hits across PDFs as well as other content.
What about dtSearch’s developer products?
Beyond dtSearch’s “off the shelf” enterprise software, the new PDF hit-highlighting extends to the dtSearch Engine for developers. Running “on premises” or in the cloud such as on Azure or AWS, the dtSearch Engine comes in x64 Intel and ARM64 Windows, Linux and macOS builds. Along with files and emails, developer APIs also support backend databases like SharePoint, SQL and NoSQL including BLOB data. APIs enable faceted search and granular data classification using any number of full-text and metadata parameters.
Final thoughts?
Don’t let World Productivity Day pass you by. Celebrate with a fully-functional 30-day evaluation download from dtSearch.com
About dtSearch®. dtSearch has enterprise and developer products that run “on premises” or on cloud platforms to instantly search terabytes of “Office” files, PDFs, emails along with nested attachments, databases and online data. Because dtSearch can instantly search terabytes with over 25 different concurrent search options, many dtSearch customers are Fortune 100 companies and government agencies. But anyone with lots of data to search can download a fully-functional 30-day evaluation from dtSearch.com
Connect with Elizabeth Thede on social media:
LinkedIn: https://www.linkedin.com/in/elizabeth-thede-4a5a042/